How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe

How to Train Your Agent: Building Reliable Agents with RL — Kyle Corbitt, OpenPipe
Transcript
um hey everyone glad you're all here this is the reasoning and reinforcement learning track uh on the afternoon of the last day of the AI engineer world's fair glad you're all here glad you're sharing it with us today what I'm going to talk about is uh a very specific case study um that we did uh this case study I'm going to talk about lessons lear... Read More
Install to Summarize YouTube Videos and Get Transcripts
Explore YouTube Video Summarizer or Get YouTube Transcript Extractor
Read in Other Languages (beta)
Share This Summary 📚
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator
Explore More Summaries from AI Engineer 📚

Small AI Teams with Huge Impact — Vik Paruchuri, Datalab
AI Engineer

The New Code — Sean Grove, OpenAI
AI Engineer

Your MCP Server is Bad (and you should feel bad) - Jeremiah Lowin, Prefect
AI Engineer

RL Environments at Scale – Will Brown, Prime Intellect
AI Engineer

The Next Unicorns: 7 Top AI startups from the HF0 Residency
AI Engineer

AI Engineer World’s Fair 2025 - Day 2 Keynotes & SWE Agents track
AI Engineer
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator