Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 8: Reward Learning

Stanford CS224R Deep Reinforcement Learning | Spring 2025 | Lecture 8: Reward Learning
Transcript
All right, let's get started. Great. So for today, we're planning to finish offline reinforcement learning. And along the reward learning front, the key goals today are to figure out why is task specification hard. Why do naive methods for trying to specify rewards fail? And what are some methods for learning reward functions from human supervision... Read More
Install to Summarize YouTube Videos and Get Transcripts
Explore YouTube Video Summarizer or Get YouTube Transcript Extractor
Read in Other Languages (beta)
Share This Summary 📚
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator
Explore More Summaries from Stanford Online 📚

Stanford CS224N NLP with Deep Learning | Winter 2021 | Lecture 16 - Social & Ethical Considerations
Stanford Online

Stanford CS229: Machine Learning | Summer 2019 | Lecture 20 - Variational Autoencoder
Stanford Online

Stanford AA228/CS238 Decision Making Under Uncertainty I Policy Gradient Estimation and Optimization
Stanford Online

Stanford Webinar - GPT-3 & Beyond
Stanford Online

Bayesian Networks 4 - Probabilistic Inference | Stanford CS221: AI (Autumn 2021)
Stanford Online
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator