Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 6: Kernels, Triton

Stanford CS336 Language Modeling from Scratch | Spring 2025 | Lecture 6: Kernels, Triton
Transcript
Today, we're going to be going into details on writing high performance code for GPUs. So part of assignment 2 is going to be you're going to have to do a bunch of profiling, you will have to write your own Triton kernel for FlashAttention-2. You will need to make all of this stuff very high performance. And so in this lecture, we're going to drill... Read More
Install to Summarize YouTube Videos and Get Transcripts
Explore YouTube Video Summarizer or Get YouTube Transcript Extractor
Read in Other Languages (beta)
Share This Summary 📚
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator
Explore More Summaries from Stanford Online 📚

Stanford CS229: Machine Learning | Summer 2019 | Lecture 20 - Variational Autoencoder
Stanford Online

Stanford CS224N NLP with Deep Learning | Winter 2021 | Lecture 16 - Social & Ethical Considerations
Stanford Online

Bayesian Networks 4 - Probabilistic Inference | Stanford CS221: AI (Autumn 2021)
Stanford Online

Stanford Webinar - GPT-3 & Beyond
Stanford Online

Stanford AA228/CS238 Decision Making Under Uncertainty I Policy Gradient Estimation and Optimization
Stanford Online
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator