DeepSeek V3, SGLang, and the state of Open Model Inference in 2025 (Quantization, MoEs, Pricing)

DeepSeek V3, SGLang, and the state of Open Model Inference in 2025 (Quantization, MoEs, Pricing)
Transcript
hey everyone welcome back to the laden space podcast our first recording of 2025 I'm alesio partner and CTO at desable partners and I'm joined by my co-host swix founder of small AI hey and today we are here with a special double guest episode with Amir oh my God I don't know your last hagat that's close enough that is good go that's really good an... Read More
Install to Summarize YouTube Videos and Get Transcripts
Explore YouTube Video Summarizer or Get YouTube Transcript Extractor
Read in Other Languages (beta)
Share This Summary 📚
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator
Explore More Summaries from Latent Space 📚

The End of Finetuning — with Jeremy Howard of Fast.ai
Latent Space - The AI Engineer Podcast (Video Podcast)

⚡️ARC-AGI-3: The Interactive Reasoning Benchmark
Latent Space

Language Agents: From Reasoning to Acting — with Shunyu Yao of OpenAI, Harrison Chase of LangGraph
Latent Space

The Origin and Future of RLHF: the secret ingredient for ChatGPT - with Nathan Lambert
Latent Space - The AI Engineer Podcast (Video Podcast)

LIVE from GTC: DGX Spark Insides First Look
Latent Space

Best of 2024 in Agents (from #1 on SWE-Bench Full, Prof. Graham Neubig of OpenHands/AllHands)
Latent Space
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator