DeepSeek V3, SGLang, and the state of Open Model Inference in 2025 (Quantization, MoEs, Pricing)

DeepSeek V3, SGLang, and the state of Open Model Inference in 2025 (Quantization, MoEs, Pricing)
Transcript
hey everyone welcome back to the laden space podcast our first recording of 2025 I'm alesio partner and CTO at desable partners and I'm joined by my co-host swix founder of small AI hey and today we are here with a special double guest episode with Amir oh my God I don't know your last hagat that's close enough that is good go that's really good an... Read More
Install to Summarize YouTube Videos and Get Transcripts
Explore YouTube Video Summarizer or Get YouTube Transcript Extractor
Read in Other Languages (beta)
Share This Summary 📚
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator
Explore More Summaries from Latent Space 📚

Agents @ Work: Lindy.ai (with live demo!)
Latent Space

The AI Coding Factory
Latent Space

Agent Engineering with Pydantic + Graphs — with Samuel Colvin, CEO of Pydantic Logfire
Latent Space

Why is everyone cloning Deep Research?
Latent Space

⚡️Accelerators @ 3x NVIDIA H200 perf, Made in the USA - Thomas Sohmers + Mitesh Agrawal, Positron AI
Latent Space

Outlasting Noam Shazeer, Crowdsourcing Chai AI w/ 1.4m DAU — with William Beauchamp, Chai Research
Latent Space
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Download browser extensions on:
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator