Products
Features
YouTube Video Summarizer
Summarize YouTube videos
Web & PDF Highlighter
Highlight web pages & PDFs
Chat with PDF
Ask any PDF questions with AI
Ask AI Clone
Chat with your highlights & memories
Audio Transcriber
Transcribe audio files to text
Glasp Reader
Read and highlight articles
Kindle Highlight Export
Export your Kindle highlights
Idea Hatch
Hatch ideas from your highlights
Integrations
Obsidian Plugin
Notion Integration
Pocket Integration
Instapaper Integration
Medium Integration
Readwise Integration
Snipd Integration
Hypothesis Integration
Apps & Extensions
Chrome Extension
Safari Extension
Edge Add-ons
Firefox Add-ons
iOS App
Android App
Discover
Discover
Ideas
Discover new ideas and insights
Articles
Curated articles and insights
Books
Book recommendations by great minds
Posts
Essays and notes from readers
Quotes
Inspiring quotes collection
Videos
Curated videos and summaries
Explore Glasp
Glasp Newsletter
Weekly insights and updates
Glasp Talk
Interview series with great minds
Glasp Blog
Latest news and articles
Glasp Use Cases
Learn how others use Glasp
Build & Support
Glasp API
Access Glasp's API for developers
MCP Connector
Connect Glasp to Claude & ChatGPT
Community
Glasp Reddit Community
Students
Student discount and benefits
FAQs
Frequently Asked Questions
AboutPricing
DashboardLog inSign up

Large Language Models (in 2023)

69.8K views
•
October 4, 2023
by
Hyung Won Chung
YouTube video player
Large Language Models (in 2023)

TL;DR

Large language models have unique abilities that emerge at certain scales, necessitating a change in perspective. Fundamental ideas and scaling are crucial, but scaling alone cannot solve all problems.

Transcript

yeah I will talk about large language models in 2023 why do we need that suffix in 2023 what we call large is misleading in the large models of today will be small models only only in a few years along with the change in the perception of scale many insights observations and conclusions we make along the way with the current large damage models wil... Read More

Key Insights

  • 🌥️ Large language models exhibit emergent abilities at certain scales, challenging traditional perspectives.
  • 🌥️ The perception of scale and "large" models is constantly changing and evolving.
  • ⚾ Insights based on first principles are more reliable and enduring than advanced ideas.
  • ⚖️ Scaling in language models is achieved through Transformer architecture and distributed computing.
  • 💡 Researchers must unlearn intuitions based on invalidated ideas and consider the potential of new ideas in the future.
  • 👻 Instruction fine-tuning allows tasks to be framed as natural language instructions for better model understanding.
  • ❓ RLHF (Reward Model and Policy Model Training) offers a paradigm shift in learning objective function, providing more scalable and expressive models.

Install to Summarize YouTube Videos and Get Transcripts

Explore YouTube Video Summarizer or Get YouTube Transcript Extractor

Questions & Answers

Q: Why is scale important in large language models?

Scale is important because it enables the emergence of unique abilities in large language models. Small models are often unable to solve certain tasks, but at a certain scale, they suddenly become capable of solving them.

Q: What is the significance of changing the perspective on large language models?

Changing the perspective allows researchers to view new ideas as possibilities for the future. Instead of assuming that an idea does not work, they can consider that it may not work with the current generation of models, but could be effective in the future.

Q: How does scaling impact the perception of size in language models?

Scaling changes the perception of what is considered "large" in language models. What may be considered large today will be considered small in a few years. This shift in perception is significant for researchers and practitioners in the field.

Q: Why is it important to unlearn intuitions based on invalidated ideas?

With advancements in large language models, many intuitions and ideas become outdated or contradictory. It is crucial to constantly update and unlearn these invalidated ideas to ensure continued progress and accuracy in the field.

Summary & Key Takeaways

  • Large language models exhibit emergent abilities at certain scales, leading to a change in perspective.

  • The perception of scale and what is considered "large" will change over time.

  • Insights based on first principles tend to be more stable and reliable than advanced ideas.

  • Large language models use the Transformer architecture, which involves scalable matrix multiplication.


Read in Other Languages (beta)

English

Share This Summary 📚

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Explore More Summaries from Hyung Won Chung 📚

Cornell AI history lecture thumbnail
Cornell AI history lecture
Hyung Won Chung

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Apps & Extensions

  • Chrome Extension
  • Safari Extension
  • Edge Add-ons
  • Firefox Add-ons
  • iOS App
  • Android App

Key Features

  • YouTube Video Summarizer
  • Web & PDF Summarizer
  • Web & PDF Highlighter
  • Chat with PDF
  • Ask AI Clone
  • Audio Transcriber
  • Glasp Reader
  • Kindle Highlight Export
  • Idea Hatch

Integrations

  • Obsidian Plugin
  • Notion Integration
  • Pocket Integration
  • Instapaper Integration
  • Medium Integration
  • Readwise Integration
  • Snipd Integration
  • Hypothesis Integration

More Features

  • APIs
  • MCP Connector
  • Blog & Post
  • Embed Links
  • Image Highlight
  • Personality Test
  • Quote Shots

Company

  • About us
  • Blog
  • Community
  • FAQs
  • Job Board
  • Newsletter
  • Pricing
Terms

•

Privacy

•

Guidelines

© 2026 Glasp Inc. All rights reserved.