Products
Features
YouTube Video Summarizer
Summarize YouTube videos
Web & PDF Highlighter
Highlight web pages & PDFs
Chat with PDF
Ask any PDF questions with AI
Ask AI Clone
Chat with your highlights & memories
Audio Transcriber
Transcribe audio files to text
Glasp Reader
Read and highlight articles
Kindle Highlight Export
Export your Kindle highlights
Idea Hatch
Hatch ideas from your highlights
Integrations
Obsidian Plugin
Notion Integration
Pocket Integration
Instapaper Integration
Medium Integration
Readwise Integration
Snipd Integration
Hypothesis Integration
Apps & Extensions
Chrome Extension
Safari Extension
Edge Add-ons
Firefox Add-ons
iOS App
Android App
Discover
Discover
Ideas
Discover new ideas and insights
Articles
Curated articles and insights
Books
Book recommendations by great minds
Posts
Essays and notes from readers
Quotes
Inspiring quotes collection
Videos
Curated videos and summaries
Explore Glasp
Glasp Newsletter
Weekly insights and updates
Glasp Talk
Interview series with great minds
Glasp Blog
Latest news and articles
Glasp Use Cases
Learn how others use Glasp
Build & Support
Glasp API
Access Glasp's API for developers
MCP Connector
Connect Glasp to Claude & ChatGPT
Community
Glasp Reddit Community
Students
Student discount and benefits
FAQs
Frequently Asked Questions
AboutPricing
DashboardLog inSign up

This AI Makes "Audio Deepfakes"!

617.7K views
•
April 8, 2020
by
Two Minute Papers
YouTube video player
This AI Makes "Audio Deepfakes"!

TL;DR

Deepfake technology has advanced to the point where it can convincingly animate video footage using synthesized audio, achieved through techniques like Tacotron 2 and Neural Voice Puppetry.

Transcript

Dear Fellow Scholars, this is Two Minute Papers with this guy's name that is impossible to pronounce. My name is Dr. Károly Zsolnai-Fehér, and indeed, it seems that pronouncing my name requires some advanced technology. So what was this? I promise to tell you in a moment, but to understand what happened here, first, let’s have a look at this deepfa... Read More

Key Insights

  • 😥 Deepfake technology has advanced to the point where it can accurately animate video footage using synthesized audio.
  • 👂 Tacotron 2 is an AI-based voice cloning technique that can synthesize new sentences in a person's voice using a 5-second sound sample.
  • 🙊 Neural Voice Puppetry combines Tacotron 2 with video footage to make the target subject appear as if they are speaking the synthesized audio.
  • 🎯 The deepfake techniques showcased in the video achieve superior quality and can generalize to multiple target subjects.
  • 🏃 The neural rendering part of the process runs in real time, further enhancing the realism of the animated video.
  • 🎯 The combination of multiple existing techniques enables joint video and audio synthesis for a target subject.
  • 🫵 Viewers are encouraged to try out the deepfake tool themselves and share their results.

Install to Summarize YouTube Videos and Get Transcripts

Explore YouTube Video Summarizer or Get YouTube Transcript Extractor

Questions & Answers

Q: How does deepfake technology animate video footage using synthesized audio?

Deepfake technology uses techniques like Tacotron 2 and Neural Voice Puppetry to translate mouth, head, and eye movements to a chosen target subject using just one photograph and synthesize new sentences in a person's voice using a sound sample. It then applies these gestures and motions to an intermediate 3D model and adapts it to the target subject using a neural renderer.

Q: Can deepfake technology synthesize sounds and consonants not heard in the original voice sample?

Yes, deepfake technology, such as Tacotron 2, is capable of synthesizing sounds and consonants that were not heard in the original voice sample. This is achieved through advanced AI techniques that infer and generate these sounds based on the given sample.

Q: How does Neural Voice Puppetry improve upon previous techniques?

Neural Voice Puppetry combines the synthesized audio from Tacotron 2 with video footage, animating it to make the target subject appear as if they are speaking the synthesized audio. This technique improves upon previous methods by achieving a higher level of realism and synchronization between the audio and video.

Q: Can anyone try out these deepfake techniques for themselves?

Yes, viewers are encouraged to try out these deepfake techniques themselves. The link to the tool is provided in the video description, and users can leave comments with their results.

Summary & Key Takeaways

  • Deepfake techniques can now transfer realistic mouth, head, and eye movements to a chosen target subject using just one photograph.

  • Tacotron 2 is an AI-based voice cloning technique that can synthesize new sentences in a person's voice using just a 5-second sound sample.

  • Neural Voice Puppetry combines Tacotron 2 with video footage, animating it to make the target subject appear as if they are speaking the synthesized audio.


Read in Other Languages (beta)

English

Share This Summary 📚

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Explore More Summaries from Two Minute Papers 📚

This Adorable Baby T-Rex AI Learned To Dribble 🦖 thumbnail
This Adorable Baby T-Rex AI Learned To Dribble 🦖
Two Minute Papers
DeepMind’s New AI Makes Games From Scratch! thumbnail
DeepMind’s New AI Makes Games From Scratch!
Two Minute Papers
Is Visualizing Light Waves Possible? ☀️ thumbnail
Is Visualizing Light Waves Possible? ☀️
Two Minute Papers
Finally, Instant Monsters! 🐉 thumbnail
Finally, Instant Monsters! 🐉
Two Minute Papers
NVIDIA’s Robot AI Finally Enters The Real World! 🤖 thumbnail
NVIDIA’s Robot AI Finally Enters The Real World! 🤖
Two Minute Papers
How to Create Virtual Worlds with AI thumbnail
How to Create Virtual Worlds with AI
Two Minute Papers

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Apps & Extensions

  • Chrome Extension
  • Safari Extension
  • Edge Add-ons
  • Firefox Add-ons
  • iOS App
  • Android App

Key Features

  • YouTube Video Summarizer
  • Web & PDF Summarizer
  • Web & PDF Highlighter
  • Chat with PDF
  • Ask AI Clone
  • Audio Transcriber
  • Glasp Reader
  • Kindle Highlight Export
  • Idea Hatch

Integrations

  • Obsidian Plugin
  • Notion Integration
  • Pocket Integration
  • Instapaper Integration
  • Medium Integration
  • Readwise Integration
  • Snipd Integration
  • Hypothesis Integration

More Features

  • APIs
  • MCP Connector
  • Blog & Post
  • Embed Links
  • Image Highlight
  • Personality Test
  • Quote Shots

Company

  • About us
  • Blog
  • Community
  • FAQs
  • Job Board
  • Newsletter
  • Pricing
Terms

•

Privacy

•

Guidelines

© 2026 Glasp Inc. All rights reserved.