How Shazam Works (Probably!) - Computerphile

TL;DR
Shazam is a mobile service that uses Fast Fourier Transform (FFT) to analyze audio and identify songs by matching frequency patterns.
Transcript
one thing i'd asked somebody a long time ago was how does this service shazam works and shazam for those who who don't know is a service where you can hold your smartphone up press a button it listens to a song and then it tells you who the song is by and and you know what the song is called and we had a discussion and you've done some research int... Read More
Key Insights
- ❓ Shazam's algorithm has evolved over the years, and the current version is likely more advanced than the original proposal.
- 👻 FFT plays a crucial role in Shazam's audio analysis process, allowing it to extract frequency information and create audio fingerprints.
- 🧡 Different instruments have specific frequency ranges, and Shazam can identify them by analyzing the frequency components of a song.
- ❓ Shazam's analysis focuses on frequencies between 100 Hz and 5000 Hz to capture the prominent components of a song.
- 👥 Matching audio fingerprints involves comparing and grouping similar frequency components to create a unique identification pattern.
- 🪪 The quality of the microphone used to record audio for identification does affect the accuracy of the results but does not hinder the overall identification process.
- 😒 Shazam's matching algorithm uses anchor points and tolerance for timing differences to find matches between audio fingerprints more efficiently.
Install to Summarize YouTube Videos and Get Transcripts
Explore YouTube Video Summarizer or Get YouTube Transcript Extractor
Questions & Answers
Q: How does Shazam work?
Shazam uses FFT to analyze songs and create audio fingerprints based on the frequency components of each 100 millisecond chunk. It then compares these fingerprints to a database for identification.
Q: Does Shazam only use beats per minute (BPM) for song identification?
BPM is an emergent property that Shazam detects, but it is not the primary method of identification. Shazam analyzes the volume levels of different frequencies using FFT to create more accurate audio fingerprints.
Q: How does FFT work in Shazam?
FFT involves slicing the audio into 100 millisecond chunks and analyzing the volume levels of each frequency within those chunks. By comparing and averaging these levels, FFT helps identify the prominent frequencies in the song.
Q: What is the significance of audio frequencies in Shazam's analysis?
Audio frequencies represent different elements and instruments within a song. By analyzing the specific frequencies and their volume levels, Shazam can identify the unique characteristics of a song and match it to its database.
Summary & Key Takeaways
-
Shazam uses FFT, a mathematical algorithm, to break down songs into frequency components and compare them to a database for identification.
-
The FFT process involves slicing the song into 100 millisecond chunks and analyzing the volume levels of each frequency within those chunks.
-
Shazam then groups similar frequencies together to determine the prominent frequencies in the song and uses them to create a unique audio fingerprint for identification.
Read in Other Languages (beta)
Share This Summary 📚
Summarize YouTube Videos and Get Video Transcripts with 1-Click
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator
Explore More Summaries from Computerphile 📚






Summarize YouTube Videos and Get Video Transcripts with 1-Click
Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator