Products
Features
YouTube Video Summarizer
Summarize YouTube videos
Web & PDF Highlighter
Highlight web pages & PDFs
Chat with PDF
Ask any PDF questions with AI
Ask AI Clone
Chat with your highlights & memories
Audio Transcriber
Transcribe audio files to text
Glasp Reader
Read and highlight articles
Kindle Highlight Export
Export your Kindle highlights
Idea Hatch
Hatch ideas from your highlights
Integrations
Obsidian Plugin
Notion Integration
Pocket Integration
Instapaper Integration
Medium Integration
Readwise Integration
Snipd Integration
Hypothesis Integration
Apps & Extensions
Chrome Extension
Safari Extension
Edge Add-ons
Firefox Add-ons
iOS App
Android App
Discover
Discover
Ideas
Discover new ideas and insights
Articles
Curated articles and insights
Books
Book recommendations by great minds
Posts
Essays and notes from readers
Quotes
Inspiring quotes collection
Videos
Curated videos and summaries
Explore Glasp
Glasp Newsletter
Weekly insights and updates
Glasp Talk
Interview series with great minds
Glasp Blog
Latest news and articles
Glasp Use Cases
Learn how others use Glasp
Build & Support
Glasp API
Access Glasp's API for developers
MCP Connector
Connect Glasp to Claude & ChatGPT
Community
Glasp Reddit Community
Students
Student discount and benefits
FAQs
Frequently Asked Questions
AboutPricing
DashboardLog inSign up

NEW AI Jailbreak Method SHATTERS GPT4, Claude, Gemini, LLaMA

242.6K views
•
March 8, 2024
by
Matthew Berman
YouTube video player
NEW AI Jailbreak Method SHATTERS GPT4, Claude, Gemini, LLaMA

TL;DR

A new jailbreak technique using ASCII art has emerged, allowing large language models to bypass filters and censorship.

Transcript

there is a new jailbreak technique that has AI companies scrambling and it actually uses something that's been on the internet for pretty much as long as the internet has been around so I'm going to tell you about it and then we're going to test it out and see if it works all right this is the research paper but before we actually get into it let m... Read More

Key Insights

  • 🌥️ Large language models have become more aligned with safety measures, making jailbreaking techniques more challenging.
  • 🥰 The ASCII art-based jailbreak technique leverages ASCII art representations to bypass model filters and censorship.
  • 🥰 State-of-the-art language models, including GPT 3.5, GPT 4, Gemini, Claude, and Llama 2, exhibit vulnerability to the ASCII art-based jailbreak technique.
  • 🦺 Previous jailbreaking techniques have been patched to enhance model safety and alignment.
  • 👊 The research paper suggests that semantics-only interpretation of prompts during safety alignment can create vulnerabilities to jailbreak attacks.
  • 🥰 The paper introduces a comprehensive benchmark challenge to measure the susceptibility of language models to the ASCII art-based jailbreak technique.
  • 🌍 The success rate of the ASCII art-based jailbreak technique varies across different models, with GPT 4 showing the highest susceptibility.

Install to Summarize YouTube Videos and Get Transcripts

Explore YouTube Video Summarizer or Get YouTube Transcript Extractor

Questions & Answers

Q: What is jailbreaking in the context of large language models?

Jailbreaking refers to finding creative prompts to trick large language models into providing information that they are typically trained not to provide.

Q: How does the ASCII art-based jailbreak technique work?

This technique masks forbidden words with ASCII art representations, fooling the model into not recognizing them. The masked prompts can then bypass filters and obtain the desired information.

Q: Are all large language models susceptible to the ASCII art-based jailbreak technique?

The research paper shows that even state-of-the-art models like GPT 3.5, GPT 4, Gemini, Claude, and Llama 2 struggle to recognize prompts provided in the form of ASCII art.

Q: What are some other jailbreaking techniques that have been discovered?

Other techniques include direct instruction prompting, greedy coordinate gradient, autoddan, prompt automatic iterative refinement, and deep inception. Each technique aims to bypass filters and solicit unintended behaviors from the models.

Summary & Key Takeaways

  • Jailbreaking refers to obtaining forbidden information from large language models like GPT through creative prompts.

  • Researchers have discovered a new technique called "ASCII art-based jailbreak" that masks prompts using ASCII art, allowing them to bypass model filters.

  • This technique was tested on various state-of-the-art language models and found to have a high success rate.


Read in Other Languages (beta)

English

Share This Summary 📚

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Explore More Summaries from Matthew Berman 📚

Former Google CEO Spills ALL! (Google AI is Doomed) thumbnail
Former Google CEO Spills ALL! (Google AI is Doomed)
Matthew Berman
Is Qwen's New Image Model the Best? thumbnail
Is Qwen's New Image Model the Best?
Matthew Berman
OpenAI Dropped a FRONTIER Open-Source Model thumbnail
OpenAI Dropped a FRONTIER Open-Source Model
Matthew Berman
Figure Robotics FIRED Their Head of Safety (Lawsuit) thumbnail
Figure Robotics FIRED Their Head of Safety (Lawsuit)
Matthew Berman
We Finally Figured Out How AI Actually Works… (not what we thought!) thumbnail
We Finally Figured Out How AI Actually Works… (not what we thought!)
Matthew Berman
GitHub CEO predicts the future of programming...(Full Interview) thumbnail
GitHub CEO predicts the future of programming...(Full Interview)
Matthew Berman

Summarize YouTube Videos and Get Video Transcripts with 1-Click

Download browser extensions on:

Try YouTube Summary with ChatGPT & Claude or YouTube Transcript Generator

Apps & Extensions

  • Chrome Extension
  • Safari Extension
  • Edge Add-ons
  • Firefox Add-ons
  • iOS App
  • Android App

Key Features

  • YouTube Video Summarizer
  • Web & PDF Summarizer
  • Web & PDF Highlighter
  • Chat with PDF
  • Ask AI Clone
  • Audio Transcriber
  • Glasp Reader
  • Kindle Highlight Export
  • Idea Hatch

Integrations

  • Obsidian Plugin
  • Notion Integration
  • Pocket Integration
  • Instapaper Integration
  • Medium Integration
  • Readwise Integration
  • Snipd Integration
  • Hypothesis Integration

More Features

  • APIs
  • MCP Connector
  • Blog & Post
  • Embed Links
  • Image Highlight
  • Personality Test
  • Quote Shots

Company

  • About us
  • Blog
  • Community
  • FAQs
  • Job Board
  • Newsletter
  • Pricing
Terms

•

Privacy

•

Guidelines

© 2026 Glasp Inc. All rights reserved.