AI

From Highlights to Deep Research: Building an AI-Powered Research Workflow

Google, OpenAI, and Perplexity all shipped "deep research" features in the past year. But these tools are only as good as what you feed them. The real competitive advantage isn't which AI you use; it's the quality of context you've built up over months of deliberate reading.

14 min read
Key Takeaways
    • Deep research tools need quality input: Google Gemini Deep Research, OpenAI's deep research, and Perplexity Pro can synthesize dozens of sources, but their output quality depends directly on the specificity and richness of your prompts and context.
  • Highlights are research primitives: Every passage you highlight while reading becomes a retrievable, searchable, AI-queryable unit of knowledge that compounds over time.
  • The five-step workflow (Capture, Organize, Analyze, Synthesize, Output) transforms scattered reading into structured research, with AI accelerating each stage.
  • Social research multiplies insight: Accessing what others highlight on the same sources reveals perspectives and patterns you would miss alone, a core principle of information foraging theory.
  • Traditional literature reviews take weeks; AI-augmented workflows take days: But only if your captured material is well-organized and richly annotated.
  • The best researchers combine breadth and depth: Daily lightweight highlighting builds breadth; periodic deep research sessions convert that breadth into depth.

The Rise of AI Deep Research Tools

In February 2025, Google launched Deep Research inside Gemini Advanced. The feature creates multi-step research plans, browses dozens of web sources autonomously, and produces comprehensive reports with citations. Within weeks, OpenAI followed with its own deep research capability in ChatGPT Pro, and Perplexity had already been refining its research-grade search for months.

These tools represent a genuine shift. A traditional literature review for an academic paper might require 40 to 80 hours of reading, note-taking, and synthesis. Google's Deep Research can produce a first-pass literature summary in under ten minutes. OpenAI's version can browse over 100 sources in a single session and produce reports that, according to early benchmarks, score in the top percentiles on research-style evaluations.

But here's what the launch announcements don't emphasize: these tools perform radically differently depending on the quality of context you provide.

A vague prompt like "research the impact of AI on education" yields a generic overview. A specific prompt informed by months of reading, with precise sub-questions, named researchers, and conceptual frameworks, produces something genuinely useful.

The gap between a mediocre deep research output and an excellent one isn't the model. It's the researcher's accumulated context.


Why Output Quality Depends on Input Quality

Pirolli and Card's information foraging theory (1999) established a principle that remains relevant in the AI era: the value of information retrieval depends on "information scent," the cues that guide a searcher toward high-value sources. Better scent trails lead to better foraging outcomes.

AI deep research tools follow the same logic. When you prompt Gemini Deep Research or ChatGPT's research mode, the model constructs a search plan based on your query. The richer your query, the better the plan. The more specific your sub-questions, the more targeted the sources it retrieves.

This creates an asymmetry. Researchers who read widely and annotate carefully over time can construct prompts with:

  • Specific terminology the model can use to find precise sources
  • Named researchers and papers that anchor the search
  • Conceptual frameworks that structure the analysis
  • Contradictions and open questions that push the model beyond surface-level summaries

Researchers who haven't done this groundwork get the equivalent of a well-written Wikipedia summary. Competent, but not competitive.

Andrej Karpathy described a version of this dynamic in his commentary on research workflows: the people who get the most out of AI tools are those who already know enough to evaluate, redirect, and refine the output. The tool accelerates; it doesn't replace the underlying knowledge.

This is why daily reading habits matter more in the AI era, not less. The highlights, notes, and annotations you accumulate become the raw material for prompts that produce genuinely differentiated research.


The Highlight-First Research Method

Most people think of research as a distinct activity: sit down, define a question, search for sources, read, take notes, write. This model assumes research is a project with a start date.

The highlight-first method inverts this. Research becomes a continuous process where daily reading habits feed future research projects. The workflow has five stages:

  1. Capture: Highlight and annotate as you read across the web, YouTube, PDFs, and books
  2. Organize: Tag, categorize, and connect highlights into topics and themes
  3. Analyze: Use AI to query your highlights, detect patterns, and surface connections
  4. Synthesize: Combine insights across sources into coherent arguments or frameworks
  5. Output: Transform synthesized insights into articles, reports, presentations, or further research questions

The key insight: stages 1 and 2 happen continuously during everyday reading. You don't need to be "doing research" to build your research corpus. Every article you read, every YouTube video you watch, every Kindle passage you mark is a potential input for future deep research.

This aligns with how productive researchers actually work. Studies of prolific academics show they maintain ongoing reading and annotation practices that feed multiple projects simultaneously. The "aha" moment in research often comes from connecting an idea encountered in casual reading with a question from a formal research project.


Step 1 - Capture Everything That Matters

The first step is creating a low-friction capture system. If highlighting requires more than two seconds of effort, you won't do it consistently. The best capture systems work across every surface where you encounter information.

Web articles and blogs. Glasp's web highlighter lets you highlight any passage on any webpage with a single click. The highlight is saved to your Glasp profile, searchable, and visible to others who read the same page. This is the primary capture mechanism for most researchers.

YouTube videos. YouTube Summary generates full transcripts and lets you highlight specific segments. For researchers working with video content (conference talks, lectures, interviews), this converts ephemeral audio into permanent, searchable text.

Books and Kindle. Kindle highlights can be imported directly into Glasp, combining your book annotations with your web highlights in a single searchable corpus. This solves the longstanding problem of Kindle notes being trapped in Amazon's ecosystem.

PDFs and academic papers. PDF annotation tools can feed into the same pipeline. The goal is a single repository where all your highlighted material lives, regardless of source format.

What to capture. Not everything deserves a highlight. Focus on:

  • Claims backed by specific data or studies
  • Definitions of concepts you might reference later
  • Counterarguments to positions you hold
  • Methodological descriptions you could adapt
  • Quotes that articulate an idea better than you could paraphrase

The discipline of deciding what to highlight is itself a form of active reading. Research on annotation (see our guide on how to annotate) consistently shows that selective highlighting improves comprehension and retention compared to passive reading.


Step 2 - Organize With Intent

Raw highlights without organization are a pile, not a system. The second step transforms captured material into something queryable and navigable.

Tags and topics. Assign tags to highlights as you create them. Effective tagging uses a mix of:

  • Topic tags: the subject matter (e.g., "information foraging," "systematic reviews," "LLM evaluation")
  • Project tags: the research project or article this might feed (e.g., "Q2-report," "thesis-chapter-3")
  • Type tags: the kind of insight (e.g., "methodology," "data-point," "counterargument," "definition")

Atomic notes. When a highlight sparks a thought, write a brief note alongside it. These notes are more valuable than the highlights themselves because they capture your interpretation, not just the source material. Even a single sentence of commentary ("This contradicts Smith's 2023 finding on retrieval practice") creates a connection point for future synthesis.

Collections and groupings. Group related highlights into collections organized by research question or theme. This creates pre-assembled source bundles you can feed directly into AI analysis tools.

The organizational investment pays compound returns. A well-tagged corpus of 500 highlights is dramatically more useful than 5,000 untagged ones. And the act of organizing forces you to re-engage with the material, which strengthens memory consolidation.

For deeper frameworks on organizing knowledge, see our guides on personal knowledge management and building a second brain.


Step 3 - Analyze With AI

This is where AI transforms the workflow. Once you have an organized corpus of highlights and notes, you can use AI to extract insights that would take hours to surface manually.

Querying your highlights. Glasp's AI chat lets you ask questions directly against your saved highlights. Instead of prompting an AI with zero context, you're prompting it with months or years of curated material. Example queries:

  • "What do my highlights say about the effectiveness of spaced repetition for adult learners?"
  • "Find contradictions across my sources on the impact of social media on attention span."
  • "Which of my highlighted studies use randomized controlled trials?"

Pattern detection. AI excels at finding patterns across large corpora that humans miss. Feed your highlights on a topic into an LLM and ask it to identify recurring themes, outlier positions, or gaps in the literature. This is the functional equivalent of the "coding" phase in qualitative research, but completed in minutes instead of days.

Citation mapping. When you've highlighted passages from multiple papers that cite each other, AI can help reconstruct the citation network and identify foundational papers you may have missed. This is particularly valuable for researchers entering a new field.

Critical evaluation. Ask the AI to evaluate the strength of evidence across your highlights. Which claims are supported by large-scale RCTs? Which rely on self-reported surveys? Which are purely theoretical? This kind of evidence stratification is tedious to do manually but straightforward for an LLM working with structured input.

The critical point: the AI is working with your curated, pre-filtered material, not the entire internet. This produces more focused, relevant analysis than a general-purpose deep research query.


Step 4 - Synthesize Across Sources

Synthesis is where research becomes original contribution. It's the process of combining insights from multiple sources into a new argument, framework, or perspective.

Cross-source connections. The most valuable research insights come from connecting ideas across domains. A finding from cognitive psychology combined with a case study from organizational behavior and a methodology from computer science produces something none of those fields would generate alone. Your cross-domain highlights make this kind of synthesis possible.

Framework construction. Use your analyzed highlights to build conceptual frameworks. For example, if your highlights on AI in education consistently cluster around three themes (cognitive offloading, personalized tutoring, and assessment transformation), that clustering itself is a framework worth articulating.

Gap identification. What questions do your sources raise but not answer? Where do different researchers disagree, and what evidence would resolve the disagreement? These gaps become your original research questions or the most valuable sections of a literature review.

Narrative threading. Research outputs need a narrative arc. Synthesis means deciding which insights to foreground, which to use as supporting evidence, and which to set aside. AI can suggest narrative structures, but the editorial judgment of which story to tell remains a human skill.

This stage benefits enormously from having highlights across many sources. If you've only read five articles on a topic, your synthesis options are limited. If you've highlighted 50 articles over six months of ongoing reading, the combinatorial possibilities are rich. This is the compound return on consistent highlighting.


Step 5 - Produce Research Outputs

The final step converts synthesized insights into deliverables. The format varies by context, but the workflow is consistent.

Blog posts and articles. For public-facing content, the synthesis from Step 4 provides the structure. Your highlights supply evidence, quotes, and citations. AI can help with drafting, but the argument architecture comes from your accumulated reading. You can export your highlights in multiple formats (Markdown, CSV, plain text) to feed directly into your writing tool of choice.

Academic papers and literature reviews. The organized, analyzed highlight corpus maps directly onto the structure of a literature review: themes, sub-themes, evidence evaluation, and gap identification. What traditionally takes weeks of note-card shuffling can be accelerated significantly when your highlights are already tagged, analyzed, and synthesized.

Reports and presentations. Business research often requires translating academic findings into actionable recommendations. The highlight-first workflow supports this by maintaining the connection between specific evidence and the conclusions drawn from it. Every claim in your report can link back to a highlighted source.

Further research questions. Sometimes the most valuable output isn't a finished document but a refined set of questions for the next round of investigation. The gaps identified in Step 4 become the starting prompts for AI deep research tools, creating a virtuous cycle.


Traditional vs AI-Powered Research Workflows

Understanding the difference between traditional and AI-augmented research clarifies where the real time savings occur.

StageTraditional WorkflowAI-Powered Workflow
Source discoveryDatabase searches, citation chasing, manual browsing (days)AI deep research scans 50+ sources in minutes, combined with months of organic discovery via highlights
Reading and annotationPrint, read, handwrite notes (weeks)Highlight directly on web/PDF/Kindle, notes attached to source (ongoing, low effort)
OrganizationPhysical note cards, spreadsheets, or reference managers (hours)Tags, topics, and collections with search and AI querying (minutes per session)
AnalysisManual coding, thematic analysis, evidence tables (days to weeks)AI pattern detection across highlight corpus (minutes to hours)
SynthesisOutlining, drafting, reorganizing (days)AI-assisted framework construction with human editorial control (hours)
WritingDrafting from notes, checking sources (days to weeks)AI-assisted drafting from exported highlights and synthesis (hours to days)
Total for a literature review40 to 80 hours10 to 20 hours (plus ongoing highlight accumulation)

The AI-powered workflow doesn't eliminate any stage. It compresses each one. And critically, it front-loads the investment: the daily habit of highlighting means that when you start a research project, you already have material to work with.

The researchers who benefit most are those who've been capturing highlights for months before a research question crystallizes. They don't start from zero. They start from a curated, annotated, searchable corpus that gives their AI tools better context than any single search session could provide.


Social Research and Community Highlights

Research has always been social. Citation networks, peer review, academic conferences, and journal clubs all exist because no individual can read everything relevant to their field. AI doesn't change this; it amplifies it.

Glasp's community adds a social dimension to the highlight-first workflow. When you read an article, you can see what other readers highlighted on the same page. This surfaces:

  • Passages you overlooked. Other readers often highlight sections you skimmed past. Their attention acts as a filter that catches what yours missed.
  • Alternative interpretations. Seeing which passages resonate with different people reveals how the same source supports multiple arguments.
  • Expert curation. Following researchers in your field gives you a continuously updated reading list filtered through informed judgment.
  • Consensus signals. When many readers independently highlight the same passage, it's a strong signal that the passage contains a key insight or controversial claim.

This connects to information foraging theory. Pirolli and Card's framework describes how people follow "information scent" to find relevant material. Community highlights amplify scent trails. Instead of relying solely on your own sense of what's important, you benefit from the collective attention of hundreds of readers.

For researchers, this social layer is particularly valuable during the source discovery phase. Rather than relying exclusively on database searches or AI-generated source lists, you can follow the highlighting activity of domain experts. If a respected researcher in your field highlights an article, it's likely worth reading.

The social dimension also helps with synthesis. Seeing how others interpret the same sources can challenge your assumptions and suggest alternative frameworks. This is the digital equivalent of a seminar discussion, but asynchronous and scalable.

For more on how collective reading enhances understanding, see our article on AI and learning.


Frequently Asked Questions

Do I need to use a specific AI deep research tool?

No. The workflow described here is tool-agnostic. Google Gemini Deep Research, OpenAI's deep research, Perplexity Pro, and other tools all benefit from better input context. The key variable is the quality and organization of your accumulated highlights, not which AI generates the output. That said, each tool has strengths: Gemini integrates well with Google's search index, OpenAI's version handles longer multi-step reasoning, and Perplexity provides real-time source citations.

How many highlights do I need before this workflow becomes useful?

There's no strict minimum, but the value inflects noticeably around 100 to 200 highlights on a given topic. At that scale, AI analysis starts surfacing non-obvious patterns and connections. Below that, manual review is usually sufficient. The important thing is consistency: 10 highlights per week for six months is more useful than 300 highlights in a single weekend binge, because the former reflects genuine ongoing engagement with the literature.

Can this workflow replace formal systematic review methodology?

Not entirely. Formal systematic reviews (Cochrane-style) require pre-registered protocols, exhaustive database searches, and standardized quality assessment. The AI-powered highlight workflow is best suited for narrative reviews, exploratory research, and knowledge synthesis for professional (non-academic) contexts. However, the organized highlight corpus can significantly accelerate the early stages of a systematic review, particularly the screening and data extraction phases.

How do I avoid confirmation bias when AI analyzes my highlights?

This is a real risk. Your highlights already reflect your reading choices and attention biases. AI analysis can amplify those biases. Mitigations include: explicitly asking the AI to identify counterarguments and gaps; following researchers with different perspectives on Glasp; periodically reviewing sources that challenge your existing framework; and using AI deep research tools to search specifically for evidence against your working hypothesis.

What's the difference between this and just using a reference manager like Zotero?

Reference managers organize papers. This workflow organizes insights. Zotero tracks which papers you've read and their metadata. A highlight-based workflow tracks which specific ideas, data points, and arguments you found valuable across all source types (not just academic papers). The two are complementary: Zotero for bibliographic management, highlights for knowledge management.


Conclusion

The launch of AI deep research tools has created a misconception: that anyone can now produce research-grade analysis by typing a question into a chat box. In reality, these tools have raised the floor (basic summaries are accessible to everyone) while also raising the ceiling (well-prepared researchers can produce significantly better outputs).

The differentiator isn't AI access. Everyone has that. It's the accumulated context you bring to the AI: the highlights, annotations, tags, notes, and conceptual frameworks built up through months of deliberate reading.

This is why building a daily highlighting habit matters. Every article you read, every YouTube video you annotate, every Kindle passage you mark is an investment in future research capability. The five-step workflow (Capture, Organize, Analyze, Synthesize, Output) gives that investment a structure and a return.

Start small. Install Glasp's web highlighter and begin highlighting the articles you're already reading. Tag them by topic. After a few weeks, try querying your highlights with AI. You'll be surprised at what emerges from material you thought you'd forgotten.

The researchers who will thrive in the AI era aren't those who adopt the newest tools fastest. They're the ones who've been quietly building the richest, most organized knowledge bases, one highlight at a time.

Start building your knowledge library

Highlight what matters as you read across the web. Save insights from articles, books, and YouTube videos in one place.

Get Started Free