The Synthesis Loop: Turn Reading Into Original Insight (Without Plagiarizing Yourself)

Comprehension, Research, Synthesis: Three Different Jobs

Almost every article about "knowledge management" treats reading, gathering, and thinking as one continuous activity. They aren't. They are three jobs, and they reward three different habits.

Comprehension is the act of understanding a single source. You read a chapter, parse a paper, watch a lecture, and walk away knowing what it said. The win condition is fidelity. Did you get it right? Glasp covers this in Reading With AI, where the goal is to extract meaning from one thing at a time.

Research is the act of gathering many sources around a question. The win condition is coverage. Did you find the strong work, including the work that contradicts your hypothesis? We cover the gathering side in the AI research workflow. Research is mostly logistics: search, scan, save, repeat.

Synthesis is the act of combining many sources into something new. The win condition is originality. Did you produce a claim that wasn't sitting in any single source? This is the missing middle. It's the bridge between input (what you read) and output (what you publish, build, or decide).

Tiago Forte's Building a Second Brain gets close. The CODE method, Capture, Organize, Distill, Express, names the stages well, but Forte's "Distill" tends to mean progressive summarization: highlighting your highlights. That helps comprehension more than it helps synthesis. Distilling one source still leaves you with one source. Synthesis is the chemistry, the part where ideas from different sources react and produce a new compound.

Cal Newport's framing is closer to the bone: writing is thinking, and deliberate practice means staying inside the part you can't yet do. The part you can't yet do, for most knowledge workers, is the synthesis step. Capture is easy now. Search is easy now. Putting twelve highlights in a row and saying here is the through-line that nobody else noticed is still hard.

Synthesis is the rarest skill and the highest-leverage one. It also has the worst tooling. This article is about that.

Why Synthesis Is Hard (and Why AI Both Helps and Hurts)

Synthesis fails for a boring biological reason: working memory.

George Miller's 1956 paper, The Magical Number Seven, Plus or Minus Two, set the original ceiling. Nelson Cowan revised it down in 2001 to roughly four "chunks" of unrelated information held actively at once. Try holding twelve highlights, from five sources, in your head while you look for the non-obvious connection. You can't. Nobody can. You drop chunks the moment you reach for new ones.

This is why most synthesis attempts collapse into whichever idea you read most recently. It isn't laziness. It's load.

The traditional fix is externalizing the chunks. Index cards. Zettelkasten. Whiteboards. Post-its on a wall. The whole tradition of physical knowledge tools, from Lichtenberg's Sudelbücher in the 18th century to Luhmann's slip-box in the 20th, exists because human heads cannot hold the inputs to a synthesis at the same time as the synthesis itself.

AI is the next move in that tradition. A language model can hold all twelve highlights simultaneously and produce a candidate connection in seconds. That's the help. It is real, and it is large.

Now the hurt. AI's default output is the average of the corpus it was trained on. If you ask a model to synthesize ten highlights into a thesis, it will hand you the most common thesis those ten highlights tend to produce. That is the opposite of synthesis. It is regression to the mean wearing a synthesis costume. The model just averaged a hundred thousand similar essays for you.

So the rule is: use AI to extend working memory, not to do the thinking. The model holds the chunks. You decide the claim. This matches the framing Andrej Karpathy, Tobias Lütke, and Andrew Ng converged on in mid-2025: the new craft is context engineering, deciding what goes into the model's window and what comes out the other side. You are the editor. The model is the loud, eager intern.

The four-stage loop below operationalizes that division of labor.

The 4-Stage Synthesis Loop: Capture, Cluster, Compress, Crystallize

Here is the loop. The names are deliberate. Each stage has one job.

Stage	Goal	AI Role	Human Role	Output
1. Capture	Pull raw highlights from many sources	None (or light tagging)	Read, judge, highlight	A pile of 10 to 20 verbatim quotes
2. Cluster	Group highlights into themes	Suggest 3 to 5 clusters and orphan flags	Override AI's clusters, keep the orphans	A themed map of the inputs
3. Compress	Reduce each cluster to one thesis sentence	Brutally shorten until it hurts	Pick the version that survives	One-sentence theses
4. Crystallize	Find the non-obvious version of the thesis	List the obvious versions to reject	Choose the claim only you would make	A publishable argument

Two things are worth saying upfront.

First, this is a loop, not a pipeline. You will start at Stage 1, get to Stage 3, realize the compression is bland, and go back to Stage 1 to capture more inputs. Synthesis is iterative. The loop runs until you have a claim that surprises you a little.

Second, the four stages map cleanly onto the tools you already have. Capture happens in your highlighter. Cluster and Compress happen in a chat window over your highlights. Crystallize happens on a draft page. The rest of this article walks through each stage with the prompts and the moves.

Stage 1: Capture, Why Highlights Beat Notes for Synthesis Inputs

The first move in synthesis is the one almost everyone gets wrong.

The temptation is to take notes. Read the source, write your own paraphrase, save the paraphrase. This is great for comprehension. It is bad for synthesis. Here is why: a paraphrase is your past self's interpretation of the source, frozen at the moment you read it. By the time you are synthesizing weeks later, you have lost the original phrasing. You are now synthesizing your own interpretations of other people's interpretations. Two layers of compression before the synthesis even starts.

Highlights beat notes for synthesis because highlights preserve the source's exact words. When you sit down with twelve highlights from five sources, you are working with twelve original signals, not twelve echoes. The phrasing matters. The hidden connection between two ideas often lives in the specific words an author chose, not in the gist your past self extracted.

This is why Glasp's web highlighter is built around verbatim capture rather than note-taking. You highlight on the page. The text is preserved with attribution and a link back to the source. Months later, when you open a topic, you see the originals, not your old summaries. Lichtenberg understood this in 1770. The slip-box tradition understood it in 1950. The tooling has finally caught up.

A practical rule: capture liberally. The cost of a highlight that turns out not to matter is roughly zero. The cost of not having the highlight you need at synthesis time is high. Aim for 10 to 20 highlights across 5 to 10 sources before you start the next stage. Below that, you don't have enough material to find a non-obvious connection. Above that, you have crossed into research, which is a different job.

If you want a deeper look at how to query the highlights you've already captured, the personal RAG walkthrough covers chatting with your notes. That article is about retrieval. This one is about what to do with the retrieved material once you have it.

Stage 2: Cluster, Find the Hidden Common Thread

Once you have a pile of highlights, the next move is to group them into themes. Not categories. Themes. A category is what the highlight is about. A theme is what the highlight is saying.

This is the first place AI earns its keep. A model can read all twenty highlights at once, something you genuinely cannot do, and propose a thematic structure in seconds. Critically, it can also flag the orphans: the highlights that don't fit any theme. Those orphans are often the most interesting thing on the page. They are either noise to discard, or they are the seed of a synthesis nobody else has written, because nobody else noticed the orphan.

Here is the prompt to use against Glasp's AI chat feature, or any chat model you have loaded with your highlights:

I'm going to give you N highlights from M different sources. Your job:

1. Cluster them into 3 to 5 themes. Name each theme as a short noun phrase.
2. For each theme, list the highlights that belong, by number.
3. Flag any highlight that does not fit any theme cleanly. Tell me why it does not fit.
4. For each theme, write one sentence describing what the highlights in that theme are collectively claiming.

Do not summarize the highlights. Do not add commentary. Just cluster, list, flag, and describe.

Highlights:
1. [highlight text] - source: [source title]
2. [highlight text] - source: [source title]
...

Two notes on this prompt. The "do not summarize" instruction matters. Without it, the model will rewrite your highlights in its own voice, which destroys the value of having captured them verbatim. The "flag the orphans" instruction matters more. The orphan is where the synthesis often hides.

When the model returns the clusters, override them. Move highlights between themes. Rename themes. Demote a theme to a sub-theme. The model's clustering is a draft. Your clustering is the one that goes into Stage 3.

The output of this stage is a map: 3 to 5 themes, each with a 1-sentence claim, with the orphans called out. You should be able to look at that map and see, roughly, the territory of your synthesis.

Stage 3: Compress, Force the Argument Down to One Sentence

Here is the test. If you cannot say your synthesis in one sentence, you do not yet have a synthesis. You have a topic.

Compression is brutal and necessary. A long synthesis is almost always a synthesis whose author has not yet found the through-line. The Feynman compression test, the idea that you should be able to explain a thing simply or you do not understand it, applies sentence by sentence here. Make it shorter until it almost breaks. Then check if it broke.

AI is excellent at this stage if you push it hard. The default output will be a polite, balanced, average sentence. You do not want average. You want the sharpest version that still carries the argument.

Here is a cluster of highlights and a one-sentence claim describing them:

[paste cluster + claim]

Compress this claim into the shortest possible thesis sentence that still carries the argument. Maximum 25 words.

Then compress that sentence by 50%. Then again. Stop when the sentence becomes wrong, not just shorter.

Return all three versions. Tell me which one is sharpest without losing meaning, and which one has crossed the line into oversimplification.

Run this prompt for each cluster from Stage 2. You will end up with one thesis sentence per theme, plus a sense of where the cliff is between sharp and broken.

A warning. The model will sometimes compress a thesis into something that sounds great and is not actually what your highlights support. This is the moment to go back to the highlights and check. If the compressed thesis would still be true with your highlights replaced by random highlights on the same topic, the compression has gone too far. The thesis must be specific to your inputs.

When you finish Stage 3, you have between 1 and 5 candidate theses. The next stage picks one and earns it.

Stage 4: Crystallize, Articulate the Non-Obvious Claim

The synthesis is only valuable if it's non-obvious. A thesis that ten thousand other people have already written is not a synthesis. It's an aggregation.

This stage is the one AI is worst at, and it's the stage that decides whether your work is worth reading. The reason AI is bad here is the same reason it is good at clustering: it knows the average. The non-obvious claim is, by definition, not the average. The model wants to give you the average. You have to fight it.

The trick is to use the model's averaging tendency against it. Make the model list the obvious versions first. Then deliberately pick something else.

Here is my candidate thesis: [paste thesis from Stage 3]

Step 1: List the 5 most obvious versions of this thesis that someone else would write. By "obvious," I mean the ones that show up in the average article on this topic. Be specific. Quote the kind of sentence I'd find in a generic post.

Step 2: For each obvious version, identify what it gets wrong, what it leaves out, or what it flatters readers with that isn't actually true.

Step 3: Given my highlights and my candidate thesis, propose 3 non-obvious versions of the thesis. A non-obvious version should:
- Contradict at least one of the obvious versions
- Be specifically supported by my highlights, not by general knowledge
- Be the kind of claim that would make a careful reader stop and think

Do not pick a winner. Show me all three.

You pick the winner. This is the part of the loop where your judgment is irreplaceable. You know your audience, your other work, and your own track record better than the model does. You know which non-obvious claim you can defend in print, and which one will get you in trouble you do not want.

Once you have the claim, write the piece. The draft is shorter than you expect, because the hard work is already done. The claim is the spine, the clusters are the sections, the highlights are the evidence. Most of writing, at this point, is connective tissue.

If the output is a long-form essay or article, the AI long-form writing workflow covers what happens after the synthesis: outlining, drafting, revising. If you want sharper prompts for the steps above, especially for steel-manning the obvious versions, see prompt patterns for thinking.

When you are ready to publish or share, Hatch is built for the output side. It is where the synthesis becomes a thing other people can read and respond to. Capture is private. Crystallize ends in public.

Putting It on a Page: A 90-Minute Synthesis Session

Here is the operator's manual. Block 90 minutes. Pick a topic where you have already captured 10 to 20 highlights across 5 to 10 sources. If you have not captured enough, do that first; capture is its own activity, not part of this session.

Time	Stage	Activity	Output
0 to 10 min	Setup	Pull highlights into one document. Reread them in order. Do not edit.	A clean working doc
10 to 30 min	Cluster	Run the clustering prompt. Override the AI's clusters. Name your themes.	3 to 5 themed clusters with orphans flagged
30 to 60 min	Compress	Run the compression prompt for each cluster. Find the cliff.	1 to 5 candidate thesis sentences
60 to 75 min	Crystallize	Run the non-obvious prompt. Pick one thesis. Write the obvious versions you are not writing.	One sharp claim plus three rejected versions
75 to 90 min	Draft	Spine the piece: claim, sections per cluster, evidence per highlight. Rough draft only.	A 600 to 1200 word draft

This is enough to produce a publishable piece of original thinking once a week, on a regular schedule. It does not have to be public. A weekly synthesis can be a memo to your team, a section of a book in progress, a strategy doc, a Loom for your future self. The form does not matter. The cadence does.

A practical setup: capture continuously through the week using Glasp's web highlighter as you read. On Friday afternoon, or Sunday morning, run the 90-minute session. Use Glasp's AI chat feature for the clustering and compression prompts because your highlights are already there. Publish or send the synthesis through Hatch, or wherever your output lives.

Do this for a quarter. The compounding is real. By week six, the synthesis you wrote in week one has become an input to the synthesis you are writing now. You start to recognize your own through-lines. That is where the original voice comes from. Not from trying to sound original. From accumulating enough of your own claims that they form a shape only you could have made.

Frequently Asked Questions {#faq}

Isn't this just summarizing?

No. Summarizing condenses one source into a shorter version of itself. Synthesis combines many sources into a claim that wasn't in any one of them. A summary of a book is the book, smaller. A synthesis of ten books is something none of the ten books said. The first is compression. The second is creation.

How many sources do I need to synthesize?

The sweet spot is 5 to 15 sources, with 10 to 20 highlights total. Below 3 sources, you are not synthesizing, you are reacting; whatever you write will mostly reflect the source you read most recently. Above 20 sources, you have crossed into research territory, where the bottleneck is coverage rather than connection. Synthesis lives in the middle.

Can AI do the whole loop?

No. AI is excellent at the working-memory parts: holding many highlights, suggesting clusters, brutalizing prose down to a thesis. AI is bad at the crystallize step, because the non-obvious claim is, by construction, not the average. The model's instinct is to hand you the average. The judgment about which non-obvious claim is worth defending is yours. Treat AI as a working-memory extension, not as a thinker.

How is this different from a literature review?

A literature review is a survey: it reports what others have said about a topic. A synthesis is an argument: it tells the reader what you say, with the literature as evidence. A good lit review is comprehensive and neutral. A good synthesis is selective and pointed. The same set of inputs can produce many syntheses, but only one honest lit review.

How long does a synthesis cycle take?

For a focused topic with 10 to 20 highlights already captured, 90 minutes is enough to produce a publishable piece. Longer-form work, like a book chapter or a strategy doc, runs in multi-week cycles, with the 90-minute session repeated several times as new highlights come in and the thesis sharpens. The capture stage runs continuously in the background. The other three stages are batched into deliberate sessions.

Conclusion {#conclusion}

Comprehension is well-served. Research is well-served. Synthesis, the part where reading turns into thinking and thinking turns into something publishable, has been underserved for years. Most knowledge workers stall there, not because they are lazy, but because the cognitive load of holding twelve ideas at once is genuinely beyond the human working memory budget.

AI changes that, but only if you use it for what it's good at. Hold the chunks. Suggest the clusters. Brutalize the prose. Then step back and let the human pick the non-obvious claim. The four stages, Capture, Cluster, Compress, Crystallize, are a structure for that division of labor. The 90-minute session is a cadence for putting the structure into your week.

The point is not to use AI more. The point is to produce more original work, more reliably, on a schedule. The synthesis loop is how you turn a pile of highlights into a thing only you could have written. Run it once a week. Watch what compounds.

Capture lives in your highlighter. Cluster and compress live in your chat. Crystallize lives on the page. Glasp is built to make all four stages happen in one place. The rest is up to you.