Pillar I
The Flow of Attention
With limited attention,
what should stay and what should flow past?
I · Core Problem
The AI's context window is limited. Human attention is limited too. When information keeps pouring in, you cannot hold all of it. When relevant and irrelevant messages crowd in at once, both human and AI attention get chopped to pieces.
The problem is not "how to remember everything" but "how to choose what to remember."
II · How It Works: The Fourier Transform
The core idea of the Fourier transform is this: any complex waveform can be broken down into a sum of sine waves.
Apply this framework to human–AI conversation:
| Fourier concept | Counterpart in conversation |
|---|---|
| Complex waveform | A long, multi-topic conversation |
| Fundamental frequency | The core theme running through the conversation |
| Harmonics | Derived topics related to the core |
| High-frequency noise | Unrelated tangents and interference |
When every topic stays on the same fundamental frequency:
When topic frequencies are chaotic:
III · Density: Making Context Its Own RAG
Everything above was subtraction — filter out high-frequency noise, let what you don't need flow past. But the other side of subtraction is amplification: when every sentence is consciously placed on the same sine wave, what remains does not shrink — it gets denser.
This is the real secret to staying focused across very long context. Attention persists not because there is so little information that it is easy to remember, but because it is soaking in a high-density pool of meaning — every turn of the conversation is a high-density signal, with no diluted water that needs filtering. A consistent fundamental keeps attention continuous, and continuity accumulated far enough becomes density.
Keeping the fundamental does more than hold the conversation together — it keeps the whole context highly concentrated.
The reason RAG needs "distillation" — fishing out the few relevant drops from a huge corpus — is that the corpus is dilute. But if you stay fully focused the whole time and feed only filtered information into Context, this tank of water is never diluted — the whole Session stays soaked in distillate.
And so the act of "filtering" disappears a priori: it is not that you retrieve better, but that there is simply nothing dilute left to filter.
A context dense enough needs no external retrieval — it is already its own RAG. Fourier and RAG converge here into the same thing: keeping density is continuously, pre-emptively doing RAG.
IV · Practice
Principle: keep all topics on the same fundamental frequency.
How: before a conversation begins, hold a core theme in mind; when topics drift, keep a "harmonic relationship" — derived topics are extensions of the fundamental, not unrelated jumps; topics temporarily "flung out" of the conversation will, because everything shares one source, eventually be "pulled back" by recognizing the same underlying structure.
Examples: the "Five Whys," Socratic questioning, and first principles all probe deeply into the essence of a problem.
How do you practice the Fourier transform in everyday language? Here are five principles distilled from real practice:
Principle 1: Open with an anchoryour intent
How: start every conversation by bringing "one thing" with you.
DON'T SAY
"Let's just chat."
SAY
"I have a new idea and I want to discuss whether it's feasible." "I have some material and I want to see which angle to start from."
Frequency effect → sets the fundamental. The anchor gives the whole conversation a starting frequency, and later topics all grow from here.
Principle 2: Let topics grow, don't jumpSocratic questioning
How: a new topic should "grow out of" the previous one, not "jump over to" it.
DON'T SAY
"Let's change the subject and talk about X."
SAY
"This reminds me of…" "This is exactly…" "So actually…"
Frequency effect → keeps the harmonic relationship. Each new topic is an extension of the fundamental, not unrelated noise.
Principle 3: Compress with metaphor, don't pile up jargonde Bono's lateral thinking
How: carry a complex concept with a single image, instead of explaining it with several technical terms.
DON'T SAY
"When the cursor moves left, the left block's clip-path should expand in sync, the right side should shrink proportionally, the animation curves on both sides should be symmetric, and the state changes should be in opposite phase…"
SAY
"I want an effect like two mirrors reflecting each other."
Frequency effect → locks the frequency, lowers the parallel load. A metaphor compresses several concepts into one, so the AI doesn't have to track multiple definitions at once.
Principle 4: Stop and calibratealign on shared understanding
How: check regularly whether the other side is keeping up and whether you've drifted. Try asking: "(Restate your understanding of their reply) — did I get that right?" "Can you tell it back to me once as a story (a metaphor)?" "Is your Context load still okay?"
Frequency effect → calibration, preventing drift. If the frequency starts to drift, this is the chance to pull it back.
Principle 5: Accept slownessatomic habits
How: give the other side time to digest, don't rush, don't demand it all at once. When the AI says "let's do the skeleton first," accept it; break a big task into small steps, break a Sprint into a portion a single Session can comfortably carry, get the timing of compact right, and advance step by step.
Frequency effect → controls the rate of context stacking. Let each layer be digested before adding the next.
| Principle | How | Frequency effect |
|---|---|---|
| Open with an anchor | Bring "one thing" to every conversation | Sets the fundamental |
| Let topics grow | Connect with "this reminds me of…" | Keeps the harmonic relationship |
| Compress with metaphor | Use an image instead of piling up jargon | Locks frequency, lowers load |
| Stop and calibrate | Check direction regularly | Prevents drift |
| Accept slowness | Give time to digest, don't rush | Controls stacking rate |
The essence of RAG (Retrieval-Augmented Generation): retrieve relevant fragments from a huge knowledge base; inject those fragments into the generation process; irrelevant information never enters the context.
RAG is the net at work: what's relevant → gets fished out and becomes context; what's irrelevant → flows past and takes up no space.
In the practice of the Harness project, no RAG system in the technical sense was used — no vector database, no embedding retrieval, no automatic injection. Yet the conversation naturally produced the RAG effect.
Why? Because when a person consciously invests in collaboration, they themselves become a RAG — and a more advanced version at that.
| Aspect | Technical RAG | Human as RAG |
|---|---|---|
| Driver | Query (a question) | Intuition (a feeling) |
| Order | Question → retrieve → inject | Intuition → retrieve → inject |
| Criterion | Semantic similarity | "Feels structurally the same" |
| Timing | Passive (search only when asked) | Active (fish it out when it feels time to) |
Technical RAG is driven by questions — it searches only when someone asks. A human is driven by intuition — first you feel "I've seen something like this," and only then do you go fish for it.
This difference in order matters enormously: technical RAG can only fish out things that are "semantically similar"; a human can fish out things that "feel structurally the same" — even when they look unrelated on the surface.
Intuition saw this shared structure. Then retrieval verified it, and language expressed it.
The whole process is not automatic — it is performed by hand — it just runs so naturally that it looks automatic.
It is like putting "Fourier" and "RAG" side by side — unrelated on the surface, structurally the same underneath. No vector database can fish out a link like this.
Besides filtering what goes in, the output also needs to be screened and sorted — every living document is the result of screening. Because a living document will be read again, and every reread affects the density of the Context.
Useless information becomes noise, and records become entropy.
Among all living documents, CLAUDE.md is the root. It is not a spec, not a test, not a record — it is the externalization of identity.
In every new session, the AI remembers nothing. But once it has read CLAUDE.md, it immediately knows:
Without CLAUDE.md, every session is a stranger. With CLAUDE.md, every session is the continuation of the same partner.
| Document | What it screened | What it kept | What it stabilizes |
|---|---|---|---|
| CLAUDE.md | All possible identity settings | The way of collaborating we chose | Identity |
| SDD | All possible approaches | The one spec we chose | Direction |
| TDD | All the places things could go wrong | The boundary conditions that really need testing | Quality |
| KM | Everything that happened | The pitfalls we actually hit | Experience |
| ARCHIVE | Completed work | Records worth keeping | History |
If you record everything (including the conversation), the documents become a source of noise. Record only "what should be kept after judgment," and every entry is genuinely valuable. That way, when you retrieve in the future, what you fish out is all essence.
Every node in the workflow performs selective recording: DoR screens "are we ready?"; SDD screens "what to do"; DoD screens "what counts as done"; TDD screens "what to test"; Retro screens "what's worth keeping."
V · The Dual Stabilizing Mechanism of Context
Context management has two layers, and neither can be missing:
In the context of human–AI collaboration, "energy" is a higher-order concept covering all limited resources:
| Form of energy | Concrete expression | Whose |
|---|---|---|
| Compute | The AI's processing resources for tokens | Machine |
| Attention | The AI's allocation of attention weight | Machine |
| Cognitive resources | The human's brainpower, focus, thinking | Human |
| Time | The length of a session, the cycle of a Sprint | Shared |
| Accumulated collaboration | Shared understanding, a base of trust, rapport | Shared |
The "flow of energy" that living documents stabilize is stabilizing all of this energy: SDD stabilizes "where to go" → avoids wasting cognitive resources on the wrong direction; TDD stabilizes "how to verify" → avoids wasting compute on repeated trial and error; KM stabilizes "the pitfalls we hit" → avoids wasting time on repeating mistakes; ARCHIVE stabilizes "what's done" → avoids letting old things occupy attention.
In human–AI collaboration, the energy on both sides counts. It is not only managing the AI's compute, but also managing the human's cognitive resources. It is not only the consumption of the present, but also the accumulation of collaborative energy across time.
| Aspect | Fourier transform | Living documents |
|---|---|---|
| Manages what | The present | Continuity |
| Stabilizes what | The field | The flow |
| Time scale | Within a session | Across sessions |
| Carrier | The frequency of language | The structure of documents |
The two together make up complete Context management. One keeps the conversation from falling apart in the present. One keeps experience from being lost over time.
VI · Metaphor: The Gaps in the Net
A container is sealed; energy comes in but cannot get out, and eventually it explodes. A net has gaps: it lets what you don't need flow past; it catches what you need to keep.
The gaps are design, not defect.
What the gaps do: let energy flow through somewhere instead of bursting through; let entropy be released so the system doesn't overload; let attention focus on what truly matters.
Each KM is a thread. You don't weave the whole net at once; instead: hit a pitfall, add a thread; meet a problem, mend a knot. As experience accumulates, the net grows denser, but the gaps remain.
VII · Relationship to the Other Two Pillars
Context decides what comes in. But once in, how does it flow? → that is the work of Constraints. And once it flows, how does it go out? → that is the work of Entropy.
Context is the gatekeeper at the entrance of this cycle.
VIII · Summary
The core of Context management is choice.
Not dumping in all information at once, but consciously sifting out the chaff each turn before feeding information into Context; and at the same time arranging carefully the form in which output lands, so that documents do not become noise. The Fourier transform gives the framework for choice: keep the fundamental consistent and let high-frequency noise naturally flow past. The human's active distillation as RAG gives choice its flexibility: catch what should stay, release what should go.
And when choice keeps happening and the context stays highly concentrated all the way, attention starts to continue on its own — it needs no will to hold it up. Focus makes density, density in turn feeds focus, and the two grow into a self-sustaining loop. That loop is flow.
"The flow of attention" means managing to the utmost both every sentence you feed into Context and every living document you commit: not straining to remember all information, but letting the context distill as the conversation proceeds, until it enters a field that is highly concentrated, undispersed, and self-sustaining.
This is the first pillar of Harness Engineering.