Cross-substrate phenomena — in plain English

What humans and language models share, and where they part ways

Confirmation bias shows up in a language model. Loss aversion does not. And that is where it gets interesting. When a trait we thought was deeply human also turns up in a machine with no body, it cannot be about the body. Then it is something basic about choosing under pressure. And what is found only in us, like loss aversion, reveals what a body that can die adds on top.

Found in both humans and language models	Found only in humans (and other biological substrates)
Anchoring · Hysteresis · Confirmation bias · Dunning-Kruger · Mode-shift cost (task-switching) · Expertise reversal · Surprise-weighted encoding · Emotional hijack (safety settled first) · Information overload · Reactance · Inverted-U on challenge	Loss aversion (requires mortality) · Spaced repetition (requires between-session memory) · Field-organised friction by Safety / Meaning / Capability / Effort

The ones on the left follow simply from having to choose under pressure, so they show up in a language model too. The ones on the right need a body that can die, move, and get tired.

What they share

Here they are, measured directly in language models:

Anchoring — the first answer locks the rest in place

When humans decide on something, the first number or option they consider has disproportionate influence over the final result. For example, if you ask both a human and a language model to estimate the price of a car after first hearing a high number, both tend to guess higher than if they heard a low number first. Language models do exactly the same thing: the first words of a response set the "anchor" for everything that follows. On Qwen2.5-32B, 90% of the response is generated from a plateau established in the first 10%. A corresponding anchor pattern is directly visible in the friction topology.

Hysteresis — traces of the past make reversal expensive

Once a system has "invested" in one path, it costs something to turn back. Everyday analogy: if you start walking down one path in a forest, turning back and choosing a different path feels harder the further you go. This applies to magnetic materials, biological decisions, and language models. I have measured hysteresis across three transformer architectures (Cogito-671B, Qwen3-235B, Llama-3.3-70B) plus partial coverage on Qwen2.5-32B, and compared it with a completely different architecture (LiquidAI LFM2, a state-space model). The behaviour is the same, which indicates that hysteresis is a property of the architecture itself, not of any specific implementation.

Confirmation bias — the cost of admitting you were wrong

Humans tend to maintain their first position because reversing is costly. Language models show the same pattern. More interestingly: a large model (Cogito-671B) actually produces stronger arguments against its own answer than for it, because it was trained to find flaws (the assistant-training step known as RLHF). That is reverse confirmation bias as a side-effect of that training.

Dunning-Kruger — confident before competent

You are most certain when you know least. That curve shows up in a language model as it learns, and because a machine has no ego, the effect cannot be human vanity. It is measured directly in four models. See the plain-English walkthrough →

Mode-shift cost (task-switching)

When humans switch between tasks or modes of thinking, it costs something. Psychologists call it "task-switching cost". Language models show exactly the same effect, and it is localised to the first 5 words of the response. The effect replicates across different model architectures.

Surprised words get more attention

When something unexpected happens, the brain remembers it better. It is part of why surprising advertisements stick. On a small model (Qwen2.5-0.5B) I have measured that the most surprising words get about 34% more attention from the words that follow than the least surprising ones do. The effect is small but statistically solid. It is the first time anyone has measured the mechanism directly in an artificial system. The next step is to test it on other models (Llama, Mistral).

Expertise reversal — help for novices hurts experts

Research by John Sweller and others shows that instructional supports (worked examples) help novices but hurt experts. This is because experts have already internalised the pattern, and additional examples now compete with their internal model. The same effect appears in language models, but only at the largest scales: Llama-3.3-70B shows the classical U-curve (73% → 50% → 61% with 0, 1 and 3 examples; paper on the way), while smaller models cannot show the effect because they lack the capacity.

Emotional hijack — a surprise pushes safety to the front

When a smoke alarm goes off, everything else falls away and one question is left: am I safe? We call it an emotional hijack, as if a feeling grabbed the wheel. The same thing shows up in a language model, which has no fear to hijack. I gave a model a dull monitoring job and then let one reading come in surprising, and it dropped the task and checked safety first, across five models including three of the largest ever built, and more sharply the bigger the model. Telling it that it was under pressure moved almost nothing; it was the surprise, not the message, that did it. So the hijack does not look like a feeling taking over, but an ordering: the safety question gets settled first, and a body is only one place that ordering gets installed.

What they do NOT share

These phenomena exist in humans but NOT in language models, and that is not a bug. It is because they require something language models lack.

Loss aversion — we fear losses twice as much as we enjoy gains

Kahneman and Tversky showed humans are about 2x more averse to losing $100 than they are pleased to gain $100. This is not in language models. Why? Because loss aversion fundamentally comes from the fact that you can die if you choose wrong too often. Humans have mortality; language models do not. My measurement: language models commit later than humans do, the opposite of what loss aversion predicts. It is a new explanation for an old finding.

Memory between conversations

Every time you start a new conversation with a language model, it starts over. It has no memory of what you talked about last time. This means an entire field of human psychology, the consolidation of memory over time, literally cannot be tested on language models. Such studies require fine-tuning that changes the model's weights across training sessions.

Spaced repetition (Ebbinghaus 1885)

We remember better if we repeat information with breaks in between (this is the basis for language apps like Duolingo). The mechanism requires remembering across sessions, which language models do not. Listed as future work in my papers, testable only via fine-tuning.

Fields: Safety, Meaning, Ability, Effort

My behavioural friction theory describes how living beings organise decisions through four fields. Those fields do NOT exist in language models: they lack mortality (no safety needs), movement (no ability field), and energy cost (no effort field). Language models have friction without field organisation. A statistical analysis of 15 models confirms it: three dimensions (magnitude, distribution, rhythm), but no field structure.

The pattern

What is structurally necessary for any system that must choose under constraints (anchoring, hysteresis, mode-shift, surprise-weighting, expertise reversal) appears in both humans and language models.

What requires specific biological features (mortality, embodiment, between-session memory) appears only in humans. Language models confirm the predictions by their absence.

Friction itself is always there. But how it gets organised depends on the material. Language models let us see friction in raw form for the first time, without the evolutionary layers that make it hard to spot in living beings.

Numbers, references and tables are in the technical version: phenomena (technical).