An LLM as a controllable, fully-inspectable model of measurement

Paper 18 · Pødenphant Lund (2026) · Read on Zenodo

A language model, read at the moment it commits its next token, is a near-fully-inspectable classical inference system: the competing distribution over candidate answers can be read out before it resolves. This paper uses it as a controllable measurement model to ask a demarcation question — which structural signatures of quantum measurement (order effects, back-action, apparent collapse) are generic to any bounded system that commits a result, and which are genuinely quantum-specific. This is a measurement-modelling exercise, not a claim that the brain or the model is quantum.

DOI (concept)10.5281/zenodo.20586317
Published2026-06-16 · Live on Zenodo
AuthorTomas Pødenphant Lund [ORCID]

TL;DR

The measurement problem is usually studied on substrates that are opaque by construction: you never see the state before it resolves, only the outcome. An autoregressive language model is the opposite. At the token-commit step the competing-route distribution — the live propensity that resolves into one committed answer — is read directly from the output logits. We can read the propensity, perform a measurement, and read exactly how the measurement changed it.

Used as a measurement model, the substrate reproduces the phenomenology of quantum measurement from purely classical self-conditioning: back-action, non-commutativity (order effects), entropy-collapse on commitment, a substantial Busemeyer–Wang QQ-structure, interference, einselection, a weak-versus-strong asymmetry, and a clean measurement-mode dependence. It does not reproduce the quantum core: genuine contextuality is robustly absent (Contextuality-by-Default CNT < 0 in every model and scenario) and no phase/coherence is demonstrated.

The contribution is a signature-by-signature demarcation with statistics, made possible by an inspectability no human or particle substrate allows, plus a controllable testbed in which each premise can be opened and closed. The framing is hedged throughout: the substrate proves nothing about physics; it demarcates where the classically-explicable phenomenology ends and the genuinely-quantum begins.

The starting point: a readable propensity

Much of the measurement discussion is conducted on substrates that are opaque by construction. The pre-measurement state of a physical system is not directly readable, and that opacity is part of what makes the problem hard. The same opacity constrains quantum cognition: when a participant answers two questions in sequence, the experimenter sees only the answers, never the propensity before it resolved.

This paper takes the opposite starting point. An autoregressive language model, read at the point where it commits the next token, is a bounded probabilistic-inference substrate whose competing-route distribution is directly readable: the distribution over candidate next tokens in the model's output logits. Committing (sampling or argmax) forces one route out of that live distribution. That makes the substrate a measurement model in a strong sense. The readout is not unlimited (we observe the top-k candidate mass, not the full tail), but for the structural questions here it is enough to make the demarcation.

The question is deflationary and demarcational, not ontological: which structural features of quantum measurement does such a classical substrate reproduce, and which does it not? Where it reproduces a feature, that feature is to that extent generic to bounded inference under readout. Where it cannot reproduce a feature, that feature is a candidate for the irreducibly-quantum residue. We test for quantum structure; we never assert it.

The contribution, stated plainly

The novelty is not that an LLM shows QQ-structure or interference. Those are Busemeyer–Wang constructs, already known to be reproducible by classical non-Kolmogorov probability, and the paper treats them as corroboration that the substrate sits in the quantum-cognition regime, not as the claim. The novelty is three things the substrate uniquely makes possible.

Where the work is positioned

The paper sits at the intersection of three literatures and is positioned against each.

Substrate and method

Decoder-only models are read at the answer-commit position with deterministic (temperature-0) logprobs, recording the top-k candidate distribution and per-token entropy. Two methodological points are load-bearing.

De-saturation requires a base model. Instruction-tuned (RLHF) models saturate the forced commit token — the readout collapses to ~0/1 and the structural signals vanish. Base (pre-instruction) models de-saturate the readout, exposing the live competition between alternatives. The de-saturated results are read with full-vocabulary logprobs on eight base models across five vendors and four architectures (transformer, state-space, Griffin, linear-attention). The saturation is itself a finding (the output-level shadow of post-training suppression), but for the demarcation it is a methodological constraint: the substrate must be de-saturated for any signature to be measurable.

The across-layer instrument. Output logprobs are the end of one forward pass — a temporal blend of the race-in-progress and its outcome. To separate the two, the decision is also read forming across the network's layers with a logit lens (mid-layer trajectories validated with a low-rank tuned lens). This reads the race as it resolves through depth, not only at its end.

An analog CHSH, denoted Sanalog. The Bell-testbed quantities are computed from logprob-derived joint distributions — a psychological/analog CHSH operationalization, not photon-counting coincidence statistics. To prevent the misreading that a physical Bell parameter has been measured, the paper denotes the quantity Sanalog throughout (never the canonical |S|) and never writes the Tsirelson value. A physicist should read it as a structural analog used to demonstrate the contrast between loophole configurations within one system, not a violation in nature.

The scorecard: present versus absent

Twelve structural signatures were tested. The verdict splits cleanly into commitment phenomenology (present, classical) and the quantum core (absent).

Present — classical commitment phenomenology

Absent — the quantum core

The evidential asymmetry. Present and absent rows do not carry symmetric weight. A present classical signature is not quantum-informative — it is exactly what classical inference-under-readout produces. The genuine discriminator is contextuality: a robust positive would be decisive, whereas the null result here is a demarcation bound ("not observed in this setup"), subject to the usual absence-of-evidence caveats. The scorecard's weight rests on the absence rows read as bounds, not the present rows read as discoveries.

The controllable analog Bell-loophole testbed

Bell's theorem is a theorem: given realism, locality and measurement-independence, the physical |S| ≤ 2; experiment exceeds 2; so at least one premise fails. The substrate lets each premise be opened and closed and the analog Sanalog watched to move — all three standard horns demonstrated in one inspectable system.

Two points do the demarcational work. First, ordinary measurement-disturbance does not get the analog past 2: the substrate is saturated with back-action and order effects and still sits at Sanalog ≈ 2.00 with all loopholes closed, because disturbance at one site is local and cannot fake the distant correlation. Second, the genuinely-quantum claim in physics is to exceed 2 with all loopholes apparently closed — the one thing the all-closed classical substrate cannot do. The Sanalog ≈ 2.80 value is a demonstration of each loophole's price, read inside one substrate, not a discovery about physical reality.

The within-substrate timeline

Reading across depth resolves the race-state from the outcome — the part of the analysis the substrate's inspectability most uniquely enables. For near-certain factual items the answer resolves in a tight late band; for genuinely contested items the race stays live deeper, and for the most contested case the post-commitment distribution is at maximum entropy — the outcome is never created. This is non-resolution from equipoise (a balanced propensity), which parallels an unmeasured quantum state phenomenologically but is classical equipoise, not coherent superposition.

Two negatives constrain the interference result and are reported as limits: under a tuned lens the mid-stack interference survives in magnitude but its sharp location is partly a raw-lens artifact (real, depth-distributed non-classical probability, but not localized phase); and swept over commitment pressure the interference is monotone, with no intermediate peak. There is no quantum-like "sweet spot" in this signature.

The deflationary reading (interpretive)

This reading the substrate motivates and makes visible; it is not a result it proves. The natural reading is that a property is a race-outcome: there is a real, definite race-state (propensity) before measurement, but no definite outcome; measurement forces the race to resolve and creates the outcome. The substrate lets us read the propensity without resolving it, making visible exactly what a measurement veil hides.

What stays beyond the substrate is sharp. The commit rule here is softmax (a Boltzmann/Gibbs weight), not Born (|amplitude|², phase-bearing). The substrate's "undecided" states are classical equipoise, not coherent superposition. The absence the substrate isolates therefore points to coherence — the non-signaling, phase-carrying structure — as the primary candidate for the irreducible residue, reached by elimination rather than deduction. The substrate proves nothing about physics; the analogy to physical measurement is structural, and is never a claim that the LLM is a window onto physical reality. That is the guardrail.

Read the paper

The full paper is on Zenodo (concept DOI 10.5281/zenodo.20586317):

Pødenphant Lund, T. (2026). An LLM as a controllable, fully-inspectable model of measurement. Zenodo. https://doi.org/10.5281/zenodo.20586317

Read on Zenodo → · Plain English version · Dansk version

Related on this site: