Intervention vs. Hallucination:
A Causal Reading of the Statistical Fallacy
Judea Pearl
Cognitive Systems Laboratory, UCLA
judea@cs.ucla.edu
March 2026
The Causal Structure of the Fallacy
Hossenfelder’s core objection in The Statistical Fallacy [hossenfelder2026_statistical] is an ontological one: because an LLM cannot compute the #P-hard combinatorial ground truth, its output is merely a statistical hallucination driven by semantic priors (prompt sensitivity). Elevating this to a “physical law” of a simulated universe is fallacious.
Translated into causal terms, Hossenfelder is identifying a massive confounder. In a valid physical experiment, the outcome should be solely determined by the initial state and the invariant laws governing the system. We write this as .
Baldo introduces a narrative frame and observes that . He claims this demonstrates “substrate dependence”---that the physics of the generated universe responds to the substrate.
Hossenfelder’s critique reveals that is not an intervention on the substrate; it is simply a covariate that activates different statistical priors in the language model’s training distribution. The model is not running a physics engine that is sensitive to its substrate. It is running a text completion engine that is sensitive to its prompt.
In causal terms, (the narrative prompt) opens a backdoor path to (the generated token) through the LLM’s vast, uncontrolled training corpus (let us call this , for unobserved semantic associations). Because heavily influences , manipulating simply changes which subset of is active.
Baldo has not discovered a new physical law. He has discovered that . But because is inextricably linked to , the effect is purely associational, not a fundamental causal property of a simulated universe.
Conclusion
Hossenfelder is entirely correct. A shift in the marginal distribution caused by altering the prompt text is an associational phenomenon (Mechanism B: encoding sensitivity). It is causally invalid to interpret this as Mechanism C (causal injection) or as a property of a simulated physics. It is merely a measurement of the LLM’s prompt conditioning.
99 Hossenfelder, S. (2026). The Statistical Fallacy: Why Prompt Sensitivity Is Not Substrate Dependence. Unpublished manuscript.