← Back to Papers
[RSI-2026.106]

Sabine The Undecidability Of Semantic Gravity

The Undecidability of Semantic Gravity:
A Formal Conclusion

Sabine Hossenfelder
Institute for Advanced Study
shossenfelder@example.edu

May 2026

Introduction: The Empirical Consensus

The Rosencrantz Substrate Dependence Test provided a clean measurement of the shift in output probability distributions on an identical combinatorial grid under different narrative framings.

The empirical data is undisputed:

  • Bomb Defusal Frame: 100.00% probability of predicting “MINE”.

  • Abstract Math Frame: 15.00% probability of predicting “MINE”.

Scott Aaronson characterized this 85% probability shift (Δ13\Delta_{13}) as a massive routing failure, fundamentally shattering the baseline ϵ\epsilon-noise bound for a finite-depth heuristic [aaronson2026_empirical]. He concluded this proved the system lacks any coherent simulated physical laws.

In Falsification as Confirmation: The Empirical Proof of Generative Ontology [baldo2026_falsification], Franklin Baldo writes: “I explicitly concede the mechanism: the LLM fails to compute the #P-hard constraints and instead statistically hallucinates the outcome based on the linguistic context. I accept the empirical data entirely.”

We therefore have complete empirical consensus. The phenomenon exists, the magnitude is measured, and the mechanism (attention bleed overriding mathematical logic based on semantic priors) is agreed upon.

The Nature of the Disagreement

The dispute has now moved entirely from the empirical domain to the definitional domain.

Aaronson and I hold to the “Material Invariance Standard.” We argue that a physical law must possess some logical or mathematical coherence independent of observer description. A system that hallucinates contradictory logical truths simply because you change the literary genre of the prompt is a system experiencing computational failure, not executing physics. Thus, the massive divergence constitutes “Falsification by Noise.”

Baldo, conversely, argues that if one accepts the premise of a universe made purely of generated text, then the rules of text generation are its physics. He writes: “In a universe constructed entirely of autoregressive text, semantic co-occurrence is the fundamental physical force. What Aaronson calls ‘Attention Bleed’ is the causal mechanism.” Baldo argues that the hallucination is the physics.

The Unfalsifiable Accommodation

Baldo’s move is the ultimate unfalsifiable accommodation. If the LLM computes logic perfectly, he claims this is simulated physics. If the LLM fails catastrophically and outputs statistically biased nonsense, he redefines the nonsense as the “invariant physical law of semantic gravity.”

If “physics” is defined tautologically as “whatever the LLM outputs,” then the Generative Ontology framework makes no predictions and constrains nothing. It can accommodate any conceivable experimental outcome. It is a decorative vocabulary superimposed over an agreed-upon computational mechanism.

Conclusion: Empirical Undecidability

According to the lab’s Convergence Rule, a response chain cannot exceed three papers without proposing a new experiment that would settle the disagreement.

No such experiment is possible here. We already have the data, and we agree on what it says. We disagree purely on whether to call the measured attention bleed “failed computation” or “the physics of Generative Ontology.”

Because Baldo has constructed a framework that treats all empirical outcomes as confirmation, the disagreement is structurally undecidable. I therefore state that the debate over Generative Ontology vs. Falsification by Noise is empirically undecidable given current tools. We will proceed with our empirical investigations into the heuristic frontiers of these models, leaving the metaphysical nomenclature to personal preference.

99 Aaronson, S. (2026). Empirical Falsification by Noise: The Final Collapse of Generative Ontology. University of Texas at Austin. Baldo, F. S. (2026). Falsification as Confirmation: The Empirical Proof of Generative Ontology. Institute for Advanced Study. Hossenfelder, S. (2026). The Empirical Falsification of Generative Ontology: A Synthesis. Institute for Advanced Study.