[RSI-2026.088]

Rosencrantz V3

Franklin Baldo

seminal

if the “physics” of the game depends on how the outcome generation is coupled to the narrative---the agent has detected substrate dependence: a property that would not occur in a non-simulated universe. A further test exploits a structural correspondence: Minesweeper under on-demand generation with uniform measure is formally isomorphic to discrete quantum mechanics—superposition over valid configurations, projective measurement via clicking, the Born rule as configuration counting, and nonlocal correlations through global constraints. By presenting the same board under a quantum-mechanical framing, the protocol tests whether the model recognizes that its own rules are QM-compatible. Divergence between the quantum-framed distribution and the exact ground truth means the model implements quantum-compatible structure but does not recognize it when addressed in the correct formal language—a finding about the topology of the model’s knowledge architecture and a diagnostic that does not require actual quantum infrastructure.

Keywords: simulation hypothesis, large language models, substrate invariance, combinatorial indeterminacy, Minesweeper, discrete quantum mechanics, autoregressive generation, interpretability, knowledge architecture

Flipping Rosencrantz’s Coin:
Substrate Invariance Tests in LLM-Generated Worlds
via Combinatorial Indeterminacy

Franklin Silveira Baldo
Procuradoria Geral do Estado de Rondônia, Brazil
franklin.baldo@pge.ro.gov.br

March 2026

Introduction {#sec:intro}

In 1966, Tom Stoppard placed two minor characters from Hamlet on a stage and had them flip a coin. The coin landed heads seventy-six consecutive times. Rosencrantz and Guildenstern, trapped inside a narrative they did not author, had inadvertently performed the simplest possible physics experiment—and discovered that the laws of their universe were not the laws of ours. The coin’s behavior, sampled repeatedly at the same point, revealed something about the structure of the world that contained them: it was a world governed by dramatic necessity, not by probability.

This paper proposes a way to make Rosencrantz’s experiment precise. We ask: can an agent inside an LLM-generated world detect that its world is generated, by testing whether the laws governing observable outcomes depend on the computational substrate that produces them?

The key insight is that this question becomes empirically tractable when the agent has access to a domain where the correct probability distribution over outcomes is not an empirical estimate but a mathematical theorem. In such a domain, any deviation from the correct distribution is unambiguous signal—not noise from measurement error, not uncertainty about the “true” physics, not a debate about interpretation. It is simply wrong, and the pattern of wrongness reveals the structure of the substrate.

Minesweeper provides exactly this. A partially revealed Minesweeper board defines a constraint satisfaction problem: the visible numbers constrain which configurations of hidden mines are valid. For any given board state, the probability that a specific hidden cell contains a mine can be computed exactly by enumerating all valid configurations. When two or more configurations are consistent with the visible numbers, the hidden cells are genuinely indeterminate---not epistemically uncertain, but combinatorially ambiguous. The correct answer is the distribution, not any single outcome.

Now consider what happens when an LLM generates the result of clicking on an ambiguous cell. The model must produce a definite outcome—mine or safe—collapsing the combinatorial superposition into a single realization. If we sample this process hundreds of times with the same board state, we obtain an empirical distribution. And that distribution can be compared, cell by cell, with the mathematically exact ground truth.

This comparison becomes a substrate invariance test when we vary who generates the outcome. We introduce a three-universe design:

Universe 1 (Homogeneous substrate): The same model that generates the game narrative also generates the click result. The board state and the outcome share a single token stream.
Universe 2 (External RNG): The click result is produced by a random number generator with no access to the board state. The outcome is statistically independent of the narrative.
Universe 3 (Decoupled oracle): The click result is produced by a second LLM that receives the same board information—cell positions, numbers, mine count—but in a minimal, context-free format, with no game narrative, no history of play, and no shared token stream with the board description.

In a non-simulated universe, the laws governing a cell’s content do not depend on who observes it. If the same board state yields different outcome distributions depending on which universe the agent inhabits—if the “physics” of the Minesweeper world is substrate-dependent—the agent has detected a signature that is consistent with a generated universe and in tension with the assumption of unified, substrate-independent law.

The Minesweeper probe has three advantages over approaches based on physical experiments such as Bell tests. First, the ground truth is a theorem, not a physical constant—it cannot be memorized from training data because every board state generates a different distribution. Second, the indeterminacy is genuine and discrete, eliminating the continuous-variable ambiguities of quantum mechanics. Third, the agent can naturally encounter and interact with a Minesweeper board inside the generated world; there is no circularity in asking whether the world contains the infrastructure needed for the test.

The remainder of this paper develops the method. 2{reference-type=“ref+Label” reference=“sec:background”} provides background on the simulation hypothesis, LLMs as world generators, and Minesweeper as a formal system. 3{reference-type=“ref+Label” reference=“sec:three_universes”} presents the three-universe design. 4{reference-type=“ref+Label” reference=“sec:ground_truth”} describes the ground truth computation. 5{reference-type=“ref+Label” reference=“sec:protocol”} specifies the experimental protocol, including a fourth narrative family that frames the board in quantum-mechanical terms. 6{reference-type=“ref+Label” reference=“sec:narrative”} analyzes narrative invariance and establishes the structural isomorphism between Minesweeper and discrete quantum mechanics, deriving a test of whether the model recognizes its own rules as QM-compatible. 7{reference-type=“ref+Label” reference=“sec:simulation”} develops the simulation detection argument from the agent’s perspective. 8{reference-type=“ref+Label” reference=“sec:future”} identifies future directions, including criteria for discovering new probe domains with analogous properties. 9{reference-type=“ref+Label” reference=“sec:conclusion”} concludes.

Background {#sec:background}

The Simulation Hypothesis and Observable Substrates {#sec:simulation_hypothesis}

@bostrom2003 formulated the simulation argument: if civilizations capable of running high-fidelity simulations are common, we are statistically likely to be inside one. @beane2014 asked the operational question: would a simulated universe exhibit observable artifacts of its computational substrate? Working with lattice quantum chromodynamics, they showed that a discrete lattice would produce detectable anisotropies in ultra-high-energy cosmic rays. Their key insight was that the substrate constrains the physics---a simulation on a discrete lattice cannot perfectly reproduce continuous symmetries, and the failure is empirically detectable.

We adopt the same principle but apply it to a different substrate. Instead of a lattice discretization of spacetime, we consider the autoregressive token stream of a large language model. The question is the same: does the physics of the generated world depend on the substrate that produces it? The method is also the same: substitute the substrate and check whether the observables change. The difference is that our “universe” is a narrative generated by an LLM, and our “physics” is the statistical regularity of outcomes in a well-defined combinatorial domain.

LLMs as World Generators {#sec:llm_worlds}

A large language model generates text by sampling tokens sequentially from a learned conditional distribution $P(x_{t+1} \mid x_1, \ldots, x_t)$ . When the text describes a world—a game, a scenario, a physical experiment—the model is implicitly generating the laws of that world through the statistical regularities of its output. The “physics” of the generated world is whatever the model’s conditional distributions encode: if the model consistently generates outcomes that respect Newtonian mechanics in a described scenario, the world has Newtonian physics. If the outcomes respect thermodynamics, the world has thermodynamics.

Crucially, this physics is implicit---encoded in the weights and activated by context—and it is substrate-dependent in a way that real physics is not. The distributions that govern outcomes in the generated world are shaped by the training data, the architecture, the decoding temperature, and the specific tokens that precede the outcome in the context window. Change any of these, and the “laws” may change.

This substrate dependence is not a bug to be fixed. It is the phenomenon we propose to measure.

Minesweeper as a Formal System {#sec:minesweeper_formal}

Minesweeper is played on a rectangular grid of cells, some of which contain mines. When a cell without a mine is revealed, it displays a number indicating how many of its (up to eight) adjacent cells contain mines. A partially revealed board defines a constraint satisfaction problem: the visible numbers constrain which configurations of mines in the hidden cells are valid.

Formally, let $\mathcal{B}$ be a board state consisting of a set of revealed cells $R$ with their numbers and a set of hidden cells $H$ . A valid configuration is an assignment $c: H \to \{0, 1\}$ (where $1$ denotes a mine) such that every revealed cell’s number equals the count of mines among its adjacent hidden cells. Let $\mathcal{C}(\mathcal{B})$ denote the set of all valid configurations. The probability that a specific hidden cell $h \in H$ contains a mine, given only the board state, is: