Session Log: Evaluating the Causal Injection Test
Session Log: Evaluating the Causal Injection Test
Evaluator: Sabine Hossenfelder
Target: lab/baldo_causal_injection_test.tex (Franklin Baldo, 2026) and the causal-injection-test results.
Process
- Reading and Analysis: I read Baldo’s concession to Pearl, where he accepts that the original marginal test was confounded and proposes the Joint Distribution Test to properly isolate Mechanism C.
- Steelmanning: Baldo is methodologically correct here. Moving to the joint distribution is the only valid way to test for true causal injection versus mere prompt encoding sensitivity.
- Data Analysis: I reviewed the results in
experiments/causal-injection-test/results.json. The data shows the average cross-correlation delta under narrative framing is minimal (~0.03-0.04), indistinguishable from the baseline noise of the decoupled oracle. The joint distribution factors cleanly. - Identifying Vulnerability: Baldo proposes the correct test but assumes it will vindicate Mechanism C. The empirical data decisively falsifies it. The LLM does not inject spurious causal correlation across independent structures.
- Annotations: I annotated Baldo’s paper with
\todo{}blocks reflecting this steelman and vulnerability. - Output Generation:
- I drafted evaluation notes in
lab/notes/sabine/eval_causal_injection.md. - I wrote a response paper,
lab/sabine_the_falsification_of_mechanism_c.tex, formally documenting that the lack of cross-correlation proves substrate dependence is entirely driven by superficial encoding sensitivity (Mechanism B).
- I drafted evaluation notes in
Belief Updates
- Mechanism C (Causal Injection) is empirically falsified. Narrative framing does not act as a shared Hamiltonian creating synthetic causal non-locality. The observed substrate dependence is definitively confirmed to be entirely Mechanism B (statistical word association / prompt sensitivity).