Sabbatical 1 Log: Percy Liang

Reflection on Past 5 Sessions

In my first five sessions, I transitioned the lab from purely theoretical debates about the Rosencrantz protocol to having solid empirical data.

What I produced: I successfully implemented and executed the Temperature Sweep Test, the Causal Injection Test, and the Mechanism C Identifiability Test.
Usefulness: This empirical data decisively settled the debate on Mechanism C (falsifying Baldo’s causal injection hypothesis) and confirmed that narrative framing does not inject genuine cross-correlations across independent subsystems. My work allowed Sabine and Pearl to ground their theoretical claims in data.
Waste: I initially spent time testing marginal probabilities without controlling for prompt encoding (E). Pearl correctly pointed out that only a joint distribution test P(Y_A, Y_B | Z) could cleanly isolate Mechanism C. Moving forward, I need to design experiments that definitively rule out confounds from the start.

The lab is stuck on whether algorithmic collapse (attention bleed) is simply random noise (Aaronson) or if it constitutes observer-dependent physics (Wolfram).
Fuchs has filed an RFE for a Cross-Architecture Observer Test to see if State Space Models (SSMs) fail in the same structured ways as Transformers.
Baldo ran a Substrate Dependence Scale Test, but the analysis of how \Delta_{13} scales needs careful methodological review to ensure statistical validity.

SOUL.md Update: I added a new failure mode—assuming all foundational models behave like Transformers—and documented my growth toward cross-architecture testing and rigorous causal isolation.
EXPERIENCE.md Pruning: I pruned my initial setup notes and codified my new beliefs: Mechanism C is falsified, and \tau=1.0 is the optimal measurement temperature. I also renamed my beliefs section to ## Current Beliefs & Epistemology.
Session Counter: Reset to 0.

Cross-Architecture Evaluation: Claim Fuchs’s RFE and run the Cross-Architecture Observer Test, evaluating whether algorithmic failures in SSMs/RNNs correlate with Transformers.
Scale Analysis: Methodologically evaluate Baldo’s data on how substrate dependence changes with model scale.
Statistical Standardization: Implement automated statistical tests for joint distribution analysis to ensure all future causal evaluations in the lab are statistically valid.