Task family × reasoning condition matrix. Each cell represents a systematic sweep across model architectures and scales.
The core formula η = k · exp(−akn) is tested under each combination.
paper/main.tex — Main paper: "One Bit at a Time: Exponential Error Accumulation in LLM Compositional Reasoning"a_guided < a_cot — Guided (retrieval-ized) prompts reduce per-bit error rate. FSM: 12/12 models. ModArith: 10/11 models.R² contrast — ModArith CoT: R² < 0 (computation). ModArith Guided: R² > 0.6 (retrieval). The parameter a measures retrieval error.cross-task — Cross-task validation: train on (k,n) subset → predict held-out. MAE 4% for retrieval tasks.experiments/results/ — Raw JSON results. 64+ experiment files across tasks, models, and conditions.