thomas-schweich
/

pawn-small

next-token-prediction

representation-learning

Eval Results (legacy)

Model card Files Files and versions

thomas-schweich commited on 20 days ago

Commit

b025ba4

·

verified ·

1 Parent(s): f0371b5

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +3 -4

README.md CHANGED Viewed

@@ -88,13 +88,12 @@ This is the **small** variant (~9.5M parameters). PAWN is designed as a frozen b
 ### Accuracy Ratios
-PAWN is trained on uniformly random chess games, so top-1 accuracy has a hard theoretical ceiling. Ratios above 100% on the unconditioned ceiling indicate the model has learned structure beyond simply identifying legal moves. See [Accuracy Ceiling Analysis](https://github.com/thomas-schweich/PAWN/blob/main/docs/ACCURACY_CEILING.md).
 | Ceiling | Ratio |
 |---------|-------|
-| Unconditioned (E\[1/N_legal\] = 6.43%) | 105% |
-| Naive-conditioned (1-ply filter = 6.44%) | 105% |
-| Bayes-optimal conditioned (MCTS, 32 rollouts = 7.92%) | 85% |
 ## Probe Results

 ### Accuracy Ratios
+PAWN is trained on uniformly random chess games, so top-1 accuracy has a hard theoretical ceiling. Ratios above 100% on the unconditioned ceiling indicate the model exploits the outcome token to make non-uniform predictions. The MC conditioned ceiling is an estimate reported as a bracket \[corrected, naive\]; see [Accuracy Ceiling Analysis](https://github.com/thomas-schweich/PAWN/blob/main/docs/ACCURACY_CEILING.md) for methodology.
 | Ceiling | Ratio |
 |---------|-------|
+| Unconditioned (E\[1/N_legal\] = 6.52%) | 103% |
+| Bayes-optimal conditioned (MC, 128 rollouts = \[6.67, 7.34\]%) | 92–101% |
 ## Probe Results