Qwen3-32B · GSL-247 · arm F (v3-mixed-r4)

LoRA adapter trained on Lichess human games for the GSL-247 chess SFT ablation (form-vs-strategy axis). All four arms (A/B/C/F) share representation mix v3_mixed_history, N=200,000 records, identical hyperparameters; they vary only in the human-data filter applied upstream.

field value
base model Qwen/Qwen3-32B
PEFT LoRA r=32, alpha=32, dropout=0.0, modules=all-linear
train slice 180,000 records (90/5/5 of 200k)
manifest hash d4164be0b7b470a53e23fd52c9214d6f9b4aa1a6c8afde5e38f91a487947b8f4
manifest path data/v3/armF_4x.manifest.json
representation v3_mixed_history
epochs 1
optimizer lr=0.001 (linear, warmup_ratio=0.03), batch=8
training method sft
training infra Together Fine-Tuning (job ft-4c44216b-f95f)
source commit 5b6e6b8

Arm definitions

  • A: unfiltered human games
  • B: top-decile by player Elo (per-time-control p90 floor)
  • C: bottom-decile by player Elo (per-time-control p10 ceiling)
  • F: arm A minus the top-decile (i.e., everyone except B)

The B-vs-C contrast is the pre-registered falsifier for whether better human data improves SFT on chess. F is an exploratory matched-N rebuild used to disambiguate A ≈ B ≈ C results from the v2 run.

Usage

from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer

base = AutoModelForCausalLM.from_pretrained("Qwen/Qwen3-32B", device_map="auto")
tok  = AutoTokenizer.from_pretrained("Qwen/Qwen3-32B")
model = PeftModel.from_pretrained(base, "GoodStartLabs/qwen3-32b-gsl247-armF-v3-mixed-r4")

Eval contract

The pre-registered evaluation contract (rung-0 floor gates, rung-1 puzzle Elo ladder, B-vs-C falsifier thresholds) lives at eval/contract.yaml in the source repo. Every Inspect task stamps its contract_sha into the .eval metadata so any reported number is recoverable to the thresholds in force.

Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for GoodStartLabs/qwen3-32b-gsl247-armF-v3-mixed-r4

Base model

Qwen/Qwen3-32B
Adapter
(320)
this model