FoVer Collection Process Reward Models (PRMs) trained on step-level error labels automatically annotated by formal verification tools. • 3 items • Updated 6 days ago • 1
ryokamoi/FoVer-FormalLogic-FormalProof-Llama-3.1-8B-LastStepBalanced-40k Viewer • Updated 6 days ago • 40k • 55
ryokamoi/FoVer-FormalLogic-FormalProof-Qwen-2.5-7B-LastStepBalanced-40k Viewer • Updated 6 days ago • 40k • 79
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals Paper • 2602.02581 • Published Jan 31 • 10
VisOnlyQA Collection Dataset for evaluating the visual perception capabilities of LVLMs. • 12 items • Updated Mar 2 • 4