- dataset: id: SWE-bench/SWE-bench_Verified task_id: swe_bench_%_resolved value: 72.80 source: url: https://www.swebench.com/ name: SWE-Bench official evaluation user: nielsr notes: high reasoning, official - dataset: id: SWE-bench/SWE-bench_Verified task_id: swe_bench_%_resolved value: 77.8 source: url: https://huggingface.co/zai-org/GLM-5/ name: Model card user: nielsr notes: Z.ai reported number