YAML Metadata Error:Invalid content in Eval Result file .eval_results/hle_high_with_tools.yaml
Check out the documentation for more information.
Show details
Task ID "hle" does not match any task in dataset "cais/hle". Available: none
| - dataset: | |
| id: cais/hle | |
| task_id: hle | |
| value: 19.0 | |
| date: '2025-08-05' | |
| source: | |
| url: https://arxiv.org/abs/2508.10925 | |
| name: GPT-OSS Model Card | |
| user: SaylorTwift | |
| notes: "Reasoning: high, With tools" | |