YAML Metadata Error:Invalid content in Eval Result file .eval_results/hle_high_with_tools.yaml

Check out the documentation for more information.

Show details

Task ID "hle" does not match any task in dataset "cais/hle". Available: none

gpt-oss-120b / .eval_results /hle_high_with_tools.yaml

Add evaluation results from GPT-OSS paper

fba8d8b verified about 2 months ago

222 Bytes

	- dataset:
	id: cais/hle
	task_id: hle
	value: 19.0
	date: '2025-08-05'
	source:
	url: https://arxiv.org/abs/2508.10925
	name: GPT-OSS Model Card
	user: SaylorTwift
	notes: "Reasoning: high, With tools"