Add MathArena evaluation result for hmmt/hmmt_feb_2026
#17 opened 28 days ago
by
JasperDekoninck
Add MathArena evaluation result for aime/aime_2026
#16 opened 28 days ago
by
JasperDekoninck
RoPE theta 5mln instead of 1mln
#15 opened about 2 months ago
by
Michalea
📋 Documentation Enhancement Suggestion
#14 opened about 2 months ago
by
CroviaTrust
Add GPQA evaluation result
#13 opened 3 months ago
by
burtenshaw
一种用于加速和增强语言模型微调的新颖“负重”方法
#12 opened 3 months ago
by
aifeifei798
Add community evaluation results for MMLU-PRO, GPQA
#10 opened 3 months ago
by
nielsr
Add Artificial Analysis evaluations for qwen3-8b-instruct-reasoning
#9 opened 5 months ago
by
burtenshaw
Input Hallucination
#7 opened 7 months ago
by
zhangziji1021
Only end </think> tag but no start <think> tag.
7
#5 opened 8 months ago
by
zhangziji1021
Sampling parameters & vLLM settings for tau2-bench?
#4 opened 8 months ago
by
lewtun
Request: DOI
1
#3 opened 8 months ago
by
Raybou
Terrible instruction following
👍 1
4
#2 opened 8 months ago
by
denisalpino
32B 32B 32B
👍🤝 10
1
#1 opened 8 months ago
by
imoc