Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
thunder-research-group 's Collections
SNU Thunder-LLM Korean Benchmark Suite
SNU Thunder-LLM English Benchmark Suite
SNU Thunder-LLM Dataset Suite
Post-Training Datasets
SNU Thunder-DeID

SNU Thunder-LLM Korean Benchmark Suite

updated Mar 2
Upvote
1

  • thunder-research-group/SNU_Ko-LAMBADA

    Viewer • Updated Jun 13, 2025 • 2.26k • 120

  • thunder-research-group/SNU_Ko-WinoGrande

    Viewer • Updated Jun 13, 2025 • 1.27k • 50

  • thunder-research-group/SNU_Ko-ARC

    Viewer • Updated Jun 13, 2025 • 3.54k • 18

  • thunder-research-group/SNU_Ko-GSM8K

    Viewer • Updated Oct 16, 2025 • 1.32k • 20 • 1

  • thunder-research-group/SNU_Ko-IFEval

    Viewer • Updated Jun 13, 2025 • 841 • 199

  • thunder-research-group/SNU_Ko-EQ-Bench

    Viewer • Updated Jun 13, 2025 • 171 • 38

  • skt/kobest_v1

    Viewer • Updated Mar 28, 2024 • 23.4k • 3.14k • 54

    Note We use hellaswag > test set for evaluation


  • HAERAE-HUB/KMMLU

    Viewer • Updated Mar 5, 2024 • 244k • 6.88k • 97

  • HYU-NLP/KR-HumanEval

    Viewer • Updated Jun 3, 2025 • 328 • 20

    Note We use v1 for evaluation


  • LGCNS/KorQuAD_2.0

    Viewer • Updated Aug 7, 2025 • 93.7k • 129 • 2

  • thunder-research-group/SNU_Ko-MuSR

    Viewer • Updated Nov 24, 2025 • 750 • 10
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs