-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Paper • 2602.05885 • Published • 28 -
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 49 • 6 -
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 332 • 4 -
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 607
HKUST NLP Group
university
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
STALE: Can LLM Agents Know When Their Memories Are No Longer Valid?
AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios
-
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations
Paper • 2602.05885 • Published • 28 -
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 49 • 6 -
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 332 • 4 -
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 607
models 66
hkust-nlp/drkernel-8b-coldstart
Text Generation • 0.3B • Updated • 15 •
hkust-nlp/drkernel-14b-coldstart
Text Generation • 0.5B • Updated • 607
hkust-nlp/drkernel-14b
Text Generation • 15B • Updated • 49 • 6
hkust-nlp/drkernel-8b
Text Generation • 8B • Updated • 332 • 4
hkust-nlp/WebExplorer-8B
Image-Text-to-Text • 8B • Updated • 418 • 14
hkust-nlp/Qwen-2.5-7B-Verifier-general-verifier
Reinforcement Learning • 8B • Updated • 2
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Qwen-1.5B
Reinforcement Learning • 8B • Updated • 3
hkust-nlp/Qwen-2.5-7B-Verifier-HF
Reinforcement Learning • 8B • Updated • 5
hkust-nlp/R1-Distill-Verifier-1.5B
2B • Updated • 4 • 1
hkust-nlp/Qwen-2.5-7B-Verifier-R1-Verifier-1.5B
Reinforcement Learning • 8B • Updated • 1 • 1
datasets 32
hkust-nlp/drkernel-validation-data
Viewer • Updated • 100 • 54 • 1
hkust-nlp/drkernel-rl-data
Viewer • Updated • 72k • 85
hkust-nlp/drkernel-coldstart-8k
Viewer • Updated • 8.92k • 66 • 2
hkust-nlp/Toolathlon-Trajectories
Preview • Updated • 3.62k • 20
hkust-nlp/WebExplorer-QA
Viewer • Updated • 100 • 109 • 7
hkust-nlp/CodeIO-PyEdu-Reasoning-Raw
Updated • 67 • 2
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview • Updated • 148 • 58
hkust-nlp/rl-verifier-pitfalls_hacking_data
Viewer • Updated • 6.12k • 43 • 1
hkust-nlp/deepscaler_simplelr
Viewer • Updated • 40.3k • 37
hkust-nlp/Laser-Deepscaler-Dataset
Viewer • Updated • 40.8k • 98