10 1

li sheng

bambisheng

https://github.com/BambiSheng

AI & ML interests

None yet

Recent Activity

upvoted a paper about 17 hours ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

upvoted a paper 17 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

updated a dataset 23 days ago

dynn-datasets/Evaluation

View all activity

Organizations

Collections 1

Papers 2

arxiv:2504.16084

arxiv:2502.04153

models 3

datasets 0

None public yet

li sheng

AI & ML interests

Recent Activity

Organizations

Collections 1

UltraIF: Advancing Instruction Following from the Wild

bambisheng/UltraIF-8B-SFT

bambisheng/UltraIF-8B-UltraComposer

bambisheng/UltraIF-8B-DPO

UltraIF: Advancing Instruction Following from the Wild

bambisheng/UltraIF-8B-SFT

bambisheng/UltraIF-8B-UltraComposer

bambisheng/UltraIF-8B-DPO

Papers 2

models 3

bambisheng/UltraIF-8B-DPO

bambisheng/UltraIF-8B-UltraComposer

bambisheng/UltraIF-8B-SFT

datasets 0

li sheng

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 2

models 3 Sort: Recently updated

datasets 0

models 3