Open-Sourced model and data for ULTRAIF: Advancing Instruction Following from the Wild.
li sheng
bambisheng
AI & ML interests
None yet
Recent Activity
upvoted a paper about 17 hours ago
Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe upvoted a paper 17 days ago
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models updated a dataset 23 days ago
dynn-datasets/Evaluation