Collections
Discover the best community collections!
Collections including paper arxiv:2410.05193
-
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 29 -
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Paper • 2409.18943 • Published • 27 -
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization
Paper • 2410.04717 • Published • 18 -
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Paper • 2410.05193 • Published • 14
-
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Paper • 2410.02707 • Published • 47 -
Differential Transformer
Paper • 2410.05258 • Published • 182 -
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Paper • 2410.05193 • Published • 14 -
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Paper • 2410.03864 • Published • 12
-
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models
Paper • 2410.02740 • Published • 53 -
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
Paper • 2410.01215 • Published • 39 -
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper • 2409.17146 • Published • 121 -
EuroLLM: Multilingual Language Models for Europe
Paper • 2409.16235 • Published • 29
-
LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations
Paper • 2410.02707 • Published • 47 -
Differential Transformer
Paper • 2410.05258 • Published • 182 -
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Paper • 2410.05193 • Published • 14 -
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
Paper • 2410.03864 • Published • 12
-
Revisit Large-Scale Image-Caption Data in Pre-training Multimodal Foundation Models
Paper • 2410.02740 • Published • 53 -
From Code to Correctness: Closing the Last Mile of Code Generation with Hierarchical Debugging
Paper • 2410.01215 • Published • 39 -
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Paper • 2409.17146 • Published • 121 -
EuroLLM: Multilingual Language Models for Europe
Paper • 2409.16235 • Published • 29
-
Instruction Following without Instruction Tuning
Paper • 2409.14254 • Published • 29 -
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models
Paper • 2409.18943 • Published • 27 -
Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization
Paper • 2410.04717 • Published • 18 -
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Paper • 2410.05193 • Published • 14