Collections
Discover the best community collections!
Collections including paper arxiv:2605.01428
-
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning
Paper • 2510.03259 • Published • 57 -
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Paper • 2510.07242 • Published • 30 -
First Try Matters: Revisiting the Role of Reflection in Reasoning Models
Paper • 2510.08308 • Published • 24 -
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 31 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 15 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 45 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 24
-
Multi-agent cooperation through in-context co-player inference
Paper • 2602.16301 • Published • 24 -
TIDE: Every Layer Knows the Token Beneath the Context
Paper • 2605.06216 • Published • 11 -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 81 -
Hallucinations Undermine Trust; Metacognition is a Way Forward
Paper • 2605.01428 • Published • 24
-
I-Con: A Unifying Framework for Representation Learning
Paper • 2504.16929 • Published • 31 -
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens
Paper • 2508.05305 • Published • 48 -
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
Paper • 2511.04217 • Published • 17 -
Large Language Models as Markov Chains
Paper • 2410.02724 • Published • 33
-
Multi-agent cooperation through in-context co-player inference
Paper • 2602.16301 • Published • 24 -
TIDE: Every Layer Knows the Token Beneath the Context
Paper • 2605.06216 • Published • 11 -
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 81 -
Hallucinations Undermine Trust; Metacognition is a Way Forward
Paper • 2605.01428 • Published • 24
-
Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning
Paper • 2510.03259 • Published • 57 -
Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense
Paper • 2510.07242 • Published • 30 -
First Try Matters: Revisiting the Role of Reflection in Reasoning Models
Paper • 2510.08308 • Published • 24 -
Low-probability Tokens Sustain Exploration in Reinforcement Learning with Verifiable Reward
Paper • 2510.03222 • Published • 76
-
I-Con: A Unifying Framework for Representation Learning
Paper • 2504.16929 • Published • 31 -
SONAR-LLM: Autoregressive Transformer that Thinks in Sentence Embeddings and Speaks in Tokens
Paper • 2508.05305 • Published • 48 -
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms
Paper • 2511.04217 • Published • 17 -
Large Language Models as Markov Chains
Paper • 2410.02724 • Published • 33
-
EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters
Paper • 2402.04252 • Published • 31 -
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models
Paper • 2402.03749 • Published • 15 -
ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Paper • 2402.04615 • Published • 45 -
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss
Paper • 2402.05008 • Published • 24