CORE: Contrastive Reflection Enables Rapid Improvements in Reasoning Paper • 2605.28742 • Published 15 days ago • 3
Reinforcement Learning from Rich Feedback with Distributional DAgger Paper • 2606.05152 • Published 8 days ago • 3
Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development Paper • 2606.07207 • Published 6 days ago • 3
Bayesian-Agent: Posterior-Guided Skill Evolution for LLM Agent Harnesses Paper • 2606.08348 • Published 5 days ago • 12
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 3 days ago • 52
UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs Paper • 2606.06622 • Published 7 days ago • 19
LLM Explainability with Counterfactual Chains and Causal Graphs Paper • 2606.05972 • Published 7 days ago • 16
Echo-Memory: A Controlled Study of Memory in Action World Models Paper • 2606.09803 • Published 3 days ago • 30
When Tools Fail: Benchmarking Dynamic Replanning and Anomaly Recovery in LLM Agents Paper • 2606.05806 • Published 7 days ago • 21
Human Psychometric Questionnaires Mischaracterize LLM Behavior Paper • 2509.10078 • Published 13 days ago • 31
Direct 3D-Aware Object Insertion via Decomposed Visual Proxies Paper • 2606.06601 • Published 7 days ago • 24
AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization Paper • 2606.07326 • Published 6 days ago • 27
SoCRATES: Towards Reliable Automated Evaluation of Proactive LLM Mediation across Domains and Socio-cognitive Variations Paper • 2606.05563 • Published 7 days ago • 47
Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings Paper • 2606.07502 • Published 6 days ago • 84
When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges Paper • 2605.26046 • Published 17 days ago • 3