-
Reasoning Under 1 Billion: Memory-Augmented Reinforcement Learning for Large Language Models
Paper • 2504.02273 • Published • 7 -
Multi-Reference Preference Optimization for Large Language Models
Paper • 2405.16388 • Published • 1 -
Automatic Prompt Selection for Large Language Models
Paper • 2404.02717 • Published • 1 -
Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention
Paper • 2603.20640 • Published • 2
Hung Le
neurocoder
AI & ML interests
None yet
Recent Activity
updated a collection about 20 hours ago
My papers new activity 11 months ago
neurocoder/Qwen2.5-0.5B-Instruct-MemoryR:Improve language tagOrganizations
None yet