lmx
meixiu
AI & ML interests
None yet
Recent Activity
upvoted a paper 4 days ago
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating upvoted a paper 12 days ago
SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search liked a model about 2 months ago
deepseek-ai/DeepSeek-V4-Pro