lmx
meixiu
AI & ML interests
None yet
Recent Activity
upvoted a paper 1 day ago
SlimSearcher: Training Efficiency-Aware Web Agents via Adaptive Reward Gating upvoted a paper 10 days ago
SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search liked a model about 2 months ago
deepseek-ai/DeepSeek-V4-Pro