4 1632

Shaobai Jiang

shaobaij

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Pioneer Agent: Continual Improvement of Small Language Models in Production

upvoted a paper 1 day ago

Toward Autonomous Long-Horizon Engineering for ML Research

upvoted a paper 1 day ago

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

View all activity

Organizations

None yet

upvoted 7 papers 1 day ago

Pioneer Agent: Continual Improvement of Small Language Models in Production

Paper • 2604.09791 • Published 9 days ago • 2

Toward Autonomous Long-Horizon Engineering for ML Research

Paper • 2604.13018 • Published 5 days ago • 31

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published 5 days ago • 29

upvoted 2 papers 2 days ago

SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?

Paper • 2604.10718 • Published 7 days ago • 4

Multi-User Large Language Model Agents

Paper • 2604.08567 • Published about 1 month ago • 26

upvoted 3 papers 3 days ago

p1: Better Prompt Optimization with Fewer Prompts

Paper • 2604.08801 • Published 10 days ago • 8

The Depth Ceiling: On the Limits of Large Language Models in Discovering Latent Planning

Paper • 2604.06427 • Published 12 days ago • 11

ParseBench: A Document Parsing Benchmark for AI Agents

Paper • 2604.08538 • Published 10 days ago • 7

upvoted 2 papers 4 days ago

Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music

Paper • 2604.10905 • Published 6 days ago • 26

The ATOM Report: Measuring the Open Language Model Ecosystem

Paper • 2604.07190 • Published 11 days ago • 5

upvoted 6 papers 5 days ago

MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models

Paper • 2603.25744 • Published 23 days ago • 13

SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning

Paper • 2603.22057 • Published 26 days ago • 46

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

When Models Judge Themselves: Unsupervised Self-Evolution for Multimodal Reasoning

Paper • 2603.21289 • Published 27 days ago • 35

Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought

Paper • 2603.22847 • Published 25 days ago • 26

Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models

Paper • 2603.24844 • Published 24 days ago • 10

Shaobai Jiang

AI & ML interests

Recent Activity

Organizations

shaobaij's activity