TianlaiChen 's Collections papers
updated
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through
Two-Stage Rule-Based RL
Paper
• 2503.07536
• Published • 88
Seedream 2.0: A Native Chinese-English Bilingual Image Generation
Foundation Model
Paper
• 2503.07703
• Published • 37
Gemini Embedding: Generalizable Embeddings from Gemini
Paper
• 2503.07891
• Published • 46
Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning
Paper
• 2503.07572
• Published • 48
Implicit Reasoning in Transformers is Reasoning through Shortcuts
Paper
• 2503.07604
• Published • 23
Beyond Decoder-only: Large Language Models Can be Good Encoders for
Machine Translation
Paper
• 2503.06594
• Published • 6
A Survey of Efficient Reasoning for Large Reasoning Models: Language,
Multimodality, and Beyond
Paper
• 2503.21614
• Published • 43
Exploring Data Scaling Trends and Effects in Reinforcement Learning from
Human Feedback
Paper
• 2503.22230
• Published • 45
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion
Transformers
Paper
• 2504.10483
• Published • 22
Efficient Reasoning Models: A Survey
Paper
• 2504.10903
• Published • 21
DataDecide: How to Predict Best Pretraining Data with Small Experiments
Paper
• 2504.11393
• Published • 19
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation
through Pretraining, SFT, and RL
Paper
• 2504.11455
• Published • 14
InternVL3: Exploring Advanced Training and Test-Time Recipes for
Open-Source Multimodal Models
Paper
• 2504.10479
• Published • 308
Scaling Data-Constrained Language Models
Paper
• 2305.16264
• Published • 16