Lize Pirenne's picture

Lize Pirenne

Inversta

·

Pangasius

AI & ML interests

LLMs, RL

Recent Activity

upvoted a paper about 6 hours ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

upvoted a paper about 6 hours ago

INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling

upvoted a paper about 6 hours ago

DMax: Aggressive Parallel Decoding for dLLMs

View all activity

Organizations

None yet

upvoted 4 papers about 6 hours ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published 7 days ago • 32

INSPATIO-WORLD: A Real-Time 4D World Simulator via Spatiotemporal Autoregressive Modeling

Paper • 2604.07209 • Published 7 days ago • 35

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published 6 days ago • 48

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Paper • 2604.04921 • Published 9 days ago • 106

upvoted a paper about 7 hours ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published 12 days ago • 352

upvoted 2 papers about 8 hours ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published 13 days ago • 468

FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization

Paper • 2603.19835 • Published 25 days ago • 335

upvoted a paper about 11 hours ago

DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models

Paper • 2603.26164 • Published 18 days ago • 351

upvoted a paper about 12 hours ago

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 17 days ago • 141

upvoted 8 papers 1 day ago

Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale

Paper • 2603.25040 • Published 20 days ago • 130

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published 22 days ago • 135

Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing

Paper • 2603.12254 • Published Mar 12 • 22

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 28 days ago • 308

Demystifing Video Reasoning

Paper • 2603.16870 • Published 28 days ago • 368

Lost in Backpropagation: The LM Head is a Gradient Bottleneck

Paper • 2603.10145 • Published Mar 10 • 13

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published Mar 6 • 93

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 151

upvoted 3 papers about 1 month ago

Spilled Energy in Large Language Models

Paper • 2602.18671 • Published Feb 21 • 12

Helios: Real Real-Time Long Video Generation Model

Paper • 2603.04379 • Published Mar 4 • 186

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published Feb 5 • 352