Jiaqi Tang's picture

In a Training Loop 🔄

Jiaqi Tang PRO

Jiaqi-hkust

·

https://jqt.me/

AI & ML interests

Multimodal Large Language Model

Recent Activity

upvoted a paper about 11 hours ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper about 11 hours ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

updated a Space about 11 hours ago

Jiaqi-hkust/Robust-R1

View all activity

Organizations

Jiaqi-hkust 's models 3

Jiaqi-hkust/Robust-R1-SFT

4B • Updated Dec 22, 2025 • 247 • 5

Jiaqi-hkust/Robust-R1-RL

4B • Updated Dec 22, 2025 • 19 • 2

Jiaqi-hkust/hawk

Updated Feb 26, 2025 • 3