Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Jiaqi Tang's picture
In a Training Loop ๐Ÿ”„
3 9 13

Jiaqi Tang PRO

Jiaqi-hkust
renzhiyingdanxingcongcong's profile picture Limalimarj's profile picture branikita's profile picture
ยท
https://jqt.me/
  • jqtangust
  • jqtnpu

AI & ML interests

Multimodal Large Language Model

Recent Activity

upvoted a paper about 11 hours ago
Rethinking the Divergence Regularization in LLM RL
upvoted a paper about 11 hours ago
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models
updated a Space about 11 hours ago
Jiaqi-hkust/Robust-R1
View all activity

Organizations

PolyX Research's profile picture Tencent-Hunyuan-Multimodal-RL's profile picture harnessRL's profile picture

Jiaqi-hkust 's models 3

Jiaqi-hkust/Robust-R1-SFT

4B โ€ข Updated Dec 22, 2025 โ€ข 247 โ€ข 5

Jiaqi-hkust/Robust-R1-RL

4B โ€ข Updated Dec 22, 2025 โ€ข 19 โ€ข 2

Jiaqi-hkust/hawk

Updated Feb 26, 2025 โ€ข 3
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs