mz.w's picture

mz.w

iiiiwis

·

AI & ML interests

None yet

Recent Activity

authored a paper about 11 hours ago

From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space

upvoted a paper about 12 hours ago

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

upvoted a paper 29 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

View all activity

Organizations

None yet

iiiiwis 's datasets 2

iiiiwis/AMPO

Preview • Updated May 15, 2025 • 55 • 1

iiiiwis/DEMO

Viewer • Updated Dec 16, 2024 • 7.98k • 17 • 1