mz.w's picture

mz.w

iiiiwis

·

AI & ML interests

None yet

Recent Activity

authored a paper about 11 hours ago

From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space

upvoted a paper about 12 hours ago

From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space

upvoted a paper 29 days ago

On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models

View all activity

Organizations

None yet

iiiiwis 's models 1

iiiiwis/DEMO_Agent

Text Generation • Updated Dec 10, 2024 • 2