arxiv:2604.14142
mz.w
iiiiwis
AI & ML interests
None yet
Recent Activity
authored a paper about 9 hours ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space upvoted a paper about 10 hours ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space upvoted a paper 29 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language ModelsOrganizations
None yet