Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
6
20
5
mz.w
iiiiwis
Follow
tnlin's profile picture
RainBowLuo's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
authored
a paper
about 11 hours ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space
upvoted
a
paper
about 12 hours ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
upvoted
a
paper
29 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
View all activity
Organizations
None yet
iiiiwis
's datasets
2
Sort: Recently updated
iiiiwis/AMPO
Preview
•
Updated
May 15, 2025
•
55
•
1
iiiiwis/DEMO
Viewer
•
Updated
Dec 16, 2024
•
7.98k
•
17
•
1