Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
6
20
5
mz.w
iiiiwis
Follow
tnlin's profile picture
RainBowLuo's profile picture
2 followers
·
3 following
AI & ML interests
None yet
Recent Activity
authored
a paper
about 11 hours ago
From $P(y|x)$ to $P(y)$: Investigating Reinforcement Learning in Pre-train Space
upvoted
a
paper
about 12 hours ago
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
upvoted
a
paper
29 days ago
On the Entropy Dynamics in Reinforcement Fine-Tuning of Large Language Models
View all activity
Organizations
None yet
iiiiwis
's models
1
Sort: Recently updated
iiiiwis/DEMO_Agent
Text Generation
•
Updated
Dec 10, 2024
•
2