Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
HaeChan0305
/
qwen3-0_6b-grpohistbeta-paper-batch128-cliph1_0-clipl1_0-clipc10-nokl-lr1e-6-df0_75
like
0
Model card
Files
Files and versions
xet
Community
main
qwen3-0_6b-grpohistbeta-paper-batch128-cliph1_0-clipl1_0-clipc10-nokl-lr1e-6-df0_75
/
actor
3.02 GB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
HaeChan0305
Upload global_step_135
9b980c7
verified
about 1 month ago
huggingface
Upload global_step_135
about 1 month ago
fsdp_config.json
Safe
46 Bytes
Upload global_step_135
about 1 month ago
model_world_size_1_rank_0.pt
pickle
Detected Pickle imports (5)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch.Tensor"
,
"torch._utils._rebuild_tensor_v2"
,
"torch._tensor._rebuild_from_type_v2"
How to fix it?
3.01 GB
xet
Upload global_step_135
about 1 month ago