willx7890
/

Qwen2-0.5B-GRPO-test

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

Qwen2-0.5B-GRPO-test

18.1 MB

Ctrl+K

Ctrl+K

1 contributor

History: 7 commits

willx7890's picture

Training in progress, step 10

89a0f27 verified 10 months ago

runs
Training in progress, step 10 10 months ago
.gitattributes

1.57 kB
Training in progress, step 10 10 months ago
README.md

2.03 kB
Training in progress, step 10 10 months ago
adapter_config.json

778 Bytes
Training in progress, step 10 10 months ago
adapter_model.safetensors

2.18 MB
xet

Training in progress, step 10 10 months ago
added_tokens.json

80 Bytes
Training in progress, step 10 10 months ago
chat_template.jinja

328 Bytes
Training in progress, step 10 10 months ago
merges.txt

1.67 MB
Training in progress, step 10 10 months ago
special_tokens_map.json

367 Bytes
Training in progress, step 10 10 months ago
tokenizer.json

11.4 MB
xet

Training in progress, step 10 10 months ago
tokenizer_config.json

999 Bytes
Training in progress, step 10 10 months ago
training_args.bin
Detected Pickle imports (10)
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_utils.SchedulerType",
- "trl.trainer.grpo_config.GRPOConfig",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "torch.device",
- "transformers.trainer_utils.SaveStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.state.PartialState",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.trainer_utils.HubStrategy"
How to fix it?
6.39 kB
xet

Training in progress, step 10 10 months ago
vocab.json

2.78 MB
Training in progress, step 10 10 months ago