Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
zswzswzsw
/
simpo_run
like
0
arxiv:
2310.16944
arxiv:
2203.02155
arxiv:
2307.09288
Model card
Files
Files and versions
xet
Community
main
simpo_run
741 kB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
zswzswzsw
Upload folder using huggingface_hub
57625ca
verified
about 1 year ago
.github
Upload folder using huggingface_hub
about 1 year ago
assets
Upload folder using huggingface_hub
about 1 year ago
chapters
Upload folder using huggingface_hub
about 1 year ago
recipes
Upload folder using huggingface_hub
about 1 year ago
scripts
Upload folder using huggingface_hub
about 1 year ago
src
Upload folder using huggingface_hub
about 1 year ago
tests
Upload folder using huggingface_hub
about 1 year ago
trl_012_grpo
Upload folder using huggingface_hub
about 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
about 1 year ago
.gitignore
Safe
3.11 kB
Upload folder using huggingface_hub
about 1 year ago
CITATION.cff
Safe
738 Bytes
Upload folder using huggingface_hub
about 1 year ago
LICENSE
Safe
11.4 kB
Upload folder using huggingface_hub
about 1 year ago
Makefile
Safe
1.03 kB
Upload folder using huggingface_hub
about 1 year ago
README.md
Safe
8.28 kB
Upload folder using huggingface_hub
about 1 year ago
config_dpo_run.yaml
Safe
2.05 kB
Upload folder using huggingface_hub
about 1 year ago
config_grpo_offline.yaml
2.17 kB
Upload folder using huggingface_hub
about 1 year ago
config_sft_test_env.yaml
Safe
2.02 kB
Upload folder using huggingface_hub
about 1 year ago
grpo_max_completion.py
9.29 kB
Upload folder using huggingface_hub
about 1 year ago
grpo_offline_run.py
8.5 kB
Upload folder using huggingface_hub
about 1 year ago
run_dpo.py
Safe
10.3 kB
Upload folder using huggingface_hub
about 1 year ago
run_sft_test_env.py
Safe
7.86 kB
Upload folder using huggingface_hub
about 1 year ago
run_simpo.py
14.6 kB
Upload folder using huggingface_hub
about 1 year ago
setup.cfg
Safe
698 Bytes
Upload folder using huggingface_hub
about 1 year ago
setup.py
Safe
4.9 kB
Upload folder using huggingface_hub
about 1 year ago
simpo_ori.yaml
1.03 kB
Upload folder using huggingface_hub
about 1 year ago
test.json
Safe
269 kB
Upload folder using huggingface_hub
about 1 year ago