Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Shubhamw11
/
gemma-3-270m-dpo-negative
like
1
PyTorch
English
gemma3
rlhf
dpo
slm
tinystories
alignment
License:
mit
Model card
Files
Files and versions
xet
Community
main
gemma-3-270m-dpo-negative
/
README.md
Commit History
.
aff1d57
Shubhamw11
commited on
19 days ago
.
9b334a0
Shubhamw11
commited on
19 days ago
.
8e6eafa
Shubhamw11
commited on
19 days ago