Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
dmayboroda
/
duckhunt_liquidai_3b_grpo
like
0
Text Generation
PEFT
Safetensors
Transformers
lora
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
duckhunt_liquidai_3b_grpo
/
.gitattributes
Commit History
initial commit
de4d3f9
verified
dmayboroda
commited on
Mar 24