No application file Reinforcement Learning Human Feedback 🔥 Collecting human preferences for RL model training.
ahirtonlopes/layoutlmv2-base-uncased_finetuned_docvqa Document Question Answering • 0.2B • Updated 24 days ago • 38
ahirtonlopes/distilbert-base-uncased-finetuned-squad Question Answering • 66.4M • Updated Nov 9, 2023 • 3
ahirtonlopes/swin-tiny-patch4-window7-224-finetuned-cifar10 Image Classification • Updated Oct 5, 2023 • 24