Inference Providers
Active filters: dpo
tsavage68/IE_M2_50steps_1e7rate_03beta_SFT
Text Generation
• 7B • Updated • 2
SongTonyLi/gemma-2b-it-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge
Text Generation
• 3B • Updated • 4
tsavage68/IE_M2_350steps_1e8rate_01beta_cSFTDPO
Text Generation
• 7B • Updated • 2
tsavage68/IE_M2_350steps_1e8rate_03beta_cSFTDPO
Text Generation
• 7B • Updated • 3
tsavage68/IE_L3_1000steps_1e8rate_01beta_cSFTDPO
Text Generation
• 8B • Updated • 4
Text Generation
• 8B • Updated • 6
tsavage68/IE_M2_350steps_1e8rate_05beta_cSFTDPO
Text Generation
• 7B • Updated • 3
tsavage68/IE_L3_1000steps_1e8rate_03beta_cSFTDPO
Text Generation
• 8B • Updated • 4
tsavage68/IE_L3_1000steps_1e8rate_05beta_cSFTDPO
Text Generation
• 8B • Updated • 4
tsavage68/IE_L3_150steps_1e7rate_01beta_cSFTDPO
Text Generation
• 8B • Updated • 2
tsavage68/IE_L3_100steps_1e7rate_03beta_cSFTDPO
Text Generation
• 8B • Updated • 3
tsavage68/IE_L3_100steps_1e7rate_05beta_cSFTDPO
Text Generation
• 8B • Updated • 3
tsavage68/IE_L3_450steps_1e8rate_01beta_cSFTDPO
Text Generation
• 8B • Updated • 2
DUAL-GPO/zephyr-7b-ipo-40k-60k-0.001-i2
tsavage68/IE_L3_350steps_1e8rate_03beta_cSFTDPO
Text Generation
• 8B • Updated • 3
pL-Community/SauerkrautLM-Mixtral-8x7B-Instruct-FP8-Dynamic
Text Generation
• 47B • Updated • 19
• 1
taicheng/zephyr-7b-align-scan
Text Generation
• 7B • Updated • 4
taicheng/zephyr-7b-align-scan-0.0-0.9-linear-2
Text Generation
• 7B • Updated • 4
taicheng/zephyr-7b-align-scan-0.0-0.2-polynomial-3
Text Generation
• 7B • Updated • 4
taicheng/zephyr-7b-align-scan-0.0-0.8-cosine-1
Text Generation
• 7B • Updated • 2
taicheng/zephyr-7b-align-scan-0.0-0.7-cosine-3
Text Generation
• 7B • Updated • 2
taicheng/zephyr-7b-align-scan-0.0-0.3-polynomial-3
Text Generation
• 7B • Updated • 4
taicheng/zephyr-7b-align-scan-0.0-1.0-linear-2
Text Generation
• 7B • Updated • 4
taicheng/zephyr-7b-align-scan-0.0-0.9-polynomial-3
Text Generation
• 7B • Updated • 2
taicheng/zephyr-7b-align-scan-0.0-0.4-polynomial-2
Text Generation
• 7B • Updated • 2
taicheng/zephyr-7b-align-scan-0.0-0.3-linear-3
Text Generation
• 7B • Updated • 5
SongTonyLi/gemma-2b-it-DPO-D1-HuggingFaceH4-ultrafeedback_binarized-Xlarge
Text Generation
• 3B • Updated • 2
taicheng/zephyr-7b-align-scan-0.0-0.3-polynomial-1
Text Generation
• 7B • Updated • 2
yuvraj17/Llama3-8B-SuperNova-Spectrum-Hermes-DPO
Text Generation
• 8B • Updated • 6
SongTonyLi/OpenELM-450M-CPT-D1_chosen-then-DPO-D2a-HuggingFaceH4-ultrafeedback_binarized-Xlarge
Text Generation
• 0.5B • Updated • 2