mradermacher/Reflector-Internalizing-Safety-Llama-3.1-8B-RL-GGUF Reinforcement Learning • 8B • Updated 17 days ago • 883 • 1
mradermacher/Reflector-Internalizing-Safety-Llama-3.1-8B-RL-i1-GGUF Reinforcement Learning • 8B • Updated 17 days ago • 2.46k • 1
ValueFX9507/Tifa-Deepsex-14b-CoT-GGUF-Q4 Reinforcement Learning • 15B • Updated Feb 13, 2025 • 2.07k • 837