SimPO: Simple Preference Optimization with a Reference-Free Reward
Paper • 2405.14734 • Published • 12
The kraken learns to refuse the obvious.
Cipher-SimPO-Merged is the anti-slop checkpoint of the Cipher series. Starting from the SFT base, this model is tuned with SimPO (Simple Preference Optimization, Meng et al. 2024) against preference pairs where the rejected response is a textbook Tailwind / lenis.stop() / copy-paste hero and the chosen response is the Awwwards-grade equivalent.
The result is a model that refuses to reach for the easy cliché.
Auroraventures/cipher-sft-mergedcdn.tailwindcss.com, lenis.stop(), <div class="container mx-auto"> boilerplate, Bootstrap defaultscipher-sft25-real-merged (v3, with real scraped SFT on top)from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
tok = AutoTokenizer.from_pretrained("Auroraventures/cipher-simpo-merged")
m = AutoModelForCausalLM.from_pretrained(
"Auroraventures/cipher-simpo-merged",
torch_dtype=torch.bfloat16, device_map="auto",
)
msgs = [
{"role":"system","content":"You are Cipher, the Code Kraken. Emit complete single-file HTML."},
{"role":"user","content":"Build a minimal dark-mode hero with GSAP stagger entry — no frameworks."},
]
p = tok.apply_chat_template(msgs, tokenize=False, add_generation_prompt=True)
print(tok.decode(m.generate(**tok(p,return_tensors='pt').to(m.device), max_new_tokens=2048)[0]))
GGUF sibling: cipher-simpo-merged-Q4_K_M-GGUF.
| Reject | Prefer |
|---|---|
cdn.tailwindcss.com script tag |
Hand-rolled CSS tokens + @media |
lenis.stop() / lenis.start() misuse |
Native Lenis on page + GSAP ScrollTrigger integration |
container mx-auto / text-center text-white |
CSS custom properties + fluid typography |
| Bootstrap grid | CSS Grid + clamp() |
.fadeIn / .fade-up generic utility classes |
Bespoke keyframes named by intent |
cipher-sft-merged ──► [SimPO anti-slop] ──► cipher-simpo-merged (YOU) ──► cipher-sft25-real-merged
CC-BY-NC-4.0. Base model: Gemma-4 terms apply.
Built with 🦑 by Aurora Ventures.
Base model
Auroraventures/cipher-sft-merged