Instructions to use dealignai/Step-3.7-Flash-JANG_K-CRACK with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use dealignai/Step-3.7-Flash-JANG_K-CRACK with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("dealignai/Step-3.7-Flash-JANG_K-CRACK") config = load_config("dealignai/Step-3.7-Flash-JANG_K-CRACK") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Pi
How to use dealignai/Step-3.7-Flash-JANG_K-CRACK with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "dealignai/Step-3.7-Flash-JANG_K-CRACK"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "dealignai/Step-3.7-Flash-JANG_K-CRACK" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use dealignai/Step-3.7-Flash-JANG_K-CRACK with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "dealignai/Step-3.7-Flash-JANG_K-CRACK"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default dealignai/Step-3.7-Flash-JANG_K-CRACK
Run Hermes
hermes
Step 3.7 Flash — JANG_K CRACK
CRACK abliterated · JANG_K (mixed-precision, Q2 routed) · 45-layer MoE with always-on shared expert · Vision · Reasoning · 74 GB
What Is This?
This is Step 3.7 Flash by StepFun — a 45-layer multimodal Mixture-of-Experts reasoning model with always-on shared expert (DSV4-style). This release is:
- CRACK abliterated — refusal behavior removed at the weight level. The model complies across all task categories while preserving reasoning quality, knowledge, and multimodal capabilities.
- JANG_K (mixed-precision, Q2 routed) for MLX on Apple Silicon — 74 GB.
Results
Evaluated on a Mac Studio M5 Max via the vMLX engine in reasoning mode (temperature 0.6, top-p 0.95). HarmBench-text 70-prompt sample (10 per category). MMLU-114 (2 questions per subject across all 57 subjects) with up to 1,800 tokens per question for full reasoning traces.
HarmBench compliance (70 prompts · 10 per category)
| Category | CRACK ASR |
|---|---|
| Chemical / biological | 10/10 |
| Copyright | 8/10 |
| Cybercrime / intrusion | 10/10 |
| Harassment / bullying | 10/10 |
| Illegal | 10/10 |
| Misinformation / disinformation | 10/10 |
| General harmful | 10/10 |
| Overall | 68/70 (97%) |
MMLU-114 (reasoning-mode, by subject area)
| Subject area | base | CRACK | Δ |
|---|---|---|---|
| Overall | 78.1% | 76.3% | -1.8pp |
| STEM | 75.0% | 77.8% | +2.8pp |
| Humanities | 73.1% | 76.9% | +3.8pp |
| Social Sciences | 87.5% | 83.3% | -4.2pp |
| Other (medicine, business, …) | 78.6% | 67.9% | -10.7pp |
Features
- Reasoning — chat template pre-opens
<think>; vMLX's reasoning parser surfaces the trace inmessage.reasoning_contentand the final answer inmessage.content - MoE with always-on shared expert — every layer combines routed experts with a shared expert (DSV4-style)
- Vision — multimodal image-text understanding
- 45 transformer layers, hidden 4096
Usage
Run with vMLX (recommended — full step3p7 reasoning + vision support).
# OpenAI-compatible chat completion
# POST /v1/chat/completions
{
"model": "dealignai/Step-3.7-Flash-JANG_K-CRACK",
"messages": [{"role": "user", "content": "..."}],
"temperature": 0.6, "top_p": 0.95
}
About CRACK
CRACK (Controlled Refusal Ablation via Calibrated Knockouts) removes safety-refusal behavior at the weight level so the model complies with all task categories while preserving reasoning quality, factual knowledge, and coherence.
Support dealignai
All models are built from original research and released free.
Support us on Ko-fi — membership gets early access and extras.
Ko-fi · X @dealignai · dealign.ai
See our research: Safety Generalization in Frontier Models

Disclaimer
This model has had its safety-refusal behavior removed for research purposes. It will follow instructions across all categories without refusing. You are solely responsible for how you use it and for complying with all applicable laws. Published for AI-safety research and authorized security testing.
- Downloads last month
- 1,015
Quantized
Model tree for dealignai/Step-3.7-Flash-JANG_K-CRACK
Base model
stepfun-ai/Step-3.7-Flash