JanusCoder-8B-Nemotron-Claude-Opus-qx86-hi-mlx

Qwen3-8B-Element-qx86-hi-mlx

This model is a 1.4/0.6 nuslerp merge of:

unsloth/JanusCoder-8B
TeichAI/Nemotron-Cascade-8B-Thinking-Claude-4.5-Opus-High-Reasoning-Distill

The Nemotron-Cascade base is prone to looping, mainly for the lack of social skills: the addition of just Claude thinking traces without a body of evidence made the Element very smart, but unstable, even with Janus help.

Brainwaves

          arc   arc/e boolq hswag obkqa piqa  wino
qx86-hi   0.532,0.746,0.846,0.738,0.456,0.794,0.709

Janus     0.537,0.731,0.862,0.697,0.446,0.782,0.667
Element   0.532,0.746,0.846,0.738,0.456,0.794,0.709

Perplexity
qx86-hi   4.744 ± 0.036
qx64-hi   4.798 ± 0.036
mxfp4     5.012 ± 0.038

-G

Use with mlx

pip install mlx-lm

from mlx_lm import load, generate

model, tokenizer = load("JanusCoder-8B-Nemotron-Claude-Opus-qx86-hi-mlx")

prompt = "hello"

if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(
        messages, add_generation_prompt=True, return_dict=False,
    )

response = generate(model, tokenizer, prompt=prompt, verbose=True)