It will follow the instruct template, but it won't work with chat completions. YMMV, purely experimental.

GGUF

Model size

121B params

Architecture

nemotron_h_moe

Hardware compatibility

We're not able to determine the quantization variants.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for coughmedicine/NVIDIA-Nemotron-3-Super-120B-A12B-Base-GGUF

Base model

Quantized

(2)

this model