It will follow the instruct template, but it won't work with chat completions. YMMV, purely experimental.

Downloads last month
31
GGUF
Model size
121B params
Architecture
nemotron_h_moe
Hardware compatibility
Log In to add your hardware

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for coughmedicine/NVIDIA-Nemotron-3-Super-120B-A12B-Base-GGUF

Quantized
(2)
this model