GGUF quantizations of nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16.

Downloads last month
900
GGUF
Model size
4B params
Architecture
nemotron_h
Hardware compatibility
Log In to add your hardware

8-bit

32-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ddh0/NVIDIA-Nemotron-3-Nano-4B-GGUF