GGUF
English
Japanese
conversational
How to use from
Lemonade
Pull the model
# Download Lemonade from https://lemonade-server.ai/
lemonade pull tatsuyaaaaaaa/NVIDIA-Nemotron-Nano-9B-v2-gguf:
Run and chat with the model
lemonade run user.NVIDIA-Nemotron-Nano-9B-v2-gguf-
List all available models
lemonade list
Quick Links

NVIDIAのNVIDIA-Nemotron-Nano-9B-v2をgguf変換したものです。

imatrix量子化時にはTFMC/imatrix-dataset-for-japanese-llmのデータセットを用いています。

Downloads last month
53
GGUF
Model size
9B params
Architecture
nemotron_h
Hardware compatibility
Log In to add your hardware

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tatsuyaaaaaaa/NVIDIA-Nemotron-Nano-9B-v2-gguf

Dataset used to train tatsuyaaaaaaa/NVIDIA-Nemotron-Nano-9B-v2-gguf

Collection including tatsuyaaaaaaa/NVIDIA-Nemotron-Nano-9B-v2-gguf