6.00 bpw exl3 quants. 8 head bits.

Model's Card:

Mistral v3 Tekken or Metharme.

Can think via <thinking> or <think>

Just like Roci X but better.

(Model card still a WIP)

FP16: https://huggingface.co/TheDrummer/Rocinante-XL-16B-v1 GGUF: https://huggingface.co/TheDrummer/Rocinante-XL-16B-v1-GGUF

Downloads last month
51
Safetensors
Model size
7B params
Tensor type
BF16
·
F16
·
I16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dr-housemd/TheDrummer-Rocinante-XL-16B-v1-6.00bpw-exl3

Quantized
(10)
this model

Collection including dr-housemd/TheDrummer-Rocinante-XL-16B-v1-6.00bpw-exl3