majentik
/

gemma-4-E2B-RotorQuant-MLX-4bit

Image-Text-to-Text

kv-cache-quantization

4-bit precision

Model card Files Files and versions

gemma-4-E2B-RotorQuant-MLX-4bit

3.61 GB

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

majentik's picture

chore(card): enrich YAML frontmatter (pipeline_tag, language, library_name, inference)

e90d6f1 verified 8 days ago

.gitattributes

1.57 kB
Add MLX quantized model with KV cache compression 11 days ago
README.md

4.02 kB
chore(card): enrich YAML frontmatter (pipeline_tag, language, library_name, inference) 8 days ago
config.json

5.96 kB
Add MLX quantized model with KV cache compression 11 days ago
generation_config.json

181 Bytes
Add MLX quantized model with KV cache compression 11 days ago
model.safetensors

3.58 GB
xet

Add MLX quantized model with KV cache compression 11 days ago
model.safetensors.index.json

230 kB
Add MLX quantized model with KV cache compression 11 days ago
processor_config.json

902 Bytes
Add MLX quantized model with KV cache compression 11 days ago
tokenizer.json

32.2 MB
xet

Add MLX quantized model with KV cache compression 11 days ago
tokenizer_config.json

1.5 kB
Add MLX quantized model with KV cache compression 11 days ago