vocaela
/

Vocaela-500M-GGUF

Image-Text-to-Text

Model card Files Files and versions

Vocaela-500M-GGUF

This repo contains GGUF-format weights for Vocaela-500M, containing:

Language model (LLM): Q8_0
Vision encoder (mmproj): Q8_0

Note: So far LlamaCpp has problem on rendering chat template correctly. To workaround it, we apply chat template (e.g., using python / node.js etc.) before calling llama-server endpoint. For examples of how to use it, please refer to repo vocaela-500m-demo

Downloads last month: 11

GGUF

Model size

0.4B params

Architecture

llama

Hardware compatibility

Log In to add your hardware

8-bit

Model tree for vocaela/Vocaela-500M-GGUF

Base model

HuggingFaceTB/SmolLM2-360M

Quantized

HuggingFaceTB/SmolLM2-360M-Instruct

Quantized

HuggingFaceTB/SmolVLM-500M-Instruct

Quantized

HuggingFaceTB/SmolVLM2-500M-Video-Instruct

Finetuned

vocaela/Vocaela-500M

Quantized

(1)

this model

Collection including vocaela/Vocaela-500M-GGUF

vocaela-for-edge

5 items • Updated Mar 16