Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
caiovicentino1
/
Qwen3.5-9B-Claude-Opus-HLWQ-Q5
like
3
Text Generation
Transformers
Safetensors
4 languages
qwen3_5
image-text-to-text
hlwq
quantized
compressed-tensors
int4
marlin
vllm
conversational
8-bit precision
arxiv:
2502.02617
arxiv:
2603.29078
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Qwen3.5-9B-Claude-Opus-HLWQ-Q5
7.67 GB
Ctrl+K
Ctrl+K
1 contributor
History:
22 commits
caiovicentino1
Remove legacy polar_config.json
c8867a6
verified
3 days ago
.gitattributes
Safe
1.57 kB
PolarQuant Q5 unified (PPL 6.54, 7.1GB, 43 tok/s)
15 days ago
README.md
Safe
4.03 kB
HLWQ rebrand: title, tags, notice, self-links
3 days ago
config.json
Safe
3.73 kB
fix: use base model config for vLLM --language-model-only compatibility
9 days ago
hlwq_config.json
49.1 kB
Add hlwq_config.json (rename from polar_config.json)
3 days ago
kv_context.png
Safe
45.8 kB
Upload kv_context.png with huggingface_hub
15 days ago
model.safetensors
Safe
7.65 GB
xet
fix: unpack lm_head/embed_tokens/bad-dim layers to BF16 (Marlin compat)
9 days ago
ppl_comparison.png
Safe
51.3 kB
Upload ppl_comparison.png with huggingface_hub
15 days ago
processor_config.json
Safe
1.3 kB
PolarQuant Q5 unified (PPL 6.54, 7.1GB, 43 tok/s)
15 days ago
speed_vram.png
Safe
57.6 kB
Upload speed_vram.png with huggingface_hub
15 days ago
tokenizer.json
Safe
20 MB
xet
PolarQuant Q5 unified (PPL 6.54, 7.1GB, 43 tok/s)
15 days ago
tokenizer_config.json
Safe
5.4 kB
fix: rename weight keys for vLLM Qwen3.5 compatibility (model.X -> model.language_model.X), fix quant_method and tokenizer_class
10 days ago