Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
kshitijthakkar
/
qwen3.5-moe-4.7B-d4B
like
0
Image-Text-to-Text
Transformers
Safetensors
qwen3_5_moe
qwen3.5
Mixture of Experts
weight-transfer
hybrid-attention
conversational
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
qwen3.5-moe-4.7B-d4B
10.4 GB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
kshitijthakkar
set tie_word_embeddings=False for GGUF/ollama compatibility
7e2c048
verified
about 1 month ago
.gitattributes
Safe
1.57 kB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
LICENSE
Safe
11.5 kB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
README.md
2.02 kB
fix README: add pipeline_tag and image-text-to-text tag
about 1 month ago
chat_template.jinja
Safe
7.76 kB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
config.json
2.73 kB
set tie_word_embeddings=False for GGUF/ollama compatibility
about 1 month ago
merges.txt
Safe
3.35 MB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
model-00001-of-00002.safetensors
5.35 GB
xet
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
model-00002-of-00002.safetensors
5 GB
xet
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
model.safetensors.index.json
80.9 kB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
preprocessor_config.json
Safe
390 Bytes
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
tokenizer.json
Safe
12.8 MB
xet
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
tokenizer_config.json
Safe
16.7 kB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
video_preprocessor_config.json
Safe
385 Bytes
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago
vocab.json
Safe
6.72 MB
Add Qwen3.5 MoE 4.54B (dense-to-MoE from Qwen/Qwen3.5-4B)
about 1 month ago