rdtand/Qwen3.6-35B-A3B-PrismaQuant-4.75bit-vllm at main

Qwen3.6-35B-A3B-PrismaQuant-4.75bit-vllm

22.9 GB

Ctrl+K

1 contributor

History: 8 commits

rdtand

README: document scale_sweep polish (closed-form AutoRound analog) + bake-off table

e347d86 verified 24 days ago

.gitattributes

1.64 kB
Upload folder using huggingface_hub 26 days ago
README.md

10.9 kB
README: document scale_sweep polish (closed-form AutoRound analog) + bake-off table 24 days ago
chat_template.jinja

7.76 kB
Upload folder using huggingface_hub 26 days ago
config.json

51.2 kB
PrismaQuant final: visual NVFP4 (108/110 DP-placed), lm_head BF16 (vLLM runtime limit), pos_embed excluded 24 days ago
configuration.json

58 Bytes
Upload folder using huggingface_hub 26 days ago
generation_config.json

202 Bytes
Upload folder using huggingface_hub 26 days ago
merges.txt

3.35 MB
Upload folder using huggingface_hub 26 days ago
mixed_native_manifest.json

487 Bytes
PrismaQuant final: visual NVFP4 (108/110 DP-placed), lm_head BF16 (vLLM runtime limit), pos_embed excluded 24 days ago
model-00001-of-00006.safetensors

4.57 GB
xet

PrismaQuant: switch polish from act_round (no-op that undid GPTQ) to closed-form scale_sweep; GPTQ + scale_sweep measured geomean out_mse 0.33 vs prior 0.99 24 days ago
model-00002-of-00006.safetensors

4.58 GB
xet

PrismaQuant: switch polish from act_round (no-op that undid GPTQ) to closed-form scale_sweep; GPTQ + scale_sweep measured geomean out_mse 0.33 vs prior 0.99 24 days ago
model-00003-of-00006.safetensors

4.58 GB
xet

PrismaQuant: switch polish from act_round (no-op that undid GPTQ) to closed-form scale_sweep; GPTQ + scale_sweep measured geomean out_mse 0.33 vs prior 0.99 24 days ago
model-00004-of-00006.safetensors

4.58 GB
xet

PrismaQuant: switch polish from act_round (no-op that undid GPTQ) to closed-form scale_sweep; GPTQ + scale_sweep measured geomean out_mse 0.33 vs prior 0.99 24 days ago
model-00005-of-00006.safetensors

4.58 GB
xet

PrismaQuant: switch polish from act_round (no-op that undid GPTQ) to closed-form scale_sweep; GPTQ + scale_sweep measured geomean out_mse 0.33 vs prior 0.99 24 days ago
model-00006-of-00006.safetensors

2.11 MB
xet

Upload folder using huggingface_hub 26 days ago
model.safetensors.index.json

14.6 MB
xet

PrismaQuant final: visual NVFP4 (108/110 DP-placed), lm_head BF16 (vLLM runtime limit), pos_embed excluded 24 days ago
preprocessor_config.json

390 Bytes
Upload folder using huggingface_hub 26 days ago
tokenizer.json

12.8 MB
xet

Upload folder using huggingface_hub 26 days ago
tokenizer_config.json

16.7 kB
Upload folder using huggingface_hub 26 days ago
video_preprocessor_config.json

385 Bytes
Upload folder using huggingface_hub 26 days ago
vocab.json

6.72 MB
Upload folder using huggingface_hub 26 days ago