demon-zombie/MiniMax-M2.7-AWQ-4bit · These are NOT actual AWQ-quantized models.

These are NOT actual AWQ-quantized models.

by cai-cai - opened 10 days ago

Heads up! Despite the "AWQ" tag in the title, the config.json reveals these models are using standard compressed-tensors (W4A16) rather than the AWQ (Activation-aware Weight Quantization) method. Real AWQ requires an activation calibration process and specific scaling factors, which are missing here. This is misleading for users looking for actual AWQ kernels.

Geximus

10 days ago

u need the AWQ quant — you can easily find it with the AWQ tag. Any reason to use the original AWQ quant?

mtcl

3 days ago

this guy is posting same comment on every awq quant lol so annoying!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment