Huihui-Qwen3.5-27B-abliterated-MLX-4bit

This repository contains an MLX-optimized 4-bit quantized conversion of huihui-ai/Huihui-Qwen3.5-27B-abliterated for Apple Silicon inference.

The model was converted for mlx-lm text generation workflows.

Model details

Install dependencies:

pip install -U mlx-lm

Quick generation:

mlx_lm.generate \
  --model dotwee/Huihui-Qwen3.5-27B-abliterated-MLX-4bit \
  --prompt "Write one short sentence about MLX." \
  --max-tokens 128

Chat mode:

mlx_lm.chat --model dotwee/Huihui-Qwen3.5-27B-abliterated-MLX-4bit

This artifact is intended for text generation with mlx-lm.
The upstream checkpoint is described as uncensored/abliterated and may produce sensitive, controversial, or unsafe outputs.
Use only in contexts where strict output review and moderation are possible.
For production or public-facing deployments, add downstream safety controls.

This release follows the upstream Qwen3.5 license and terms: apache-2.0 (see license_link in model metadata).

Safetensors

Model size

27B params

Tensor type

BF16

U32

F32

MLX

Hardware compatibility

4-bit

Base model

Finetuned

Quantized

(17)

this model