qwen3-6-distill-chat / requirements.txt
lordx64's picture
Upload requirements.txt with huggingface_hub
a804b53 verified
raw
history blame
400 Bytes
torch
transformers>=4.45
accelerate>=0.34
bitsandbytes>=0.45
spaces
sentencepiece
# Note: flash-linear-attention + causal-conv1d would make Qwen3.6's
# GatedDeltaNet layers take a fast path (~5-10x faster), but HF Spaces'
# build container has no nvcc so they can't compile. The torch fallback
# is plenty fast at inference seq lengths; the "fast path not available"
# log line at startup is benign.