Spaces:

lordx64
/

qwen3-6-distill-chat

Runtime error

Upload requirements.txt with huggingface_hub

a804b53 verified 18 days ago

400 Bytes

	torch
	transformers>=4.45
	accelerate>=0.34
	bitsandbytes>=0.45
	spaces
	sentencepiece
	# Note: flash-linear-attention + causal-conv1d would make Qwen3.6's
	# GatedDeltaNet layers take a fast path (~5-10x faster), but HF Spaces'
	# build container has no nvcc so they can't compile. The torch fallback
	# is plenty fast at inference seq lengths; the "fast path not available"
	# log line at startup is benign.