rdtand commited on
Commit
b245dda
·
verified ·
1 Parent(s): 1d78f3d

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -54,7 +54,7 @@ The speedup comes from eliminating ~5 GB of BF16 weight loads per token for the
54
  Requires vLLM >= 0.19.1 with PR #38423 (W4A4 SM120/SM121 support) and FlashInfer >= 0.6.7.
55
 
56
  ```bash
57
- vllm serve RobTand/Qwen3.5-27B-NVFP4-Full \
58
  --trust-remote-code \
59
  --kv-cache-dtype fp8 \
60
  --attention-backend flashinfer \
 
54
  Requires vLLM >= 0.19.1 with PR #38423 (W4A4 SM120/SM121 support) and FlashInfer >= 0.6.7.
55
 
56
  ```bash
57
+ vllm serve rdtand/Qwen3.5-27B-NVFP4-DeltaNet-Included \
58
  --trust-remote-code \
59
  --kv-cache-dtype fp8 \
60
  --attention-backend flashinfer \