1*H100 with vLLM 0.19.0 Failed

#7
by JeffreySheng - opened

The model card announce that:
Software Integration:
Supported Runtime Engine(s):
vLLM

Supported Hardware Microarchitecture Compatibility:
NVIDIA Blackwell
...
Preferred Operating System(s):
Linux
Model Version(s):
The model version is v1.0 which NVFP4 quantized with nvidia-modelopt v0.42.0
...
Inference:
Engine: vLLM
Test Hardware: NVIDIA Hopper H100

Im very confused that you said supported hardware must based on NVIDIA Blackwell, but you inference the evaluation on NVIDIA Hopper H100 with vLLM? Thats not make sense.
And I run this with vLLM on 1*H100 is failed.
Is this an official mistaken announcement or something?

Sign up or log in to comment