[Bug]: NVFP4A16 spurious warning that GPU doesn't support Fp4

### Your current environment

<details>
<summary>The output of <code>python collect_env.py</code></summary>

```text
vLLM API server version 0.11.1rc2.dev200+g250fb1b8e.d20251021
```

</details>


### 🐛 Describe the bug

When loading a NVFP4A16 quantized model we get a spurious warning

```
(Worker_TP0 pid=201) WARNING 10-24 09:50:14 [marlin_utils_fp4.py:137] Your GPU does not have native support for FP4 computation but FP4 quantization is being used. Weight-only FP4 compression will be used leveraging the Marlin kernel. This may degrade performance for compute-heavy workloads.
```

This is using 2x RTX Pro 6000, which do support Fp4.

That warning is coming from https://github.com/vllm-project/vllm/blob/3567816932e674abce3f44ceb0aff03f73b5aaff/vllm/model_executor/layers/quantization/utils/marlin_utils_fp4.py#L136-L142

I believe that warning was supposed to be removed when the emulation path for NVFP4A16 was removed in https://github.com/vllm-project/vllm/pull/18000. If so I can submit a simple PR to remove that part.

Otherwise does it mean there is a new marlin kernel planned that avoids software-dequantizing of NVFP4?

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

	def prepare_fp4_layer_for_marlin(layer: torch.nn.Module) -> None:
	logger.warning_once(
	"Your GPU does not have native support for FP4 computation but "
	"FP4 quantization is being used. Weight-only FP4 compression will "
	"be used leveraging the Marlin kernel. This may degrade "
	"performance for compute-heavy workloads."
	)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bug]: NVFP4A16 spurious warning that GPU doesn't support Fp4 #27471

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[Bug]: NVFP4A16 spurious warning that GPU doesn't support Fp4 #27471

Description

Your current environment

🐛 Describe the bug

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions