[Bugfix] Disable the statslogger if the api_server_count is greater than 1 #22227

chaunceyjiang · 2025-08-05T03:07:06Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Fixes #21954

Purpose

vllm serve /home/jovyan/qwen3-8b  --data-parallel-size 2 --data-parallel-rpc-port 25555 --data-parallel-address 127.0.0.1 --api-server-count 2
INFO 09-05 01:29:36 [__init__.py:241] Automatically detected platform cuda.
INFO 09-05 01:29:41 [api_server.py:1894] vLLM API server version 0.10.2.dev403+g14b4326b9
INFO 09-05 01:29:41 [utils.py:328] non-default args: {'model_tag': '/home/jovyan/qwen3-8b', 'api_server_count': 2, 'model': '/home/jovyan/qwen3-8b', 'data_parallel_size': 2, 'data_parallel_address': '127.0.0.1', 'data_parallel_rpc_port': 25555, 'mm_processor_cache_gb': 0}
INFO 09-05 01:29:51 [__init__.py:748] Resolved architecture: Qwen3ForCausalLM
`torch_dtype` is deprecated! Use `dtype` instead!
....
(ApiServer_1 pid=21224) WARNING 09-05 01:30:13 [async_llm.py:108] AsyncLLM created with api_server_count more than 1; disabling stats logging to avoid incomplete stats.
(ApiServer_0 pid=21221) INFO 09-05 01:30:13 [scheduler.py:222] Chunked prefill is enabled with max_num_batched_tokens=8192.
(ApiServer_0 pid=21221) WARNING 09-05 01:30:13 [async_llm.py:108] AsyncLLM created with api_server_count more than 1; disabling stats logging to avoid incomplete stats.

Test Result

(Optional) Documentation Update

gemini-code-assist

Code Review

This pull request introduces a bugfix to disable the statistics logger when api_server_count is greater than 1, as this configuration is not compatible. The change correctly saves the original state of the disable_log_stats argument, forces it to True when multiple API servers are used, and issues a warning to the user if the feature was previously enabled. The implementation is consistent with how other incompatible features are handled in this scenario. The code is correct and effectively addresses the issue.

chaunceyjiang · 2025-08-05T03:12:05Z

Hi, @njhill I have a question: when disable_log_stats is set to true, both LoggingStatLogger and PrometheusStatLogger are disabled. Is this the expected behavior?

vllm/vllm/engine/llm_engine.py

Lines 343 to 368 in 601f856

    
           # Metric Logging. 
        
           if self.log_stats: 
        
               if stat_loggers is not None: 
        
                   self.stat_loggers = stat_loggers 
        
               else: 
        
                   # Lazy import for prometheus multiprocessing. 
        
                   # We need to set PROMETHEUS_MULTIPROC_DIR environment variable 
        
                   # before prometheus_client is imported. 
        
                   # See https://prometheus.github.io/client_python/multiprocess/ 
        
                   from vllm.engine.metrics import (LoggingStatLogger, 
        
                                                    PrometheusStatLogger) 
        
                   self.stat_loggers = { 
        
                       "logging": 
        
                       LoggingStatLogger( 
        
                           local_interval=_LOCAL_LOGGING_INTERVAL_SEC, 
        
                           vllm_config=vllm_config), 
        
                       "prometheus": 
        
                       PrometheusStatLogger( 
        
                           local_interval=_LOCAL_LOGGING_INTERVAL_SEC, 
        
                           labels=dict( 
        
                               model_name=self.model_config.served_model_name), 
        
                           vllm_config=vllm_config), 
        
                   } 
        
                   self.stat_loggers["prometheus"].info("cache_config", 
        
                                                        self.cache_config)

github-actions · 2025-08-05T04:13:46Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

njhill · 2025-08-05T17:25:48Z

Hi, @njhill I have a question: when disable_log_stats is set to true, both LoggingStatLogger and PrometheusStatLogger are disabled. Is this the expected behavior?

@chaunceyjiang yes that's expected. Here we only want to omit LoggingStatLogger.

The API here is kind of crappy in general - if you provide custom stats loggers then they will replace the built-in ones rather than augment them. There's a separate discussion / PR proposal around that I think.

chaunceyjiang · 2025-08-06T14:37:24Z

/cc @njhill PTAL.

njhill · 2025-09-04T21:41:00Z

@chaunceyjiang I wonder if you could rebase this now that #20952 is merged?

The behaviour now is that custom stats loggers will augment the built-in ones unless disable_log_stats=True.

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang · 2025-09-05T01:34:31Z

@njhill PTAL.

njhill

Thanks @chaunceyjiang.

Actually we still want the prometheus metrics in this case and I think this change will disable those too.

I think the check should go here instead:

vllm/vllm/v1/metrics/loggers.py

Lines 662 to 663 in eedb2a2

    
           if enable_default_loggers and logger.isEnabledFor(logging.INFO): 
        
               factories.append(LoggingStatLogger)

vllm/v1/engine/async_llm.py

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

vllm/v1/metrics/loggers.py

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

njhill

Thanks @chaunceyjiang

njhill · 2025-09-08T19:54:00Z

Remaining test failure is also occurring on main.

…han 1 (vllm-project#22227) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Co-authored-by: Nick Hill <nhill@redhat.com>

…han 1 (vllm-project#22227) Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com> Co-authored-by: Nick Hill <nhill@redhat.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

chaunceyjiang requested a review from aarnphm as a code owner August 5, 2025 03:07

mergify bot added the frontend label Aug 5, 2025

DarkLight1337 requested a review from njhill August 5, 2025 03:07

gemini-code-assist bot reviewed Aug 5, 2025

View reviewed changes

chaunceyjiang requested review from WoosukKwon, alexm-redhat, comaniac, robertgshaw2-redhat and ywang96 as code owners August 6, 2025 11:39

mergify bot added the v1 label Aug 6, 2025

njhill mentioned this pull request Aug 8, 2025

[Misc] Have AsyncLLM custom_stat_loggers extend default logger list #20952

Merged

4 tasks

chaunceyjiang force-pushed the disable_log_stat branch 5 times, most recently from a42c8a7 to 1529366 Compare September 5, 2025 01:20

[Bugfix] Disable the statslogger if the api_server_count is greater t…

9d353fa

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang force-pushed the disable_log_stat branch from 1529366 to 9d353fa Compare September 5, 2025 01:27

njhill requested changes Sep 5, 2025

View reviewed changes

chaunceyjiang commented Sep 6, 2025

View reviewed changes

vllm/v1/engine/async_llm.py Outdated Show resolved Hide resolved

chaunceyjiang added 2 commits September 6, 2025 00:29

[Bugfix] Disable the statslogger if the api_server_count is greater t…

c25b064

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

[Bugfix] Disable the statslogger if the api_server_count is greater t…

7ac2118

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang commented Sep 6, 2025

View reviewed changes

vllm/v1/metrics/loggers.py Outdated Show resolved Hide resolved

chaunceyjiang commented Sep 6, 2025

View reviewed changes

vllm/v1/metrics/loggers.py Outdated Show resolved Hide resolved

[Bugfix] Disable the statslogger if the api_server_count is greater t…

1c3be02

…han 1 Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>

chaunceyjiang requested a review from njhill September 8, 2025 09:54

njhill approved these changes Sep 8, 2025

View reviewed changes

Merge branch 'main' into disable_log_stat

bfeca9f

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 8, 2025

simon-mo merged commit e680723 into vllm-project:main Sep 8, 2025
44 of 46 checks passed

	if enable_default_loggers and logger.isEnabledFor(logging.INFO):
	factories.append(LoggingStatLogger)

Uh oh!

[Bugfix] Disable the statslogger if the api_server_count is greater than 1 #22227

[Bugfix] Disable the statslogger if the api_server_count is greater than 1 #22227

Uh oh!

Conversation

chaunceyjiang commented Aug 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Result

(Optional) Documentation Update

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

chaunceyjiang commented Aug 5, 2025

Uh oh!

github-actions bot commented Aug 5, 2025

Uh oh!

njhill commented Aug 5, 2025

Uh oh!

chaunceyjiang commented Aug 6, 2025

Uh oh!

njhill commented Sep 4, 2025

Uh oh!

chaunceyjiang commented Sep 5, 2025

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

njhill left a comment

Choose a reason for hiding this comment

Uh oh!

njhill commented Sep 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chaunceyjiang commented Aug 5, 2025 •

edited by github-actions bot

Loading