Skip to content

[Bug]: Console stats logging is incorrect when using api-server scaleout #21954

@njhill

Description

@njhill

StatsLoggers run in the front-end process. When there are multiple of these they will each be processing a subset of requests and thus have incomplete stats.

The prometheus stats logger aggregates the stats via it's shm-based multiprocessing mode, but the periodic console log that shows request queue lengths, throughput, and kv cache usage will happen independently in each front-end process and thus be incomplete/incorrect.

At a minimum we should disable this statslogger for api_server_count > 1.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions