Skip to content

Random traffic lock in 9.0.x #6383

@calavera

Description

@calavera

I just realized that I never opened an issue about this problem, and we should look into it.

We deployed c4b7d10 in two canary boxes serving full production traffic and bump into an issue where TrafficServer stopped processing any transaction after some time. The time was aleatory, the first occurrence happened in one box after 8 hours, it happened sooner in that box after a restart, it was in less than 4 hours. The second box worked normally for about three days, until it arrived to this deadlock.

We saw this problem because in our graph, all TCP connections were suddenly blocked:

Screenshot from 2020-01-24 08-52-25

Screenshot from 2020-01-24 09-51-55

@shinrich mentioned that she was investigating something similar.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions