Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataStorm/reliability hang (Ubuntu release CI) #3462

Open
bernardnormier opened this issue Jan 31, 2025 · 2 comments
Open

DataStorm/reliability hang (Ubuntu release CI) #3462

bernardnormier opened this issue Jan 31, 2025 · 2 comments
Assignees
Milestone

Comments

@bernardnormier
Copy link
Member

From:
https://github.com/zeroc-ice/ice/actions/runs/13063024722/job/36450166492?pr=3458

ok
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:06:54
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:07:24
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:07:54
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:08:24
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:08:54
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:09:24
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:09:54
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:10:24
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:10:54
process /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer --Ice.Default.Host=127.0.0.1 --Test.BasePort=14200 --Ice.Warn.Connections=1 --Ice.Default.Protocol=tcp --Ice.IPv6=0 --Ice.PrintStackTraces=1 --DataStorm.Node.Multicast.Enabled=0 --DataStorm.Node.Server.Enabled=0 --DataStorm.Node.ConnectTo="tcp -p 14211" --Ice.Connection.Server.IdleTimeout=0 --Ice.Connection.Client.IdleTimeout=0 --DataStorm.Node.Name=writer-app --DataStorm.Trace.Topic=1 --DataStorm.Trace.Session=3 --DataStorm.Trace.Data=2 --Ice.Trace.Protocol=1 --Ice.LogFile=/home/runner/work/ice/ice/cpp/test/DataStorm/reliability/client-013125-0005.log pid=80273 is hanging - 01/31/25 00:11:24
...


@bernardnormier bernardnormier added this to the 3.8.0 milestone Jan 31, 2025
@pepone pepone self-assigned this Jan 31, 2025
@pepone
Copy link
Member

pepone commented Feb 4, 2025

From the client log attached to the CI job. The last messages in the log are:

-- 01/31/25 00:05:54.597 /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer: Session: s/2: session 'pf/1-reader-app' disconnected:
   local address = 127.0.0.1:43304
   remote address = 127.0.0.1:14212
   
-- 01/31/25 00:05:54.597 /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer: Session: s/2: unsubscribed topic '1' from topic '1:int'
-- 01/31/25 00:05:54.597 /home/runner/work/ice/ice/cpp/test/DataStorm/reliability/build/x86_64-linux-gnu/shared/writer: Session: s/2: trying to reconnect session with 'reader-app -t -e 1.1'

No clear why the reconnect attempt isn't made, wonder if there is a deadlock preventing it.

@pepone
Copy link
Member

pepone commented Feb 4, 2025

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants