-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Infra] SampleAnalyzerIntegrationTest randomly timeouting in CI #10036
Comments
discussed offline: the execution time can be extended for handling it. Please track it for some time if this approach solves the problem |
After observing 9 runs that failed with this test, the failure only happened to the following test cases. From the output logs attached inline, it looks like within 30 seconds the build check analyzer hadn't finished.
|
@JanKrivanek , can this fix help with the test performance? |
Only slightly - it slashes only about a third of currently detected slowdown of the analyzers. |
Another incidence (after the timeout increasing): https://dev.azure.com/dnceng-public/public/_build/results?buildId=681320&view=ms.vss-test-web.build-test-results-tab&runId=16909114&resultId=100003&paneView=debug |
Might be totally red herring: The timeouting cases seem to have TerminalLogger used (notice the control sequences):
Sample full log: TIMEOUT-Microsoft.Build.BuildCheck.UnitTests_net8.0_x64.log Wherease I do not see those appear in individual test cases for the success cases - sample log: OK-Microsoft.Build.BuildCheck.UnitTests_net8.0_x64.log Regardless of it being red herring or not - it is strange that TerminalLogger is used in CI. I believe @AR-May was looking into somthing similar |
@JanKrivanek and @maridematte I repro this issue with the command "dotnet test ./artifacts/bin/Microsoft.Build.BuildCheck.UnitTests/Debug/net8.0/Microsoft.Build.BuildCheck.UnitTests.dll --filter "FullyQualifiedName~SampleAnalyzerIntegrationTest" randomly Added -v:d with process command to output more info and found the build is stuck in target GetCopyToPublishDirectoryItems This is build log.txt. |
Perfect! Can you collect 2 more cases to see if there is any pattern? Plus appart from the diag log it might be helpfull to have binlog as well. Other than that I unfortunately do not have any idea now about what can bw causing this |
Here is log with -v:diag diaglog.txt The binlog is not completed when failed. True.binlog.txt |
Yeah - the binlog being cut aburptly is fine (and expected as test is killed after timeout) - the timeline from binlog will be interresting - it should show where was the time spent. At it'd be nice to see 2 or 3 cases - to be able to compare if the excesive time is spent in the same part of the build or if it is random. Since it is Linux, we unfortunately cannot use ETW :-/ |
I compare the logs of several failed cases, it's always the same part of the build. And unfortunately, there is no timeline shown in the binlog True.binlog.txt |
From offline discussion Observations:
Recommendation on further steps:
As a side not - the issue happens during restore. So it might get resolved by skipping the restore - #9747. But it still would be very valuable to understand why is it happening |
Though this issue might be resolved in the fix above, I had some information about trying to debug the test running on linux remotely from VS on Windows. I tried the following ways. But they failed with different errors. I was not able to debug this test running on Linux.
|
@GangWang01 , thank you for analysis. |
Fixed by #10353 |
Context
Sample run: https://dev.azure.com/dnceng-public/public/_build/results?buildId=648080&view=ms.vss-test-web.build-test-results-tab&runId=16013174&resultId=100001&paneView=debug
The text was updated successfully, but these errors were encountered: