Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failing integration test: SendReceiveAfterProjectReset #710

Closed
myieye opened this issue Apr 9, 2024 · 11 comments · Fixed by #716 or #715
Closed

Failing integration test: SendReceiveAfterProjectReset #710

myieye opened this issue Apr 9, 2024 · 11 comments · Fixed by #716 or #715
Assignees
Labels
bug Something isn't working testing

Comments

@myieye
Copy link
Contributor

myieye commented Apr 9, 2024

For the test SendReceiveAfterProjectReset:

All 6 executions (2 protocols x 3 jobs) failed after/during the last deployment to staging.

For the deployment before that the results were 5 failures / 8 test executions.

I think we should try to see why it's failing and stabilize it.

It might be worth noting, that when querying changesets we explicitly return an empty list if the repo is less than 40 seconds old. We found that we had to wait longer than 20 seconds even though the hgweb refreshinterval was set to 20 seconds at the time. See Project.GetChangesets().

@myieye myieye added bug Something isn't working testing labels Apr 9, 2024
@rmunn
Copy link
Contributor

rmunn commented Apr 10, 2024

Currently working on #712 (PR for #709) so I have a chance at figuring this out; without Playwright traces, it's a little hard. (BTW, #712 is ready for review; at this point any tweaks I make to it are going to be minor).

@myieye
Copy link
Contributor Author

myieye commented Apr 10, 2024

@rmunn Are you sure Playwright traces are relevant for this issue? SendReceiveAfterProjectReset is a .NET Send/Receive integration test.

@rmunn
Copy link
Contributor

rmunn commented Apr 11, 2024

#709 was marked high priority, so I looked into it first and didn't actually read this one carefully. Yes, the Playwright traces are irrelevant to .NET tests. I simply wrote that comment in haste, without realizing that this issue was about a .NET integration test.

Anyway, #716 doesn't actually modify the SendReceiveAfterProjectReset yet, but it does make it possible to run it correctly with Mercurial 6. Which may or may not actually fix this issue; I'll find out once I can finally get a S/R run. (The self-hosted runner has been flaky; sometimes it stops responding during an apt install step, or the setup-pnpm step... it's been a long process of pushing the retry button and hoping that this time I get a run that makes it as far as the test I need to see. Oh, and the develop hgweb server returning 503 errors didn't help much either...)

@rmunn
Copy link
Contributor

rmunn commented Apr 11, 2024

More 5xx errors, this time 502 Bad Gateway. Still haven't gotten a clean run of the SendReceiveAfterProjectReset test, and I've been trying all morning. I'll keep trying, but at some point I'm going to give up on this and try to go do something actually productive.

@rmunn
Copy link
Contributor

rmunn commented Apr 11, 2024

And the latest test run had "The active test run was aborted. Reason: Test host process crashed." Funny, though, it reported 0 failures and 48 successes when that happened. Weird.

@rmunn
Copy link
Contributor

rmunn commented Apr 11, 2024

STILL getting "Test host process crashed". We might be running into https://www.github.com/microsoft/vstest/issues/2952 — if so, there's no way I'm going to be able to fix this. It's frustrating, though, because I can't quite trust that the 48 test successes are real.

@rmunn
Copy link
Contributor

rmunn commented Apr 11, 2024

I thought commit 06c027a would fix the "Test host process crashed" errors. I just noticed the compilation warning about not using async void methods, which I had used in the "project codes may not start with a hyphen" test I wrote a couple weeks ago. Changed that test's return type to async Task and ran the integration test workflow again, but it's still crashing. Still 48 tests succeeding, though, so I'm hopeful that I've at least made SendReceiveAfterProjectReset start passing. But I can't prove it.

@hahn-kev
Copy link
Collaborator

@rmunn are you running this test from your machine or the self hosted runner? I'd run it from your machine as we want to stabilize the test itself, anything specific to the self hosted runner we also want to fix, but it would be better to isolate the issues to solve them.

@rmunn
Copy link
Contributor

rmunn commented Apr 12, 2024

Running it on my machine passes every time, so I don't know what else to do to stabilize it. So I'm trying to run it on the self-hosted runner, because that's where it's unstable. But I've had so many issues with the runner that I'm starting to feel like the past couple days have been a complete waste of time.

@myieye
Copy link
Contributor Author

myieye commented Apr 16, 2024

It's been refactored a bit and hg runner timeouts have been increased. Recent runs have been smooth.

@myieye myieye closed this as completed Apr 16, 2024
This was linked to pull requests Apr 29, 2024
@rmunn
Copy link
Contributor

rmunn commented Apr 29, 2024

Probably fixed by a combination of #715 and #716, I'm guessing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working testing
Projects
None yet
3 participants