intermittent 404 Not Found on /pulls github calls #1019

tedder · 2020-04-30T17:59:15Z

Hi, sometimes when Atlantis is triggered on a PR in github, Atlantis posts the following error onto the PR:

Plan Error

GET https://api.github.com/repos/myorg/myrepo/pulls/1163/files?per_page=300: 404 Not Found []

Looking at github's API docs, that per_page=300 seems okay:

Note: Responses include a maximum of 3000 files. The paginated response returns 30 files per page by default.

We can replan and it works- e.g., it appears to be intermittent. Looking in the Atlantis logs, I see the following (I've removed the timestamps and redacted IPs/private info):

[INFO] server: POST /events – from xxx:51544
[INFO] server: Identified event as type "other"
[INFO] server: POST /events – respond HTTP 200
[EROR] myorg/myrepo#1163: GET https://api.github.com/repos/myorg/myrepo/pulls/1163/files?per_page=300: 404 Not Found []
[EROR] myorg/myrepo#1163: Unable to hide old comments: GET https://api.github.com/repos/myorg/myrepo/issues/1163/comments?direction=asc&sort=created: 404 Not Found []

We have a couple of theories but haven't been able to reproduce. First, it's only happened since we updated to v12.0, the current release. (we also added --hide-prev-plan-comments --disable-markdown-folding at this time).

Second is that it may happen with a largeish number of directories, though generally our changed dirs is under 50 and changed files is under 100.

The third theory is that it might happen when two unrelated repos are processing at the same time. That can be seen here; I've left the timestamps so you can see the overlap, "myrepo" is the same as above, and "REPO2" is the other repo that is planning.

2020-04-30T10:29:45.155000 [INFO] myorg/myrepo#1163: Creating dir "/home/atlantis/.atlantis/repos/myorg/myrepo/1163/default"
2020-04-30T10:29:45.252000 [INFO] myorg/REPO2#276: Creating dir "/home/atlantis/.atlantis/repos/myorg/REPO2/276/default"
2020-04-30T10:29:45.961000 [INFO] myorg/REPO2#276: Successfully parsed atlantis.yaml file
2020-04-30T10:29:45.963000 [INFO] myorg/REPO2#276: 1 projects are to be planned based on their when_modified config
2020-04-30T10:29:46.152000 [INFO] myorg/myrepo#1163: Successfully parsed atlantis.yaml file
2020-04-30T10:29:46.156000 [INFO] myorg/myrepo#1163: 13 projects are to be planned based on their when_modified config
2020-04-30T10:29:46.157000 [INFO] myorg/myrepo#1163: Acquired lock with id "myorg/myrepo/xxx"
2020-04-30T10:29:46.457000 [INFO] myorg/REPO2#276: Acquired lock with id "myorg/REPO2/./yyy"
2020-04-30T10:29:46.457000 [INFO] myorg/REPO2#276: Creating dir "/home/atlantis/.atlantis/repos/myorg/REPO2/276/yyyy"

The text was updated successfully, but these errors were encountered:

kenske · 2020-07-22T21:13:55Z

This is happening to us too. It always happens when opening a PR, but then we comment atlantis plan and it works. 🤷‍♂️

waltervargas · 2020-07-24T11:03:55Z

we are having this issue as well:

no multiple repos, just a single repo
small number of files.

waltervargas · 2020-07-24T12:22:56Z

Are you guys implementing a retry/backoff mechanism to handle eventual consistency?

--- PR CREATION
Jul 24 11:31:25 ip-42-42-42-42 bash[5186]: 2020/07/24 11:31:25+0000 [INFO] server: Identified event as type "opened"
Jul 24 11:31:25 ip-42-42-42-42 bash[5186]: 2020/07/24 11:31:25+0000 [INFO] server: Executing autoplan
Jul 24 11:31:25 ip-42-42-42-42 bash[5186]: 2020/07/24 11:31:25+0000 [INFO] server: POST /events – respond HTTP 200
--- ERROR
Jul 24 11:31:25 ip-42-42-42-42 bash[5186]: 2020/07/24 11:31:25+0000 [EROR] owner/atlantis-repo-name#1: GET https://api.github.com/repos/owner/atlantis-repo-name/pulls/1/files?per_page=300: 404 Not Found []
--- END
Jul 24 11:31:27 ip-42-42-42-42 bash[5186]: 2020/07/24 11:31:27+0000 [INFO] server: POST /events – from 127.42.42.42:4242
Jul 24 11:31:27 ip-42-42-42-42 bash[5186]: 2020/07/24 11:31:27+0000 [INFO] server: POST /events – respond HTTP 200

Example:
- https://github.com/rust-lang/highfive/pull/191/files
- Some GitHub API requests fail with 404 rust-lang/highfive#190

lkysow · 2020-07-24T16:59:48Z

Looks like we need to add a retry.

tedder · 2020-07-24T17:13:22Z

FWIW we hadn't seen this for a while and started seeing it again yesterday (or the day before?). I'm sure it's a github problem, not an Atlantis problem, but Atlantis probably needs to work around it.

rtb-recursion · 2020-07-24T17:15:35Z

We have been using Atlantis for ~1 week and we also just saw this problem just now for the first time in a project with ~30 terraform files and on a PR with only 1 changed file. We are using Atlantis v0.14.0.

lkysow · 2020-07-24T19:13:33Z

FWIW we hadn't seen this for a while and started seeing it again yesterday (or the day before?). I'm sure it's a github problem, not an Atlantis problem, but Atlantis probably needs to work around it.

Yeah totally, and it shouldn't be too hard to throw some retries in there.

sparky005 · 2020-11-02T19:13:55Z

We just started seeing messages like this on PRs. I wonder if there need to be more retries or implement the exponential backoff? Mostly commenting just to see if others who happen to stop by here are having the same issue

jayceebernardino · 2020-12-08T16:06:17Z

We just started seeing messages like this on PRs. I wonder if there need to be more retries or implement the exponential backoff? Mostly commenting just to see if others who happen to stop by here are having the same issue

We have been getting is more often as well

kenske · 2020-12-08T18:07:02Z

After trying the version with the fix, we stopped seeing this

benwh · 2021-04-06T14:24:11Z

We're running with the fix implemented in #1131 and have still seen this issue occur, relatively often within the past week (presumably due to GitHub performance), so it seems like it might be worth implementing a different retry strategy such as exponential backoff, as suggested in that PR.

* Improve github pull request call retries Retry with fixed 1 second backoff up to 3 retries was added by #1131 to address #1019, but the issue continued to show up (#1453). Increase max attempts to 5 and use exponential backoff for a maximum total retry time of (2^n - n - 1) seconds, which is roughly 30 seconds for current max attempts n = 5. Also move the sleep to the top of the loop so that we never sleep without sending the request again on the last iteration. * Fix style with gofmt -s

virgofx · 2021-11-22T19:43:38Z

We are still observing this issue fairly consistently in new Pull requests with autoplan enabled in 0.17.5

This is a follow on to resolve similar issues to runatlantis#1019. In runatlantis#1131 retries were added to GetPullRequest. And in runatlantis#1810 a backoff was included. However, those only resolve one potential request at the very beginning of a PR creation. The other request that happens early on during auto-plan is one to ListFiles to detect the modified files. This too can sometimes result in a 404 due to async updates on the GitHub side.

* Improve github pull request call retries Retry with fixed 1 second backoff up to 3 retries was added by runatlantis#1131 to address runatlantis#1019, but the issue continued to show up (runatlantis#1453). Increase max attempts to 5 and use exponential backoff for a maximum total retry time of (2^n - n - 1) seconds, which is roughly 30 seconds for current max attempts n = 5. Also move the sleep to the top of the loop so that we never sleep without sending the request again on the last iteration. * Fix style with gofmt -s

pgold30 · 2023-04-06T07:44:52Z

Same on version 0.23

lkysow added the bug Something isn't working label Jul 24, 2020

lkysow mentioned this issue Jul 26, 2020

Retry get pull request calls to github #1131

Merged

lkysow closed this as completed in #1131 Jul 26, 2020

nrmitchi mentioned this issue Apr 7, 2021

weird error at PR initialization #1453

Closed

aristocrates mentioned this issue Sep 15, 2021

Improve github pull request call retries #1810

Merged

jacekn mentioned this issue Nov 19, 2021

intermittent 404 Not Found on /pulls github calls #1905

Open

elementalvoid mentioned this issue Jan 21, 2022

fix: add retries to GitHub client for GetModifiedFiles on 404 #2013

Closed

fajpunk mentioned this issue Jun 26, 2024

DM-44635: mobu in CI lsst-sqre/mobu#352

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

intermittent 404 Not Found on /pulls github calls #1019

intermittent 404 Not Found on /pulls github calls #1019

tedder commented Apr 30, 2020

kenske commented Jul 22, 2020

waltervargas commented Jul 24, 2020

waltervargas commented Jul 24, 2020

lkysow commented Jul 24, 2020

tedder commented Jul 24, 2020

rtb-recursion commented Jul 24, 2020

lkysow commented Jul 24, 2020

sparky005 commented Nov 2, 2020

jayceebernardino commented Dec 8, 2020

kenske commented Dec 8, 2020

benwh commented Apr 6, 2021

virgofx commented Nov 22, 2021

pgold30 commented Apr 6, 2023

intermittent 404 Not Found on /pulls github calls #1019

intermittent 404 Not Found on /pulls github calls #1019

Comments

tedder commented Apr 30, 2020

kenske commented Jul 22, 2020

waltervargas commented Jul 24, 2020

waltervargas commented Jul 24, 2020

lkysow commented Jul 24, 2020

tedder commented Jul 24, 2020

rtb-recursion commented Jul 24, 2020

lkysow commented Jul 24, 2020

sparky005 commented Nov 2, 2020

jayceebernardino commented Dec 8, 2020

kenske commented Dec 8, 2020

benwh commented Apr 6, 2021

virgofx commented Nov 22, 2021

pgold30 commented Apr 6, 2023