Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chronos error on mutation ProcessRecurrence #9664

Closed
rafaelromcar-parabol opened this issue Apr 23, 2024 · 5 comments · Fixed by #9665 or #9670
Closed

Chronos error on mutation ProcessRecurrence #9664

rafaelromcar-parabol opened this issue Apr 23, 2024 · 5 comments · Fixed by #9665 or #9670
Assignees
Labels

Comments

@rafaelromcar-parabol
Copy link
Contributor

Issue - Bug

  • How serious is the bug? P1 / P2?
  • OS: macOS / Linux / Windows / ? our Docker images
  • Browser: Chrome / Safari / Lynx / ? -
  • Node version: node --version v20.11.0
  • NPM version: npm --version 10.2.4
  • RethinkDB version: rethinkdb --version

Chronos will throw the following error non-stop

Trace: SEND TO SENTRY TIMEOUT {"query":"\n        mutation ProcessRecurrence{\n          pro"} undefined                                                                                                           
    at sendToSentry (/home/node/parabol/dist/chronos.js:17369:11)                                                                                                                                                  
    at publishWebhookGQL (/home/node/parabol/dist/chronos.js:17340:28)

(See Datadog for that date. We only have 15 days of history in Datadog, and it has been happening for at least those 15 days.

Acceptance Criteria (optional)

Chronos does not throw the error non-stop. And I suppose recurrence works.

Triage is performed to find the root cause of the bug, timeboxed to ~1 hour.

Estimated effort: 1 hour to triage. More if root cause is already identified.

@Dschoordsch
Copy link
Contributor

If chronos cannot publish to the gql executor, it looks like it lost the connection to redis 🤔

@rafaelromcar-parabol
Copy link
Contributor Author

Ah! Not clear enough on the logs IMHO. I can restart Chronos, even if it has been restarted multiple times in the last two weeks and we are getting that error message 😿

That said, Chronos should retry connecting because Redis is up and running as the other components are using it. And the error message should state that it can't connect to Redis 🤔

@Dschoordsch
Copy link
Contributor

Might be something different then. I will check the logs.

@Dschoordsch Dschoordsch self-assigned this Apr 23, 2024
@Dschoordsch
Copy link
Contributor

I checked, it's not a real issue, just a wrong timeout. The logs 🔒 show that the mutation takes around 15s but the default query timeout is 10s. Nothing bad happens atm., but I think we wanted to use the 10m timeout instead, but lost it at some point.

@rafaelromcar-parabol
Copy link
Contributor Author

Good that it is working then 😸 Still, the error message should be fixed to help catching real errors then.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
2 participants