[BUG] 429 too many requests #1583

rawpixel-vincent · 2021-11-18T18:45:02Z

Describe the bug
we're using the aws hosted opensearch service, for about 10 days from now we started to get 429 too many requests response from the elasticsearch api (fwict only from search endpoint) - it happened even we haven't seen any increase in the number of requests - we have been working since then to reduce the number of requests - the search request queued is steady at 0 with eventual peaks around 10 or 20.

Expected behavior
why did we start to get this 429 that looks like an api rate limit, when the request count didn't increase from our usual workload and all the critical metrics are green (as before) ?

Plugins
none

Host/Environment (please complete the following information):
ecs / fargate / elasticsearch hosted by aws / graviton powered containers
last supported elasticsearch version and requesting with the last compatible elasticsearch node.js client

Additional context

{"name":"ResponseError","meta":{"body":"429 Too Many Requests /****/_search","statusCode":429,"headers":{"date":"Thu, 18 Nov 2021 18:35:30 GMT","content-type":"text/plain;charset=ISO-8859-1","content-length":"54","connection":"keep-alive","server":"Jetty(8.1.12.v20130726)"},"meta":{"context":null,"request":{"params":{"method":"POST","path":"/***/_search","body":{"type":"Buffer","data":[***]},"querystring":"size=100&from=0&_source=id about 10 fields","headers":{"user-agent":"elasticsearch-js/7.10.0 (linux 4.14.248-189.473.amzn2.x86_64-x64; Node.js v16.13.0)","accept-encoding":"gzip,deflate","content-type":"application/json","content-encoding":"gzip","content-length":"294"},"timeout":30000},"options":{},"id":5379},"name":"elasticsearch-js","connection":{"url":"https://***/","id":"https://***/","headers":{},"deadCount":0,"resurrectTimeout":0,"_openRequests":0,"status":"alive","roles":{"master":true,"data":true,"ingest":true,"ml":false}},"attempts":0,"aborted":false}}}

The text was updated successfully, but these errors were encountered:

rawpixel-vincent · 2021-11-18T19:17:09Z

I'm sorry there is no step to reproduce, but I'm facing that issue while there is no obvious reason so hopefully someone who knows what is going on can give me some insight

will3942 · 2021-11-19T16:42:37Z

We are seeing the same with the same graphs and no change in the number of requests on a cluster with 1 x m6g.large.search node.

radove · 2021-11-22T03:03:54Z

We experienced the same issue. I ended up increasing the hardware specs for now and it reduced the issue. I wish I didn't have to as we're trying to be budget friendly. AWS talks about it at this link: https://aws.amazon.com/premiumsupport/knowledge-center/opensearch-resolve-429-error/

rawpixel-vincent · 2021-11-22T03:18:58Z

something changed because of opensearch, with the same metrics and instances, we never had that issue before

Poojita-Raj · 2021-11-24T19:06:30Z

Looking into this.

Poojita-Raj · 2021-12-02T19:41:54Z

@rawpixel-vincent Hi, could you please state which version of OpenSearch you're using - 1.0, 1.1 or 1.2?

rawpixel-vincent · 2021-12-02T20:21:43Z

Hi, @Poojita-Raj,
thank you for looking into this,
as stated in the description of the issue we use:

last supported elasticsearch version and requesting with the last compatible elasticsearch node.js client

we are stuck with this until opensearch-project/opensearch-js#187 has landed

Poojita-Raj · 2021-12-02T21:48:33Z

Hi @rawpixel-vincent,

Since you're using elasticsearch currently, this is an issue with the AWS OpenSearch Service offering. Please open a ticket against the AWS OpenSearch team. AWS support is the right place to get the assistance required to resolve this issue.

Hope this helps!

rawpixel-vincent · 2021-12-03T06:50:30Z

thank you I have a ticket opened in aws support so they can look into that opensearch service bug

kartg · 2021-12-03T18:38:27Z

Closing this out

cameron-hurd · 2022-10-06T18:22:49Z

@rawpixel-vincent curious what the solution here was? we are running into a similar 429 issue

cameron-hurd · 2022-11-10T16:28:50Z

We were able to resolve the 429 issue by switching to a non-graviton AWS instance type. The graviton instance would have memory spikes over 85% which triggered 429 responses.

anthonygerrard · 2022-11-22T09:01:01Z

Switching from graviton to non graviton instance types fixes this issue

dblock · 2022-11-22T20:07:08Z

@anthonygerrard @cameron-hurd Do you have tickets open with the Amazon managed service on these Graviton-related issues? If so would you mind opening them and/or sending me ticket numbers (dblock[at]amazon[dot]com works), please? There's a team that has looked at similar issues, but I can't tell whether it's the same problem or not from the above.

cameron-hurd · 2022-11-28T13:21:13Z

@dblock we reached out to our AWS support - they mentioned The GC behavior is different for the domain once G1GC got enabled with the Graviton instance type. AWS support did not recommend switching to non graviton but based on the GC statement from them we tried it and it resolved our 429 issue. We never fully rolled out customers to the graviton cluster as it hit the issue with half the normal load. Data node memory pressure with graviton went above the 429 threshold of 85%:

Non graviton with more load does not have the memory pressure issue:

amitmun · 2022-11-28T14:58:38Z

@cameron-hurd Can you please make it clear which type of GC causes this and which type resolved this?

cameron-hurd · 2022-11-28T15:17:02Z

@amitmun switching to non-Graviton instance type resolved it. We have autotune enabled and use the managed opensearch service so we do not have ability to set any GC settings

anthonygerrard · 2022-11-29T09:04:03Z

We've only just raised a support case with Amazon. No resolution yet.

anthonygerrard · 2022-12-06T15:55:04Z

We had a call with AWS support today. The solution offered was for us to raise a support request to increase the JVM utilization threshold from 85% to 95% after we create a cluster using Graviton instance types. We're not going to make use of this because we're operating fine on m5 instance types now and have a fully automated infrastructure as code deployment process.

I sent a message to our account manager requesting a feature to improve OpenSearch support on newer instance types.

jahidmomin · 2024-02-12T12:08:37Z

still for t3 machine somtimes we are getting 429 issue

arshashi · 2024-09-03T08:45:52Z

We had similar issue and resolved with below.

By default, OPENSEARCH_JAVA_OPTS comes with 512M. Based on the data load JVM might require additional memory to process the data. Edited stateful set to increase the OPENSEARCH_JAVA_OPTS to 2g to solve the issue.

    - name: OPENSEARCH_JAVA_OPTS
      value: -Xmx2g -Xms2g

dblock · 2024-09-03T12:04:44Z

Is it time to increase this default for 3.0?

rawpixel-vincent added bug Something isn't working untriaged labels Nov 18, 2021

anasalkouz added distributed framework Priority-High and removed untriaged labels Nov 23, 2021

kartg assigned Poojita-Raj Nov 24, 2021

kartg closed this as completed Dec 3, 2021

anasalkouz mentioned this issue Dec 3, 2021

Update github bug template #1653

Open

dvas0004 mentioned this issue Apr 27, 2022

[BUG] 429 too many requests - heap issue #3089

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] 429 too many requests #1583

[BUG] 429 too many requests #1583

rawpixel-vincent commented Nov 18, 2021 •

edited

Loading

rawpixel-vincent commented Nov 18, 2021 •

edited

Loading

will3942 commented Nov 19, 2021

radove commented Nov 22, 2021

rawpixel-vincent commented Nov 22, 2021

Poojita-Raj commented Nov 24, 2021

Poojita-Raj commented Dec 2, 2021

rawpixel-vincent commented Dec 2, 2021

Poojita-Raj commented Dec 2, 2021

rawpixel-vincent commented Dec 3, 2021

kartg commented Dec 3, 2021

cameron-hurd commented Oct 6, 2022

cameron-hurd commented Nov 10, 2022

anthonygerrard commented Nov 22, 2022

dblock commented Nov 22, 2022

cameron-hurd commented Nov 28, 2022

amitmun commented Nov 28, 2022

cameron-hurd commented Nov 28, 2022 •

edited

Loading

anthonygerrard commented Nov 29, 2022

anthonygerrard commented Dec 6, 2022 •

edited

Loading

jahidmomin commented Feb 12, 2024

arshashi commented Sep 3, 2024

dblock commented Sep 3, 2024

[BUG] 429 too many requests #1583

[BUG] 429 too many requests #1583

Comments

rawpixel-vincent commented Nov 18, 2021 • edited Loading

rawpixel-vincent commented Nov 18, 2021 • edited Loading

will3942 commented Nov 19, 2021

radove commented Nov 22, 2021

rawpixel-vincent commented Nov 22, 2021

Poojita-Raj commented Nov 24, 2021

Poojita-Raj commented Dec 2, 2021

rawpixel-vincent commented Dec 2, 2021

Poojita-Raj commented Dec 2, 2021

rawpixel-vincent commented Dec 3, 2021

kartg commented Dec 3, 2021

cameron-hurd commented Oct 6, 2022

cameron-hurd commented Nov 10, 2022

anthonygerrard commented Nov 22, 2022

dblock commented Nov 22, 2022

cameron-hurd commented Nov 28, 2022

amitmun commented Nov 28, 2022

cameron-hurd commented Nov 28, 2022 • edited Loading

anthonygerrard commented Nov 29, 2022

anthonygerrard commented Dec 6, 2022 • edited Loading

jahidmomin commented Feb 12, 2024

arshashi commented Sep 3, 2024

dblock commented Sep 3, 2024

rawpixel-vincent commented Nov 18, 2021 •

edited

Loading

rawpixel-vincent commented Nov 18, 2021 •

edited

Loading

cameron-hurd commented Nov 28, 2022 •

edited

Loading

anthonygerrard commented Dec 6, 2022 •

edited

Loading