Redash OOM with mid-sized queries #4867

noxdafox · 2020-05-06T11:44:49Z

Greetings,

we have an instance of Redash 8.0 deployed in an AWS EC2 instance using Redash provided AMI. The local PostgreSQL DB has been replaced with a dedicated AWS RDS one.

The EC2 instance is of type t3.large which comes with 8Gb of memory.

We have a single data source which is Athena.

The following query:

SELECT * FROM <table> limit 1000000;

runs in few seconds on Athena and produces a 444Mb CSV file.

When running it through Redash, it takes an indefinite amount of time until it fails with a Error running query: Worker exited prematurely: signal 9 (SIGKILL)..
Looking at the memory of the instance, we see Redash consumption growing until it fills the instance memory forcing the OOM killer to kick in.

We can reproduce the issue both via the web UI and the API api/query_results.

The text was updated successfully, but these errors were encountered:

noxdafox · 2020-05-06T11:55:45Z

This is most likely a duplicate of #3241.

I have very similar use case: Users want to download csv and xlsx files to work locally. We want to us Redash both for showing charts and dashboards but also as access point for Athena when it comes to pre-filtering data.

susodapop · 2020-05-08T15:38:08Z

Agree this is a duplicate of #3241 and Arik's guidance there still stands. This is currently expected behavior. Redash is meant to display aggregated datasets below ~50k rows / 50mb. Even if the query runner didn't run out of memory your browser would crash trying to load 1M rows. If you bypass the front-end, the workers can usually handle results around ~250mb via the API (or larger if you provision them appropriately).

But this is well outside Redash's scope. You should probably consider a different tool for pulling such large datasets into Excel (perhaps PowerQuery?)

noxdafox · 2020-05-10T15:19:23Z

Indeed my use case is not accessing to such large results via the browser but rather letting the Users interface programmatically through the API. I will explain my use case in #78.

susodapop closed this as completed May 8, 2020

weekly-digest bot mentioned this issue May 11, 2020

Weekly Digest (4 May, 2020 - 11 May, 2020) #4877

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Redash OOM with mid-sized queries #4867

Redash OOM with mid-sized queries #4867

noxdafox commented May 6, 2020

noxdafox commented May 6, 2020

susodapop commented May 8, 2020

noxdafox commented May 10, 2020

Redash OOM with mid-sized queries #4867

Redash OOM with mid-sized queries #4867

Comments

noxdafox commented May 6, 2020

noxdafox commented May 6, 2020

susodapop commented May 8, 2020

noxdafox commented May 10, 2020