-
Notifications
You must be signed in to change notification settings - Fork 10.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gatsby develop and build hang on source and transform nodes stage on large CSV file with extremely high ram usage #33868
Comments
"gatsby build" -> It uses 105Gb in about 30 seconds on my computer! Crazy! |
My guess is that we should do csv streaming instead of loading the whole file. It's not something we will prioritize anytime soon but feel free to open a PR. |
So @wardpeet this looks like some inefficiency within gastby-transformer-csv specifically - is that right? |
I'm also experiencing very high usage of RAM on my local machine on my gatsby 4 test branch on the same build step. I don't use the csv source plugin, but maybe the following list of
|
shot in the dark, but it could be the same issue #34081 |
Hiya! This issue has gone quiet. Spooky quiet. 👻 We get a lot of issues, so we currently close issues after 60 days of inactivity. It’s been at least 20 days since the last update here. Thanks for being a part of the Gatsby community! 💪💜 |
I guess I need to fork the entire gatsby project to create the PR for this. @joernroeder |
I've committed the change to my fork. I'm not sure I'm following the right procedure to hand it off but you can find it here: |
Hiya! This issue has gone quiet. Spooky quiet. 👻 We get a lot of issues, so we currently close issues after 60 days of inactivity. It’s been at least 20 days since the last update here. Thanks for being a part of the Gatsby community! 💪💜 |
@witcradg We'd be happy to receive this as a PR. Feel free to send it in! :) |
Any updates on this??? Having a similar problem trying to build a gatsby website with 10,000+ pages from 50 csv files for each state and then 1 big csv file with all url slugs in it. Site builds using about 1/6 of the data, building with anymore and gets stuck on "source and transform nodes" then stops with a "Killed: 9" message. Tried --max-old-space-size=8192 as well as fix @witcradg made with csv transformer and no luck. Also tried breaking the url slug csv into smaller csvs with no luck. |
I might be able to create a pull request for this over the weekend. Just need the time to look into it. |
1 similar comment
I might be able to create a pull request for this over the weekend. Just need the time to look into it. |
Hiya! This issue has gone quiet. Spooky quiet. 👻 We get a lot of issues, so we currently close issues after 60 days of inactivity. It’s been at least 20 days since the last update here. Thanks for being a part of the Gatsby community! 💪💜 |
I get a trace from
when I run
|
Hiya! This issue has gone quiet. Spooky quiet. 👻 We get a lot of issues, so we currently close issues after 60 days of inactivity. It’s been at least 20 days since the last update here. Thanks for being a part of the Gatsby community! 💪💜 |
Hey again! It’s been 60 days since anything happened on this issue, so our friendly neighborhood robot (that’s me!) is going to close it. Thanks again for being part of the Gatsby community! 💪💜 |
Co-authored-by: ascott <ascott@DESKTOP-39AL99T.localdomain> Co-authored-by: Lennart <lekoarts@gmail.com> Fixes #33868
Preliminary Checks
Description
When using gatsby build or gatsby develop with the plugin gatsby-transformer-csv with a largeish csv file ~16'000 rows it hangs on the 'source and transform nodes' stage seemingly indefinitely. Also the nodejs process is using approximately 50Gb out of my 64gb of memory.
Ive tried this with a new starter of gatsby 4 and created a new csv file with the same length to make sure it wasnt an issue the one project or the file was somehow corrupted/invalid.
I know a similar issue was mentioned here : #11839 (comment) but this one doesnt mention high memory usage so may have a different cause
Reproduction Link
https://github.com/DanielPBliss/GatsbySourceAndTransformIssue
Steps to Reproduce
...
Expected Result
Build to complete and Ram usage to stay within normal range
Actual Result
Build hangs on 'source and transform nodes' stage seemingly indefinitely and ram usage quickly accumulates to about 50gb out of my 64gb.
Environment
Config Flags
No response
The text was updated successfully, but these errors were encountered: