-
Notifications
You must be signed in to change notification settings - Fork 416
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vacuum is very slow on Cloudflare R2 #1366
Comments
Hi @djouallah, |
@Blajda turn out the issue is vacuum !!! I have remove it from the cloud function and it went from 10 minutes to 1 minute |
How many files do you have under |
I vacuum and optimised already today, so I don't have much files left : but I produce 288 new files daily, anyway running this with only 68 files is rather fast, got 3 second |
do you have any insights on memory consumption? in my experience, whenever we seen such difference once we get to a certain size, it often relates to limitations in memory.
Unfortunately, this is not yet implemented on the object-store side, but there is some previous work apache/arrow-rs#4060 |
# Description We don't yet have batch deletes in object store (but will soon). In the meantime, we can at least issue multiple requests in parallel. Set the default at 10. I think that should be reasonable for now; later we can optimize to try to find the right rate to avoid rate-limiting. # Related Issue(s) * helps with #1366 # Documentation <!--- Share links to useful documentation --->
Environment
0.9
Binding:
Python
Environment:
Cloudflare R2
Bug
Edit : the issue is with vacuum, it is very slow for a delete operation
I am running a cloud function to vacuum and optimize a small delta table in Cloudflare R2, the table has currently 45 partition (per day( and every day, I insert 288 new small files.
the function take nearly 10 minute to finish, that's seems very slow, and I am not sure if it will scale later when the table increase in size
here is the code I use
The text was updated successfully, but these errors were encountered: