-
Notifications
You must be signed in to change notification settings - Fork 752
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
allow executing COPY INTO in a cluster #6395
Comments
Is it a single file ? |
One hundred files, obtained by cutting the ontime dataset. |
Ok, seems parallel copy only works in single query mode.
|
Distributed copy into need #6253(exchange precommit block in cluster nodes). |
@RinChanNOWWW you can try this issue, it's ready to do now. |
I think we can make the <internal/external-stage, remote location> as a special storage engine, then we can get the file list as table source, and optimize the files to the distribution cluster. Also, this will be the basement for: I would ping @dantengsky, he is doing a similar storage engine (pre-sign), if some codes need refactoring, please let us know:) |
Then we can convert |
Please take a look #7502 |
Summary
i'm running a COPY INTO in my cluster with a 8 replicas, but it seems only utilized one replica to execute the COPY INTO statement:
it'd be 8x faster if the COPY INTO statement could utilize the other instances in the cluster.
The text was updated successfully, but these errors were encountered: