[Question]Does SCQL support query jobs on bigdata? #318

xyz-scorpio · 2024-07-10T09:28:35Z

Issue Type

Have you searched for existing issues?

Yes

Link to Relevant Documentation

No response

Question Details

What is the upper bound of dataset scale that SCQL could handle? Say, if I want to do a query job on two datasets from Alice and Bob, both of ~TB size, can SCQL handle that? 

Also, does SCQL support distributed computing? If I have 4 AWS EC2s, can SCQL take advantage of all the resources, and how?

tongke6 · 2024-07-11T02:45:51Z

Hello @xyz-scorpio, SCQL is a system implementation of MPC SQL. Limited by MPC network communication, computing and memory overhead, I think its upper bound is to support data analysis on a scale of tens of millions within an acceptable time(e.g. < 6 hours).

For now, SCQL can only use one computing node on one party to process a query job, but different jobs can be scheduled to different computing node.

tongke6 · 2024-07-11T02:49:19Z

Expect for privacy set intersection (PSI) scenario, it can scale well.

tongke6 self-assigned this Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question]Does SCQL support query jobs on bigdata? #318

[Question]Does SCQL support query jobs on bigdata? #318

xyz-scorpio commented Jul 10, 2024 •

edited

Loading

tongke6 commented Jul 11, 2024

tongke6 commented Jul 11, 2024

[Question]Does SCQL support query jobs on bigdata? #318

[Question]Does SCQL support query jobs on bigdata? #318

Comments

xyz-scorpio commented Jul 10, 2024 • edited Loading

Issue Type

Have you searched for existing issues?

Link to Relevant Documentation

Question Details

tongke6 commented Jul 11, 2024

tongke6 commented Jul 11, 2024

xyz-scorpio commented Jul 10, 2024 •

edited

Loading