Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data sets SF3000+ are not supported #30

Closed
szarnyasg opened this issue Oct 19, 2024 · 1 comment
Closed

Data sets SF3000+ are not supported #30

szarnyasg opened this issue Oct 19, 2024 · 1 comment

Comments

@szarnyasg
Copy link
Member

As remarked in ldbc/ldbc_snb_interactive_v1_impls#173:

Scaling the Interactive workload SF3000 is not trivial: the Hadoop-based Datagen breaks for SF1000+ data sets (with an NPE) and the old parameter generator has scalability issues (it's a single-threaded Python2 script – for SF1000, it already requires about a day to finish).

It would be worth trying to resolve that NPE. This would allow us to generate the data set and parameters for SF3000.

@szarnyasg
Copy link
Member Author

Added support for this now. See the details in the documentation PR: ldbc/ldbc_snb_docs#251

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant