Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Partitioning in MapReduce #407

Open
qh681248 opened this issue Sep 26, 2024 · 0 comments
Open

Partitioning in MapReduce #407

qh681248 opened this issue Sep 26, 2024 · 0 comments
Labels
new Something yet to be discussed by development team question Further information is requested

Comments

@qh681248
Copy link
Contributor

What's the issue?

Currently MapReduce is implemented such a way that if you partition a dataset into number of partitions that does not divide data then you would get partitions of unequal sizes, the current implementation repeats the last data point to fill the partitions, would it make more sense to use a random datapoint instead?

@qh681248 qh681248 added new Something yet to be discussed by development team question Further information is requested labels Sep 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new Something yet to be discussed by development team question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant