Skip to content

Conversation

copybara-service[bot]
Copy link
Contributor

Use the max size of serialized examples to find a safe number of shards

If we know the max size of serialized examples, then we can account for the worst case scenario where one shard would get only examples of the max size. This hopefully should prevent users running into problems with having too big shards.

@copybara-service copybara-service bot force-pushed the test_726377778 branch 2 times, most recently from 30080cb to 21c0be4 Compare March 11, 2025 16:39
If we know the max size of serialized examples, then we can account for the worst case scenario where one shard would get only examples of the max size. This hopefully should prevent users running into problems with having too big shards.

PiperOrigin-RevId: 726377778
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant