-
Notifications
You must be signed in to change notification settings - Fork 152
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix buffer overflow for unzip with columns_to_skip #658
Conversation
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
I will add tests tmrw |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM! Just need tests. Can you test that the buffer properly clears when columns_to_skip
is used with unzip
? Thanks!
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
This PR requires pytorch/pytorch#81502 landed and nightly released. |
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
1 similar comment
@ejguan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
When
unzip
hascolumns_to_skip
, we should only createChildDataPipe
corresponding toinstance_ids
. AndUnzipper
should also handle mappinginstance_ids
to 0-based indices.This is one of the root causes for the issue reported internally. See: https://fburl.com/2k0et1gv