Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stream large files when computing hash #72 #73

Merged
merged 18 commits into from
Dec 6, 2024

Conversation

pombredanne
Copy link
Member

This PR is a fix for #72
It streams large files in chunks when computing checksums to avoid exhausting memory

OmkarPh and others added 15 commits October 18, 2023 15:42
Signed-off-by: Omkar Phansopkar <omkarphansopkar@gmail.com>
Added docs server script, dark mode & copybutton for docs, tracking chmod in git
Reference: aboutcode-org/skeleton#84

Signed-off-by: John M. Horan <johnmhoran@gmail.com>
Reference: aboutcode-org/skeleton#84

Signed-off-by: John M. Horan <johnmhoran@gmail.com>
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Update CSS to widen page and handle mobile #84
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Update CI runners and python version
Replace macos-11 runners with macos-14 runners.

Reference: https://github.com/actions/runner-images?tab=readme-ov-file#available-images
Reference: aboutcode-org/skeleton#89
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Replace deprecated macos CI runners
Signed-off-by: Chin Yeung Li <tli@nexb.com>
Scanning a large file may exhaust memory and file because
commoncode.hash loads the whole file in memory. With this commit, the
files are loaded in chunks so we do not run out of memory on very large
files, such as VM images.

Reference: #72
Signed-off-by: Philippe Ombredanne <pombredanne@nexb.com>
Signed-off-by: Philippe Ombredanne <pombredanne@nexb.com>
Signed-off-by: Philippe Ombredanne <pombredanne@nexb.com>
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
src/commoncode/hash.py Outdated Show resolved Hide resolved
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Copy link
Member

@AyanSinhaMahapatra AyanSinhaMahapatra left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!
Thanks @pombredanne
Merging with a few docstring and other minor updates

src/commoncode/hash.py Outdated Show resolved Hide resolved
src/commoncode/hash.py Outdated Show resolved Hide resolved
@AyanSinhaMahapatra AyanSinhaMahapatra force-pushed the 72-large-file-hash branch 3 times, most recently from 299ede9 to b467d2e Compare December 6, 2024 19:05
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
@AyanSinhaMahapatra AyanSinhaMahapatra merged commit faf4a61 into main Dec 6, 2024
13 checks passed
@pombredanne
Copy link
Member Author

Thanks for merging!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants