Updater: target hash calculation should not read the whole file in memory #1215

jku · 2020-11-17T18:38:51Z

Description of issue or feature request:

This is not a bug (although it could be on memory limited client device), but a performance improvement:

After PR #1202 there is only one place where the updater loads the whole target file in memory. We should avoid doing that as targets could be very large and memory could be limited.

Current behavior:

_check_hashes() does this:

digest_object = securesystemslib.hash.digest(algorithm)
digest_object.update(file_object.read())
computed_hash = digest_object.hexdigest()

Expected behavior:
something like handwaves

digest_object = securesystemslib.hash.digest(algorithm)
while True:
    chunk = file_object.read(CHUNK_SIZE)
    if not chunk:
        break
    digest_object.update(chunk)
computed_hash = digest_object.hexdigest()

or even more simply just let SSLib handle this with its default chunk size:

digest_object = securesystemslib.hash.digest_fileobject(file_object, algorithm)
computed_hash = digest_object.hexdigest()

The text was updated successfully, but these errors were encountered:

trishankatdatadog · 2020-11-17T18:58:54Z

Good find, thanks! On top of that, shouldn't securesystemslib already offer a function for checking hashes?

jku · 2020-11-17T21:27:35Z

shouldn't securesystemslib already offer a function for checking hashes?

Yes my last example uses a sslib function for calculating a fileobject hash, I just forgot to include 'securesystemslib.hash.' prefix there (now fixed)

Or do you mean actually comparing the hashes?

We don't want to read the whole file in memory as it can be huge. Use digest_fileobject() instead: This way Securesystemslib will read the file in chunks. Securesystemslib already takes care of seeking to beginning of file. Fixes theupdateframework#1215 Signed-off-by: Jussi Kukkonen <jkukkonen@vmware.com>

jku mentioned this issue Nov 23, 2020

Avoid reading target in memory #1219

Merged

joshuagl closed this as completed in #1219 Nov 25, 2020

sechkova mentioned this issue Dec 22, 2020

PoC: Let user download targets #1171

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updater: target hash calculation should not read the whole file in memory #1215

Updater: target hash calculation should not read the whole file in memory #1215

jku commented Nov 17, 2020 •

edited

Loading

trishankatdatadog commented Nov 17, 2020

jku commented Nov 17, 2020 •

edited

Loading

Updater: target hash calculation should not read the whole file in memory #1215

Updater: target hash calculation should not read the whole file in memory #1215

Comments

jku commented Nov 17, 2020 • edited Loading

trishankatdatadog commented Nov 17, 2020

jku commented Nov 17, 2020 • edited Loading

jku commented Nov 17, 2020 •

edited

Loading

jku commented Nov 17, 2020 •

edited

Loading