Simplify updater logic for downloading and verifying target files #1202

joshuagl · 2020-11-09T16:02:49Z

Description of the changes being introduced by the pull request:

The logic for downloading/verifying target files is fairly complex to follow and debug. This PR flattens several internal methods (including a callback) to make the logic simpler to follow.

Please verify and check that the pull request fulfills the following
requirements:

The code follows the Code Style Guidelines
Tests have been added for the bug fix or new feature
Docs have been added for the bug fix or new feature

This internal method isn't used by any code other than tests. Signed-off-by: Joshua Lock <jlock@vmware.com>

The call stack and code for download_target() is more complex than required: * download_target() : builds target destination filepath, gets length and hashes * _get_target_file() : fixes filenames if consistent snapshots enabled, defines verification callback * _get_file() : iterates mirrors, tries to download files, verifies them Remove the verification callback and collapse the call stack by a single level to make the code easier to follow. Signed-off-by: Joshua Lock <jlock@vmware.com>

sechkova · 2020-11-09T16:50:30Z

tuf/client/updater.py

+      try:
+        file_object = tuf.download.safe_download(file_mirror, file_length)
+
+        # Verify 'file_object' against the expected length and hashes.


Do you think it is possible to explain why there is a duplicated check of file length here after the one in safe_download?

strictly speaking the checks that are done in functions that safe_download() calls are for the downloaded byte count, whereas _check_file_length() actually looks at the file object. I don't know how that could be of practical difference but...

Good question, but I think better safe than sorry. I think redundant checks are fine as long as no real impact on performance.

tuf/client/updater.py

Rather than read to the end of the file in order to determin its size, use the whence value of seek() to move the file object's position to the end of the file, then the tell() method of the file object to read the current position in bytes. Co-authored-by: Jussi Kukkonen <jkukkonen@vmware.com> Signed-off-by: Joshua Lock <jlock@vmware.com>

joshuagl · 2020-11-12T12:11:56Z

Trying close/re-open to handle the CI move

tuf/client/updater.py

tests/test_updater.py

Simplify the loop exit logic in _get_target_file() to simply return a verified file_object, once we have it, rather than breaking from the loop and then returning the file_object. This converts a use of a try/except/else to a try/except and is a little easier to read. Signed-off-by: Joshua Lock <jlock@vmware.com>

trishankatdatadog · 2020-11-26T10:10:06Z

Why is it called _get_target_file()? How are metadata downloaded and verified?

joshuagl · 2020-11-26T10:43:13Z

Why is it called _get_target_file()? How are metadata downloaded and verified?

with _get_metadata_file(). It does things slightly differently: doesn't prefix the file with a hash, has different/additional checks for metadata fields (version and spec_version), checks for expiry, verifies signature, etc.

lukpueh

Impeccable work, @joshuagl! 🎉 Thanks for deobfuscating. :)

joshuagl added 2 commits November 9, 2020 15:55

updater: remove unused _soft_check_file_length

b3ada5b

This internal method isn't used by any code other than tests. Signed-off-by: Joshua Lock <jlock@vmware.com>

sechkova reviewed Nov 9, 2020

View reviewed changes

jku reviewed Nov 9, 2020

View reviewed changes

tuf/client/updater.py Show resolved Hide resolved

jku approved these changes Nov 10, 2020

View reviewed changes

joshuagl closed this Nov 12, 2020

joshuagl reopened this Nov 12, 2020

jku mentioned this pull request Nov 17, 2020

Updater: target hash calculation should not read the whole file in memory #1215

Closed

joshuagl requested a review from lukpueh November 25, 2020 16:30

trishankatdatadog reviewed Nov 26, 2020

View reviewed changes

tuf/client/updater.py Outdated Show resolved Hide resolved

tests/test_updater.py Show resolved Hide resolved

lukpueh approved these changes Nov 26, 2020

View reviewed changes

lukpueh merged commit e061bc6 into theupdateframework:develop Nov 26, 2020

joshuagl deleted the joshuagl/updater-simplify branch November 26, 2020 12:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify updater logic for downloading and verifying target files #1202

Simplify updater logic for downloading and verifying target files #1202

Uh oh!

joshuagl commented Nov 9, 2020

Uh oh!

sechkova Nov 9, 2020

Uh oh!

jku Nov 9, 2020

Uh oh!

trishankatdatadog Nov 26, 2020

Uh oh!

Uh oh!

joshuagl commented Nov 12, 2020

Uh oh!

Uh oh!

Uh oh!

trishankatdatadog commented Nov 26, 2020

Uh oh!

joshuagl commented Nov 26, 2020

Uh oh!

lukpueh left a comment

Uh oh!

Uh oh!

Simplify updater logic for downloading and verifying target files #1202

Simplify updater logic for downloading and verifying target files #1202

Uh oh!

Conversation

joshuagl commented Nov 9, 2020

Uh oh!

sechkova Nov 9, 2020

Choose a reason for hiding this comment

Uh oh!

jku Nov 9, 2020

Choose a reason for hiding this comment

Uh oh!

trishankatdatadog Nov 26, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joshuagl commented Nov 12, 2020

Uh oh!

Uh oh!

Uh oh!

trishankatdatadog commented Nov 26, 2020

Uh oh!

joshuagl commented Nov 26, 2020

Uh oh!

lukpueh left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!