feat: Decouple `asset_handlers` into `c2pa-codecs` crate #533

ok-nick · 2024-07-31T20:40:59Z

Changes in this pull request

Decouples asset_handler/asset_io into its own crate.

Changes

Separate/group encoding and decoding
- Introduces Encode/Decode trait with the following methods:
  - Encode: write_c2pa, remove_c2pa, patch_c2pa, write_xmp, write_xmp_provenance, remove_xmp, remove_xmp_provenance
  - Decode: read_c2pa, read_xmp, read_xmp_provenance
- Asset handlers (codecs) now take a read-only stream on construction
  - When signing, the typical workflow is read->write->read->write, there's a lot we can cache
- Streams are no longer trait objects and instead generics (less overhead)
Decouple hashers to the parsers
- Introduces a Hash trait with 5 methods: hash, data_hash, box_hash, bmff_hash, collection_hash
- Parsers can choose their default hasher (usually defined by spec) with Hasher::hash whilst also implementing any other supported hashes
- Data hash now returns a list of byte spans corresponding to the manifest rather than explicitly defining byte ranges over the entire file
Add file signature inference
- Introduces a Supporter trait with 3 methods: supports_signature, supports_extension, supports_mime.
Add Embed trait (composed manifests)
- Construct an Embeddable with embeddable, read it from a stream with read_embeddable and write it to a stream with write_embeddable
Codec is the new entry-point for asset handlers
- It implements all traits and forwards it to the corresponding codec based on the file type
- It is an enum over all codecs, removing the need for boxing trait objects
- For instance, Codec::from_stream(&mut stream).read_c2pa()
Granular parsing errors
- It's now clear specifically where parsing went wrong, rather than "not found"
Codecs are individually locked behind feature flags
- If you only want png, enable the png feature, same with gif, jpeg, etc.
External codecs
- The Codec struct can be created with an external codec that implements the desired traits (user-defined codecs)
Updated dependencies
- Many parser dependencies have become unmaintained or deprecated

Testing

Test the codecs separately from c2pa-rs
- Use verified test image suites provided online, things like pngsuite, imagetestsuite, etc.
Test all codecs simulatenously
- Construct Codec and test read_xmp for all file types (no longer needs to be implemented for each asset handler)
Fuzz the codecs
- Individually fuzz each method of the codecs using afl or proptest

Related issues

Checklist

This PR represents a single feature, fix, or change.
All applicable changes have been documented.
Any TO DO items (or similar) have been entered as GitHub issues and the link to that issue has been included in a comment.

ok-nick added 2 commits July 29, 2024 17:02

prototype

45b0293

Prototype c2pa-codecs

95fef06

ok-nick added the enhancement New feature or request label Jul 31, 2024

ok-nick added 4 commits August 1, 2024 11:12

Cleanup + integration tests + c2pa codec + external codecs

c1e05d2

Add embeddables, more descriptive errors, svg, etc.

db76654

Add read_embeddable

3869a3c

Add JPEG, add tests, fix bugs found from tests, and more

911403c

scouten-adobe changed the title ~~Decouple asset_handlers into c2pa-codecs crate~~ feat: Decouple asset_handlers into c2pa-codecs crate Oct 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Decouple `asset_handlers` into `c2pa-codecs` crate #533

feat: Decouple `asset_handlers` into `c2pa-codecs` crate #533

ok-nick commented Jul 31, 2024 •

edited

Loading

feat: Decouple asset_handlers into c2pa-codecs crate #533

Are you sure you want to change the base?

feat: Decouple asset_handlers into c2pa-codecs crate #533

Conversation

ok-nick commented Jul 31, 2024 • edited Loading

Changes in this pull request

Changes

Testing

Related issues

Checklist

feat: Decouple `asset_handlers` into `c2pa-codecs` crate #533

feat: Decouple `asset_handlers` into `c2pa-codecs` crate #533

ok-nick commented Jul 31, 2024 •

edited

Loading