Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create new pipeline to fetch dependency provenance data #284

Open
mjherzog opened this issue Aug 9, 2021 · 1 comment
Open

Create new pipeline to fetch dependency provenance data #284

mjherzog opened this issue Aug 9, 2021 · 1 comment
Assignees
Labels
design-needed enhancement New feature or request

Comments

@mjherzog
Copy link
Member

mjherzog commented Aug 9, 2021

ScanCode Toolkit identifies dependencies from package manifest files, but in most cases what you really want is the provenance data that can be retrieved from a package repository. The enhancement request is to build a new SCIO pipeline with at least one example of fetching the provenance data from a package rep to use as a template. In a common use case you may want to include fetching package repo provenance data for 2 or 3 package types.
This is related to:

I suspect that this enhancement may require Data Model changes.

@pombredanne
Copy link
Member

This makes a lot of sense. This is also related to aboutcode-org/scancode-toolkit#272
The use cases could be reformulated this way:

  • I have Package (e.g. PURLs) but I may be missing the provenance metadata.
  • Or I have Package with weak or limited provenance metadata (including pre-built binaries) and I want to either get extra metadata or scan the corresponding source code for completeness.

In the case of dependencies when I do not know the exact versions I may want to further resolve a dependency version constraints to get a concrete version (and this can be very simple such as picking the latest released version).

And to get the extra data I can either:

  • fetch extra data from package repository API
  • fetch and scan the corresponding source code

tdruez added a commit that referenced this issue Oct 27, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Oct 27, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Oct 31, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Oct 31, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Nov 1, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Nov 4, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Nov 4, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Nov 4, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Nov 4, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
tdruez added a commit that referenced this issue Nov 18, 2022
Signed-off-by: Thomas Druez <tdruez@nexb.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
design-needed enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants