Skip to content

Use intake for data fetching #190

@VeckoTheGecko

Description

@VeckoTheGecko

Intake is a lightweight package for finding, investigating, loading and disseminating data.

Currently we ingest data using copernicusmarine to ingest data. As we look to expand the abilities of virtualship, we will need to also manage data that is from other sources (which could get complicated).

Migrating our approach to use an intake catalogue will allow us to:

  • consolidate the datasets used in virtualship in a declarative manner complete with metadata, that is also freely available for users to use and explore
  • ingest datasets from various sources
  • use datasets in a lazier manner from remote storage (data doesn't per-se have to be eagerly downloaded in future1)

Intake is well adopted in the Pangeo community.

My understanding of intake at the moment is somewhat limited - still need to investivate

Additional context
Add any other context or screenshots about the feature request here.

Footnotes

  1. Currently parcels v3 expects that the data is on disk, but perhaps in v4 this doesn't have to be the case

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions