Skip to content

0.6.0

Compare
Choose a tag to compare
@RobbeSneyders RobbeSneyders released this 19 Oct 09:07
· 241 commits to main since this release
f8f4946

Highlights

  • Vertex AI is now supported as a backend for pipeline execution.

    Simply run fondant run vertex <pipeline.py> to submit your pipeline.
    Run fondant run vertex --help to see the possible configuration options.

  • The reusable components are now available on DockerHub under the fndnt organization.

    DockerHub is supported more broadly than Github container registry which we were using before.

  • Previously executed components are now cached when re-executed with the same arguments.

    • This makes it easier to iterate on development of down-stream components
    • This allows you to resume failed pipelines from their failed step
  • Added fondant build command which let's you build fondant components easily

    Run fondant build <component_dir>. Check fondant build -h for options.
    The command will also update the image reference in the fondant_component.yaml to the newly built one.

  • We migrated from KfP v1 to KfP v2. This means:

    • We now benefit from the latest KfP developments
    • We compile fondant pipelines to the IR YAML format, which is supported by other execution engines such as Vertex
    • You need a KfP v2 cluster to run fondant pipelines

Fixes

  • Fix data explorer for usage on Windows
  • Fix propagation of client_kwargs argument to configure Dask Client

Components

  • Every reusable component now has a clear README describing its usage
  • Add load_from_parquet component to load parquet files as input data
  • Add embed_text component to embed documents and other text
  • Add chunk_text component to chunk documents into passages
  • Add index_weaviate component to index data in a weaviate vector store
  • Fix issue with mixed type ids in LAION retrieval components
  • Improve success rate of download_images component
  • Fix OOM issues for inference components using GPU
  • Limit data read by load_from_hub component to used columns

Detailed changes

New Contributors

Full Changelog: 0.5.0...0.6.0