You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Given our vision we want to promote the idea of a dataset.
Now fondant is very pipeline focused where we see a pipeline as a graph of operations with intermediate datasets. We should rethink the our primary interface to enable the dataset first approach:
We should pack more functionality in the Dataset class:
view data preview (html formatted like pandas)
view data schema / metadata
view lineage
...
We should abstract the Pipeline idea. Compiling and starting a pipeline is a cheap operation so it should be less static. If we store the correct information on the dataset class we can create pipelines from a dataset (and its dependencies).
The content you are editing has changed. Please copy your edits and refresh the page.
Given our vision we want to promote the idea of a dataset.
Now fondant is very pipeline focused where we see a pipeline as a graph of operations with intermediate datasets. We should rethink the our primary interface to enable the dataset first approach:
We should pack more functionality in the Dataset class:
We should abstract the Pipeline idea. Compiling and starting a pipeline is a cheap operation so it should be less static. If we store the correct information on the dataset class we can create pipelines from a dataset (and its dependencies).
Tasks
The text was updated successfully, but these errors were encountered: