-
Notifications
You must be signed in to change notification settings - Fork 16.3k
Open
Labels
area:corearea:data-aware-schedulingassets, datasets, AIP-48assets, datasets, AIP-48kind:featureFeature RequestsFeature Requestspriority:mediumBug that should be fixed before next release but would not block a releaseBug that should be fixed before next release but would not block a release
Milestone
Description
Body
Asset.extra was originally designed so users can attach additional useful information to describe an asset. We also designed AssetEvent.extra so when an event is emitted against an asset, it can contain additional data to be used in dags/tasks that are triggered by the event, or read the asset as a data inlet.
However, perhaps due to the very similar name, Asset.extra is constantly being misunderstood as some form of blueprint/template/source to AssetEvent.extra, and key-values defined on the asset should be populated into event extras against the asset.
Combined with the latest attempt to add extra templating (#54885 by @hussein-awala), I feel the confusion is much too significant for this feature. My proposal for the situation is
- Deprecate
Asset.extrato be removed in 4.0. It is not clear whether a key-value store for static additional information is useful. Dags don’t have it, and people don’t seem to be too bothered. We can probably adddoc_mdetc. if needed. - Add
Asset.event_extra_templateto re-implement Support templated extra in outlets assets #54885. Clearer name, no backward compatibility consideration needed.
Committer
- I acknowledge that I am a maintainer/committer of the Apache Airflow project.
Lee-W
Metadata
Metadata
Assignees
Labels
area:corearea:data-aware-schedulingassets, datasets, AIP-48assets, datasets, AIP-48kind:featureFeature RequestsFeature Requestspriority:mediumBug that should be fixed before next release but would not block a releaseBug that should be fixed before next release but would not block a release