Releases: apache/airflow
Apache Airflow 2.8.4
Significant Changes
No significant changes.
Bug Fixes
- Fix incorrect serialization of
FixedTimezone
(#38139) - Fix excessive permission changing for log task handler (#38164)
- Fix task instances list link (#38096)
- Fix a bug where scheduler heartrate parameter was not used (#37992)
- Add padding to prevent grid horizontal scroll overlapping tasks (#37942)
- Fix hash caching in
ObjectStoragePath
(#37769)
Miscellaneous
- Limit importlib_resources as it breaks
pytest_rewrites
(#38095, #38139) - Limit
pandas
to<2.2
(#37748) - Bump
croniter
to fix an issue with 29 Feb cron expressions (#38198)
Doc Only Changes
Apache Airflow Helm Chart 1.13.1
Significant Changes
Default Airflow image is updated to 2.8.3
(#38036)
The default Airflow image that is used with the Chart is now 2.8.3
, previously it was 2.8.2
.
Bug Fixes
- Don't overwrite
.Values.airflowPodAnnotations
(#37917) - Fix cluster-wide RBAC naming clash when using multiple
multiNamespace
releases with the same name (#37197)
Misc
- Chart: Default airflow version to 2.8.3 (#38036)
Apache Airflow 2.8.3
Significant Changes
The smtp provider is now pre-installed when you install Airflow. (#37713)
Bug Fixes
- Add "MENU" permission in auth manager (#37881)
- Fix external_executor_id being overwritten (#37784)
- Make more MappedOperator members modifiable (#37828)
- Set parsing context dag_id in dag test command (#37606)
Miscellaneous
- Remove useless methods from security manager (#37889)
- Improve code coverage for TriggerRuleDep (#37680)
- The SMTP provider is now preinstalled when installing Airflow (#37713)
- Bump min versions of openapi validators (#37691)
- Properly include
airflow_pre_installed_providers.txt
artifact (#37679)
Doc Only Changes
- Clarify lack of sync between workers and scheduler (#37913)
- Simplify some docs around airflow_local_settings (#37835)
- Add section about local settings configuration (#37829)
- Fix docs of
BranchDayOfWeekOperator
(#37813) - Write to secrets store is not supported by design (#37814)
ERD
generating doc improvement (#37808)- Update incorrect config value (#37706)
- Update security model to clarify Connection Editing user's capabilities (#37688)
- Fix ImportError on examples dags (#37571)
Apache Airflow Helm Chart 1.13.0
Significant Changes
Default Airflow image is updated to 2.8.2 (#37704)
The default Airflow image that is used with the Chart is now 2.8.2, previously it was 2.8.1.
New Features
- Support labels specific to the database migration objects and pods (#37490)
Improvements
- Flower K8s Probe config (#37528)
Bug Fixes
- Remove duplicate ports key in webserver service (#37356)
- Add AIRFLOW_HOME env var to log groomer sidecar (#37588)
- Skip . path when preparing reproducible packages (#37402)
Misc
- Default airflow version to 2.8.2 (#37704)
Apache Airflow 2.8.2
Significant Changes
The allowed_deserialization_classes
flag now follows a glob pattern (#36147).
For example if one wants to add the class airflow.tests.custom_class
to the
allowed_deserialization_classes
list, it can be done by writing the full class
name (airflow.tests.custom_class
) or a pattern such as the ones used in glob
search (e.g., airflow.*
, airflow.tests.*
).
If you currently use a custom regexp path make sure to rewrite it as a glob pattern.
Alternatively, if you still wish to match it as a regexp pattern, add it under the new
list allowed_deserialization_classes_regexp
instead.
The audit_logs permissions have been updated for heightened security (#37501).
This was done under the policy that we do not want users like Viewer, Ops,
and other users apart from Admin to have access to audit_logs. The intention behind
this change is to restrict users with less permissions from viewing user details
like First Name, Email etc. from the audit_logs when they are not permitted to.
The impact of this change is that the existing users with non admin rights won't be able
to view or access the audit_logs, both from the Browse tab or from the DAG run.
AirflowTimeoutError
is no longer except
by default through Exception
(#35653).
The AirflowTimeoutError
is now inheriting BaseException
instead of
AirflowException
->Exception
.
See https://docs.python.org/3/library/exceptions.html#exception-hierarchy
This prevents code catching Exception
from accidentally
catching AirflowTimeoutError
and continuing to run.
AirflowTimeoutError
is an explicit intent to cancel the task, and should not
be caught in attempts to handle the error and return some default value.
Catching AirflowTimeoutError
is still possible by explicitly except
ing
AirflowTimeoutError
or BaseException
.
This is discouraged, as it may allow the code to continue running even after
such cancellation requests.
Code that previously depended on performing strict cleanup in every situation
after catching Exception
is advised to use finally
blocks or
context managers. To perform only the cleanup and then automatically
re-raise the exception.
See similar considerations about catching KeyboardInterrupt
in
https://docs.python.org/3/library/exceptions.html#KeyboardInterrupt
Bug Fixes
- Sort dag processing stats by last_runtime (#37302)
- Allow pre-population of trigger form values via URL parameters (#37497)
- Base date for fetching dag grid view must include selected run_id (#34887)
- Check permissions for ImportError (#37468)
- Move
IMPORT_ERROR
from DAG related permissions to view related permissions (#37292) - Change
AirflowTaskTimeout
to inheritBaseException
(#35653) - Revert "Fix future DagRun rarely triggered by race conditions when max_active_runs reached its upper limit. (#31414)" (#37596)
- Change margin to padding so first task can be selected (#37527)
- Fix Airflow serialization for
namedtuple
(#37168) - Fix bug with clicking url-unsafe tags (#37395)
- Set deterministic and new getter for
Treeview
function (#37162) - Fix permissions of parent folders for log file handler (#37310)
- Fix permission check on DAGs when
access_entity
is specified (#37290) - Fix the value of
dateTimeAttrFormat
constant (#37285) - Resolve handler close race condition at triggerer shutdown (#37206)
- Fixing status icon alignment for various views (#36804)
- Remove superfluous
@Sentry.enrich_errors
(#37002) - Use execution_date= param as a backup to base date for grid view (#37018)
- Handle SystemExit raised in the task. (#36986)
- Revoking audit_log permission from all users except admin (#37501)
- Fix broken regex for allowed_deserialization_classes (#36147)
- Fix the bug that affected the DAG end date. (#36144)
- Adjust node width based on task name length (#37254)
- fix: PythonVirtualenvOperator crashes if any python_callable function is defined in the same source as DAG (#37165)
- Fix collapsed grid width, line up selected bar with gantt (#37205)
- Adjust graph node layout (#37207)
- Revert the sequence of initializing configuration defaults (#37155)
- Displaying "actual" try number in TaskInstance view (#34635)
- Bugfix Triggering DAG with parameters is mandatory when show_trigger_form_if_no_params is enabled (#37063)
- Secret masker ignores passwords with special chars (#36692)
- Fix DagRuns with UPSTREAM_FAILED tasks get stuck in the backfill. (#36954)
- Disable
dryrun
auto-fetch (#36941) - Fix copy button on a DAG run's config (#36855)
- Fix bug introduced by replacing spaces by + in run_id (#36877)
- Fix webserver always redirecting to home page if user was not logged in (#36833)
- REST API set description on POST to
/variables
endpoint (#36820) - Sanitize the conn_id to disallow potential script execution (#32867)
- Fix task id copy button copying wrong id (#34904)
- Fix security manager inheritance in fab provider (#36538)
- Avoid
pendulum.from_timestamp
usage (#37160)
Miscellaneous
- Install latest docker
CLI
instead of specific one (#37651) - Bump
undici
from5.26.3
to5.28.3
in/airflow/www
(#37493) - Add Python
3.12
exclusions inproviders/pyproject.toml
(#37404) - Remove
markdown
from core dependencies (#37396) - Remove unused
pageSize
method. (#37319) - Add more-itertools as dependency of common-sql (#37359)
- Replace other
Python 3.11
and3.12
deprecations (#37478) - Include
airflow_pre_installed_providers.txt
intosdist
distribution (#37388) - Turn Pydantic into an optional dependency (#37320)
- Limit
universal-pathlib to < 0.2.0
(#37311) - Allow running airflow against sqlite in-memory DB for tests (#37144)
- Add description to
queue_when
(#36997) - Updated
config.yml
for environment variablesql_alchemy_connect_args
(#36526) - Bump min version of
Alembic to 1.13.1
(#36928) - Limit
flask-session
to<0.6
(#36895)
Doc Only Changes
- Fix upgrade docs to reflect true
CLI
flags available (#37231) - Fix a bug in fundamentals doc (#37440)
- Add redirect for deprecated page (#37384)
- Fix the
otel
config descriptions (#37229) - Update
Objectstore
tutorial withprereqs
section (#36983) - Add more precise description on avoiding generic
package/module
names (#36927) - Add airflow version substitution into Docker Compose Howto (#37177)
- Add clarification about DAG author capabilities to security model (#37141)
- Move docs for cron basics to Authoring and Scheduling section (#37049)
- Link to release notes in the upgrade docs (#36923)
- Prevent templated field logic checks in
__init__
of operators automatically (#33786)
Apache Airflow Helm Chart 1.12.0
Significant Changes
The helm chart is now using a newer version of bitnami/postgresql
dependency (#34817)
The version of bitnami/postgresql
subchart upgraded from 12.10.0
to 13.2.24
.
The version of PostgreSQL
binaries upgraded from 11
to 16.1.0
.
The change requires existing bitnami/postgresql
subchart users to perform manual major version upgrade using pg_dumpall
or pg_upgrade
.
As a reminder, it is recommended to set up an external database <https://airflow.apache.org/docs/helm-chart/stable/production-guide.html#database>
_ in production.
Default Airflow image is updated to 2.8.1
(#36907)
The default Airflow image that is used with the Chart is now 2.8.1
, previously it was 2.7.1
.
Default PgBouncer and PgBouncer Exporter images have been updated (#36898)
The PgBouncer and PgBouncer Exporter images are based on newer software/os.
pgbouncer
: 1.21.0 based on alpine 3.14 (airflow-pgbouncer-2024.01.19-1.21.0
)pgbouncer-exporter
: 0.16.0 based on alpine 3.19 (apache/airflow:airflow-pgbouncer-exporter-2024.01.19-0.16.0
)
Default StatsD image is updated to v0.26.0
(#37187)
The default StatsD image that is used with the Chart is now v0.26.0
, previously it was v0.22.8
.
Default Redis image is updated to 7-bookworm
(#37187)
The default Redis image that is used with the Chart is now 7-bookworm
, previously it was 7-bullseye
.
New Features
- Enable native HPA for Airflow Workers (#36174)
- Add init container + sidecar support for Airflow Kerberos (#35548)
- Support MySQL backend as KEDA trigger (#36167)
Improvements
- Improve PriorityClass to improve debuggability (#36365)
- Add
securityContexts
in dag processors log groomer sidecar (#34499) - Add support for
securityContexts
in dag processors wait-for-migrations container (#35593) - Add templating for PVC
storageClassName
(#35581) - Add
volumeClaimTemplate
for worker (#34986) - Add support for
priorityClassName
on Redis pods (#34879) - Configurable mount path for DAGs volume (#35083)
- Add support for custom
emptyDir
config (#34837) - Added ability to enable/disable scheduler and webserver (#36991)
Bug Fixes
- Fix StatsD host in Airflow config (#35679)
- Set
AIRFLOW_HOME
env var withairflowHome
value (#34839) - Safer worker pod annotations (#35309)
- Set worker
safeToEvict
properly (#35130) - Fix Redis broker URL with
useStandardNaming
(#34825) - Fix metadata DB & port in KEDA connection when
usePgbouncer
is false (#34741) - Fix PgBouncer connection with
useStandardNaming
(#34787)
Doc only changes
- Add docs about extending the Airflow Helm chart (#36331)
- Add comment for Elasticsearch connection scheme (#35588)
- Add notes about Virtualenvs preventing the need for custom images (#35306)
Misc
- Default Airflow version to 2.8.1 (#36907)
- Support git-sync v4 (#34731)
- Upgrade
bitnami/postgresql
subchart to13.2.24
(#36156) - Change git sync container indent to 4 (#35824)
- Remove K8S 1.24 support (#35214)
- Rebuild
pgbouncer
andpgbouncer-exporter
images with newer versions (#36898) - Update
statsd
andredis
chart images (#37187)
Apache Airflow 2.8.1
Significant Changes
Target version for core dependency pendulum
package set to 3 (#36281).
Support for pendulum 2.1.2 will be saved for a while, presumably until the next feature version of Airflow.
It is advised to upgrade user code to use pendulum 3 as soon as possible.
Airflow packaging specification follows modern Python packaging standards (#36537).
We standardized Airflow dependency configuration to follow latest development in Python packaging by
using pyproject.toml
. Airflow is now compliant with those accepted PEPs:
PEP-440 Version Identification and Dependency Specification <https://www.python.org/dev/peps/pep-0440/>
__PEP-517 A build-system independent format for source trees <https://www.python.org/dev/peps/pep-0517/>
__PEP-518 Specifying Minimum Build System Requirements for Python Projects <https://www.python.org/dev/peps/pep-0518/>
__PEP-561 Distributing and Packaging Type Information <https://www.python.org/dev/peps/pep-0561/>
__PEP-621 Storing project metadata in pyproject.toml <https://www.python.org/dev/peps/pep-0621/>
__PEP-660 Editable installs for pyproject.toml based builds (wheel based) <https://www.python.org/dev/peps/pep-0660/>
__PEP-685 Comparison of extra names for optional distribution dependencies <https://www.python.org/dev/peps/pep-0685/>
__
Also we implement multiple license files support coming from Draft, not yet accepted (but supported by hatchling) PEP:
PEP 639 Improving License Clarity with Better Package Metadata <https://peps.python.org/pep-0639/>
__
This has almost no noticeable impact on users if they are using modern Python packaging and development tools, generally
speaking Airflow should behave as it did before when installing it from PyPI and it should be much easier to install
it for development purposes using pip install -e ".[devel]"
.
The differences from the user side are:
- Airflow extras now get extras normalized to
-
(following PEP-685) instead of_
and.
(as it was before in some extras). When you install airflow with such extras (for exampledbt.core
or
all_dbs
) you should use-
instead of_
and.
.
In most modern tools this will work in backwards-compatible way, but in some old version of those tools you might need to
replace _
and .
with -
. You can also get warnings that the extra you are installing does not exist - but usually
this warning is harmless and the extra is installed anyway. It is, however, recommended to change to use -
in extras in your dependency
specifications for all Airflow extras.
-
Released airflow package does not contain
devel
,devel-*
,doc
anddoc-gen
extras.
Those extras are only available when you install Airflow from sources in--editable
mode. This is
because those extras are only used for development and documentation building purposes and are not needed
when you install Airflow for production use. Those dependencies had unspecified and varying behaviour for
released packages anyway and you were not supposed to use them in released packages. -
The
all
andall-*
extras were not always working correctly when installing Airflow using constraints
because they were also considered as development-only dependencies. With this change, those dependencies are
now properly handling constraints and they will install properly with constraints, pulling the right set
of providers and dependencies when constraints are used.
Graphviz dependency is now an optional one, not required one (#36647).
The graphviz
dependency has been problematic as Airflow required dependency - especially for
ARM-based installations. Graphviz packages require binary graphviz libraries - which is already a
limitation, but they also require to install graphviz Python bindings to be build and installed.
This does not work for older Linux installation but - more importantly - when you try to install
Graphviz libraries for Python 3.8, 3.9 for ARM M1 MacBooks, the packages fail to install because
Python bindings compilation for M1 can only work for Python 3.10+.
This is not a breaking change technically - the CLIs to render the DAGs is still there and IF you
already have graphviz installed, it will continue working as it did before. The only problem when it
does not work is where you do not have graphviz installed it will raise an error and inform that you need it.
Graphviz will remain to be installed for most users:
- the Airflow Image will still contain graphviz library, because
it is added there as extra - when previous version of Airflow has been installed already, then
graphviz library is already installed there and Airflow will
continue working as it did
The only change will be a new installation of new version of Airflow from the scratch, where graphviz will
need to be specified as extra or installed separately in order to enable DAG rendering option.
Bug Fixes
- Fix airflow-scheduler exiting with code 0 on exceptions (#36800)
- Fix Callback exception when a removed task is the last one in the
taskinstance
list (#36693) - Allow anonymous user edit/show resource when set
AUTH_ROLE_PUBLIC=admin
(#36750) - Better error message when sqlite URL uses relative path (#36774)
- Explicit string cast required to force integer-type run_ids to be passed as strings instead of integers (#36756)
- Add log lookup exception for empty
op
subtypes (#35536) - Remove unused index on task instance (#36737)
- Fix check on subclass for
typing.Union
in_infer_multiple_outputs
for Python 3.10+ (#36728) - Make sure
multiple_outputs
is inferred correctly even when usingTypedDict
(#36652) - Add back FAB constant in legacy security manager (#36719)
- Fix AttributeError when using
Dagrun.update_state
(#36712) - Do not let
EventsTimetable
schedule past events ifcatchup=False
(#36134) - Support encryption for triggers parameters (#36492)
- Fix the type hint for
tis_query
in_process_executor_events
(#36655) - Redirect to index when user does not have permission to access a page (#36623)
- Avoid using dict as default value in
call_regular_interval
(#36608) - Remove option to set a task instance to running state in UI (#36518)
- Fix details tab not showing when using dynamic task mapping (#36522)
- Raise error when
DagRun
fails while runningdag test
(#36517) - Refactor
_manage_executor_state
by refreshing TIs in batch (#36502) - Add flask config:
MAX_CONTENT_LENGTH
(#36401) - Fix get_leaves calculation for teardown in nested group (#36456)
- Stop serializing timezone-naive datetime to timezone-aware datetime with UTC tz (#36379)
- Make
kubernetes
decorator type annotation consistent with operator (#36405) - Fix Webserver returning 500 for POST requests to
api/dag/*/dagrun
from anonymous user (#36275) - Fix the required access for get_variable endpoint (#36396)
- Fix datetime reference in
DAG.is_fixed_time_schedule
(#36370) - Fix AirflowSkipException message raised by BashOperator (#36354)
- Allow PythonVirtualenvOperator.skip_on_exit_code to be zero (#36361)
- Increase width of execution_date input in trigger.html (#36278)
- Fix logging for pausing DAG (#36182)
- Stop deserializing pickle when enable_xcom_pickling is False (#36255)
- Check DAG read permission before accessing DAG code (#36257)
- Enable mark task as failed/success always (#36254)
- Create latest log dir symlink as relative link (#36019)
- Fix Python-based decorators templating (#36103)
Miscellaneous
- Rename concurrency label to max active tasks (#36691)
- Restore function scoped
httpx
import in file_task_handler for performance (#36753) - Add support of Pendulum 3 (#36281)
- Standardize airflow build process and switch to Hatchling build backend (#36537)
- Get rid of
pyarrow-hotfix
forCVE-2023-47248
(#36697) - Make
graphviz
dependency optional (#36647) - Announce MSSQL support end in Airflow 2.9.0, add migration script hints (#36509)
- Set min
pandas
dependency to 1.2.5 for all providers and airflow (#36698) - Bump follow-redirects from 1.15.3 to 1.15.4 in
/airflow/www
(#36700) - Provide the logger_name param to base hook in order to override the logger name (#36674)
- Fix run type icon alignment with run type text (#36616)
- Follow BaseHook connection fields method signature in FSHook (#36444)
- Remove redundant
docker
decorator type annotations (#36406) - Straighten typing in workday timetable (#36296)
- Use
batch_is_authorized_dag
to check if user has permission to read DAGs (#36279) - Replace deprecated get_accessible_dag_ids and use get_readable_dags in get_dag_warnings (#36256)
Doc Only Changes
- Metrics tagging documentation (#36627)
- In docs use logical_date instead of deprecated execution_date (#36654)
- Add section about live-upgrading Airflow (#36637)
- Replace
numpy
example with practical exercise demonstrating top-level code (#35097) - Improve and add more complete description in the architecture diagrams (#36513)
- Improve the error message displayed when there is a webserver error (#36570)
- Update
dags.rst
with information on DAG pausing (#36540) - Update installation prerequisites after upgrading to Debian Bookworm (#36521)
- Add description on the ways how users should approach DB monitoring (#36483)
- Add branching based on mapped task group example to dynamic-task-mapping.rst (#36480)
- Add further details to replacement documentation (#36485)
- Use cards when describing priority weighting methods (#36411)
- Update
metrics.rst
for paramdagrun.schedule_delay
(#36404) - Update admonitions in Python operator doc to reflect sentiment (#36340)
- Improve audit_logs.rst (#36213)
- Remove Redshift mention from the list of managed Postgres backends (#36217)
Apache Airflow 2.8.0
Significant Changes
-
Raw HTML code in DAG docs and DAG params descriptions is disabled by default
To ensure that no malicious javascript can be injected with DAG descriptions or trigger UI forms by DAG authors
a new parameterwebserver.allow_raw_html_descriptions
was added with default value ofFalse
.
If you trust your DAG authors code and want to allow using raw HTML in DAG descriptions and params, you can restore the previous
behavior by setting the configuration value toTrue
.To ensure Airflow is secure by default, the raw HTML support in trigger UI has been super-seeded by markdown support via
thedescription_md
attribute. If you have been usingdescription_html
please migrate todescription_md
.
Thecustom_html_form
is now deprecated. (#35460)
New Features
- AIP-58: Add Airflow ObjectStore (AFS) (AIP-58)
- Add XCom tab to Grid (#35719)
- Add "literal" wrapper to disable field templating (#35017)
- Add task context logging feature to allow forwarding messages to task logs (#32646, #32693, #35857)
- Add Listener hooks for Datasets (#34418, #36247)
- Allow override of navbar text color (#35505)
- Add lightweight serialization for deltalake tables (#35462)
- Add support for serialization of iceberg tables (#35456)
prev_end_date_success
method access (#34528)- Add task parameter to set custom logger name (#34964)
- Add pyspark decorator (#35247)
- Add trigger as a valid option for the db clean command (#34908)
- Add decorators for external and venv python branching operators (#35043)
- Allow PythonVenvOperator using other index url (#33017)
- Add Python Virtualenv Operator Caching (#33355)
- Introduce a generic export for containerized executor logging (#34903)
- Add ability to clear downstream tis in
List Task Instances
view (#34529) - Attribute
clear_number
to track DAG run being cleared (#34126) - Add BranchPythonVirtualenvOperator (#33356)
- Allow PythonVenvOperator using other index url (#33017)
- Add CLI notification commands to providers (#33116)
- Use dropdown instead of buttons when there are more than 10 retries in log tab (#36025)
Improvements
- Add
multiselect
to run state in grid view (#35403) - Fix warning message in
Connection.get_hook
in case of ImportError (#36005) - Add processor_subdir to import_error table to handle multiple dag processors (#35956)
- Consolidate the call of change_state to fail or success in the core executors (#35901)
- Relax mandatory requirement for start_date when schedule=None (#35356)
- Use ExitStack to manage mutation of secrets_backend_list in dag.test (#34620)
- improved visibility of tasks in ActionModal for
taskinstance
(#35810) - Create directories based on
AIRFLOW_CONFIG
path (#35818) - Implements
JSON-string
connection representation generator (#35723) - Move
BaseOperatorLink
into the separate module (#35032) - Set mark_end_on_close after set_context (#35761)
- Move external logs links to top of react logs page (#35668)
- Change terminal mode to
cbreak
inexecute_interactive
and handleSIGINT
(#35602) - Make raw HTML descriptions configurable (#35460)
- Allow email field to be templated (#35546)
- Hide logical date and run id in trigger UI form (#35284)
- Improved instructions for adding dependencies in TaskFlow (#35406)
- Add optional exit code to list import errors (#35378)
- Limit query result on DB rather than client in
synchronize_log_template
function (#35366) - Allow description to be passed in when using variables CLI (#34791)
- Allow optional defaults in required fields with manual triggered dags (#31301)
- Permitting airflow kerberos to run in different modes (#35146)
- Refactor commands to unify daemon context handling (#34945)
- Add extra fields to plugins endpoint (#34913)
- Add description to pools view (#34862)
- Move cli's Connection export and Variable export command print logic to a separate function (#34647)
- Extract and reuse get_kerberos_principle func from get_kerberos_principle (#34936)
- Change type annotation for
BaseOperatorLink.operators
(#35003) - Optimise and migrate to
SA2-compatible
syntax for TaskReschedule (#33720) - Consolidate the permissions name in SlaMissModelView (#34949)
- Add debug log saying what's being run to
EventScheduler
(#34808) - Increase log reader stream loop sleep duration to 1 second (#34789)
- Resolve pydantic deprecation warnings re
update_forward_refs
(#34657) - Unify mapped task group lookup logic (#34637)
- Allow filtering event logs by attributes (#34417)
- Make connection login and password TEXT (#32815)
- Ban import
Dataset
fromairflow
package in codebase (#34610) - Use
airflow.datasets.Dataset
in examples and tests (#34605) - Enhance task status visibility (#34486)
- Simplify DAG trigger UI (#34567)
- Ban import AirflowException from airflow (#34512)
- Add descriptions for airflow resource config parameters (#34438)
- Simplify trigger name expression (#34356)
- Move definition of Pod*Exceptions to pod_generator (#34346)
- Add deferred tasks to the cluster_activity view Pools Slots (#34275)
- heartbeat failure log message fix (#34160)
- Rename variables for dag runs (#34049)
- Clarify new_state in OpenAPI spec (#34056)
- Remove
version
top-level element from docker compose files (#33831) - Remove generic trigger cancelled error log (#33874)
- Use
NOT EXISTS
subquery instead oftuple_not_in_condition
(#33527) - Allow context key args to not provide a default (#33430)
- Order triggers by - TI priority_weight when assign unassigned triggers (#32318)
- Add metric
triggerer_heartbeat
(#33320) - Allow
airflow variables export
to print to stdout (#33279) - Workaround failing deadlock when running backfill (#32991)
- add dag_run_ids and task_ids filter for the batch task instance API endpoint (#32705)
- Configurable health check threshold for triggerer (#33089)
- Rework provider manager to treat Airflow core hooks like other provider hooks (#33051)
- Ensure DAG-level references are filled on unmap (#33083)
- Affix webserver access_denied warning to be configurable (#33022)
- Add support for arrays of different data types in the Trigger Form UI (#32734)
- Add a mechanism to warn if executors override existing CLI commands (#33423)
Bug Fixes
- Account for change in UTC offset when calculating next schedule (#35887)
- Add read access to pools for viewer role (#35352)
- Fix gantt chart queued duration when queued_dttm is greater than start_date for deferred tasks (#35984)
- Avoid crushing container when directory is not found on rm (#36050)
- Update
reset_user_sessions
to work from either CLI or web (#36056) - Fix UI Grid error when DAG has been removed. (#36028)
- Change Trigger UI to use HTTP POST in web ui (#36026)
- Fix airflow db shell needing an extra key press to exit (#35982)
- Change dag grid
overscroll
behaviour to auto (#35717) - Run triggers inline with dag test (#34642)
- Add
borderWidthRight
to grid for Firefoxscrollbar
(#35346) - Fix for infinite recursion due to secrets_masker (#35048)
- Fix write
processor_subdir
in serialized_dag table (#35661) - Reload configuration for standalone dag file processor (#35725)
- Long custom operator name overflows in graph view (#35382)
- Add try_number to extra links query (#35317)
- Prevent assignment of non JSON serializable values to DagRun.conf dict (#35096)
- Numeric values in DAG details are incorrectly rendered as timestamps (#35538)
- Fix Scheduler and triggerer crashes in daemon mode when statsd metrics are enabled (#35181)
- Infinite UI redirection loop after deactivating an active user (#35486)
- Bug fix fetch_callback of Partial Subset DAG (#35256)
- Fix DagRun data interval for DeltaDataIntervalTimetable (#35391)
- Fix query in
get_dag_by_pickle
util function (#35339) - Fix TriggerDagRunOperator failing to trigger subsequent runs when reset_dag_run=True (#35429)
- Fix weight_rule property type in
mappedoperator
(#35257) - Bugfix/prevent concurrency with cached venv (#35258)
- Fix dag serialization (#34042)
- Fix py/url-redirection by replacing request.referrer by get_redirect() (#34237)
- Fix updating variables during variable imports (#33932)
- Use Literal from airflow.typing_compat in Airflow core (#33821)
- Always use
Literal
fromtyping_extensions
(#33794)
Miscellaneous
- Change default MySQL client to MariaDB (#36243)
- Mark daskexecutor provider as removed (#35965)
- Bump FAB to
4.3.10
(#35991) - Mark daskexecutor provider as removed (#35965)
- Rename
Connection.to_json_dict
toConnection.to_dict
(#35894) - Upgrade to Pydantic v2 (#35551)
- Bump
moto
version to>= 4.2.9
(#35687) - Use
pyarrow-hotfix
to mitigate CVE-2023-47248 (#35650) - Bump
axios
from0.26.0 to 1.6.0
in/airflow/www/
(#35624) - Make docker decorator's type annotation consistent with operator (#35568)
- Add default to
navbar_text_color
andrm
condition in style (#35553) - Avoid initiating session twice in
dag_next_execution
(#35539) - Work around typing issue in examples and providers (#35494)
- Enable
TCH004
andTCH005
rules (#35475) - Humanize log output about retrieved DAG(s) (#35338)
- Switch from Black to Ruff formatter (#35287)
- Upgrade to Flask Application Builder 4.3.9 (#35085)
- D401 Support (#34932, #34933)
- Use requires_access to check read permission on dag instead of checking it explicitly (#34940)
- Deprecate lazy import
AirflowException
from airflow (#34541) - View util refactoring on mapped stuff use cases (#34638)
- Bump
postcss
from8.4.25 to 8.4.31
in/airflow/www
(#34770) - Refactor Sqlalchemy queries to 2.0 s...
Apache Airflow 2.7.3
Significant Changes
No significant changes.
Bug Fixes
- Fix pre-mature evaluation of tasks in mapped task group (#34337)
- Add TriggerRule missing value in rest API (#35194)
- Fix Scheduler crash looping when dagrun creation fails (#35135)
- Fix test connection with
codemirror
and extra (#35122) - Fix usage of cron-descriptor since BC in v1.3.0 (#34836)
- Fix
get_plugin_info
for class based listeners. (#35022) - Some improvements/fixes for dag_run and task_instance endpoints (#34942)
- Fix the dags count filter in webserver home page (#34944)
- Return only the TIs of the readable dags when ~ is provided as a dag_id (#34939)
- Fix triggerer thread crash in daemon mode (#34931)
- Fix wrong plugin schema (#34858)
- Use DAG timezone in TimeSensorAsync (#33406)
- Mark tasks with
all_skipped
trigger rule asskipped
if any task is inupstream_failed
state (#34392) - Add read only validation to read only fields (#33413)
Misc/Internal
- Improve testing harness to separate DB and non-DB tests (#35160, #35333)
- Add pytest db_test markers to our tests (#35264)
- Add pip caching for faster build (#35026)
- Upper bound
pendulum
requirement to<3.0
(#35336) - Limit
sentry_sdk
to1.33.0
(#35298) - Fix subtle bug in mocking processor_agent in our tests (#35221)
- Bump
@babel/traverse
from7.16.0 to 7.23.2
in/airflow/www
(#34988) - Bump
undici
from5.19.1 to 5.26.3
in/airflow/www
(#34971) - Remove unused set from
SchedulerJobRunner
(#34810) - Remove warning about
max_tis per query > parallelism
(#34742) - Improve modules import in Airflow core by moving some of them into a type-checking block (#33755)
- Fix tests to respond to Python 3.12 handling of utcnow in sentry-sdk (#34946)
- Add
connexion<3.0
upper bound (#35218) - Limit Airflow to
< 3.12
(#35123) - update moto version (#34938)
- Limit WTForms to below
3.1.0
(#34943)
Doc Only Changes
- Fix variables substitution in Airflow Documentation (#34462)
- Added example for defaults in
conn.extras
(#35165) - Update datasets.rst issue with running example code (#35035)
- Remove
mysql-connector-python
from recommended MySQL driver (#34287) - Fix syntax error in task dependency
set_downstream
example (#35075) - Update documentation to enable test connection (#34905)
- Update docs errors.rst - Mention sentry "transport" configuration option (#34912)
- Update dags.rst to put SubDag deprecation note right after the SubDag section heading (#34925)
- Add info on getting variables and config in custom secrets backend (#34834)
- Document BaseExecutor interface in more detail to help users in writing custom executors (#34324)
- Fix broken link to
airflow_local_settings.py
template (#34826) - Fixes python_callable function assignment context kwargs example in params.rst (#34759)
- Add missing multiple_outputs=True param in the TaskFlow example (#34812)
- Remove extraneous
'>'
in provider section name (#34813) - Fix imports in extra link documentation (#34547)
Apache Airflow 2.7.2
Significant Changes
No significant changes
Bug Fixes
- Check if the lower of provided values are sensitives in config endpoint (#34712)
- Add support for ZoneInfo and generic UTC to fix datetime serialization (#34683, #34804)
- Fix AttributeError: 'Select' object has no attribute 'count' during the airflow db migrate command (#34348)
- Make dry run optional for patch task instance (#34568)
- Fix non deterministic datetime deserialization (#34492)
- Use iterative loop to look for mapped parent (#34622)
- Fix is_parent_mapped value by checking if any of the parent
taskgroup
is mapped (#34587) - Avoid top-level airflow import to avoid circular dependency (#34586)
- Add more exemptions to lengthy metric list (#34531)
- Fix dag warning endpoint permissions (#34355)
- Fix task instance access issue in the batch endpoint (#34315)
- Correcting wrong time showing in grid view (#34179)
- Fix www
cluster_activity
view not loading due tostandaloneDagProcessor
templating (#34274) - Set
loglevel=DEBUG
in 'Not syncingDAG-level
permissions' (#34268) - Make param validation consistent for DAG validation and triggering (#34248)
- Ensure details panel is shown when any tab is selected (#34136)
- Fix issues related to
access_control={}
(#34114) - Fix not found
ab_user
table in the CLI session (#34120) - Fix FAB-related logging format interpolation (#34139)
- Fix query bug in
next_run_datasets_summary
endpoint (#34143) - Fix for TaskGroup toggles for duplicated labels (#34072)
- Fix the required permissions to clear a TI from the UI (#34123)
- Reuse
_run_task_session
in mappedrender_template_fields
(#33309) - Fix scheduler logic to plan new dag runs by ignoring manual runs (#34027)
- Add missing audit logs for Flask actions add, edit and delete (#34090)
- Hide Irrelevant Dag Processor from Cluster Activity Page (#33611)
- Remove infinite animation for pinwheel, spin for 1.5s (#34020)
- Restore rendering of provider configuration with
version_added
(#34011)
Doc Only Changes
- Clarify audit log permissions (#34815)
- Add explanation for Audit log users (#34814)
- Import
AUTH_REMOTE_USER
from FAB in WSGI middleware example (#34721) - Add information about drop support MsSQL as DB Backend in the future (#34375)
- Document how to use the system's timezone database (#34667)
- Clarify what landing time means in doc (#34608)
- Fix screenshot in dynamic task mapping docs (#34566)
- Fix class reference in Public Interface documentation (#34454)
- Clarify var.value.get and var.json.get usage (#34411)
- Schedule default value description (#34291)
- Docs for triggered_dataset_event (#34410)
- Add DagRun events (#34328)
- Provide tabular overview about trigger form param types (#34285)
- Add link to Amazon Provider Configuration in Core documentation (#34305)
- Add "security infrastructure" paragraph to security model (#34301)
- Change links to SQLAlchemy 1.4 (#34288)
- Add SBOM entry in security documentation (#34261)
- Added more example code for XCom push and pull (#34016)
- Add state utils to Public Airflow Interface (#34059)
- Replace markdown style link with rst style link (#33990)
- Fix broken link to the "UPDATING.md" file (#33583)
Misc/Internal
- Update min-sqlalchemy version to account for latest features used (#34293)
- Fix SesssionExemptMixin spelling (#34696)
- Restrict
astroid
version < 3 (#34658) - Fail dag test if defer without triggerer (#34619)
- Fix connections exported output (#34640)
- Don't run isort when creating new alembic migrations (#34636)
- Deprecate numeric type python version in PythonVirtualEnvOperator (#34359)
- Refactor
os.path.splitext
toPath.*
(#34352, #33669) - Replace = by is for type comparison (#33983)
- Refactor integer division (#34180)
- Refactor: Simplify comparisons (#34181)
- Refactor: Simplify string generation (#34118)
- Replace unnecessary dict comprehension with dict() in core (#33858)
- Change "not all" to "any" for ease of readability (#34259)
- Replace assert by if...raise in code (#34250, #34249)
- Move default timezone to except block (#34245)
- Combine similar if logic in core (#33988)
- Refactor: Consolidate import and usage of random (#34108)
- Consolidate importing of os.path.* (#34060)
- Replace sequence concatenation by unpacking in Airflow core (#33934)
- Refactor unneeded 'continue' jumps around the repo (#33849, #33845, #33846, #33848, #33839, #33844, #33836, #33842)
- Remove [project] section from
pyproject.toml
(#34014) - Move the try outside the loop when this is possible in Airflow core (#33975)
- Replace loop by any when looking for a positive value in core (#33985)
- Do not create lists we don't need (#33519)
- Remove useless string join from core (#33969)
- Add TCH001 and TCH002 rules to pre-commit to detect and move type checking modules (#33865)
- Add cancel_trigger_ids to to_cancel dequeue in batch (#33944)
- Avoid creating unnecessary list when parsing stats datadog tags (#33943)
- Replace dict.items by dict.values when key is not used in core (#33940)
- Replace lambdas with comprehensions (#33745)
- Improve modules import in Airflow core by some of them into a type-checking block (#33755)
- Refactor: remove unused state - SHUTDOWN (#33746, #34063, #33893)
- Refactor: Use in-place .sort() (#33743)
- Use literal dict instead of calling dict() in Airflow core (#33762)
- remove unnecessary map and rewrite it using list in Airflow core (#33764)
- Replace lambda by a def method in Airflow core (#33758)
- Replace type func by
isinstance
in fab_security manager (#33760) - Replace single quotes by double quotes in all Airflow modules (#33766)
- Merge multiple
isinstance
calls for the same object in a single call (#33767) - Use a single statement with multiple contexts instead of nested statements in core (#33769)
- Refactor: Use f-strings (#33734, #33455)
- Refactor: Use random.choices (#33631)
- Use
str.splitlines()
to split lines (#33592) - Refactor: Remove useless str() calls (#33629)
- Refactor: Improve detection of duplicates and list sorting (#33675)
- Simplify conditions on
len()
(#33454)