Is possible to add a loop condition in Airflow? #21726

appassionate · 2022-02-22T07:35:47Z

appassionate
Feb 22, 2022

how about a operator for "clear"

According to the "Directed Acyclic Graph " (DAG) concept, we will not get a explict loop flow graph in our dag or webserverUI.
For a loop condition I am always faced with such a condition:

BAD DAG

[do a task] >> is_result_ok? >> no >> continue [do a task] (which is not DAG!)
is_result_ok >> Yes >> end_loop

GOOD DAG

I believe we CAN NOT add “>> do a task“ in the downstream for it has occured in the upstream.
BUT, it is possible that we can "CLEAR" the previous task just like we do in UI sometimes.

So, solution might be:
[do a task] >> is_result_ok? >> no >> clear_previous_do_a_task (it is DAG, it might be a special operator)
is_result_ok >> Yes >> end_loop

Is this feasible？

it will be tricky:

to have such a "evalute" section
how to prepare next "do a task" parameters for we have to change to reach the good condition.

potiuk · 2022-02-26T23:15:19Z

potiuk
Feb 26, 2022
Collaborator

Look at Branch Operators - for example here: https://www.astronomer.io/guides/airflow-branch-operator/

1 reply

appassionate Feb 28, 2022
Author

Look at Branch Operators - for example here: https://www.astronomer.io/guides/airflow-branch-operator/
Many thanks! I believe I know what branchoperators do by using the example dag. I focus the implementation of "loop" more which i mean doing the similar things within states updating. : )

appassionate · 2022-03-02T08:01:13Z

appassionate
Mar 2, 2022
Author

To handle the "loop" condition, i have a simple example DAG if interested.
But there still be tricky for clearing task operation suggest the disappearance of XCOM parameter in taskinstances.

I imaged if there will be a XCOM object for "DAG level" but "taskinstance level", which will be more "global". I know Variables can manage, but it still be a trouble when you just want to temporally record a special DAG value.
Any suggestions will be welcome! Thanks all.

0 replies

appassionate · 2022-04-07T03:47:42Z

appassionate
Apr 7, 2022
Author

example dag here if interested. : ) @potiuk

I believe it has not break the "acyclic" rule although it still be very trick.
Airflow can do many things in different fields including machine learning. There might be some other tools such as kubeflow but still (in my opinion) cant establish such a DAG "network".

But the loop condition still trouble me.
I image the universal ML workflow like this:
model building >> model analysis>> model evaluation >> {next similar loop for bad condition in evaluation}

The dag below can manage it,but it still be some probs.

xcom for a task update is neccessary (also tricky)
not clean for evaluation task
callback function is tricky
no xcom for earlier iteration because the clear operation

I think airflow will be more universal if having such a feature to manage such a loop workflow or other similar condition in our daily use avoiding TriggerDagRunOpt running another DAG instance.

Many thanks!

import numpy as np

from airflow.models import DAG
from airflow.models import XCom

from airflow.operators.python import BranchPythonOperator
from airflow.operators.python import get_current_context

from airflow.utils.dates import days_ago
from airflow.utils.edgemodifier import Label

from airflow.decorators import task_group, task


DAG_NAME = "clear_loop_example"
default_parameters = {}

def dummy_evaluate(result):

    return False

def do_evaluate(evaluate_func, result, fail_task_id, success_task_id):

    ctx = get_current_context()
    cond = evaluate_func(result)
    ctx["ti"].xcom_push(key="condition" ,value=cond)

    if cond:
        return fail_task_id
    else:
        return success_task_id

@task
def init_loop():

    return 1

@task
def update_loop():

    ctx =get_current_context()
    loop_num = ctx["ti"].xcom_pull(task_ids="init_loop",key="return_value")

    print("loop_num: ", loop_num)

    XCom.set(
            key="return_value",
            value=loop_num+1,
            task_id="init_loop",
            dag_id=DAG_NAME,
            execution_date=ctx["execution_date"],
            )

    pass

@task
def print_loop_num(loop_num):

    print(loop_num)
    return loop_num

@task
def opts_in_loop():

    pass

@task
def report():

    pass

def get_xcom_from_evaluate(evaluate_xcomargs):

    cond = evaluate_xcomargs.output.get("condition")    
    if cond == True:
        
        return True
    else:
        raise ValueError("DUMMY")
@task
def success():
    pass

def clear_upstream_task(context):
    execution_date = context.get("execution_date")
    dag.clear(exclude_task_ids=["init_loop"], start_date=execution_date)
    print("successfully clear task instance")
    return True

with DAG(dag_id=DAG_NAME,
 default_args=default_parameters,
  schedule_interval=None, start_date=days_ago(2),) as dag:


    some_evaluate = BranchPythonOperator(
    task_id='evaluate',
    python_callable=do_evaluate,
    op_args=[dummy_evaluate, "blabla","success","update_loop"],
    )
    
    clean = task(get_xcom_from_evaluate,
    task_id='run_next_loop',
    trigger_rule="none_failed",
    on_failure_callback=clear_upstream_task)(some_evaluate)

    loop_num = init_loop()
    print_loop_num(loop_num) >> opts_in_loop() >> some_evaluate

    some_evaluate >> Label("success") >> success() >> clean
    some_evaluate >> Label("failed") >> update_loop() >> clean

    clean >> report()

5 replies

potiuk Apr 7, 2022
Collaborator

That's certainly very abusive way of using Airlfow. Way better will be to have external script to trigger execution of such loops as actually spearate dag runs. Airflow 2.2+ have dag_run_id as primary key and you can simply launch (via API) multiple DAG RUN executions either parallel or sequential. Using Airflow to clear own tasks and re-run makes very little sense as you have no history.

Simply treat DAG Run as single loop pass and control it externally. You need to do it anyway, because you need to know when to stop/start experiment.

What you've done is completely unmaintainable and is really trying to use hammer to turn the screw. Probably you can do it if you try hard enough, but does it make sense? Not at all. Better use screwdriver.

appassionate Apr 7, 2022
Author

Ok..... Anyway, thanks for your sincere reply. : )

appassionate Apr 7, 2022
Author

Honestly, I think it is an "idea" .
Like AIP-42, I imaged there might be some advanced method added like task_group.run_as_loop() (similar to task.map).
Maybe it can enrich the related airflow ecosystems, which saves some extra trigger operations.

potiuk Apr 7, 2022
Collaborator

Feel free to discuss it in devlist and propose and stand by it if you think it's a good idea. I think we are already going to implement "Retry task group feature. We also have something theat actually might be way better for your case without introducing loops.

https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-42+Dynamic+Task+Mapping

The first version of it is going to be released in Airlfow 2.3 (soon-ish) and rather than "looping a task" it will alow you to run "n parallel incarnations of a task" . This fits much better the DAG approach of Airflow and allows you for example to run X parallel machine learnign experiments - each with different sets of parameters. It's not yet "Dynamic Group Mapping" which is probably what woudl be closest to what you propose, but if anything - this is a better approach IMHO.

appassionate Apr 7, 2022
Author

Feel free to discuss it in devlist and propose and stand by it if you think it's a good idea. I think we are already going to implement "Retry task group feature. We also have something theat actually might be way better for your case without introducing loops.

https://cwiki.apache.org/confluence/display/AIRFLOW/AIP-42+Dynamic+Task+Mapping

The first version of it is going to be released in Airlfow 2.3 (soon-ish) and rather than "looping a task" it will alow you to run "n parallel incarnations of a task" . This fits much better the DAG approach of Airflow and allows you for example to run X parallel machine learnign experiments - each with different sets of parameters. It's not yet "Dynamic Group Mapping" which is probably what woudl be closest to what you propose, but if anything - this is a better approach IMHO.

yeah, looking forward to the release of 2.3.0.

philippefutureboy · 2023-08-03T15:37:45Z

philippefutureboy
Aug 3, 2023

For future references for those that want to implement a looping condition in Airflow, here's a possible implementation:

import abc
from typing import Any, Generic, Mapping, TypeVar, Union
from airflow.utils.operator_helpers import KeywordParameters

T = TypeVar('T')

class AbstractLoop(abc.ABC, Generic[T]):
    """
    Abstract class to execute as a python_callable using a PythonOperator.
    Runs the `run` method until the `condition` method returns false.

    Example:

    PythonOperator(
        task_id="task_looper"
        python_callable=Loop(
            ...args
        ),
        op_kwargs={
            ...op_kwargs
        }
    )
    """

    def __call__(self, **context)
        condition_kwargs = self._determine_kwargs(self.condition, context)
        run_kwargs = self._determine_kwargs(loop.run, context)

        result = None
        while loop.condition(previous=result, **condition_kwargs):
            result = loop.run(previous=result, **run_kwargs)

    @abc.abstractmethod
    def condition(self, previous: Union[T, None], **context) -> bool:
        pass

    @abc.abstractmethod
    def run(self, previous: Union[T, None], **context) -> T:
        pass

    def _determine_kwargs(self, fn, context: Mapping[str, Any]) -> Mapping[str, Any]:
        return KeywordParameters.determine(fn, (), context).unpacking()

Here you can find an example of how this can be used - My specific use case is to backfill a dbt snapshot given a set of daily captures.

from typing import Union
import datetime as dt

from airflow.models.dagrun import DagRun
from airflow.operators.trigger_dagrun import TriggerDagRunOperator
from airflow.utils import timezone
from airflow.utils.types import DagRunType

class DateIntervalTriggerDagRunLoop(AbstractLoop[dt.date]):
    """
    Triggers a DagRun for a given dag_id for each date in a given interval.

    Args:
        trigger_dag_id (str): Identifier of the Dag to trigger.
        min_date_conf_key (str): DagRun configuration key for lower bound of the date interval.
                                 Defaults to "min_date".
        max_date_conf_key (str): DagRun configuration key for upper bound of the date interval.
                                 Defaults to "max_date".
        date_task_id (str, optional): Task id of a task within the Dag to trigger that pushes the date
                                      of the current iteration to XCom.
                                      Optional only if dag_run.conf[min_date_conf_key] is not None.
        date_xcom_key (str, optional): XCom key used to push the date of the current iteration.
                                       Optional only if dag_run.conf[min_date_conf_key] is not None.

    PythonOperator Op Kwargs:
        conf (Dict[str, str], optional): Configuration to pass to the DagRuns to trigger.
    
    """
    def __init__(
        self,
        trigger_dag_id: str,
        min_date_conf_key: Optional[str] = "min_date",
        max_date_conf_key: Optional[str] = "max_date",
        date_task_id: Optional[str] = None,
        date_xcom_key: Optional[str] = None
    ):
        self.trigger_dag_id = trigger_dag_id
        self.min_date_conf_key = min_date_conf_key
        self.max_date_conf_key = max_date_conf_key
        self.date_task_id = date_task_id
        self.date_xcom_key = date_xcom_key


    def condition(
        self,
        previous: Union[dt.date, None],
        dag_run,
    ):
        # If previous is None, this is the first iteration. Keep looping.
        if previous is None:
            return True
        
        # Else, this is a subsequent iteration. Determine what the upper bound is.
        # Defaults to dag_run.execution_date to avoid looping in the future.
        max_date = (
            dag_run.execution_date.date()
            if dag_run.conf.get(self.max_date_conf_key) is None
            else dt.datetime.strptime(dag_run.conf.get(self.max_date_conf_key), '%Y-%m-%d').date()
        )
        # Keep looping if upper bound has not been met
        return previous.date <= max_date


    def run(
        self,
        previous: Union[dt.date, None],
        dag_run: DagRun,
        ti: TaskInstance,
        conf: Dict[str, str],
    ):
        # Get the date from the previous iteration if possible, else from the minimum date as specified
        # in dag_run.conf.
        date = (
            dag_run.conf.get(self.min_date_conf_key, None)
            if previous is None
            else previous + dt.timedelta(days=1)
        )

        dag_run = context['dag_run']
        
        # Prepare the conf for the DagRun to trigger
        trigger_dag_conf = {
            **conf,
            "date": None if date is None else date.strftime('%Y-%m-%d'),
        }

        # Create a trigger_run_id - implementation from airflow/operators/trigger_dagrun.py
        trigger_run_id = DagRun.generate_run_id(DagRunType.MANUAL, timezone.utcnow())

        # Create a unique task_id from trigger_run_id
        task_id = f"{trigger_dag_id}_{hex(hash(trigger_run_id))[2:]}"

        # Trigger the DagRun & wait for completion
        TriggerDagRunOperator(
            task_id=task_id,
            trigger_dag_id=trigger_dag_id,
            trigger_run_id=trigger_run_id,
            conf=trigger_dag_conf,
            failed_states=["failed"],
            poke_interval=30,
            wait_for_completion=True,
        ).execute(context)

        # If the date is None - that is if no previous run AND no minimum date specified, get
        # iteration date from a task of the dagrun that returns the date of the iteration.
        if date is None:
            assert self.date_task_id
            assert self.date_xcom_key
            date = ti.xcom_pull(run_id=trigger_run_id, task_id=self.date_task_id, key=self.date_xcom_key)
        
        return date

PythonOperator(
    task_id="snapshot_backfill_my_table"
    python_callable=DateIntervalTriggerDagRunLoop(
        trigger_dag_id="snapshot-backfill",
        date_task_id="determine_latest_snapshot",
        date_xcom_key="date",
        min_date_conf_key="min_date",
        max_date_conf_key="max_date",
    ),
    op_kwargs={
        "conf": {
            "table_id": "{{dag_run.conf.project}}.{{dag_run.conf.client_id}}.stg_my_table",
            "snapshot_table_id": "{{dag_run.conf.project}}.{{dag_run.conf.client_id}}.snapshot_my_table"
        }
    }
)

For the case of @appassionate, a similar class could be created where the loop condition is a threshold or set of threshold on the model's performance.

1 reply

appassionate Nov 10, 2024
Author

thanks!:) long time to see this discussion.
now, i think tasks in airflow are pre-defined according to DAG concept. it means "trigger" will create extra flow which maybe be hard to trace, thks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is possible to add a loop condition in Airflow? #21726

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 4 comments 7 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Is possible to add a loop condition in Airflow? #21726

appassionate Feb 22, 2022

how about a operator for "clear"

BAD DAG

GOOD DAG

Is this feasible？

Replies: 4 comments · 7 replies

potiuk Feb 26, 2022 Collaborator

appassionate Feb 28, 2022 Author

appassionate Mar 2, 2022 Author

appassionate Apr 7, 2022 Author

potiuk Apr 7, 2022 Collaborator

appassionate Apr 7, 2022 Author

appassionate Apr 7, 2022 Author

potiuk Apr 7, 2022 Collaborator

appassionate Apr 7, 2022 Author

philippefutureboy Aug 3, 2023

appassionate Nov 10, 2024 Author

appassionate
Feb 22, 2022

Replies: 4 comments 7 replies

potiuk
Feb 26, 2022
Collaborator

appassionate Feb 28, 2022
Author

appassionate
Mar 2, 2022
Author

appassionate
Apr 7, 2022
Author

potiuk Apr 7, 2022
Collaborator

appassionate Apr 7, 2022
Author

appassionate Apr 7, 2022
Author

potiuk Apr 7, 2022
Collaborator

appassionate Apr 7, 2022
Author

philippefutureboy
Aug 3, 2023

appassionate Nov 10, 2024
Author