Prevent breaking scheduler on task_instance.priority_weight column has overflow value #22784

kosteev · 2022-04-06T14:40:42Z

…s overflow value

eladkal · 2022-04-06T15:07:49Z

The DAG shared on the issue runs just fine with Sqlite:

I'm not sure if this is the right fix. Why should the model be concerned with issues originated from a specific DB?
I suspect that this could be also reported on retries, pool_slots etc.
I think the fix should be in the scheduler level (at least by not crashing) / raising broken DAG?

uranusjr · 2022-04-06T17:48:06Z

The fix is specific to Postgres I believe. Every database has different integer ranges so this will need to contain some if-else branches to cover each of them. Although honestly maybe it’s even better to just document this is a database limitation and you just can’t go over the limit.

eladkal · 2022-04-06T18:01:23Z

The fix is specific to Postgres I believe. Every database has different integer ranges so this will need to contain some if-else branches to cover each of them. Although honestly maybe it’s even better to just document this is a database limitation and you just can’t go over the limit.

I agree but the point of concern is that this cause the scheduler to crash which is not good.
It means that one dag can cause cluster wide problems. I think we need to find a way to locallize the effect so only the problematic dag will be effected.

uranusjr · 2022-04-06T18:03:16Z

Yeah the scheduler should not crash, but perhaps we could catch the exception or something like that. I seem to recall there’s a similar situation for something else a while ago (datetime out of range?)

eladkal · 2022-04-06T18:17:37Z

I recall only #17003 which also caused scheduler crash with divide by zero

ashb · 2022-04-06T18:20:39Z

Is silently clamping it right, or should this be a parse time DAG import error?

uranusjr · 2022-04-06T18:23:16Z

Ah found it, 2213635

So I guess the better fix here is to do something similar, check for Postgres, and prevent an out-of-bounds value to be assigned for Postgres specifically.

+1 to an import error.

kosteev · 2022-04-07T09:38:55Z

All right, validating it during DAG import and throwing error makes sense.
What about different databases? Should we handle all the databases taking into account different constraints of each?

uranusjr · 2022-04-07T13:48:42Z

Let’s start with adding just a check for Postgres now, we can add more later.

Prevent breaking scheduler on task_instance.priority_weight column ha…

a707042

…s overflow value

kosteev requested review from kaxil, XD-DENG and ashb as code owners April 6, 2022 14:40

kosteev closed this Apr 19, 2022

tiranux mentioned this pull request Sep 7, 2023

Validate DAG task_instance.priority_weight column overflow value during import #34168

Closed

This was referenced Sep 18, 2024

Change weight priority type from Integer to Float VladaZakharova/airflow#110

Closed

Change weight priority type from Integer to Float #42410

Closed

jscheffl mentioned this pull request Nov 2, 2024

Ensure priority weight is capped at 32-bit integer to prevent roll-over #43611

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent breaking scheduler on task_instance.priority_weight column has overflow value #22784

Prevent breaking scheduler on task_instance.priority_weight column has overflow value #22784

kosteev commented Apr 6, 2022

eladkal commented Apr 6, 2022 •

edited

Loading

uranusjr commented Apr 6, 2022

eladkal commented Apr 6, 2022

uranusjr commented Apr 6, 2022

eladkal commented Apr 6, 2022 •

edited

Loading

ashb commented Apr 6, 2022

uranusjr commented Apr 6, 2022 •

edited

Loading

kosteev commented Apr 7, 2022

uranusjr commented Apr 7, 2022

Prevent breaking scheduler on task_instance.priority_weight column has overflow value #22784

Prevent breaking scheduler on task_instance.priority_weight column has overflow value #22784

Conversation

kosteev commented Apr 6, 2022

eladkal commented Apr 6, 2022 • edited Loading

uranusjr commented Apr 6, 2022

eladkal commented Apr 6, 2022

uranusjr commented Apr 6, 2022

eladkal commented Apr 6, 2022 • edited Loading

ashb commented Apr 6, 2022

uranusjr commented Apr 6, 2022 • edited Loading

kosteev commented Apr 7, 2022

uranusjr commented Apr 7, 2022

eladkal commented Apr 6, 2022 •

edited

Loading

eladkal commented Apr 6, 2022 •

edited

Loading

uranusjr commented Apr 6, 2022 •

edited

Loading