Refactor _executable_task_instances_to_queued to make logic more readable by ashb · Pull Request #66878 · apache/airflow

ashb · 2026-05-13T16:06:10Z

I was looking at the core scheduler logic, and the _executable_task_instances_to_queued was rather large, and hard for even me to understand, and past me wrote a good chunk of it!

This splits it into two (plus a helper) focused methods to aid readability:

_acquire_pool_capacity: takes the advisory lock (lifetime of the session, so longer than just this fn) and reads pool utilisation via SELECT FOR UPDATE. Returns (pools, max_tis, starved_pools) so caller can short-circuit when all pools are full before doing any TI selection work.
_select_task_instances_to_queue: given pre-computed pool capacity, selects eligible SCHEDULED TIs and moves them to QUEUED. Accepts the pools dict and starved_pools set as parameters, making it directly testable without needing a real lock or DB pool read. This uses the new _build_schedulable_tis_query helper fn to build the complex query.

_critical_section_enqueue_task_instances now calls these two methods in sequence, making the two-phase structure (acquire capacity, then select and queue) more explicit.

All test call sites updated to call _select_task_instances_to_queue directly with a make_pool_stats() helper, removing the dependency on pool row locking in unit tests.

No behaviour changes, just refactoring.

jedcunningham · 2026-05-13T16:56:36Z


-    def _executable_task_instances_to_queued(self, max_tis: int, session: Session) -> list[TI]:
+    def _acquire_pool_capacity(
+        self, max_tis: int, session: Session


Suggested change

self, max_tis: int, session: Session

self, max_tis: int, *, session: Session

Or, convert it all to be kwarg only. Same with the others.

I think we had the rule with session as kwarg mainly for the @provide_session decorator which is not used here. So a session needs to be provided by caller anyway. So it is rather a nit.

…stances Split the monolithic `_executable_task_instances_to_queued` into two focused methods: - `_acquire_pool_capacity`: takes the advisory lock and reads pool utilisation via SELECT FOR UPDATE. Returns (pools, max_tis, starved_pools) so callers can short-circuit when all pools are full before doing any TI selection work. - `_select_task_instances_to_queue`: given pre-computed pool capacity, selects eligible SCHEDULED TIs and moves them to QUEUED. Accepts the pools dict and starved_pools set as parameters, making it directly testable without needing a real lock or DB pool read. `_critical_section_enqueue_task_instances` now calls these two methods in sequence, making the two-phase structure (acquire capacity, then select and queue) visible at the orchestration level. All test call sites updated to call `_select_task_instances_to_queue` directly with a `make_pool_stats()` helper, removing the dependency on pool row locking in unit tests.

jscheffl · 2026-06-14T20:04:44Z

    clear_db_triggers()


+def make_pool_stats(


Would be nicer making this a real @fixture

jscheffl

Thanks for the break-up of the code and logic. That makes totally sense and due to the good pydoc makes it much more readable.

two nit from my side, then LGTM, hope we have this in 3.3.0? Would be cool as a clean start on a new minor.

ashb · 2026-06-23T13:31:14Z

+        max_tis: int,
+        pools: dict[str, PoolStats],
+        starved_pools: set[str],
+        session: Session,


Suggested change

session: Session,

*,

session: Session,

ashb · 2026-06-23T13:31:29Z

+            .limit(max_tis)
+        )
+
+    def _mark_task_instances_queued(self, executable_tis: list[TI], session: Session) -> list[TI]:


Suggested change

def _mark_task_instances_queued(self, executable_tis: list[TI], session: Session) -> list[TI]:

def _mark_task_instances_queued(self, executable_tis: list[TI], *, session: Session) -> list[TI]:

ashb requested a review from XD-DENG as a code owner May 13, 2026 16:06

boring-cyborg Bot added the area:Scheduler including HA (high availability) scheduler label May 13, 2026

ashb changed the title ~~Extract _acquire_pool_capacity from _critical_section_enqueue_task_instances~~ Refactor _executable_task_instances_to_queued to make logic more readable May 13, 2026

ashb added the changelog:skip Changes that should be skipped from the changelog (CI, tests, etc..) label May 13, 2026

ashb added this to the Airflow 3.3.0 milestone May 13, 2026

jedcunningham approved these changes May 13, 2026

View reviewed changes

This comment was marked as resolved.

Sign in to view

ashb force-pushed the scheduler-enqueue-readability branch from 007fd8b to 0e16ac2 Compare May 14, 2026 11:21

jscheffl reviewed Jun 14, 2026

View reviewed changes

jscheffl approved these changes Jun 14, 2026

View reviewed changes

ashb commented Jun 23, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor _executable_task_instances_to_queued to make logic more readable#66878

Refactor _executable_task_instances_to_queued to make logic more readable#66878
ashb wants to merge 1 commit into
apache:mainfrom
astronomer:scheduler-enqueue-readability

ashb commented May 13, 2026

Uh oh!

jedcunningham May 13, 2026

Uh oh!

jscheffl Jun 14, 2026

Uh oh!

This comment was marked as resolved.

jscheffl Jun 14, 2026

Uh oh!

jscheffl left a comment

Uh oh!

ashb Jun 23, 2026

Uh oh!

ashb Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	self, max_tis: int, session: Session
	self, max_tis: int, *, session: Session

	def _mark_task_instances_queued(self, executable_tis: list[TI], session: Session) -> list[TI]:
	def _mark_task_instances_queued(self, executable_tis: list[TI], *, session: Session) -> list[TI]:

Conversation

ashb commented May 13, 2026

Uh oh!

jedcunningham May 13, 2026

Choose a reason for hiding this comment

Uh oh!

jscheffl Jun 14, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

jscheffl Jun 14, 2026

Choose a reason for hiding this comment

Uh oh!

jscheffl left a comment

Choose a reason for hiding this comment

Uh oh!

ashb Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

ashb Jun 23, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants