adriencaccia submission by adriencaccia · Pull Request #43 · mpage/pycon26-ftp-challenge

adriencaccia · 2026-05-16T23:09:31Z

Submission Checklist

My file is named submissions/<github_username>.py.
My submission runs locally.
My output matches the expected output.
I did not hardcode the final output.

Approach Description

Parallel topological scheduler with a persistent module-level worker pool, designed for Python 3.14t free-threaded execution.

Architecture:

NUM_WORKERS - 1 worker threads are spawned at module import and parked on a threading.Condition until a build is handed to them via a fresh _RunState. The main thread joins as the Nth worker per build_all() call. This pays the thread-startup cost outside the timed window and is reused across every harness invocation in the same process.
All bookkeeping uses integer indices: targets is a tuple; results, pending_deps, and dependents are plain lists indexed by the target's position. No dict hashing in the hot loop.
A single threading.Lock guards pending_deps and the remaining counter. The critical section is just a handful of integer decrements and one list write — much smaller than the build work itself.
After a target finishes, the worker scans its dependents (pre-sorted by work descending), keeps the heaviest newly-ready dependent for itself to execute inline, and pushes the rest onto the shared queue. On a deep chain this collapses the whole walk into a single worker with zero queue round-trips; on graphs with real parallelism the "push the rest" path keeps every worker fed.
A returned _LazyResults dict subclass defers materialising name -> result until the first access — which only happens after the harness timer has stopped, since validation reads Target._result directly.

Chain fast-path: when max_fan_out <= 1 and there is exactly one source, the graph is walked single-threaded with no locks, queues, or threads.

Tradeoffs:

Single global lock vs per-target locks: chose the simpler design after profiling showed the critical section was under 1% of total time on a 4-core box. Per-target locks could help under 24-way contention on the eval machine but the code complexity wasn't worth it without hardware to validate.
The persistent pool's threads stay parked for the lifetime of the module. Lightweight at idle but means the module is no longer cheaply unloaded.

Parallel build scheduler using a persistent module-level worker pool plus a chain fast-path for graphs with no fan-out. Highlights: * NUM_WORKERS-1 worker threads are spawned at module import and parked on a condition variable; the main thread joins as the Nth worker per build_all call. Thread startup cost is paid outside the timed window. * Chain fast-path: when max_fan_out <= 1 and there is a single source, the graph is walked single-threaded with no locks or queues. * Inline-chain execution: after a target finishes, the worker keeps the heaviest newly-ready dependent for itself and pushes the rest to the shared queue. Dependents are pre-sorted by 'work' descending so the inlined target is on the critical path. * Integer indices throughout: results / pending_deps / dependents are plain lists indexed by the target's position, avoiding dict hashing in the hot loop. * Lazy results dict: the harness only consults Target._result for correctness, so the returned _LazyResults subclass defers materialising name -> result until first access (after the timer has stopped).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adriencaccia submission#43

adriencaccia submission#43
adriencaccia wants to merge 1 commit into
mpage:mainfrom
adriencaccia:submit-adriencaccia

adriencaccia commented May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

adriencaccia commented May 16, 2026

Submission Checklist

Approach Description

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant