Run submodel inference in parallel using CUDA streams by kmaziarz · Pull Request #18 · microsoft/retrochimera

kmaziarz · 2026-06-09T15:09:24Z

So far, we've been running RetroChimera's submodels sequentially. Since they are independent, we can run them in parallel; this has potential to speed things up if the submodels are not fully utilizing the GPU, which is the case for the ones currently used in RetroChimera. This PR adds an option to run the submodels in parallel using CUDA streams. This is hidden behind a flag, which is True by default (the main reason to turn it off would be if operating under very stringent memory requirements, as parallelization does slightly increase memory needs).

The speedup is small but detectable, and model output is unchanged. For context, I also include a comparison with the state of main before #15 (back from before we started looking at model speed altogether).

Setting	Before #15	Before #18	After #18	Speedup #18	Speedup combined
batch_size=1	3052.616	2335.829	2285.757	1.02x	1.34x
batch_size=16	1197.197	658.432	584.615	1.13x	2.05x

jla-gardner

Nice 🚀 LGTM

kmaziarz added 3 commits June 9, 2026 13:36

feat(inference): Run ensembled models in parallel

1175844

doc(CHANGELOG): Add an entry for #18

29af9e0

feat(inference): Add flag to turn parallelization off

6542eb1

kmaziarz requested review from fiberleif, jla-gardner and mrwnmsr June 9, 2026 15:09

jla-gardner approved these changes Jun 9, 2026

View reviewed changes

mrwnmsr approved these changes Jun 10, 2026

View reviewed changes

kmaziarz merged commit a4ae7c7 into main Jun 10, 2026
8 checks passed

kmaziarz deleted the kmaziarz/parallelize-ensemble-model-inference branch June 10, 2026 11:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run submodel inference in parallel using CUDA streams#18

Run submodel inference in parallel using CUDA streams#18
kmaziarz merged 3 commits into
mainfrom
kmaziarz/parallelize-ensemble-model-inference

kmaziarz commented Jun 9, 2026

Uh oh!

jla-gardner left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

kmaziarz commented Jun 9, 2026

Uh oh!

jla-gardner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants