Add a persistent Parakeet helper for low-latency host integrations by seyeong-han · Pull Request #18861 · pytorch/executorch

seyeong-han · 2026-04-14T00:41:18Z

Summary

Factor the Parakeet transcription core out of parakeet_runner into a shared ParakeetTranscriber class
Add a new parakeet_helper binary plus a stdin/stdout helper protocol for long-lived host integrations
Build the helper in the existing Parakeet CMake presets and document the helper workflow in the README

Why a helper?

The Voxtral Realtime macOS app (executorch-examples/voxtral_realtime/macos) didn't need any changes to the executorch repo because voxtral_realtime_runner was already designed as a streaming, long-running process — the app just launches it and feeds audio.

parakeet_runner is different: it's a one-shot batch CLI tool that loads the model, transcribes one WAV file, prints the result, and exits. There's no way to send it a second request without restarting the process and paying the ~1.4 s model-load cost again.

The ExecuWhisper macOS app (meta-pytorch/executorch-examples#232) runs repeated record-then-transcribe requests via system dictation, so a fresh process per recording is too slow. parakeet_helper fills that gap — it's the Parakeet equivalent of what the Voxtral Realtime runner already does natively: stay alive, keep the model warm, and accept multiple requests over stdin/stdout.

Test plan

cmake --preset llm-metal-stats -DEXECUTORCH_BUILD_MLX=OFF
cmake --build --preset llm-metal-stats-install
cd examples/models/parakeet && cmake --build --preset parakeet-metal -- both parakeet_runner and parakeet_helper link successfully

Made-with: Cursor

Factor the Parakeet transcription logic out of the one-shot runner so host apps can keep the model warm across requests. Build the new helper alongside the runner and document the helper workflow for app integrations. Made-with: Cursor

pytorch-bot · 2026-04-14T00:41:22Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18861

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull workflow for PyTorch trunk commits

❌ 2 New Failures, 1 Cancelled Job, 1 Unrelated Failure

As of commit b54a81c with merge base 411ede2 ():

NEW FAILURES - The following jobs have failed:

MLX / test-mlx / test-mlx (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 1
MLX / test-mlx-qwen35-moe / test-mlx-qwen35-moe (gh)
RuntimeError: Command bash /Users/runner/work/_temp/exec_script failed with exit code 2

CANCELLED JOB - The following job was cancelled. Please retry:

pull / unittest / windows / windows-job (gh)

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-04-14T00:47:02Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

seyeong-han requested review from kirklandsign, larryliu0820 and lucylq as code owners April 14, 2026 00:41

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 14, 2026

seyeong-han mentioned this pull request Apr 14, 2026

Add ExecuWhisper macOS app for low-latency on-device Parakeet dictation meta-pytorch/executorch-examples#232

Open

2 tasks

seyeong-han requested a review from mergennachin April 14, 2026 18:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a persistent Parakeet helper for low-latency host integrations#18861

Add a persistent Parakeet helper for low-latency host integrations#18861
seyeong-han wants to merge 1 commit intomainfrom
parakeet-helper-macos-alignment

seyeong-han commented Apr 14, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

seyeong-han commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why a helper?

Test plan

Uh oh!

pytorch-bot bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18861

❗ 1 Active SEVs

❌ 2 New Failures, 1 Cancelled Job, 1 Unrelated Failure

Uh oh!

github-actions bot commented Apr 14, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

seyeong-han commented Apr 14, 2026 •

edited

Loading

pytorch-bot bot commented Apr 14, 2026 •

edited

Loading

This PR needs a `release notes:` label