feat: text-to-speech x LLM integration by IgorSwat · Pull Request #936 · software-mansion/react-native-executorch

IgorSwat · 2026-03-04T20:07:04Z

Description

This pull request introduces a few changes to the Text-to-Speech module:

Improved streaming mode by allowing an incrementally expanded text input. This change focuses on integrating T2S with text generation models (e.g. Llama 3.2).
Added simple test cases for T2S module.

Introduces a breaking change?

Yes
No

Type of change

Bug fix (change which fixes an issue)
New feature (change which adds functionality)
Documentation update (improves or adds clarity to existing documentation)
Other (chores, tests, code style improvements etc.)

Tested on

iOS
Android

Testing instructions

To test the Text-to-Speech module, run the set of tests for this module.
To test the new streaming mode and it's integration with text generation models, one can use 'text-to-speech-llm' demo app.

Screenshots

Related issues

#773
#897

Checklist

I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have updated the documentation accordingly
My changes generate no new warnings

Additional notes

…on/react-native-executorch into @is/llm-to-speech

chmjkb · 2026-03-06T11:02:21Z

apps/speech/screens/TextToSpeechLLMScreen.tsx

+        await tts.stream({
+          text: '',
+          speed: 0.9,
+          stopAutomatically: false,
+          onNext,
+        });


I'm not sure why we need this, can you clarify?

What exactly do you mean? The 'text' field?

chmjkb · 2026-03-06T11:10:47Z

docs/docs/04-typescript-api/01-natural-language-processing/TextToSpeechModule.md

 :::

-2.  [**`stream({ text, speed })`**](../../06-api-reference/classes/TextToSpeechModule.md#stream): An async generator that yields chunks of audio as they are computed. This is ideal for reducing the "time to first audio" for long sentences.
+2. [**`stream({ speed, stopAutomatically })`**](../../06-api-reference/classes/TextToSpeechModule.md#stream): An async generator that yields chunks of audio as they are computed. This is ideal for reducing the "time to first audio" for long sentences. In contrast to `forward`, it enables inserting text chunks dynamically into processing buffer with [**`streamInsert(text)`**](../../06-api-reference/classes/TextToSpeechModule.md#streaminsert) and allows stopping generation early with [**`streamStop(instant)`**](../../06-api-reference/classes/TextToSpeechModule.md#streamstop).


Any reason why we're ditching the speed param?

chmjkb · 2026-03-06T11:11:06Z

packages/react-native-executorch/common/rnexecutorch/models/text_to_speech/kokoro/Kokoro.cpp

 #include <rnexecutorch/data_processing/Sequential.h>
+#include <thread>
+
+#include <rnexecutorch/Log.h>


redundant include

chmjkb · 2026-03-06T13:35:32Z

packages/react-native-executorch/src/modules/natural_language_processing/TextToSpeechModule.ts

just a note: it should be a factory from now, see #937

chmjkb · 2026-03-06T13:45:05Z

packages/react-native-executorch/src/modules/natural_language_processing/TextToSpeechModule.ts

+        await this.nativeModule.stream(
+          speed,
+          stopAutomatically,
+          (audio: number[]) => {


I think we should return JSTensorViewOut from the native side, so there's less copying going on

chmjkb · 2026-03-06T13:50:16Z

packages/react-native-executorch/src/modules/natural_language_processing/TextToSpeechModule.ts

+    while (!this.streamFinished) {
      if (queue.length > 0) {
        yield queue.shift()!;
-        if (finished && queue.length === 0) {
+        if (this.streamFinished && queue.length === 0) {
          return;
        }
        continue;
      }
      if (error) throw error;
-      if (finished) return;
+      if (this.streamFinished) return;


why are we checking this.streamFinished twice?

chmjkb · 2026-03-06T13:55:44Z

packages/react-native-executorch/src/hooks/natural_language_processing/useTextToSpeech.ts

+        if (input.text) {
+          moduleInstance.streamInsert(input.text);
+        }


Is this thread safe? This appends to a std::string on a JS thread while a bg thread reads from it, right?

chmjkb · 2026-03-06T14:06:43Z

packages/react-native-executorch/common/rnexecutorch/tests/CMakeLists.txt

+# Phonemis
+set(LIBS_DIR "${PACKAGE_ROOT}/third-party/android/libs")
+set(PHONEMIS_LIBS
+  "${LIBS_DIR}/phonemis/${ANDROID_ABI}/libphonemis.a"
+)
+


lets follow the convention

add_library(phonemis STATIC IMPORTED) set_target_properties(phonemis PROPERTIES IMPORTED_LOCATION "${ANDROID_THIRD_PARTY}/phonemis/${ANDROID_ABI}/libphonemis.a" )

IgorSwat added 4 commits March 4, 2026 14:02

Implement TTS incremental input streaming

27b5de3

Merge branch '@is/llm-to-speech' of https://github.com/software-mansi…

ee0b968

…on/react-native-executorch into @is/llm-to-speech

Fix stopping the streaming

994efcc

Add simple T2S test cases

cf6fb84

IgorSwat changed the title ~~@is/llm to speech~~ feat: text-to-speech x LLM integration & text-to-speech tests Mar 4, 2026

IgorSwat changed the title ~~feat: text-to-speech x LLM integration & text-to-speech tests~~ feat: text-to-speech x LLM integration Mar 4, 2026

Update T2S docs

ff288d8

IgorSwat added test Issue and PR related to tests or testing infrastructure feature PRs that implement a new feature labels Mar 5, 2026

chmjkb requested changes Mar 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: text-to-speech x LLM integration#936

feat: text-to-speech x LLM integration#936
IgorSwat wants to merge 5 commits intomainfrom
@is/llm-to-speech

IgorSwat commented Mar 4, 2026 •

edited

Loading

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

IgorSwat Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

chmjkb Mar 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

IgorSwat commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Introduces a breaking change?

Type of change

Tested on

Testing instructions

Screenshots

Related issues

Checklist

Additional notes

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

IgorSwat commented Mar 4, 2026 •

edited

Loading