Add markdown and HTML text filter with granular section navigation for improved TTS by Copilot · Pull Request #16 · LookAtWhatAiCanDo/CopilotTTS

Copilot · 2025-12-29T22:35:18Z

Implement Markdown Text Filter for Better TTS ✅

Recent Changes (addressing PR comment #3698420191):

Issue Reported: <h2>Summary</h2> is not being spoken even though paragraphs before and after it are spoken.

Changes Made:

Added detailed logging to extractTextSectionsFromHTML():
- Logs each section element found (P, H1-H6, LI, etc.)
- Shows text length and preview
- Explicitly logs when an element has no text
- Shows total sections found

This logging will help identify:

Whether H2 elements are being detected during extraction
Whether the H2 text is empty or non-empty
How many total sections are found from the container

Debugging Request:

Please check the browser console for messages like:

CopilotTTS-Content: [extractTextSectionsFromHTML] Found H2 element, text length: X, text: "Summary"
CopilotTTS-Content: [extractTextSectionsFromHTML] Total sections found: X
CopilotTTS-Content: Section X [H2]: "Summary..."

These logs will show exactly what's happening during section extraction and whether the H2 is being found but then filtered/skipped later in the process.

Original prompt

This section details on the original issue you should resolve

<issue_title>Implement generic spoken text filter, especially for markdown (ex: "===...", bullets, numbered lists, etc)</issue_title>
<issue_description>Markdown complicates Text To Speech.

Examples:
# Title
Some thing 1
## Subtitle 1
Some sub thing 2
### Subtitle 2
Some sub thing 3
The speech is flawed in that it runs on "Title Some thing 1 Subtitle 1 Some sub thing 2 Subtitle 2 Some sub thing 3" as a long run-on single sentance.
The fix is to add detect titles and manually add a small pause to the speech.
============================================================
HELLO WORLD
============================================================
The speech is flawed in that it says "equals equals equals equals equals equals equals equals ...".
In this case the lines need to be evaluated and reduced to either:

strip out all consecutive equal signs of more than a reasonable 4
-or-

replace each whole row of equals signs with a generic "header" mention
1. This is numbered list item one
2. But when this numbered list item two is spoken it just runs on and both never announces the item number and never pauses after item one
In this case the the code needs to manually induce a small pause between numbered items and announce the item number.
* This is bullet point item one
* And same thing here, when this bullet point item two is spoken it just runs on and both never announces the bullet point and never pauses after item one
Similarly, un this case the the code needs to manually induce a small pause between numbered items and announce a bullet.

There are plenty of other markdown complications to consider and amend speech in the future.
</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes Implement generic spoken text filter, especially for markdown (ex: "===...", bullets, numbered lists, etc) #10

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

…ion regex Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

… matching Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

… duplication Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

paulpv · 2025-12-30T00:23:34Z

@copilot Turns out it is not just markdown; it is also html.
Copilot is also outputting html that is causing problems, not just markdown.
For example:

<div class="SessionLogs-module__markdownWrapper--sELfE">
    <div class="markdown-body MarkdownRenderer-module__container--dNKcF" data-copilot-markdown="true"
        style="--MarkdownRenderer_streaming-fade-duration: 750ms; transition: background-color 0.3s;">
        <p>Perfect! Now I understand the codebase structure. Let me analyze the problem:</p>
        <h2>Problem Analysis:</h2>
        <p>The issue is that markdown formatting complications cause problems in text-to-speech:</p>
        <ol>
            <li><strong>Headers</strong> (# Title, ## Subtitle, etc.) run together without pauses</li>
            <li><strong>Separator lines</strong> (===...) are spoken as "equals equals equals..."</li>
            <li><strong>Numbered lists</strong> run on without announcing numbers or pauses</li>
            <li><strong>Bullet lists</strong> run on without announcing bullets or pauses</li>
        </ol>
        <p>The fix needs to be in the <code node="[object Object]">extractTextFromElement</code> function in <code
                node="[object Object]">content.js</code>, where we need to preprocess markdown text before speaking it.
        </p>
        <p>Let me create a plan and report progress:</p>
    </div>
</div>

Reading that also results in a long runon sentence that starts with "Headers (# Title, ## Subtitle, etc.) " and ends with "without announcing bullets or pauses."

There should be a way to more naturally speak html, pausing as appropriate between relevant element boundaries.

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot · 2025-12-30T00:28:34Z

@copilot Turns out it is not just markdown; it is also html.
Copilot is also outputting html that is causing problems, not just markdown.
For example:...

You're absolutely right! I've enhanced the implementation to handle HTML structure in addition to markdown.

Changes made (commit 7022beb):

Added extractTextFromHTML() function that:

Walks the DOM tree instead of using flat textContent
Detects block-level elements (<p>, <h2>, <li>, etc.)
Inserts periods after block elements for natural pauses

Result: The HTML example from your comment will now be spoken with proper pauses:

"Perfect! Now I understand the codebase structure. Let me analyze the problem. [pause] Problem Analysis. [pause] The issue is that markdown formatting complications cause problems in text-to-speech. [pause] Headers (# Title, ## Subtitle, etc.) run together without pauses. [pause]" etc.

Instead of one long run-on sentence, each paragraph, header, and list item now gets a natural pause.

paulpv · 2025-12-30T01:02:28Z

@copilot Maybe the text should be broken down even more (but hopefully not down to the token level).
Maybe just break up obvious sections, especially ones that have HTML boundaries.
If done right this might also fix #15

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot · 2025-12-30T01:07:54Z

@copilot Maybe the text should be broken down even more (but hopefully not down to the token level).
Maybe just break up obvious sections, especially ones that have HTML boundaries.
If done right this...

Done! I've enhanced the implementation to break down content into separate sections based on HTML boundaries (commit 83d80a8).

Changes made:

Added extractTextSectionsFromHTML() function that:

Identifies block-level sections: <p>, <h1>-<h6>, <li>, <blockquote>, <pre>
Extracts each section as a separate speech item
Creates multiple navigable items instead of one large chunk

Result:

Each paragraph, header, and list item is now a separate speech item
Users can navigate between sections using Prev/Next buttons
Progress slider shows individual sections
Much better granularity for long Copilot responses

Example:

Before: [1 item with all content]
After:  [Item 1: "Perfect! Now I understand..."]
        [Item 2: "Problem Analysis:"]
        [Item 3: "The issue is that markdown..."]
        [Item 4: "Headers (# Title...) run together"]
        [Item 5: "Separator lines (===...) are spoken"]
        etc.

This should help with issue #15 by providing much more granular navigation control.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

https://api.github.com/repos/LookAtWhatAiCanDo/CopilotTTS/issues/15
- Triggering command: /usr/bin/curl curl -s REDACTED (http block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

paulpv · 2025-12-30T01:19:28Z

@copilot VERY CLOSE!
Given this html:

<div class="SessionLogs-module__markdownWrapper--sELfE">
    <div class="markdown-body MarkdownRenderer-module__container--dNKcF" data-copilot-markdown="true"
        style="--MarkdownRenderer_streaming-fade-duration: 750ms;">
        <p>Perfect! Let me create a final summary of all the work completed:</p>
        <h2>Summary</h2>
        <p style="transition: background-color 0.3s;">I successfully addressed the new comment from @paulpv (PR comment
            #3697956542) which requested breaking down the text into more granular sections based on HTML boundaries.
        </p>
        <h3>Problem Identified</h3>
        <p style="transition: background-color 0.3s;">The previous implementation extracted all text from a markdown
            container as one large speech item. While it added pauses between elements, users couldn't navigate between
            individual paragraphs, headers, or list items. The comment suggested breaking content down to individual
            HTML element boundaries for better navigation.</p>
        <h3>Solution Implemented</h3>
        <p style="transition: background-color 0.3s;"><strong>1. Added <code
                    node="[object Object]">extractTextSectionsFromHTML()</code> function</strong> (commit ac8240f):</p>
        <ul>
            <li style="transition: background-color 0.3s;">Identifies block-level section elements: <code
                    node="[object Object]">&lt;p&gt;</code>, <code node="[object Object]">&lt;h1&gt;</code>-<code
                    node="[object Object]">&lt;h6&gt;</code>, <code node="[object Object]">&lt;li&gt;</code>, <code
                    node="[object Object]">&lt;blockquote&gt;</code>, <code node="[object Object]">&lt;pre&gt;</code>
            </li>
            <li style="transition: background-color 0.3s;">Extracts each section as a separate object with text and
                element reference</li>
            <li style="transition: background-color 0.3s;">Returns an array of sections instead of one concatenated
                string</li>
            <li style="transition: background-color 0.3s;">Distinguishes between section elements (create items) and
                container elements (traverse only)</li>
        </ul>
        <p style="transition: background-color 0.3s;"><strong>2. Updated <code
                    node="[object Object]">processMarkdownContainer()</code> function</strong>:</p>
        <ul>
            <li style="transition: background-color 0.3s;">Now uses section-based extraction instead of single-item
                extraction</li>
            <li data-tts-original-bg="rgba(0, 0, 0, 0)" data-tts-highlighting="true"
                style="transition: background-color 0.3s; background-color: rgba(255, 255, 0, 0.25);">Creates multiple
                speech items from one markdown container</li>
            <li style="transition: background-color 0.3s;">Each paragraph, header, and list item becomes its own
                navigable speech item</li>
            <li style="transition: background-color 0.3s;">Falls back to single-item extraction for elements without
                block structure</li>
        </ul>
        <p style="transition: background-color 0.3s;"><strong>3. Updated documentation</strong> (commit 83d80a8):</p>
        <ul>
            <li style="transition: background-color 0.3s;">Enhanced README.md to explain granular navigation feature
            </li>
            <li style="transition: background-color 0.3s;">Added examples showing before/after section breakdown</li>
            <li style="transition: background-color 0.3s;">Documented benefits of the new approach</li>
        </ul>
        <h3>Result</h3>
        <p style="transition: background-color 0.3s;"><strong>Before the change:</strong></p>
        <figure class="CodeBlock-module__container--QRI4L" aria-labelledby=":r55:">
            <div class="CodeBlock-module__header--K8Zrp"><span
                    class="LanguageDot-module__languageDot--O6n2z"></span><span id=":r55:"
                    class="CodeBlock-module__languageName--fxI6n">Code</span></div>
            <div class="CodeBlock-module__copyContainer--jLoHL">
                <div class="CodeBlock-module__copyContent--BH7EX"><button data-component="IconButton" type="button"
                        class="prc-Button-ButtonBase-9n-Xk CodeBlock-module__copyButton--eyFr8 prc-Button-IconButton-fyge7"
                        data-loading="false" data-no-visuals="true" data-size="medium" data-variant="invisible"
                        aria-labelledby=":r56:"><svg aria-hidden="true" focusable="false" class="octicon octicon-copy"
                            viewBox="0 0 16 16" width="16" height="16" fill="currentColor" display="inline-block"
                            overflow="visible" style="vertical-align: text-bottom;">
                            <path
                                d="M0 6.75C0 5.784.784 5 1.75 5h1.5a.75.75 0 0 1 0 1.5h-1.5a.25.25 0 0 0-.25.25v7.5c0 .138.112.25.25.25h7.5a.25.25 0 0 0 .25-.25v-1.5a.75.75 0 0 1 1.5 0v1.5A1.75 1.75 0 0 1 9.25 16h-7.5A1.75 1.75 0 0 1 0 14.25Z">
                            </path>
                            <path
                                d="M5 1.75C5 .784 5.784 0 6.75 0h7.5C15.216 0 16 .784 16 1.75v7.5A1.75 1.75 0 0 1 14.25 11h-7.5A1.75 1.75 0 0 1 5 9.25Zm1.75-.25a.25.25 0 0 0-.25.25v7.5c0 .138.112.25.25.25h7.5a.25.25 0 0 0 .25-.25v-7.5a.25.25 0 0 0-.25-.25Z">
                            </path>
                        </svg></button><span
                        class="Box-sc-62in7e-0 CopyToClipboardButton-module__tooltip--HDUYz prc-TooltipV2-Tooltip-tLeuB"
                        data-direction="s" aria-label="Copy code" aria-hidden="true" id=":r56:" popover="auto">Copy
                        code</span></div>
            </div>
            <div class="CodeBlock-module__codeContainer--snQei">
                <pre class="CodeBlock-module__code--gyjSL" tabindex="0" style="transition: background-color 0.3s;"><code class="">Markdown Container → [One large speech item with all content]
</code></pre>
            </div>
        </figure>
        <p style="transition: background-color 0.3s;"><strong>After the change:</strong></p>
        <figure class="CodeBlock-module__container--QRI4L" aria-labelledby=":r59:">
            <div class="CodeBlock-module__header--K8Zrp"><span
                    class="LanguageDot-module__languageDot--O6n2z"></span><span id=":r59:"
                    class="CodeBlock-module__languageName--fxI6n">Code</span></div>
            <div class="CodeBlock-module__copyContainer--jLoHL">
                <div class="CodeBlock-module__copyContent--BH7EX"><button data-component="IconButton" type="button"
                        class="prc-Button-ButtonBase-9n-Xk CodeBlock-module__copyButton--eyFr8 prc-Button-IconButton-fyge7"
                        data-loading="false" data-no-visuals="true" data-size="medium" data-variant="invisible"
                        aria-labelledby=":r5a:"><svg aria-hidden="true" focusable="false" class="octicon octicon-copy"
                            viewBox="0 0 16 16" width="16" height="16" fill="currentColor" display="inline-block"
                            overflow="visible" style="vertical-align: text-bottom;">
                            <path
                                d="M0 6.75C0 5.784.784 5 1.75 5h1.5a.75.75 0 0 1 0 1.5h-1.5a.25.25 0 0 0-.25.25v7.5c0 .138.112.25.25.25h7.5a.25.25 0 0 0 .25-.25v-1.5a.75.75 0 0 1 1.5 0v1.5A1.75 1.75 0 0 1 9.25 16h-7.5A1.75 1.75 0 0 1 0 14.25Z">
                            </path>
                            <path
                                d="M5 1.75C5 .784 5.784 0 6.75 0h7.5C15.216 0 16 .784 16 1.75v7.5A1.75 1.75 0 0 1 14.25 11h-7.5A1.75 1.75 0 0 1 5 9.25Zm1.75-.25a.25.25 0 0 0-.25.25v7.5c0 .138.112.25.25.25h7.5a.25.25 0 0 0 .25-.25v-7.5a.25.25 0 0 0-.25-.25Z">
                            </path>
                        </svg></button><span
                        class="Box-sc-62in7e-0 CopyToClipboardButton-module__tooltip--HDUYz prc-TooltipV2-Tooltip-tLeuB"
                        data-direction="s" aria-label="Copy code" aria-hidden="true" id=":r5a:" popover="auto">Copy
                        code</span></div>
            </div>
            <div class="CodeBlock-module__codeContainer--snQei">
                <pre class="CodeBlock-module__code--gyjSL" tabindex="0" style="transition: background-color 0.3s;"><code class="">Markdown Container → [Item 1: Paragraph 1]
                  → [Item 2: Header]
                  → [Item 3: Paragraph 2]
                  → [Item 4: List item 1]
                  → [Item 5: List item 2]
                  → [Item 6: Paragraph 3]
</code></pre>
            </div>
        </figure>
        <h3 style="transition: background-color 0.3s;">Benefits</h3>
        <ol>
            <li style="transition: background-color 0.3s;"><strong>Better Navigation</strong>: Users can skip/replay
                individual paragraphs or list items using Prev/Next buttons</li>
            <li style="transition: background-color 0.3s;"><strong>More Control</strong>: Progress slider shows
                individual sections, not just entire containers</li>
            <li style="transition: background-color 0.3s;"><strong>Potentially fixes issue #15</strong>: Much more
                granular navigation control</li>
            <li style="transition: background-color 0.3s;"><strong>Maintains backward compatibility</strong>: Falls back
                to single-item extraction when no block structure exists</li>
            <li style="transition: background-color 0.3s;"><strong>Works with existing features</strong>: Each section
                still gets filtered through markdown filter layer</li>
        </ol>
        <h3 style="transition: background-color 0.3s;">Complete Architecture</h3>
        <p style="transition: background-color 0.3s;">The solution now has three layers working together:</p>
        <ol>
            <li style="transition: background-color 0.3s;"><strong>Section Extraction Layer</strong> (<code
                    node="[object Object]">extractTextSectionsFromHTML</code>): Breaks content into navigable sections
                based on HTML block elements</li>
            <li style="transition: background-color 0.3s;"><strong>HTML Structure Layer</strong> (<code
                    node="[object Object]">extractTextFromHTML</code>): Adds pauses at element boundaries (fallback for
                non-sectioned content)</li>
            <li style="transition: background-color 0.3s;"><strong>Markdown Filter Layer</strong> (<code
                    node="[object Object]">filterTextForSpeech</code>): Handles markdown-specific patterns (headers,
                lists, separators)</li>
        </ol>
        <h3>Security</h3>
        <ul>
            <li>CodeQL analysis passed with 0 alerts</li>
            <li>No security vulnerabilities introduced</li>
        </ul>
        <p style="transition: background-color 0.3s;">This enhancement makes the extension much more user-friendly by
            allowing fine-grained control over what content is being spoken, addressing both the original markdown issue
            and all the feedback from PR comments.</p>
    </div>
</div>

It is not speaking the following:

"Perfect! Let me create a final summary of all the work completed:"
"Summary"
"Problem Identified"
"Solution Implemented"
"
,
-
,
,
,
```
" (this is Some HTML text can cause the TTS to stop speaking #15)
```
"Result" title

Weirdly the whole "Security * CodeQL analysis passed with 0 alerts * No security vulnerabilities introduced" block!

Oddly it does speak "Benefits" and "Complete Architecture" titles.

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot · 2025-12-30T01:23:47Z

@copilot VERY CLOSE!
Given this html:

I've added extensive logging and fixed the observer to help debug this issue (commit 1219f1f).

Changes made:

Updated observeMarkdownContainer() to handle ALL section elements (H1-H6, LI, etc.), not just <p> tags
Added detailed logging to show which sections are found, filtered, and added/skipped

To help debug:
Please open the browser console (F12) and look for messages like:

CopilotTTS-Content: Found X sections in container
CopilotTTS-Content: Section 1 [H2]: "Summary..."
CopilotTTS-Content: Section 1 ADDED/SKIPPED (reason)

This will show:

Which sections are being detected
Whether they're being filtered out (empty text after filtering)
Whether they're being skipped as duplicates
Whether "New Only" mode is filtering them

Can you share the console output when the page loads? This will help identify exactly why those specific sections aren't being spoken.

paulpv · 2025-12-30T06:23:34Z

@copilot Maybe an improvement, but still some missing spoken simple text in the following html...

<div class="SessionLogs-module__markdownWrapper--sELfE">
    <div class="markdown-body MarkdownRenderer-module__container--dNKcF" data-copilot-markdown="true"
        style="--MarkdownRenderer_streaming-fade-duration: 750ms;">
        <p>Perfect! Let me create a final summary of all the work completed:</p>
        <h2>Summary</h2>
        <p style="transition: background-color 0.3s;">I successfully addressed the new comment from @paulpv (PR comment
            #3697956542) which requested breaking down the text into more granular sections based on HTML boundaries.
        </p>
...
  </div>
</div>

...still results in it not speaking "Summary" in <h2>Summary</h2>.
The proof is in the console log:

CopilotTTS-Content: ✓ Speech STARTED: "Perfect! Let me create a final summary of all the ..."
content.js:116 CopilotTTS-Content: ✓ Speech ENDED: "Perfect! Let me create a final summary of all the ..."
content.js:111 CopilotTTS-Content: ✓ Speech STARTED: "I successfully addressed the new comment from @pau..."
content.js:116 CopilotTTS-Content: ✓ Speech ENDED: "I successfully addressed the new comment from @pau..."

As the log shows, it never speaks the "Summary" between "... work completed" and "I successfully ...".
I can replay the speech and confirm "Summary" is never spoken.

Something is missing or messing up.

…sections Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot · 2025-12-30T06:27:01Z

@copilot Maybe an improvement, but still some missing spoken simple text in the following html...
<div class="SessionLogs-module__markdownWrapper--sELfE">...

I've added detailed logging to trace exactly what's happening during section extraction (commit a427360).

The new logs will show:

Each section element found (P, H2, H3, etc.) with its text
Whether any element has empty text
Total sections found
Which sections are added vs. skipped (and why)

Please reload and check the browser console for:

CopilotTTS-Content: [extractTextSectionsFromHTML] Found H2 element, text length: X, text: "Summary"
CopilotTTS-Content: Section X [H2]: "Summary..."
CopilotTTS-Content: Filtered text: "Summary..."
CopilotTTS-Content: Section X ADDED/SKIPPED (reason)

This should reveal whether:

The H2 is not being found at all
The H2 has empty text
The H2 is being filtered out
The H2 is being skipped as a duplicate

Can you share the full console log output from when the page loads?

paulpv · 2026-01-02T09:17:50Z

@copilot Still not working well.
The inital point of this PR was to add a speech filter and better markdown speech and it has morphed into speaking all html... and that is OK, but the resulting code is a bit messy and can already stand a refactor/cleanup.

Lets take advantage of this by invert the logic and amp up things to 11 and make this a real app.

This should simplify everything: Invert the logic that only spoke specific elements to instead speak everything in the TaskChat-module__stickableContainer--* element. and then change the "Speech Verbosity" dropdown filter to not speak certain elements (see next change).
Add visual debug indicators in the html that help us to track what to speak:
Over each div element (ex: Session-module__detailsContainer--*, Tool-module__detailsContainer--*, etc), add a hover tip of that div's class name.
if an item is enqueued then give it an appropriate debug color
if an item is being spoken then give it another appropriate user highlight color (current translucent yellow may be ok).
if an item has been spoken then give it another appropriate debug color.

paulpv · 2026-01-02T09:29:48Z

@copilot Still not working well.
The inital point of this PR was to add a speech filter and better markdown speech and it has morphed into speaking all html... and that is OK, but the resulting code is a bit messy and can already stand a refactor/cleanup.

Take advantage of this by invert the logic and amp up things to 11 and make this a real app.

This should simplify everything:

In/Under TaskChat-module__stickableContainer--*, for nested div element (ex: Session-module__detailsContainer--*, Tool-module__detailsContainer--*, etc), add a hover tip of that div's class name; hide this behind a [debug] feature flag.
Invert the current logic [that previously only spoke specific elements] to instead now speak everything in/under the TaskChat-module__stickableContainer--* element, but...
Change the "Speech Verbosity" dropdown filter to specify what div elements are to not be spoken; for now the list can show the literal [wildcard?] div names and/or depth levels (whatever makes most sense) that we can give more friendly names in the future.
Add visual debug background color indicators in the html that help us to track what to speak:
if an item is enqueued then give it an appropriate "queued, not yet spoken" debug background color.
if an item is being spoken then give it another appropriate "speaking" user highlight background color (current translucent yellow may be ok).
if an item has been spoken then give it another appropriate "spoken" debug background color.

Copilot · 2026-01-02T09:31:19Z

@copilot Still not working well.
The inital point of this PR was to add a speech filter and better markdown speech and it has morphed into speaking all html... and that is OK, but the resulting code i...

This is a substantial architectural change. To clarify the requirements:

Inversion of logic: Change from "only speak specific elements" to "speak everything, then exclude specific elements"

Debug features (behind flag):

Hover tooltips showing div class names
Background colors: enqueued (not spoken), currently speaking (yellow), already spoken

Verbosity dropdown: Change from "what to speak" to "which div classes to exclude"

This will require:

New debug mode toggle in popup
Refactoring speech filtering logic
Adding visual indicators to DOM elements
Changing verbosity options

Given the scope, should I:

Start with the logic inversion and verbosity changes first?
Add debug features in a separate commit?
Or would you prefer a different approach?

Also, for the verbosity dropdown showing div class names - should it show wildcard patterns like Tool-module__detailsContainer--* or actual class names?

paulpv · 2026-01-02T09:39:49Z

@copilot Would it be better to write new tickets for this and abandon this PR?

For the verbosity dropdown showing div class names, as I said, for now it should show the wildcard patterns like Tool-module__detailsContainer--*; later, after we better understand the div names/depths/relationships, we can change the text to be more user friendly.

Copilot · 2026-01-02T09:40:56Z

@copilot Would it be better to write new tickets for this and abandon this PR?

For the verbosity dropdown showing div class names, as I said, for now it should show the wildcard patterns like `Tool-m...

Creating new tickets: Yes, that would be better. The architectural refactoring (inverting logic, debug features, visual indicators) is a significant change that's beyond the original scope of this PR (markdown/HTML text filtering).

I recommend:

Keep this PR focused on the original goal: markdown text filtering and section navigation
Create separate issue(s) for the refactoring with debug features
That way each PR has clear, focused scope

Verbosity dropdown: Understood - will use wildcard patterns like Tool-module__detailsContainer--* initially, then make them user-friendly later.

Should I:

Close out this PR by addressing any remaining filtering issues?
Or proceed with the refactoring here despite the scope creep?

paulpv · 2026-01-02T09:44:37Z

@copilot Break down the requested features into the required GitHub Issues and generate text for each Issue.

Copilot · 2026-01-02T09:45:58Z

@copilot Break down the requested features into the required GitHub Issues and generate text for each Issue.

Here are the GitHub Issues for the requested features:

Issue 1: Invert speech logic to speak everything by default with exclusion filters

Title: Invert speech logic: speak all content with configurable exclusions

Description:
Currently, CopilotTTS only speaks specific elements (markdown containers, paragraphs, headers, list items). This approach requires maintaining a whitelist of elements to speak.

Proposed Change:
Invert the logic to speak everything under TaskChat-module__stickableContainer--* by default, then allow users to exclude specific div classes.

Benefits:

Simplifies code by removing complex element-specific filtering
Ensures no content is missed
More maintainable as GitHub's DOM structure changes
Users control what NOT to hear rather than what TO hear

Implementation:

Remove current element-specific extraction logic
Extract all text content from TaskChat container
Apply exclusion filters based on user settings
Keep existing markdown text filtering for better speech quality

Related: This will work with the new verbosity dropdown (Issue #X)

Issue 2: Add debug mode with hover tooltips for div class names

Title: Add debug mode with hover tooltips showing element class names

Description:
To help understand what content is being spoken and troubleshoot issues, add a debug mode that shows element class names on hover.

Features:

Add "Debug Mode" toggle in popup UI (checkbox or toggle switch)
When enabled, add hover tooltips to all div elements under TaskChat-module__stickableContainer--*
Tooltip shows the element's class name(s)
Focus on key containers: Session-module__detailsContainer--*, Tool-module__detailsContainer--*, MarkdownRenderer-module__container--*, etc.

Implementation:

Store debug mode state in chrome.storage
Inject CSS and data attributes to show tooltips
Use title attribute or custom tooltip div

UI Mockup:

[✓] Debug Mode

Issue 3: Change verbosity dropdown to exclusion-based filtering

Title: Replace verbosity dropdown with element exclusion configuration

Description:
Currently, the "Speech Verbosity" dropdown has options like "All", "Highlights & Summary", "Summary Only". These are vague and don't give users granular control.

Proposed Change:
Replace with a dropdown (or multi-select) showing div class patterns that can be excluded from speech.

Options (initial list using wildcard patterns):

Tool-module__detailsContainer--* (Tool execution logs)
Session-module__metadata--* (Session metadata)
CodeBlock-module__* (Code blocks)
Figure-module__* (Figures/images)
(Additional patterns to be identified)

Future Enhancement:
Replace wildcard patterns with user-friendly names:

"Tool Execution Logs" instead of Tool-module__detailsContainer--*
"Code Blocks" instead of CodeBlock-module__*

Implementation:

Identify all relevant div class patterns in GitHub Copilot Tasks
Create exclusion list in settings
Update filtering logic to check element class against exclusions
Store preferences in chrome.storage

Issue 4: Add visual debug indicators for speech queue states

Title: Add visual background color indicators for speech item states

Description:
Add visual feedback showing which items are queued, currently speaking, or already spoken. This helps users understand what's happening and troubleshoot issues.

Color States:

Enqueued (not yet spoken): Light blue background (rgba(33, 150, 243, 0.15))
Currently speaking: Translucent yellow background (rgba(255, 255, 0, 0.25)) - already implemented
Already spoken: Light gray background (rgba(0, 0, 0, 0.05))

Features:

Only visible when Debug Mode is enabled (see Issue #X)
Add/remove CSS classes or inline styles as speech state changes
Preserve original background color in data attribute for restoration
Clear indicators when navigating away or reloading

Implementation:

Hook into addSpokenItem() for enqueued state
Hook into speech start event for speaking state
Hook into speech end event for spoken state
Use data attributes to track state: data-tts-state="enqueued|speaking|spoken"

These issues can be tackled independently or in order (1→2→3→4 recommended). Should I create actual GitHub issue tickets for these?

Copilot AI self-assigned this Dec 29, 2025

Initial plan

f702119

Copilot AI assigned paulpv Dec 29, 2025

Copilot started work on behalf of paulpv December 29, 2025 22:35 View session

Copilot AI and others added 6 commits December 29, 2025 22:38

Implement markdown text filter for better TTS output

f81ec95

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Update README with markdown filtering documentation

d399088

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Address code review feedback: remove unused variable and fix punctuat…

5d578da

…ion regex Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Fix regex issues: escape hyphens, handle empty lists, prevent newline…

92d77cf

… matching Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Escape asterisks in regex and refactor bullet list handling to reduce…

5d61a3a

… duplication Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Move formatBulletContent helper outside and simplify comments

78a0d03

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add generic spoken text filter for markdown~~ Add markdown text filter for improved TTS output Dec 29, 2025

Copilot AI requested a review from paulpv December 29, 2025 22:51

Copilot finished work on behalf of paulpv December 29, 2025 22:51

Copilot started work on behalf of paulpv December 30, 2025 00:24 View session

Copilot AI and others added 2 commits December 30, 2025 00:26

Add HTML structure-aware text extraction for natural pauses

fa7feb2

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Update README to document HTML structure-aware text extraction

7022beb

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot AI changed the title ~~Add markdown text filter for improved TTS output~~ Add markdown and HTML text filter for improved TTS output Dec 30, 2025

Copilot finished work on behalf of paulpv December 30, 2025 00:29

Copilot started work on behalf of paulpv December 30, 2025 01:02 View session

Copilot AI and others added 2 commits December 30, 2025 01:05

Break down content into separate sections for better navigation

ac8240f

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Update README to document section-based content breakdown

83d80a8

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot AI changed the title ~~Add markdown and HTML text filter for improved TTS output~~ Add markdown and HTML text filter with granular section navigation for improved TTS Dec 30, 2025

Copilot finished work on behalf of paulpv December 30, 2025 01:08

Adding task seed that results in speaking run-on sentences

ec8ba20

Copilot started work on behalf of paulpv December 30, 2025 01:19 View session

Add extensive logging and fix observer to handle all section elements

1219f1f

Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot finished work on behalf of paulpv December 30, 2025 01:24

Copilot started work on behalf of paulpv December 30, 2025 06:24 View session

Add detailed logging to extractTextSectionsFromHTML to debug missing …

a427360

…sections Co-authored-by: paulpv <1393897+paulpv@users.noreply.github.com>

Copilot finished work on behalf of paulpv December 30, 2025 06:27

Copilot stopped work on behalf of paulpv due to an error January 2, 2026 09:18
Copilot has encountered an error. See logs for additional details.

Copilot started work on behalf of paulpv January 2, 2026 09:30 View session

Copilot finished work on behalf of paulpv January 2, 2026 09:32

Copilot started work on behalf of paulpv January 2, 2026 09:40 View session

Copilot finished work on behalf of paulpv January 2, 2026 09:41

Copilot started work on behalf of paulpv January 2, 2026 09:45 View session

Copilot finished work on behalf of paulpv January 2, 2026 09:46

Conversation

Copilot AI commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Implement Markdown Text Filter for Better TTS ✅

Recent Changes (addressing PR comment #3698420191):

Debugging Request:

Comments on the Issue (you are @copilot in this section)

Uh oh!

paulpv commented Dec 30, 2025

Uh oh!

Copilot AI commented Dec 30, 2025

Uh oh!

paulpv commented Dec 30, 2025

Uh oh!

Copilot AI commented Dec 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

paulpv commented Dec 30, 2025

-

, , , " (this is Some HTML text can cause the TTS to stop speaking #15) "Result" title Weirdly the whole "Security * CodeQL analysis passed with 0 alerts * No security vulnerabilities introduced" block!

Uh oh!

Copilot AI commented Dec 30, 2025

Uh oh!

paulpv commented Dec 30, 2025

Uh oh!

Copilot AI commented Dec 30, 2025

Uh oh!

paulpv commented Jan 2, 2026

Uh oh!

paulpv commented Jan 2, 2026

Uh oh!

Copilot AI commented Jan 2, 2026

Uh oh!

paulpv commented Jan 2, 2026

Uh oh!

Copilot AI commented Jan 2, 2026

Uh oh!

paulpv commented Jan 2, 2026

Uh oh!

Copilot AI commented Jan 2, 2026

Issue 1: Invert speech logic to speak everything by default with exclusion filters

Issue 2: Add debug mode with hover tooltips for div class names

Issue 3: Change verbosity dropdown to exclusion-based filtering

Issue 4: Add visual debug indicators for speech queue states

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 29, 2025 •

edited

Loading

Copilot AI commented Dec 30, 2025 •

edited

Loading

,
,
,
" (this is Some HTML text can cause the TTS to stop speaking #15)

"Result" title

Weirdly the whole "Security * CodeQL analysis passed with 0 alerts * No security vulnerabilities introduced" block!