Kid Mode: wire content_filter() into the live voice path (PiVoiceLLM + OpenAICompat output)

Scoped implementation sub-task of #138. The parent describes the gap and its history; this issue is the concrete remaining work — the post-generation blocked-words filter on the **live voice path**. Everything else split out of #138 (C3→#53→#153; dashboard ingress→#146; H4→admin-auth epic) is already landed; this is the one piece left.

## Problem (recap)
Both live LLM providers apply **prompt-level steering only** (`build_turn_suffix(kid_mode)`). There is no post-generation filter on TTS-bound LLM output. `content_filter()` exists but is **bridge-only** (`bridge/text.py:143`) and runs in the bridge container — `custom-providers/pi_voice/pi_voice.py` and `custom-providers/openai_compat/openai_compat.py` never import it.

## Design — single source of truth, no drift
`content_filter()` today bundles a **pure matcher** (three regex tiers `_CF_TIER_{ALERT,LOG,REDIRECT}_RE` → `CONTENT_FILTER_REPLACEMENT`) with **bridge-only side effects** (Prometheus `dotty_content_filter_hits_total`, the `/ui/safety/recent` ring, structured logging). The xiaozhi container has none of that infra, so split the two:

1. **Lift the pure core into `custom-providers/textUtils.py`** — the module already bind-mounts into the xiaozhi container as `core.utils.textUtils` and is already the canonical safety-constants home imported by both providers (alongside `build_turn_suffix`, `ALLOWED_EMOJIS`). Add the tier regexes, `CONTENT_FILTER_REPLACEMENT`, and a side-effect-free `content_filter_match(text) -> str | None` (returns the matched tier, or the replacement).
2. **`bridge/text.py` imports the core from the shared module** and keeps its wrapper (metrics + ring + logging) on top — so the bridge behaviour is unchanged but the regexes stop being duplicated/driftable.
3. **`pi_voice.py` + `openai_compat.py`** call the shared matcher on **TTS-bound output**, gated on `kid_mode`, substituting `CONTENT_FILTER_REPLACEMENT` on a hit. Lightweight local logging only (no bridge metrics in this container).

## Open design question — streaming
The voice path streams TTS-bound text incrementally. A blocked term can straddle chunk boundaries. Decide and document one of:
- buffer-and-check per **sentence** boundary (`_SENTENCE_BOUNDARY` already in textUtils) before emit, or
- accumulate the full turn and filter before the TTS handoff (simpler; adds latency).
Pick the one that fits how each provider currently hands text to xiaozhi — they differ (pi_voice is RPC-streamed from the pi agent; openai_compat is a direct OpenAI-style stream).

## Acceptance
- [ ] Pure matcher lives in `custom-providers/textUtils.py`; `bridge/text.py` imports it; no regex duplication.
- [ ] `pi_voice` blocks on `kid_mode` + a tier hit; clean text passes untouched.
- [ ] `openai_compat` does the same.
- [ ] `kid_mode` **off** → no filtering, zero behaviour change.
- [ ] Unit tests for both providers (hit / clean / kid-off), mirroring `tests/test_dashboard_say_filter.py` from #146.
- [ ] **Red-team verify on-device** — must not ship unverified (per #138).

## Caveat to keep honest
A blocked-words regex on LLM output is a weak, bypassable layer — prompt steering remains the primary defence and `docs/faq.md` should keep its honest "not a guarantee" framing. This closes the *advertised-but-missing* gap; it is not a content-safety guarantee.

Parent: #138

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kid Mode: wire content_filter() into the live voice path (PiVoiceLLM + OpenAICompat output) #157

Problem (recap)

Design — single source of truth, no drift

Open design question — streaming

Acceptance

Caveat to keep honest

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Kid Mode: wire content_filter() into the live voice path (PiVoiceLLM + OpenAICompat output) #157

Description

Problem (recap)

Design — single source of truth, no drift

Open design question — streaming

Acceptance

Caveat to keep honest

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions