A better STT would make auto-gain not needed in ESPHome Voice Assistants (VPE+others)

### Problem statement

Speech recognition is one of the main reported problems consistently - and solving it has been hard to prove. One main issue (but not the only one) is that auto gain can do as good as harm, depending on the situation. But there is an opportunity to improve this with a better Speak to text (STT). 

For expectations, there is an important chance this makes the Voice experience better, but its not a single solution - just another brick in the wall.

### Community signals

Community survey, although biased towards high-tech users and not clear on a specific problem, showed how there is not a great satisfaction towards voice recognition + achievement of the task. Its not conclusive, but grants the chance to at least explore this opportunity 

<img width="1512" height="700" alt="Image" src="https://github.com/user-attachments/assets/482bf591-f75c-4266-bfaa-5b5ac6fef3e3" />

<img width="1512" height="648" alt="Image" src="https://github.com/user-attachments/assets/94852343-46df-45c6-a459-9745da7442bb" />

### Scope & Boundaries

#### In scope
- New STT
- Being able to test it with and without auto gain

#### Not in scope
- Bigger architectural changes to Voice


### Foreseen solution

Add second audio channel for voice
 - https://github.com/home-assistant/core/pull/169875
 
 esphome PR - https://github.com/esphome/esphome/pull/16265
aioesphomeapi PR - https://github.com/esphome/aioesphomeapi/pull/1625

### Risks & open questions

- Is the solution really better?
- Can we test it propperly?
- How many biases do we meet when testing?

### Appetite

Small - Should be done in one cycle of 2 releases.

### Execution issues

_No response_

### Decision log

| Date | Decision | Outcome |
|------|----------|---------|
|      |          |         |


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A better STT would make auto-gain not needed in ESPHome Voice Assistants (VPE+others) #152

Problem statement

Community signals

Scope & Boundaries

In scope

Not in scope

Foreseen solution

Risks & open questions

Appetite

Execution issues

Decision log

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

A better STT would make auto-gain not needed in ESPHome Voice Assistants (VPE+others) #152

Description

Problem statement

Community signals

Scope & Boundaries

In scope

Not in scope

Foreseen solution

Risks & open questions

Appetite

Execution issues

Decision log

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions