-
Notifications
You must be signed in to change notification settings - Fork 65
Voice Activity Detection #21
Description
I would like to use Concentus in an app that does Speech-To-Text conversion. I need to be able to detect the end of sentences by monitoring voice activity and identifying segments of speech terminated by periods of silence. I know Opus has Voice Activity Detection, but looking through the Concentus source code, VAD seems to only be used in internal classes for DTX, with no exposed public classes/methods. Ideally I'd be able to poll the encoder and get a count of recent consecutive silence frames, then capture the sentence after a the silence frame threshold has been reached, and then submit that sentence to the STT engine.
Is there any way to get access to the built-in VAD status on the encoder? Or any other way to achieve what I want to achieve?
And thankyou for this library!!!! :)