Is there a means to insert an event handler as the speech audio is outputted that would enable reporting in real-time the "Visemes or Phonemes" so speech could be synchronized with animated mouth movements? Example is like the c# Microsoft "SpeechSynthesizer" class where it fires a "VisemeReached" event.
Is there a means to insert an event handler as the speech audio is outputted that would enable reporting in real-time the "Visemes or Phonemes" so speech could be synchronized with animated mouth movements? Example is like the c# Microsoft "SpeechSynthesizer" class where it fires a "VisemeReached" event.