I have tested the --stream option and find that no matter the input, it stops transcribing after the first [EOU] is encountered. It continue to process the rest of the wave file, but no further text is returned. This happens on CUDA and Vulkan.
As a concrete example, the "diffusion2023-07-03" sample from whisper.cpp exhibits this behavior:
https://github.com/ggml-org/whisper.cpp/blob/574fc0da69bcf2da3262e40d1b4009341df3d53f/Makefile#L20
I have tested the --stream option and find that no matter the input, it stops transcribing after the first [EOU] is encountered. It continue to process the rest of the wave file, but no further text is returned. This happens on CUDA and Vulkan.
As a concrete example, the "diffusion2023-07-03" sample from whisper.cpp exhibits this behavior:
https://github.com/ggml-org/whisper.cpp/blob/574fc0da69bcf2da3262e40d1b4009341df3d53f/Makefile#L20