Thank you very much for your reply, sir. I still have some questions about dataset preparation:
1、Do ori_wav and codec_wav correspond to high-quality audio and low-quality audio respectively? Is my understanding correct?
2、Why does the configuration file use HDF5, while the training code reads WAV audio? What is the difference between these two?
3、What is the final format of the audio data fed into the model? Is it HDF5 or WAV?
4、Is there any requirement for the sampling rate?
5、Is it necessary to use the Apollo-data-preprocess code to process into HDF5 format before training the model?
Thank you very much for your reply, sir. I still have some questions about dataset preparation:
1、Do ori_wav and codec_wav correspond to high-quality audio and low-quality audio respectively? Is my understanding correct?
2、Why does the configuration file use HDF5, while the training code reads WAV audio? What is the difference between these two?
3、What is the final format of the audio data fed into the model? Is it HDF5 or WAV?
4、Is there any requirement for the sampling rate?
5、Is it necessary to use the Apollo-data-preprocess code to process into HDF5 format before training the model?