Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failed to process mp3 file #309

Open
lemuriandezapada opened this issue Sep 18, 2024 · 1 comment
Open

Failed to process mp3 file #309

lemuriandezapada opened this issue Sep 18, 2024 · 1 comment
Labels
bug Something isn't working

Comments

@lemuriandezapada
Copy link

Trying to run demo_part3. Simply replaced the mp3 with a different one containing some speech and the se_extractor just fails. Ideas as to why?

`NotImplementedError Traceback (most recent call last)
Cell In[6], line 3
1 ''
2 reference_speaker = 'resources/20-1 [vocals].mp3' # This is the voice you want to clone
----> 3 target_se, audio_name = se_extractor.get_se(reference_speaker, tone_color_converter, vad=False)

File ~/repos/OpenVoice/openvoice/se_extractor.py:150, in get_se(audio_path, vc_model, target_dir, vad)
148 audio_segs = glob(f'{wavs_folder}/*.wav')
149 if len(audio_segs) == 0:
--> 150 raise NotImplementedError('No audio segments found!')
152 return vc_model.extract_se(audio_segs, se_save_path=se_path), audio_name

NotImplementedError: No audio segments found!`

Copy link

dosubot bot commented Sep 18, 2024

Hello, @lemuriandezapada! I'm here to help you with any issues you're facing. Let's work together to solve this problem!

The se_extractor fails with a NotImplementedError because no audio segments were found in the wavs_folder. This can happen if the mp3 file does not meet the specific requirements for segmentation. Ensure that the mp3 file:

  1. Contains clear speech.
  2. Has segments longer than 1.5 seconds and shorter than 20 seconds.
  3. Produces text segments between 2 and 200 characters when processed.

Additionally, try setting the vad parameter to True:

target_se, audio_name = se_extractor.get_se(reference_speaker, tone_color_converter, vad=True)

This might help in successfully processing the mp3 file [1][2].

To continue talking to Dosu, mention @dosu.

@dosubot dosubot bot added the bug Something isn't working label Sep 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant