-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Converter doesn't work for Chinese? #287
Comments
@Strive-for-excellence 你好! 我是Dosu,我是一个可以帮助您解决问题,回答您的问题,并成为贡献者的机器人。我会在等待人类维护者的过程中帮助您。 The inconsistency in the generated audio compared to the reference audio could be due to several factors:
To address these issues, you can:
Additionally, ensure that you are using the correct base speaker model for Chinese ( If the webpage for OpenVoiceV2 on Hugging Face is unresponsive, you might need to wait until it is accessible again to further assess the model's performance or check for updates. You can also refer to the QnA and multi-lingual & cross-lingual examples for additional guidance [1][2]. |
Hi - The result you get is normal. The generated voice only clones the tone color, but not any other features such as innotations and accent |
When I use https://huggingface.co/spaces/myshell-ai/OpenVoiceV2 for inference, I find that the performance is inconsistent with the reference audio.
Text: 该努力我也努力过了,该尝试的我都尝试过了!
Reference audio: https://github.com/Strive-for-excellence/show/blob/main/ref.mp3
Base speaker generated audio: https://github.com/Strive-for-excellence/show/blob/main/output_v2_3_ori.wav
Generated audio: https://github.com/Strive-for-excellence/show/blob/main/output.wav
The base speaker generated audio is normal, but the audio after conversion is not normal.
The text was updated successfully, but these errors were encountered: