v2 does not work well with cosyvoice TTS #311

xipingL · 2024-09-19T03:08:11Z

use CosyVoice Chinese woman to generate audio (first video), then use OpenVoice ToneColorConverter to generate audio(third video) according target_se(second video) that has serious electrical tone
use CosyVoice Chinese man to generate audio (fourth video), then use OpenVoice ToneColorConverter to generate audio(sixth video) according target_se(fifth video) that not corresponding timbre was converted

woman_source.mp4

man_target.mp4

man_gen.mp4

man_source.mp4

woman_target.mp4

woman_gen.mp4

Thanks.

dosubot · 2024-09-19T03:09:14Z

@xipingL Hello there! I'm a bot designed to help you with bugs, questions, and contributions. I'm here to assist you while waiting for a human maintainer.

I found a similar open issue: Converter doesn't work for Chinese?, which was last updated on August 03, 2024. The suggested solution includes:

Accent and Emotion: OpenVoice only clones the tone color of the reference speaker and does not clone the accent or emotion. The base speaker TTS model controls these aspects.
Quality of Reference Audio: Ensure the reference audio is clean, without background noise, and does not contain speech from more than one person.
Base Speaker Model: For multi-lingual and cross-lingual usage, ensure you have a base speaker model in the desired language. If not, consider using the OpenAI TTS model as the base speaker.

Additionally, ensure you are using the correct base speaker model for Chinese (zh_base_speaker_tts) and that the style parameter is set to 'default' for Chinese text inputs ^[1].

The issue with v2 not working well with CosyVoice TTS, specifically when using CosyVoice Chinese woman and man to generate audio and then using OpenVoice ToneColorConverter, is likely due to OpenVoice only cloning the tone color and not the accent or emotion. The base speaker models from MeloTTS, which OpenVoice relies on, may not perfectly align with CosyVoice TTS outputs ^[2]^[3]^[4].

To continue talking to Dosu, mention @dosu.

dosubot bot added the bug Something isn't working label Sep 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v2 does not work well with cosyvoice TTS #311

v2 does not work well with cosyvoice TTS #311

xipingL commented Sep 19, 2024

dosubot bot commented Sep 19, 2024

v2 does not work well with cosyvoice TTS #311

v2 does not work well with cosyvoice TTS #311

Comments

xipingL commented Sep 19, 2024

dosubot bot commented Sep 19, 2024