How to upsamle phoneme embedding (i.e., duration prediction) for semantics tokens? #23

Jiaxin-Ye · 2024-09-26T04:35:31Z

Hi! Thank you for your awesome work! I am a freshman on TTS, and I don't see any text-speech alignment method on this project. I wonder whether the T5 model can automatically upsample the semantics token?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to upsamle phoneme embedding (i.e., duration prediction) for semantics tokens? #23

How to upsamle phoneme embedding (i.e., duration prediction) for semantics tokens? #23

Jiaxin-Ye commented Sep 26, 2024

How to upsamle phoneme embedding (i.e., duration prediction) for semantics tokens? #23

How to upsamle phoneme embedding (i.e., duration prediction) for semantics tokens? #23

Comments

Jiaxin-Ye commented Sep 26, 2024