Integration with Merlin #52

Mark-Leisten-ajalaco · 2019-04-12T11:00:16Z

Has anyone looked at bootstrapping the wavenet vocoder to Merlin (https://github.com/CSTR-Edinburgh/merlin/)? Merlin is an open-source TTS system (which uses Ossian or Festival as a front-end) for acoustic and duration modelling by default uses the WORLD vocoder and therefore extracts world vocoder features, as such it seems that an integration of this with Merlin should be possible. Just interested to see if someone has tried this out, and if they can offer some guidance.

tuanad121 · 2019-06-10T00:27:00Z

It's interesting, I think you can replace the WORLD synthesis step with the Wavenet-based waveform generation.
In their synthesis script (https://github.com/CSTR-Edinburgh/merlin/blob/master/misc/scripts/vocoder/world/synthesis.py), the synthesis part is from line 120 to the end. The 3 input files are *.f0, *.sp, *.bapd. The data is in double type. The *.bapd is band-aperiodicity (or coarse aperiodicity). I'm not sure our wavelet-based synthesis uses coarse aperiodicity or full-band aperiodiciy (full-band aperiodicity has fft_size / 2 + 1 dimensions)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integration with Merlin #52

Integration with Merlin #52

Mark-Leisten-ajalaco commented Apr 12, 2019

tuanad121 commented Jun 10, 2019 •

edited

Loading

Integration with Merlin #52

Integration with Merlin #52

Comments

Mark-Leisten-ajalaco commented Apr 12, 2019

tuanad121 commented Jun 10, 2019 • edited Loading

tuanad121 commented Jun 10, 2019 •

edited

Loading