👉https://arxiv.org/pdf/2104.00355.pdf
MSD, F0 RMSE, F0 corr, GPE, FPE
http://www1.se.cuhk.edu.hk/~hccl/publications/pub/2016_paper_297.pdf section 4.2
log dB 4.4
RMSE 22.386
PER, WER
https://github.com/resemble-ai/Resemblyzer tsne
in paper Building Bilingual and Code-Switched Voice Conversion with Limited Training Data Using Embedding Consistency Loss