-
Notifications
You must be signed in to change notification settings - Fork 116
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mel_fmax does not cover all frequency #7
Comments
Running another experiment by setting |
@seungwonpark Actually this is how vocoder works efficiently, we always consider frequency between 0 to 8000 from wavenet to wavernn all vocoder models in between this frequency range, this helps model to consider vocal frequency ( |
@rishikksh20 Thanks for sharing your insight! |
Looks like waveglow's default configuration doesn't allow mel-spectrogram to represent all range of frequency (0~11025Hz): https://github.com/NVIDIA/waveglow/blob/master/config.json
This is a plot of
librosa.filters.mel(22050, 1024, 80, fmin=0.0, fmax=8000.0)
.I think was the reason why waveglow and our implementation of melgan doesn't look to generate high-frequency audio.
The text was updated successfully, but these errors were encountered: