The link to the training dataset is no longer working #35
shiningsword
started this conversation in
General
Replies: 2 comments
-
The dataset link is the older model's dataset. The newer one's dataset is a derivative of People Common Speech because of which it is huge, will try to upload it up this weekend |
Beta Was this translation helpful? Give feedback.
0 replies
-
You can consider using this repo in huggingface datasets https://huggingface.co/datasets/MLCommons/ml_spoken_words/tree/main/data/wav/en |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
This is really great stuff. We were looking at using a tranformer model to further improve the result. Is there any way to get access to your dataset? The link in the readme to the Google drive is no longer working.
Beta Was this translation helpful? Give feedback.
All reactions