-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Adding a dataset #5
Comments
Hi! My data folder is divided like this:
The
I hope it will help! Tell me if it is not clear enough. |
Thanks! I have a lot of mixed content, I should try to gather also clean speech and music... |
Mixed content is technically the best since it represents the goal you want to achieve. It is also the most expensive kind of data. Test, validation and training must be strictly different, otherwise you will overfit the data. I can try to give a more detailed example, if you have a custom dataset with 3 audio of mixed content:
With
With
With
For the
Is it clearer? |
Sure, thank you! |
Hi, I successfully installed everything and added my own datasets, but when issuing
Of course dataset.json is at the project's root, and contains:
What am I missing? |
Hi, I would like to add my own dataset, which consists in several mixed speech/music files.
In the README you mention the need for creating two different folders, but I cannot fully understand the "repartition of the data between each set for each type of label" part...
Could you please give us an example dir tree to use as reference?
Thanks
The text was updated successfully, but these errors were encountered: