Speaker-Identification-One-Shot-Learning

This ipython notebook contains the siamese type neural network implementation which calcaulates the similarity scores for a pair of voices. The similarity score is high if the voices belongs to same speaker while score is less for that pair which has voices belonging to different speakers.

The model has been trained on Librispeech dataset and it first converts the audio to spectrogram images and then it compares them using convolutional neural network. This notebook has been created in Google Colab and hence contains linux commands and cannot be executed in anaconda ipython notebook normally.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Generated_images		Generated_images
Test_audio_files		Test_audio_files
LICENSE.md		LICENSE.md
README.md		README.md
SpeakerID_CNN.ipynb		SpeakerID_CNN.ipynb
SpeakerID_best.hdf5		SpeakerID_best.hdf5
Test.ipynb		Test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker-Identification-One-Shot-Learning

About

Releases

Packages

Languages

License

VikasOjha666/Speaker-Identification-One-Shot-Learning

Folders and files

Latest commit

History

Repository files navigation

Speaker-Identification-One-Shot-Learning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages