Speaker Recognition

Jump to bottom

arvind0422 edited this page Jul 5, 2017 · 4 revisions

Speaker Recognition

The built system could classify 10 voices with 100% accuracy tested on the ELSDSR dataset with 90 seconds of test speech per speaker.

Workflow

Steps

Future Extensions

Confusion Estimate.
Create a threshold to identify if the speaker is new.
User Interface to Train & Test using Tkinter.
Testing with Real Life / Noisy Data.

Applications

Attendance in Classrooms.
Voice Based Security in addition to Biometrics.

Installations

NumPy: numpy
Hidden Markov Models: hmmlearn
Speech Signal Analysis: librosa

References