QAIST

Model Architecture

Two independent speech-emotion prediction models are used.
They act as independent classifiers.
Then, by ensemble method, we combine the predictions and output the final prediction.

Ensemble.ipynb:
- code for executing ensemble function
- receives prediction csv files from each model (each row = probs for each emotion for that audio file)
- computes ensemble by selecting from different ensemble functions.
- outputs csv file to submit to Eval AI. (but need to manually add column names: fileID, Emotion afterwards)
speech_emotion_recognition_XJHe:
- this code is implemented based on https://ieeexplore.ieee.org/document/8421023 this paper.
- to execute, run train.ipynb and execute test.ipynb (be careful about the path of train and val data in extract_mel.py)
- Dependencies
  - tensorflow == 1.5.0
  - sklearn
  - matplotlib
  - python_speech_features
  - wave

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
data-process		data-process
speech-emotion-recognition-master		speech-emotion-recognition-master
speech_emotion_recognition_XJHe		speech_emotion_recognition_XJHe
Ensemble.ipynb		Ensemble.ipynb
QIA_Hackathon_2019_InterimReport.docx		QIA_Hackathon_2019_InterimReport.docx
QIA_Hackathon_2019_InterimReport_example.pdf		QIA_Hackathon_2019_InterimReport_example.pdf
README.md		README.md
speech-emotion-recognition.ipynb		speech-emotion-recognition.ipynb
structure.png		structure.png
test-emotions.ipynb		test-emotions.ipynb