Speech-to-Text for the Ukrainian language based on Silero

This is a repository with demonstration code that uses the Silero Model for Ukrainian in the task of Speech-to-Text recognition.

Quality:

Common Voice 7 test set with 4300+ samples:

WER: 0.2318 (id est - quality is 76.82%)
CER: 0.0624

Install dependencies and enter the python environment:

pipenv install
pipenv shell

Run the demos:

python test_official_demo.py
python own_recodings_demo.py

Install PyTorch and other libraries:

conda install pytorch torchvision torchaudio cpuonly -c pytorch

Run the demos:

cd C:\path\to\project

python .\test_official_demo.py
python .\own_recodings_demo.py

It was tested on Windows x64 only.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
model		model
recordings		recordings
.gitignore		.gitignore
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
own_recodings_demo.py		own_recodings_demo.py
test_official_demo.py		test_official_demo.py
utils.py		utils.py

Provide feedback