GitHub - aHishamm/STT-System-using-OpenAI-s-Whisper: A Speech To Text Website using OpenAI's whisper as the core of the STT transcription. The transcriptions are then stored as records in a MongoDB database

Whisper API Test

The python script simply uses the Whisper API as the backend and Gradio web app framework as the frontend

Prerequisite Packages

pip install gradio 
pip install git+https://github.com/openai/whisper.git 
pip install -U pip setuptools wheel
pip install -U spacy
python -m spacy download en_core_web_sm
python -m spacy download xx_ent_wiki_sm
pip install spacy-fastlang

ffmpeg should also be installed

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
pymongo_get_database.py		pymongo_get_database.py
readme.md		readme.md
whisperAPITest.py		whisperAPITest.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper API Test

Prerequisite Packages

About

Releases

Packages

Languages

aHishamm/STT-System-using-OpenAI-s-Whisper

Folders and files

Latest commit

History

Repository files navigation

Whisper API Test

Prerequisite Packages

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages