Skip to content

A Speech To Text Website using OpenAI's whisper as the core of the STT transcription. The transcriptions are then stored as records in a MongoDB database

Notifications You must be signed in to change notification settings

aHishamm/STT-System-using-OpenAI-s-Whisper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

Whisper API Test

The python script simply uses the Whisper API as the backend and Gradio web app framework as the frontend

Prerequisite Packages

pip install gradio 
pip install git+https://github.com/openai/whisper.git 
pip install -U pip setuptools wheel
pip install -U spacy
python -m spacy download en_core_web_sm
python -m spacy download xx_ent_wiki_sm
pip install spacy-fastlang

ffmpeg should also be installed

# on Ubuntu or Debian
sudo apt update && sudo apt install ffmpeg

# on Arch Linux
sudo pacman -S ffmpeg

# on MacOS using Homebrew (https://brew.sh/)
brew install ffmpeg

# on Windows using Chocolatey (https://chocolatey.org/)
choco install ffmpeg

# on Windows using Scoop (https://scoop.sh/)
scoop install ffmpeg

About

A Speech To Text Website using OpenAI's whisper as the core of the STT transcription. The transcriptions are then stored as records in a MongoDB database

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages