Conversation transcriptor

This is a tool that let's you extract a conversation in an audio file into a transcription text.

Transcription model

The tool consists of a logic that combines the output of two awesome AI models:

whisper
pyannote

Usage

In order to run the tool, several components need to be set up correctly:

the tool requires external libraries like ffmpeg to manipulate audio files
all Python package requirements
pyannote can only be used with an appropriate authentication token from HuggingFace
to reduce the duration of the inference one ideally has access to a GPU

The easiest way to access the tool is by:

creating a pyannote token
using the Google Colab notebook

Installation

In order to use the click command app, the package needs to be installed locally with:

pip install --editable .

Application with Click

To see a list of available click commands, use:

python click_app.py --help

To see more details about a given click command, use e.g.:

python click_app.py click-wav-to-transcript --help

Alternatively, commands are also listed with shortcuts in setup.py. E.g.:

from_wav --help

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
convscript		convscript
test		test
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Readme.md		Readme.md
click_app.py		click_app.py
pytest.ini		pytest.ini
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Conversation transcriptor

Transcription model

Usage

Installation

Application with Click

About

Releases

Packages

Languages

License

cgroll/conversation-transcriptor

Folders and files

Latest commit

History

Repository files navigation

Conversation transcriptor

Transcription model

Usage

Installation

Application with Click

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages