lexicaps

www.lexicaps.com
Transcription and Diarization based on OpenAI's Whisper It augments Whisper's transcription, by adding "speaker tags", so you know who says what. Currently it works for 2 speakers, and is tested for English.
I trained a classifier on top of Whisper model features (medium.en), that identifies any two speakers. No third-party package is used for Diarization.
Integrated with Whisper, it provides a full Transcription-Diarization service.
Give it a try or show a Sample.
Thanks @karpathy for the fun project
Thanks @sidhantls for inspirative repo

ToDo

-- Add large-v2 Whisper model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

lexicaps

ToDo

Files

README.md

Latest commit

History

README.md

File metadata and controls

lexicaps

ToDo