Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 845 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 845 Bytes

lexicaps

www.lexicaps.com
Transcription and Diarization based on OpenAI's Whisper It augments Whisper's transcription, by adding "speaker tags", so you know who says what. Currently it works for 2 speakers, and is tested for English.
I trained a classifier on top of Whisper model features (medium.en), that identifies any two speakers. No third-party package is used for Diarization.
Integrated with Whisper, it provides a full Transcription-Diarization service.
Give it a try or show a Sample.
Thanks @karpathy for the fun project
Thanks @sidhantls for inspirative repo

ToDo

-- Add large-v2 Whisper model