Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dictation note taking with summary #1

Open
antoni-devlin opened this issue Jan 3, 2024 · 0 comments
Open

Dictation note taking with summary #1

antoni-devlin opened this issue Jan 3, 2024 · 0 comments

Comments

@antoni-devlin
Copy link
Owner

What is it?

A tool that accepts an audio file of dictated notes, transcribes the file into text, and uses an LLM to create a summary.

How would it work?

  1. User uploads an audio file
  2. Chunking function cuts the file up into 30 seconds chunks (as this is the only length Whisper ASR can work with) and saves them to the filesystem
  3. Transcription function processes the chunks one at a time, passes them over to Whisper ASR, and writes the transcript to a text file.
  4. The finished transcript is passed to the summarisation function, which runs it through an LLM prompted with something like "Summarise these dictated notes in markdown format."
  5. The finished transcript and summary are saved to the file system.

Tech stack

  • Python (and Flask?)
  • Whisper ASR (model run locally)
  • LLM for text summarisation (chatGPT? I'd prefer to do this for free...)

Issues

  • Using Python as I couldn't find any evidence that it's possible to run Whisper ASR locally using node, but there is a Python package for this
  • I don't know if there's an LLM I can use for free to do the summarisation.

Enhancments

  • Generate tags from the summarised notes, and save the tagged summary to an obsidian vault for future reference

Proof of concept

There's a proof of concept of the file chunking and transcription parts of the programme in this Gist.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant