Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

full-duplex streaming for realtime audio transcribe #346

Open
aviadr1 opened this issue Sep 8, 2024 · 1 comment
Open

full-duplex streaming for realtime audio transcribe #346

aviadr1 opened this issue Sep 8, 2024 · 1 comment

Comments

@aviadr1
Copy link

aviadr1 commented Sep 8, 2024

I'm looking to perform online realtime transcription (i.e. with something like whisper-streaming)
for this we need FULL DUPLEX capabilities i.e. the client needs to be able to continually stream data to the server.

I see the https://github.com/replicate/replicate-python?tab=readme-ov-file#run-a-model-and-stream-its-output example which shows the server can stream results, but the input has to be sent initially and I dont see how the client could send more and more input data.

is full duplex streaming supported in replicate or can you add support for it?

@mattt
Copy link
Contributor

mattt commented Sep 9, 2024

Hi @aviadr1. There's nothing about Replicate's platform or client libraries that preclude full duplex streaming. I'm not aware of any public models doing this currently, but you could accomplish this by building a whisper model with Cog that takes a URL input to a stream of audio and outputs cog.ConcatenateIteraor[str] (yield transcript chunks).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants