Replies: 4 comments 3 replies
-
Hi @zh-plus , I was wondering, does the code above disables the threshold parameters, or does it set them to default? Thanks for sharing your insights. |
Beta Was this translation helpful? Give feedback.
-
Hi @zh-plus , I get an error when i try to set log_prob_threshold to None. Is there anything I am doing wrong?
Error message is:
Thanks. |
Beta Was this translation helpful? Give feedback.
-
Thanks for looking into it @zh-plus. I didnt save the entire error message I'm afraid. However I turned to GPT4, and "his" recommendation was to change to |
Beta Was this translation helpful? Give feedback.
-
Does it actually work? |
Beta Was this translation helpful? Give feedback.
-
Recently I found that large-v2 model usually drops some transcribed segments for Chinese and Japanese because of
compression_ratio_threshold
,log_prob_threshold
, andno_speech_threshold
. However, the outputted transcription is actually correct.Is it a good practice to totally disable these 3 parameters by:
when setting
vad_filter=True
? Perhaps it would be more appropriate for a standalone VAD model to determine which voice segments should be disregarded.The reliability of
log_prob_threshold
is often questioned, as mentioned in this discussion on openai/whisper#29 (comment).Beta Was this translation helpful? Give feedback.
All reactions