Quantized Model Performance #859
Replies: 4 comments 3 replies
-
CONTRIBUTEI'm hosting an API with all models @ http://spch.her.st/ ! Use this Script to POST a file to the API Use this Script to measure the accuracy please keep the file duration below 5min, the API is not very fast, especially with the large models it takes quite a while. also the API only processes one file at a time |
Beta Was this translation helpful? Give feedback.
-
No, the model and the samplers are deterministic What is |
Beta Was this translation helpful? Give feedback.
-
I'm still seeing quite considerable usage on my API and the eval script. Please post your results here for others to see. Don't be greedy. Otherwise I'll have to shut it down and you have to clone and deploy the repo yourself. |
Beta Was this translation helpful? Give feedback.
-
Hey @Alumniminium I'm using it rn; Trying out parallel requests to see if it can get done faster. Hopefully it isn't frying your server! Also, will large-v3 end up available? Thank you so much for making the api! Results forthcoming. |
Beta Was this translation helpful? Give feedback.
-
The files were scored after removing all punctuation, [tokens], numbers from 0 to 9 rewritten to zero to nine, and everything lowercased.
Using the following python script WER + Levenshtein on the following (rap music) file The TV Screen.mp3 comparing to the manual transcript
The lowest ranking models just output [music] or music and nothing else. Is there a random seed? There's no parameter to pass to whisper to set a seed like there is with llama as far as i can tell.
George_W_Bush_Columbia_FINAL.ogg
Beta Was this translation helpful? Give feedback.
All reactions