You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've created a "real-time" application with chunks of 3 sec using my own small fine-tuned model. It reads the complete audio call recording and generate 3s chunks, but in some cases I get repetition of the same word/s:
The text was updated successfully, but these errors were encountered:
asr-lord
changed the title
[Hallucinations] Repetition of words or chunks with own fine-tuned models
[Hallucinations] Repetition of words or chunks with own fine-tuned model
Sep 3, 2024
I've created a "real-time" application with chunks of 3 sec using my own small fine-tuned model. It reads the complete audio call recording and generate 3s chunks, but in some cases I get repetition of the same word/s:
I've converted the model by the following code:
And run the following code to get transcription:
Output text transcription:
['no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo', 'soy', 'emigante', 'y', 'hasta', 'hoy', 'no', 'porque', 'yo']
Notice that the factor to transcribe chunks is lower in fine-tuned model than original OpenAI model:
*I'm using 4/16 T4-GPU AWS instance
The text was updated successfully, but these errors were encountered: