Short Spanish words not understood correctly. Nova-2 multi-language. #1027
Replies: 5 comments 2 replies
-
Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently. |
Beta Was this translation helpful? Give feedback.
-
Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion. |
Beta Was this translation helpful? Give feedback.
-
It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?
|
Beta Was this translation helpful? Give feedback.
-
Hello, I noticed you are using the Do you have both English and Spanish audio to transcribe in the same source? If you just have Spanish audio you'll likely get better results just using the Spanish model for nova-2 so instead of:
|
Beta Was this translation helpful? Give feedback.
-
Understood. This is one of our newer models and it still being improved in terms of language detection as it detects multiple languages. The quality of the audio, background noise and type of audio you are passing us (mp3, flac, acc etc) can all play a part in the transcription process as well. You could trying boosting words that are being missed with our Keywords feature: https://developers.deepgram.com/docs/keywords Smart Formatting can also help in some cases: But as for the models performance on these common Spanish words I can share this feedback with our Model Research Team.
|
Beta Was this translation helpful? Give feedback.
-
It is very easy to reproduce this error were short Spanish words like "Alo", "Si" and "Bien" are no heard at all by the engine or understood as English words.
The word 'Si' is often not heard at all. Even if I pair with more words it is not registering. Like "si senor" will be understood as senor.
Bien and alo are often times interpreted as Yeah
"uuid": "62769322-d9dc-4ad8-b83b-084051b9d7ab"
"uuid": "7c67b1e3-be51-4337-bca8-ce18de095635"
Beta Was this translation helpful? Give feedback.
All reactions