We currently have the following scenario:
When a call enters the flow, the agent asks the customer which city they would like to speak with or schedule an appointment in. After the customer responds, the prompt identifies the city mentioned and normalizes the result into a variable.
At the moment, we use the following normalization logic:
If the identified city is one of the cities served by our network (for example: Porto Alegre, Canoas, Blumenau, Lajeado, Campo Bom, Canela, Caxias do Sul, Erechim, Esteio, Gaspar, Gramado, Gravataí, Jaraguá do Sul, Novo Hamburgo, Rio Grande, or São Leopoldo), the variable is set to cidade_credenciada.
Otherwise, the variable is set to outra.
After that, the flow uses this variable to route the call to the corresponding trunk within our PBX.
The problem we are facing is that in phone calls, speech recognition sometimes returns variations of the city name due to pronunciation or transcription differences, for example:
-
Canoas may be transcribed as Canoa
-
Canela may be transcribed as Camela
-
Esteio may be transcribed as Isteio
-
Gravataí may appear with similar phonetic variations in the transcription
We would like to know if there is any recommended prompt structure or best practices in Retell to handle speech variations or transcription differences for the same city, while keeping the recognition fast and reliable.
Any example of a prompt or architecture that works well for this type of scenario (city identification in phone calls) would be greatly appreciated.



