Context
We run voice agents for hotels with a large number of international guests, so callers may speak many different languages. We currently use the language: “multi” setting so a single agent can handle whatever language the caller uses.
What we’d like help with
- Multi-language support going forward. We’d like to keep using multi, or have you suggest a better approach for letting one agent handle as wide a range of caller languages as possible. Restricting to a small fixed language set isn’t a good fit for our use case.
- Reported increase in wrong-language transcription. Over the last week, people testing our agents in Japanese have noticed an increase in speech being transcribed into the wrong language. We haven’t yet been able to isolate a clean-audio reproduction, so we’re raising this as an observed trend rather than a confirmed bug —but we’d like to know whether anything changed in the last week or so on the STT/ASR or language-detection side for multi agents.
- Recommended architecture & settings. What agent architecture and settings do you recommend to support a wide range of caller languages while keeping language handling stable well into a call (i.e. it shouldn’t lock onto an incorrect language mid-conversation and fail to recover)? For example: transcription mode, ASR provider choice, multi-agent/language-routing patterns, or any configuration that improves robustness without restricting the supported languages.
Our setup
- language: “multi” on all agents
- Transcription mode / ASR provider: using Retell defaults (not explicitly configured