Azure
Microsoft Azure Speech Services delivers enterprise-grade speech recognition with support for 100+ languages and variants. It excels in customizable speech models, keyword recognition, and compliance-ready deployments. A strong choice if you're already in the Azure ecosystem or need broad multilingual coverage with enterprise SLAs.
Usage
To use Azure as the STT engine, pass the following JSON in the
STTConfig field of the StartAIConversation API:// json — STTConfig{"Language": "en","VadSilenceTime": 1000,"CustomParam": "{\"STTType\":\"azure\",\"SubscriptionKey\":\"<your_azure_subscription_key>\",\"Region\":\"eastus\"}"}
Parameter reference
STTConfig fields
Field | Type | Required | Description |
Language | String | No | |
VadSilenceTime | Integer | No |
CustomParam fields
CustomParam is not part of the standard STTConfig fields. It is only required when using a third-party STT engine, and is used to pass the service provider's authentication parameters.Field | Type | Required | Description |
STTType | String | Yes | Fixed value: "azure". |
SubscriptionKey | String | Yes | |
Region | String | Yes |