Speech-To-Text Configuration
this document mainly introduces how to configure third-party
STT
in the StartAIConversation API. Currently, we support access to Azure and Deepgram. If you have other needs, please feel free to contact us.Request Example
STTConfig
supports a parameter CustomParam
. See the following example to configure third-party STT
:"STTConfig": {"Language": "zh","VadSilenceTime": 1200,"CustomParam": "{\"STTType\": \"azure\",\"SubscriptionKey\": \"xxxxx\",\"Region\": \"chinanorth3\"}",},}
Supported Third-Party STT Configurations
Azure STT
{"STTType": "azure", // required: String STT type"SubscriptionKey": "xxxxx", // required: String Subscription key"Region": "chinanorth3" // required: String region}
Example:
"CustomParam": "{\"STTType\": \"azure\", \"SubscriptionKey\": \"xxxxx\", \"Region\": \"chinanorth3\"}"
For more details, please refer to: https://learn.microsoft.com/en-us/azure/ai-services/speech-service/speech-to-text.
Deepgram STT
{"STTType": "deepgram", // required: String STT type"ApiKey": "XXXXXX", // required: String Authentication"Model": "nova-2" // required: String Select STT Model}
Example:
"CustomParam": "{\"STTType\": \"deepgram\", \"AppKey\": \"xxxxx\", \"Model\": \"nova-2\"}"