このページは現在英語版のみで提供されており、日本語版も近日中に提供される予定です。ご利用いただきありがとうございます。

Speech-To-Text Configuration

this document mainly introduces how to configure third-party STT in the StartAIConversation API. Currently, we support access to Azure and Deepgram. If you have other needs, please feel free to contact us.

Request Example

STTConfig supports a parameter CustomParam. See the following example to configure third-party STT:
"STTConfig": {
"Language": "zh",
"VadSilenceTime": 1200,
"CustomParam": "{\"STTType\": \"azure\",
\"SubscriptionKey\": \"xxxxx\",
\"Region\": \"chinanorth3\"}",
},
}

Supported Third-Party STT Configurations

Azure STT

{
"STTType": "azure", // required: String STT type
"SubscriptionKey": "xxxxx", // required: String Subscription key
"Region": "chinanorth3" // required: String region
}
Example:
"CustomParam": "{\"STTType\": \"azure\", \"SubscriptionKey\": \"xxxxx\", \"Region\": \"chinanorth3\"}"

Deepgram STT

{
"STTType": "deepgram", // required: String STT type
"ApiKey": "XXXXXX", // required: String Authentication
"Model": "nova-2" // required: String Select STT Model
}
Example:
"CustomParam": "{\"STTType\": \"deepgram\", \"AppKey\": \"xxxxx\", \"Model\": \"nova-2\"}"