이 페이지는 현재 영어로만 제공되며 한국어 버전은 곧 제공될 예정입니다. 기다려 주셔서 감사드립니다.

Speech-To-Text Configuration

this document mainly introduces how to configure third-party STT in the StartAIConversation API. Currently, we support access to Azure and Deepgram. If you have other needs, please feel free to contact us.

Request Example

STTConfig supports a parameter CustomParam. See the following example to configure third-party STT:
"STTConfig": {
"Language": "zh",
"VadSilenceTime": 1200,
"CustomParam": "{\"STTType\": \"azure\",
\"SubscriptionKey\": \"xxxxx\",
\"Region\": \"chinanorth3\"}",
},
}

Supported Third-Party STT Configurations

Azure STT

{
"STTType": "azure", // required: String STT type
"SubscriptionKey": "xxxxx", // required: String Subscription key
"Region": "chinanorth3" // required: String region
}
Example:
"CustomParam": "{\"STTType\": \"azure\", \"SubscriptionKey\": \"xxxxx\", \"Region\": \"chinanorth3\"}"

Deepgram STT

{
"STTType": "deepgram", // required: String STT type
"ApiKey": "XXXXXX", // required: String Authentication
"Model": "nova-2" // required: String Select STT Model
}
Example:
"CustomParam": "{\"STTType\": \"deepgram\", \"AppKey\": \"xxxxx\", \"Model\": \"nova-2\"}"