By default, audio is sent all at once as a one-shot delivery. Transfer-EncodingĪn optional value of chunked that causes the audio to be streamed to the service. By default (false), timestamps are not returned. timestampsĪn optional boolean that indicates whether the service produces timestamps for the words of the transcript. By default (false), speaker labels are not returned. ![]() speaker_labelsĪn optional boolean that indicates whether the service identifies which individuals spoke which words in a multi-participant exchange. By default (false), smart formatting is not performed. smart_formattingĪn optional boolean that indicates whether the service converts dates, times, numbers, currency, and similar values into more conventional representations in the final transcript. By default (true), profanity is filtered from the transcript. profanity_filterĪn optional boolean that indicates whether the service censors profanity from a transcript. By default, the en-US_BroadbandModel model is used. modelĪn optional model that specifies the language in which the audio is spoken and the rate at which it was sampled, broadband or narrowband. By default, the service returns a single final hypothesis. max_alternativesĪn optional integer that specifies the maximum number of alternative hypotheses that the service returns. By default, keyword spotting is not performed. keywords_thresholdĪn optional double between 0.0 and 1.0 that indicates the minimum threshold for a positive keyword match. keywordsĪn optional array of keyword strings that the service spots in the input audio. By default (false), interim results are not returned. interim_resultsĪn optional boolean that directs the service to return intermediate hypotheses that are likely to change before the final transcript. inactivity_timeoutĪn optional integer that specifies the number of seconds for the service's inactivity timeout use -1 to indicate infinity. The default is 0.3 unless a different weight was specified when the custom language model was trained. customization_weightĪn optional double between 0.0 and 1.0 that indicates the relative weight that the service gives to words from a custom language model compared to those from the base vocabulary. customization_idĪn optional customization ID for a custom language model that includes terminology from your domain. Speech to Text Parameters acoustic_customization_idĪn optional customization ID for a custom acoustic model that is adapted for the acoustic characteristics of your environment and speakers. For more information about application development with the service, see Overview for developers. SDKs are also available that simplify using the service's interfaces in various programming languages. ![]() A customization interface that lets you expand the vocabulary of a base model with domain-specific terminology or adapt a base model for the acoustic characteristics of your audio. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |