Transcript
VoiceBase employs various methods to affect and improve the text transcript returned by the speech engine:
-
Stereo - separates the transcript by speaker turn which can be used to enable analytics on a per speaker basis. (Note: The recording must be in multi/dual-channel otherwise the request will be rejected by the API)
-
Custom Vocabulary - emphasizes the VoiceBase speech engine to prefer transcribing certain unique words in a vocabulary list. Most common use cases are for proper nouns, company/product names, and first and last names.
-
Formatting and Punctuation - improves the readability of the the text transcript by formatting numerical digits and inserting punctuation.
-
Priority - affects the turnaround time of processing the submitted recording.
-
Swear Word Filter - filters and replaces swear words in the text transcript.
-
Transcoding - converts the submitted audio file to a different audio codec. This is mainly use to convert an audio file to a playable codec within our VoiceBase Player.
-
Voicemail - applies a short form speech model for transcript to improve accuracy for calls under 30 seconds.
Updated over 3 years ago