Speech To Text

Quality Assessment and Analysis (Part 2)

Download Whitepaper

Whitepaper: Speech to Text Quality Assessment and Analysis - Part 2

Macrosoft has completed a two-part research project comparing some of the leading speech to text tools available in the marketplace. This paper is the second of the series on speech to text quality assessment and analysis. Similar to part one, we fed conversational data from a contact center to the same three leading platform

  • CallMiner
  • GCP (Google Cloud Platform)
  • AWS (Amazon Web Services)

In part one, the focus was the speech to text quality assessment for stereo high fidelity audio recordings. This time we choose to work with mono low fidelity audio. The source audio in this case is stereo mp3 format with 8000 Hz sampling frequency at 32 kbps bitrates, which is at the low end of contact center recording quality. The evaluation metric we use is the BLEU (Bilingual Evaluation Understudy) score, which is the same metric we used in the prior assessment.

Download the Whitepaper to learn more on Speech to Text Quality Assessment and Analysis research on the above top three providers.