Pronunciation Assessment
Updated February 13, 2025
Overview
Pronunciation assessment evaluates speech pronunciation and gives speakers feedback on the accuracy and fluency of spoken audio. The ui is as follow.

Simple workflow
Assessment metrics
- AccuracyScore : Pronunciation accuracy of the speech. Accuracy indicates how closely the phonemes match a native speaker's pronunciation. Syllable, word, and full text accuracy scores are aggregated from the phoneme-level accuracy score, and refined with assessment objectives.
- FluencyScore : Fluency of the given speech. Fluency indicates how closely the speech matches a native speaker's use of silent breaks between words.
- CompletenessScore : Completeness of the speech, calculated by the ratio of pronounced words to the input reference text.
- ProsodyScore : Prosody of the given speech. Prosody indicates how natural the given speech is, including stress, intonation, speaking speed, and rhythm.