AGREEMENT AMONG MULTIPLE LVCSR MODELS — CORRELATION BETWEEN PAIR OF ACOUSTIC MODELS AND CONFIDENCE —Confidence Measure Study • August 26th, 2021
Contract Type FiledAugust 26th, 2021For many practical applications of speech recognition systems, it is quite desirable to have an estimate of confidence for each hypoth- esized word. Unlike previous works on confidence measures, this paper studies features for confidence measures that are extracted from outputs of more than one LVCSR models. More specifically, this paper experimentally evaluates the agreement among the out- puts of multiple Japanese LVCSR models, with respect to whether it is effective as an estimate of confidence for each hypothesized word. The results of experimental evaluation show that the agree- ment between the outputs with two LVCSR models with differ- ent decoders and acoustic models can achieve quite reliable con- fidence. Furthermore, among various features of acoustic models based on Gaussian mixture HMMs, it is concluded that ones such as whether or not to have short pause models, as well as different units in HMMs (e.g., triphone model or syllable model) are the most effective in achieving