AGREEMENT AMONG MULTIPLE LVCSR MODELS — CORRELATION BETWEEN PAIR OF ACOUSTIC MODELS AND CONFIDENCE —Conference Paper • January 9th, 2022
Contract Type FiledJanuary 9th, 2022For many practical applications of speech recognition systems, it is quite desirable to have an estimate of confidence for each hypoth- esized word. Unlike previous works on confidence measures, this paper studies features for confidence measures that are extracted from outputs of more than one LVCSR models. More specifically, this paper experimentally evaluates the agreement among the out- puts of multiple Japanese LVCSR models, with respect to whether it is effective as an estimate of confidence for each hypothesized word. The results of experimental evaluation show that the agree- ment between the outputs with two LVCSR models with differ- ent decoders and acoustic models can achieve quite reliable con- fidence. Furthermore, among various features of acoustic models based on Gaussian mixture HMMs, it is concluded that ones such as whether or not to have short pause models, as well as different units in HMMs (e.g., triphone model or syllable model) are the most effective in achieving