The focus of the paper is the eval- uation of inter-labeler reliability on broad phonetic transcriptions when la- belers do not necessarily know the lan- guage they are labeling. We pro- vide an analysis of label disagreements, presenting results from...Multi-Language Speech Database Creation and Phonetic Labeling Agreement • November 7th, 2007
Contract Type FiledNovember 7th, 2007This paper describes research on a large multi-language speech database being collected at the Oregon Graduate Institute (OGI). The Center for Spo- ken Language Understanding (CSLU) at OGI has been developing multi- language telephone speech corpora for the last 5 years. An earlier corpus [1] contained data from 11 languages with