Aspeech corpus (orspoken corpus) is a database of speech audio files and text transcriptions. In speech technology, speech corpora are used, among other things, to create acoustic models (which can then be used with a speech recognitionorspeaker identification engine).[1]Inlinguistics, spoken corpora are used to do research into phonetic, conversation analysis, dialectology and other fields.[2][3]
A corpus is one such database. Corpora is the plural of corpus (i.e. it is many such databases).
There are two types of speech corpora:
A special kind of speech corpora are non-native speech databases that contain speech with a foreign accent.
![]() | This article about a digital library is a stub. You can help Wikipedia by expanding it. |