Computer-aided transcription tool

Akman, Çağdaş Kayra.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Elektrik- Elektronik Mühendisliği
→
M.S. Theses
→
View Item

dc.contributor	Graduate Program in Electrical and Electronic Engineering.
dc.contributor.advisor	Saraçlar, Murat.
dc.contributor.author	Akman, Çağdaş Kayra.
dc.date.accessioned	2023-03-16T10:16:53Z
dc.date.available	2023-03-16T10:16:53Z
dc.date.issued	2007.
dc.identifier.other	EE 2007 A36
dc.identifier.uri	http://digitalarchive.boun.edu.tr/handle/123456789/12683
dc.description.abstract	State-of-the-art speech recognition and language processing systems widely use data-driven methods. These methods require large transcribed speech and annotated text corpora. The success of these systems greatly depends on the amount of the training data. Need for transcribed speech makes transcription an important component of every system employing statistical methods. Manual transcription is an expensive and slow task. Computers may do the same task much faster but with more errors. Computer Aided Transcription is a combination between these two methods. The output lattices of an ASR engine, which contain hypotheses about the utterances to be transcribed, are transformed into letter-based, deterministic, weighted finite-state acceptors. These transformed lattices are combined with a letter-based N-gram language model trained on a text corpus similar in content to the speech data. The combined model is used as the language model of the open source graphical text entry application Dasher, developed at the University of Cambridge. Lattice expansion methods are used to increase the performance of the combined model. It is shown that combining the models at letter level performs better than a letter-based N-gram model used as the only language model and the model built by combining the transformed lattices and letter-based N-gram model at sentence level.
dc.format.extent	30cm.
dc.publisher	Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2007.
dc.relation	Includes appendices.
dc.relation	Includes appendices.
dc.subject.lcsh	Computer-aided transcription systems.
dc.subject.lcsh	Speech perception.
dc.title	Computer-aided transcription tool
dc.format.pages	xiii, 75 leaves;