Duration analysis and modelling for Turkish text to speech synthesis

Şayli, Ömer.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Elektrik- Elektronik Mühendisliği
→
M.S. Theses
→
View Item

dc.contributor	Graduate Program in Electrical and Electronic Engineering.
dc.contributor.advisor	Arslan, Levent M.
dc.contributor.author	Şayli, Ömer.
dc.date.accessioned	2023-03-16T10:16:44Z
dc.date.available	2023-03-16T10:16:44Z
dc.date.issued	2002.
dc.identifier.other	EE 2002 S28
dc.identifier.uri	http://digitalarchive.boun.edu.tr/handle/123456789/12645
dc.description.abstract	Naturalness in TTS systems plays a big role in the acceptability of the TTS synthesis outputs. Rhythm, intonation, stress pattern, pitch and duration (timing) are the most important parameters which effect naturalness of the TTS system output. The task of the timing component in a TTS system is to compute duration information for sub-elements which are to be used in synthesis output. Duration modelling is a very challenging part of a TTS system since very little is known about the underlying process responsible for speech timing of humans.To analyze and model duration for Turkish TTS systems, spoken utterances of 1-words and sentences of an adult male are used which are recorded at high digital quality. Firstly, coverage of the Turkish by this spoken text corpus is investigated, which is found to be well enough. Afterwards, analysis of the durations of Turkish phonemes is done. Effects of factors that can be computed from text on the durations are found to determine which of them should be included in the duration models.To model duration, four models have been implemented. First two models use mean durations of the phonemes and mean durations of the triphones. Third model uses mean durations of the nodes of trees for triphones for duration prediction. The last model is an additive model where the effects of factors are found by regression analysis..
dc.format.extent	30 cm. +
dc.publisher	Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2002.
dc.relation	Includes appendices.
dc.relation	Includes appendices.
dc.subject.lcsh	Speech processing systems.
dc.subject.lcsh	Speech synthesis.
dc.title	Duration analysis and modelling for Turkish text to speech synthesis
dc.format.pages	xxi, 122 leaves ;