dc.contributor |
Graduate Program in Electrical and Electronic Engineering. |
|
dc.contributor.advisor |
Arslan, Levent M. |
|
dc.contributor.author |
Şayli, Ömer. |
|
dc.date.accessioned |
2023-03-16T10:16:44Z |
|
dc.date.available |
2023-03-16T10:16:44Z |
|
dc.date.issued |
2002. |
|
dc.identifier.other |
EE 2002 S28 |
|
dc.identifier.uri |
http://digitalarchive.boun.edu.tr/handle/123456789/12645 |
|
dc.description.abstract |
Naturalness in TTS systems plays a big role in the acceptability of the TTS synthesis outputs. Rhythm, intonation, stress pattern, pitch and duration (timing) are the most important parameters which effect naturalness of the TTS system output. The task of the timing component in a TTS system is to compute duration information for sub-elements which are to be used in synthesis output. Duration modelling is a very challenging part of a TTS system since very little is known about the underlying process responsible for speech timing of humans.To analyze and model duration for Turkish TTS systems, spoken utterances of 1-words and sentences of an adult male are used which are recorded at high digital quality. Firstly, coverage of the Turkish by this spoken text corpus is investigated, which is found to be well enough. Afterwards, analysis of the durations of Turkish phonemes is done. Effects of factors that can be computed from text on the durations are found to determine which of them should be included in the duration models.To model duration, four models have been implemented. First two models use mean durations of the phonemes and mean durations of the triphones. Third model uses mean durations of the nodes of trees for triphones for duration prediction. The last model is an additive model where the effects of factors are found by regression analysis.. |
|
dc.format.extent |
30 cm. + |
|
dc.publisher |
Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2002. |
|
dc.relation |
Includes appendices. |
|
dc.relation |
Includes appendices. |
|
dc.subject.lcsh |
Speech processing systems. |
|
dc.subject.lcsh |
Speech synthesis. |
|
dc.title |
Duration analysis and modelling for Turkish text to speech synthesis |
|
dc.format.pages |
xxi, 122 leaves ; |
|