Archives and Documentation Center
Digital Archives

Duration analysis and modelling for Turkish text to speech synthesis

Show simple item record

dc.contributor Graduate Program in Electrical and Electronic Engineering.
dc.contributor.advisor Arslan, Levent M.
dc.contributor.author Şayli, Ömer.
dc.date.accessioned 2023-03-16T10:16:44Z
dc.date.available 2023-03-16T10:16:44Z
dc.date.issued 2002.
dc.identifier.other EE 2002 S28
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12645
dc.description.abstract Naturalness in TTS systems plays a big role in the acceptability of the TTS synthesis outputs. Rhythm, intonation, stress pattern, pitch and duration (timing) are the most important parameters which effect naturalness of the TTS system output. The task of the timing component in a TTS system is to compute duration information for sub-elements which are to be used in synthesis output. Duration modelling is a very challenging part of a TTS system since very little is known about the underlying process responsible for speech timing of humans.To analyze and model duration for Turkish TTS systems, spoken utterances of 1-words and sentences of an adult male are used which are recorded at high digital quality. Firstly, coverage of the Turkish by this spoken text corpus is investigated, which is found to be well enough. Afterwards, analysis of the durations of Turkish phonemes is done. Effects of factors that can be computed from text on the durations are found to determine which of them should be included in the duration models.To model duration, four models have been implemented. First two models use mean durations of the phonemes and mean durations of the triphones. Third model uses mean durations of the nodes of trees for triphones for duration prediction. The last model is an additive model where the effects of factors are found by regression analysis..
dc.format.extent 30 cm. +
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2002.
dc.relation Includes appendices.
dc.relation Includes appendices.
dc.subject.lcsh Speech processing systems.
dc.subject.lcsh Speech synthesis.
dc.title Duration analysis and modelling for Turkish text to speech synthesis
dc.format.pages xxi, 122 leaves ;


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account