Voice driven keyword spotter

Çengelci, Onur.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Elektrik- Elektronik Mühendisliği
→
M.S. Theses
→
View Item

dc.contributor	Graduate Program in Electrical and Electronic Engineering.
dc.contributor.advisor	Arslan, Levent M.
dc.contributor.advisor	Saraçlar, Murat.
dc.contributor.author	Çengelci, Onur.
dc.date.accessioned	2023-03-16T10:16:48Z
dc.date.available	2023-03-16T10:16:48Z
dc.date.issued	2006.
dc.identifier.other	EE 2006 C46
dc.identifier.uri	http://digitalarchive.boun.edu.tr/handle/123456789/12664
dc.description.abstract	We designed a voice driven keyword spotter. To improve the success of the system, we made use of synthetically generated voice inputs in addition to natural voice inputs and used approximate string matching instead of exact string matching. Classical keyword spotters are mostly text driven. However, we have taken the input in the form of voice. Different people may pronounce the same keyword in different ways because effects such as gender, age, nationality, intonation, accent, emotional mood, environment, noise etc. play an important role on pronunciation. Even the samples of a keyword taken from the same person at different times may be different. Therefore, driving the keyword spotter with voice instead of text provides us with a source of variety. This variety increases the probability of spotting the keyword. Classical keyword spotters are mostly language dependent. In our spotter, many phoneme recognizers trained with different languages may be used in co-operation. We believe that, this ability of our spotter is highly likely to make it language independent. Even if a phoneme recognizer of only one language is used, it will make similar errors for both the input side and the search database side and the system may still have the chance of being language independent to some extent. As we take the input in voice format, we have the chance of collecting many samples of the keyword and producing their appropriate transformations. This ability of our spotter alleviates speaker dependency.
dc.format.extent	30cm.
dc.publisher	Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2006.
dc.relation	Includes appendices.
dc.relation	Includes appendices.
dc.subject.lcsh	Speech processing systems.
dc.title	Voice driven keyword spotter
dc.format.pages	xv, 80 leaves;