Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

Effects of data duration, model size and session variability on speaker verification performance

Basit öğe kaydını göster

dc.contributor Graduate Program in Electrical and Electronic Engineering.
dc.contributor.advisor Saraçlar, Murat.
dc.contributor.author Dikici, Erinç.
dc.date.accessioned 2023-03-16T10:17:12Z
dc.date.available 2023-03-16T10:17:12Z
dc.date.issued 2009.
dc.identifier.other EE 2009 D55
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12723
dc.description.abstract Speaker verification is one of the most challenging branches of biometric authentication. Covering a wide spectrum from security services to law enforcement, speaker veri cation systems are employed in phone banking, forensic audio analysis and access control applications. An important observation is that verification accuracies depend vastly on the amount of data and get easily affected by acoustic variations. This study investigates the effects of data duration, model size and session variability on text-independent speaker verification performance. We implement GMM/UBM and SVM supervector classiffiers to represent speaker characteristics and compare their results for various training and testing durations as well as model complexities. The in uence of speaker adaptation methods and kernel function selection over the verification accuracy is examined. A minority oversampling scheme is utilized in order to avoid the issue of class imbalance in SVMs. We also explore how session variability acts on error rates and resort to Nuisance Attribute Projection method for reducing acoustic mismatches between the training and test samples. Working on the CSLU Speaker Recognition Dataset, we present a comparative evaluation of speaker verification systems with limited and extensive data conditions.
dc.format.extent 30cm.
dc.publisher Thesis (M.S.)-Bogazici University. Institute for Graduate Studies in Science and Engineering, 2009.
dc.subject.lcsh Biometric identification.
dc.subject.lcsh Automatic speech recognition.
dc.subject.lcsh Gaussian processes.
dc.title Effects of data duration, model size and session variability on speaker verification performance
dc.format.pages xvi, 71 leaves;


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım