Archives and Documentation Center
Digital Archives

Mutual information based feature selection for acoustic autism diagnosis

Show simple item record

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Gürgen, S. Fikret.
dc.contributor.author Yüzsever, Şefika.
dc.date.accessioned 2023-03-16T10:02:06Z
dc.date.available 2023-03-16T10:02:06Z
dc.date.issued 2015.
dc.identifier.other CMPE 2015 Y88
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12294
dc.description.abstract Pervasive Developmental Disorders (PDD) are known to a ect children's social interactions and mental development. Prosodic and linguistic cues can be used to diagnose the disorders at early ages. Computational paralinguistics can be applied for tele-monitoring and/or educating the children with PDD. For better understanding the disorders, a small subset of highly informative features is needed. From machine learning perspective, feature selection (FS) is an important step for generalization ability of the learner and drawing inferences about the underlying problems. Since, the high dimensional data are vulnerable to comprise redundant and irrelevant features. The most popular FS methods depend on Mutual Information (MI), that resort to discretization of features. Though the e ect of di erent discretization schemes are studied in literature, to the best of our knowledge the e ect of di erent number of bins for equal width z-score discretization is not studied for MI based FS. Since MI computation depends on the number of discrete categories, we hypothesize that the feature ranking and therefore performance trajectory also changes. We carry out extensive experiments using eight MI based FS methods on the INTERSPEECH 2013 Autism sub-challenge corpus. The comparative results verify our hypothesis and lead to interesting remarks for future studies. Also in this thesis, adjustment for chance factor is proposed for normalizing MI measures, therefore obtaining a new MI based FS criterion. Finally, we choose the candidate ranked features by considering the e ect of discretization, and achieve 70.68% Unweighted Average Recall (UAR) performance on the test set using only 2% of the feature set. This result advances state-of-the-art performance on the test set adhering to the challenge protocol.
dc.format.extent 30 cm.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2015.
dc.subject.lcsh Autism spectrum disorders.
dc.title Mutual information based feature selection for acoustic autism diagnosis
dc.format.pages xii, 52 leaves ;


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account