Archives and Documentation Center
Digital Archives

Affect analysis of communication texts in Turkish language

Show simple item record

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Salah, Albert Ali.
dc.contributor.author Oktay, Eda Aydın.
dc.date.accessioned 2023-03-16T10:02:16Z
dc.date.available 2023-03-16T10:02:16Z
dc.date.issued 2016.
dc.identifier.other CMPE 2016 O57
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12309
dc.description.abstract As digital technologies expanded our range of online communication, textual a ect analysis become the focus of considerable interest in several disciplines. While many studies have been conducted in English language, there are only a few tools speci c for the Turkish language. One reason for that is the lack of lexical resources for a ect analysis and annotated corpora for an accurate evaluation. In this thesis, we develop an approach for continuous and dimensional a ect analysis of Turkish communications by combining several tools. We conduct experiments on various text corpora in Turkish including online multi-party chat records, psychotherapy records, Twitter data, movie reviews, and teachers' comments on high school students. Analyzing such texts brings challenges like non-standard word usage, grammatical irregularities, abbreviation usage, and spelling mistakes. We propose several pre-processing steps to deal with these. Then, we adapt an a ective word dictionary from English to Turkish, and by expanding it with synsets, obtain 15,200 words with annotations for valence, arousal, and dominance. We also employ a list of frequently used abbreviations, emoticons, interjections, modi ers (intensi ers and diminishers), and other linguistic indicators to capture the overall a ective state at the sentence level. We recruit and train annotators to obtain a ective ground truth speci cally for multi-party chat records. Our results show that the proposed system is useful, yet there is much room for improvement at di erent stages.
dc.format.extent 30 cm.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2016.
dc.subject.lcsh Natural language processing (Computer science)
dc.subject.lcsh Turkish language -- Texts.
dc.title Affect analysis of communication texts in Turkish language
dc.format.pages xiv, 91 leaves ;


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account