Archives and Documentation Center
Digital Archives

Neural named entity recognition for morphologically rich languages

Show simple item record

dc.contributor Ph.D. Program in Computer Engineering.
dc.contributor.advisor Güngör, Tunga.
dc.contributor.advisor Üsküdarlı, Suzan.
dc.contributor.author Güngör, Onur.
dc.date.accessioned 2023-03-16T10:13:29Z
dc.date.available 2023-03-16T10:13:29Z
dc.date.issued 2021.
dc.identifier.other CMPE 2021 G86 PhD
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12540
dc.description.abstract Named entity recognition (NER) is an important task in natural language pro cessing (NLP). Until the revival of neural network based models for NLP, NER taggers employed traditional machine learning approaches or finite-state transducers to detect the entities in a given sentence. Neural models improved the state-of-the-art perfor mance with sequence-based models and word embeddings. These approaches neglect the morphological information embedded in the surface forms of the words. In this thesis, we introduce two NER taggers that utilize such information, which we show to be significant for morphologically rich languages. Using these taggers, we improve the state-of-the-art performance levels for Turkish, Czech, Hungarian, Finnish, and Spanish. The ablation studies show that these improvements result from the inclusion of morphological information. We also show that it is possible for the neural network to also learn how to disambiguate morphological analyses, thereby, eliminating the de pendence on external morphological disambiguators that are not always available. In the second part of this thesis, we propose a model agnostic approach for explaining any sequence-based NLP task by extending a well-known feature-attribution method. We assess the plausibility of the explanations for our NER tagger for Turkish and Finnish through several novel experiments.
dc.format.extent 30 cm.
dc.publisher Thesis (Ph.D.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2021.
dc.subject.lcsh Natural language processing (Computer science)
dc.title Neural named entity recognition for morphologically rich languages
dc.format.pages xvii, 125 leaves ;


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account