Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

Enhancing relation classification by using shortest dependency paths between entities with pre-trained language models

Basit öğe kaydını göster

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Güngör, Tunga.
dc.contributor.author Karaevli, Haluk Alper.
dc.date.accessioned 2023-10-15T06:43:05Z
dc.date.available 2023-10-15T06:43:05Z
dc.date.issued 2022
dc.identifier.other CMPE 2022 K38
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/19699
dc.description.abstract Relation Extraction (RE) is the task of finding the relation between entities from a plain text. As the length of the text increases, finding the relation becomes more challenging. The shortest dependency path (SDP) between two entities, obtained by traversing the terms in the text’s dependency tree, provides a view focused on the entities by pruning noisy words. In RE’s supervised form Relation Classification, the state-of-the-art methods generally integrate a pre-trained language model (PLM) into their approaches. However, none of them incorporates the shortest dependency paths into their calculations to our knowledge. In this thesis, we investigate the effects of using shortest dependency paths with pre-trained language models by taking the R-BERT relation classification model as our baseline and building upon it. Our novel approach enhances the baseline model by adding the sequence representation of the shortest dependency path between entities, collected from PLMs, as an additional embedding. In experiments, we have evaluated the proposed model’s performance for each combination of SDPs generated from Stan ford, HPSG, LAL dependency parsers, and baseline with BERT and XLNet PLMs in two datasets, SemEval-2010 Task 8 and TACRED. We improve the baseline model by absolute 1.41% and 3.6% scores, increasing the rankings of the model from 8th to 7th and 18th to 7th in SemEval-2010 Task 8 and TACRED, respectively.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2022.
dc.subject.lcsh Neural networks (Computer science)
dc.title Enhancing relation classification by using shortest dependency paths between entities with pre-trained language models
dc.format.pages xiii, 67 leaves


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım