Archives and Documentation Center
Digital Archives

A knowledge-graph based graph neural network model to identify topics in short texts

Show simple item record

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Üsküdarlı, Suzan.
dc.contributor.author Güney, Abdullah Atakan.
dc.date.accessioned 2023-10-15T06:43:04Z
dc.date.available 2023-10-15T06:43:04Z
dc.date.issued 2022
dc.identifier.other CMPE 2022 G86
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/19695
dc.description.abstract Topic models are probabilistic generative models used to analyze a collection of documents. People have leveraged topic models for many years to extract hidden structures from documents. However, classical topic models such as Latent Dirichlet Allocation (LDA) have issues with short texts typical in user- generated social media content. Due to the limited context of short texts, they fail to learn interpretable topics from extensive vocabularies with the bag of word representations that do not repre sent them well. This thesis proposes a topic model based on Graph Neural Networks (GNN) where documents are represented as graphs with entity-specific relations using Wikidata as a knowledge graph. A graph attention network learns the embeddings of these documents whose outputs are passed to the probabilistic generative topic model Entity Embedded Topic Modeling (EETM) as probability distribution parameters to yield the topics. We evaluate our model with various short text collections fetched from Twitter related to politics, sports, pandemics, and trending news events. We provide a detailed discussion regarding our observations related to the learned embeddings and qualities of topics resulting from our model.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2022.
dc.subject.lcsh Latent structure analysis.
dc.subject.lcsh Latent Dirichlet Allocation.
dc.title A knowledge-graph based graph neural network model to identify topics in short texts
dc.format.pages xiv, 91 leaves


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Archive


Browse

My Account