Probabilistic tensor factorization for link prediction

Ermiş, Beyza.

Archives and Documentation Center Digital Archives Home
→
Boğaziçi Üniversitesi Tezleri
→
Fen Bilimleri Enstitüsü
→
Bilgisayar Mühendisliği
→
M.S. Theses
→
View Item

dc.contributor	Graduate Program in Computer Engineering.
dc.contributor.advisor	Cemgil, Ali Taylan.
dc.contributor.author	Ermiş, Beyza.
dc.date.accessioned	2023-03-16T10:01:12Z
dc.date.available	2023-03-16T10:01:12Z
dc.date.issued	2012.
dc.identifier.other	CMPE 2012 E78
dc.identifier.uri	http://digitalarchive.boun.edu.tr/handle/123456789/12229
dc.description.abstract	Link prediction is the problem of inferring the presence, absence or strength of a link between two entities, based on properties of the other observed links. In the literature, two related types of link prediction problems are considered: (i) missing and (ii) temporal. In both cases, latent variable models have been studied for link prediction tasks that consider link prediction as a noisy matrix and tensor completion problem. By using a low-rank structure of a dataset, it is possible to recover missing entries for matrices and higher-order tensors. In this thesis, we use several approaches based on probabilistic interpretation of tensor factorizations: Probabilistic Latent Tensor Factorization that can realize any arbitrary tensor factorization structure on datasets in the form of single tensor and Generalised Coupled Tensor factorization that can simultaneously fit to higher-order tensors/matrices with common latent factors. We present full Bayesian inference via variational Bayes, then we derive variational inference algorithm for Bayesian coupled tensor factorization to improve the reconstruction over Bayesian factorization of single data tensor and form update equations for these models that handle simultaneous tensor factorizations where multiple observations tensors are available. Previous studies on factorization of heterogeneous data focus on either a single loss function or a speci c tensor model of interest. However, one of the main challenges in analyzing heterogeneous data is to nd the right tensor model and loss function. So, we consider di erent tensor models and loss functions for the link prediction. Numerical experiments on synthetic and real datasets demonstrate that joint analysis of data from multiple sources via coupled factorization and variational Bayes approach improves the link prediction performance and the selection of the right loss function and tensor model is crucial for accurate prediction of unobserved links.
dc.format.extent	30 cm.
dc.publisher	Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2012.
dc.relation	Includes appendices.
dc.relation	Includes appendices.
dc.subject.lcsh	Data mining.
dc.subject.lcsh	Tensor algebra.
dc.title	Probabilistic tensor factorization for link prediction
dc.format.pages	xv, 93 leaves ;