Basit öğe kaydını göster

dc.contributor Graduate Program in Computer Engineering.
dc.contributor.advisor Say, Ahmet Celal Cem.
dc.contributor.author Jafarov, Jafar.
dc.date.accessioned 2023-03-16T10:02:21Z
dc.date.available 2023-03-16T10:02:21Z
dc.date.issued 2016.
dc.identifier.other CMPE 2016 J34
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/12315
dc.description.abstract We investigate the query complexity of additively estimating entropy of a discrete probability distribution in two settings. Let p be an unknown probability distribution on [n] := f1; 2; : : : ng, and de ne two kinds of queries: A SAMP query takes no input and returns x 2 [n] with probability p[x]; a PMF query takes as input x 2 [n] and returns the value p[x]. In the SAMP model of query complexity, the only allowed interaction with p is via SAMP queries. In the SAMP+PMF model, both SAMP and PMF queries are utilized to interact with p. In particular, we consider the task of estimating the entropy of p to within (with high probability). For the usual Shannon entropy H(p), we review the matching upper and lower bounds established by Valiant and Valiant in the SAMP model, and describe the algorithm constructed by Canonne and Rubinfeld in the SAMP+PMF model. For the R enyi entropy H (p), we analyze three di erent matching upper and lower bound pairs introduced by Acharya et al. in the SAMP model. We show that (log2 n= 2) queries are necessary to estimate the Shannon entropy H(p) in the SAMP+PMF model, matching a recent upper bound of Canonne and Rubinfeld. In addition, we prove that {u100000} n1{u100000}1= queries are necessary and su cient to estimate the R enyi entropy H (p) in the SAMP+PMF model, where > 1. This complements recent work of Acharya et al. in the SAMP model that showed O(n1{u100000}1= ) queries su ce when is an integer, but roughly n queries are necessary when is a noninteger. All of our lower bounds extend to the SAMP+CDF model, where SAMP and CDF queries (given x, return P y x p[y]) are allowed. We give a matching lower bound on estimating the support size (the number of domain elements with nonzero probability) of an unknown distribution p in the SAMP+CDF model. Lastly, we present an upper bound on additively estimating Tsallis entropy in the SAMP+PMF model.
dc.format.extent 30 cm.
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2016.
dc.subject.lcsh Entropy.
dc.subject.lcsh Querying (Computer science)
dc.title The query complexity of estimating entropy
dc.format.pages x, 61 leaves ;


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım