The Communication Complexity of Distributed Estimation
AuthorsParikshit Gopalan, Raghu Meka†, Prasad Raghavendra‡, Mihir Singhal‡, Avi Wigderson§
The Communication Complexity of Distributed Estimation
AuthorsParikshit Gopalan, Raghu Meka†, Prasad Raghavendra‡, Mihir Singhal‡, Avi Wigderson§
We study an extension of the standard two-party communication model in which Alice and Bob hold probability distributions and over domains and , respectively. Their goal is to estimate
to within additive error for a bounded function , known to both parties. We refer to this as the distributed estimation problem. Special cases of this problem arise in a variety of areas including sketching, databases and learning. Our goal is to understand how the required communication scales with the communication complexity of and the error parameter .
The random sampling approach — estimating the mean by averaging over random samples — requires total communication, where is the randomized communication complexity of . We design a new debiasing protocol which improves the dependence on to be linear instead of quadratic. Additionally we show better upper bounds for several special classes of functions, including the Equality and Greater-than functions. We introduce lower bound techniques based on spectral methods and discrepancy, and show the optimality of many of our protocols: the debiasing protocol is tight for general functions, and that our protocols for the equality and greater-than functions are also optimal. Furthermore, we show that among full-rank Boolean functions, Equality is essentially the easiest.
Private Vector Mean Estimation in the Shuffle Model: Optimal Rates Require Many Messages
July 8, 2024research area Methods and Algorithms, research area Privacyconference ICML
We study the problem of private vector mean estimation in the shuffle model of privacy where users each have a unit vector in dimensions. We propose a new multi-message protocol that achieves the optimal error using messages per user. Moreover, we show that any (unbiased) protocol that achieves optimal error requires each user to send …
Lower Bounds for Locally Private Estimation via Communication Complexity
May 5, 2019research area Privacyconference COLT
We develop lower bounds for estimation under local privacy constraints—including differential privacy and its relaxations to approximate or Rényi differential privacy—by showing an equivalence between private estimation and communication-restricted estimation problems. Our results apply to arbitrarily interactive privacy mechanisms, and they also give sharp lower bounds for all levels of differential privacy protections, that is, privacy…