What Can We Learn Privately?

Kasiviswanathan, Shiva Prasad; Lee, Homin K.; Nissim, Kobbi; Raskhodnikova, Sofya; Smith, Adam

Computer Science > Machine Learning

arXiv:0803.0924 (cs)

[Submitted on 6 Mar 2008 (v1), last revised 19 Feb 2010 (this version, v3)]

Title:What Can We Learn Privately?

Authors:Shiva Prasad Kasiviswanathan, Homin K. Lee, Kobbi Nissim, Sofya Raskhodnikova, Adam Smith

View PDF

Abstract: Learning problems form an important category of computational tasks that generalizes many of the computations researchers apply to large real-life data sets. We ask: what concept classes can be learned privately, namely, by an algorithm whose output does not depend too heavily on any one input or specific training example? More precisely, we investigate learning algorithms that satisfy differential privacy, a notion that provides strong confidentiality guarantees in contexts where aggregate information is released about a database containing sensitive information about individuals. We demonstrate that, ignoring computational constraints, it is possible to privately agnostically learn any concept class using a sample size approximately logarithmic in the cardinality of the concept class. Therefore, almost anything learnable is learnable privately: specifically, if a concept class is learnable by a (non-private) algorithm with polynomial sample complexity and output size, then it can be learned privately using a polynomial number of samples. We also present a computationally efficient private PAC learner for the class of parity functions. Local (or randomized response) algorithms are a practical class of private algorithms that have received extensive investigation. We provide a precise characterization of local private learning algorithms. We show that a concept class is learnable by a local algorithm if and only if it is learnable in the statistical query (SQ) model. Finally, we present a separation between the power of interactive and noninteractive local learning algorithms.

Comments:	35 pages, 2 figures
Subjects:	Machine Learning (cs.LG); Computational Complexity (cs.CC); Cryptography and Security (cs.CR); Databases (cs.DB)
Cite as:	arXiv:0803.0924 [cs.LG]
	(or arXiv:0803.0924v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.0803.0924
Journal reference:	SIAM Journal of Computing 40(3) (2011) 793-826

Submission history

From: Shiva Kasiviswanathan [view email]
[v1] Thu, 6 Mar 2008 17:50:07 UTC (466 KB)
[v2] Mon, 14 Apr 2008 16:18:44 UTC (490 KB)
[v3] Fri, 19 Feb 2010 01:47:02 UTC (170 KB)

Computer Science > Machine Learning

Title:What Can We Learn Privately?

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:What Can We Learn Privately?

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators