Differentially Private Contextual Linear Bandits

Shariff, Roshan; Sheffet, Or

Computer Science > Machine Learning

arXiv:1810.00068 (cs)

[Submitted on 28 Sep 2018]

Title:Differentially Private Contextual Linear Bandits

Authors:Roshan Shariff, Or Sheffet

View PDF

Abstract:We study the contextual linear bandit problem, a version of the standard stochastic multi-armed bandit (MAB) problem where a learner sequentially selects actions to maximize a reward which depends also on a user provided per-round context. Though the context is chosen arbitrarily or adversarially, the reward is assumed to be a stochastic function of a feature vector that encodes the context and selected action. Our goal is to devise private learners for the contextual linear bandit problem.
We first show that using the standard definition of differential privacy results in linear regret. So instead, we adopt the notion of joint differential privacy, where we assume that the action chosen on day $t$ is only revealed to user $t$ and thus needn't be kept private that day, only on following days. We give a general scheme converting the classic linear-UCB algorithm into a joint differentially private algorithm using the tree-based algorithm. We then apply either Gaussian noise or Wishart noise to achieve joint-differentially private algorithms and bound the resulting algorithms' regrets. In addition, we give the first lower bound on the additional regret any private algorithms for the MAB problem must incur.

Comments:	21 pages, 5 figures; to appear in NIPS 2018
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.00068 [cs.LG]
	(or arXiv:1810.00068v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.00068

Submission history

From: Roshan Shariff [view email]
[v1] Fri, 28 Sep 2018 20:04:25 UTC (5,152 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Roshan Shariff
Or Sheffet

export BibTeX citation

Computer Science > Machine Learning

Title:Differentially Private Contextual Linear Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Differentially Private Contextual Linear Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators