Learning from Conditional Distributions via Dual Embeddings

Dai, Bo; He, Niao; Pan, Yunpeng; Boots, Byron; Song, Le

Computer Science > Machine Learning

arXiv:1607.04579 (cs)

[Submitted on 15 Jul 2016 (v1), last revised 31 Dec 2016 (this version, v2)]

Title:Learning from Conditional Distributions via Dual Embeddings

Authors:Bo Dai, Niao He, Yunpeng Pan, Byron Boots, Le Song

View PDF

Abstract:Many machine learning tasks, such as learning with invariance and policy evaluation in reinforcement learning, can be characterized as problems of learning from conditional distributions. In such problems, each sample $x$ itself is associated with a conditional distribution $p(z|x)$ represented by samples $\{z_i\}_{i=1}^M$, and the goal is to learn a function $f$ that links these conditional distributions to target values $y$. These learning problems become very challenging when we only have limited samples or in the extreme case only one sample from each conditional distribution. Commonly used approaches either assume that $z$ is independent of $x$, or require an overwhelmingly large samples from each conditional distribution.
To address these challenges, we propose a novel approach which employs a new min-max reformulation of the learning from conditional distribution problem. With such new reformulation, we only need to deal with the joint distribution $p(z,x)$. We also design an efficient learning algorithm, Embedding-SGD, and establish theoretical sample complexity for such problems. Finally, our numerical experiments on both synthetic and real-world datasets show that the proposed approach can significantly improve over the existing algorithms.

Comments:	24 pages, 11 figures
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:1607.04579 [cs.LG]
	(or arXiv:1607.04579v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1607.04579

Submission history

From: Bo Dai [view email]
[v1] Fri, 15 Jul 2016 16:56:22 UTC (888 KB)
[v2] Sat, 31 Dec 2016 06:54:37 UTC (258 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2016-07

Change to browse by:

cs
math
math.OC
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Bo Dai
Niao He
Yunpeng Pan
Byron Boots
Le Song

export BibTeX citation

Computer Science > Machine Learning

Title:Learning from Conditional Distributions via Dual Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning from Conditional Distributions via Dual Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators