Dynamics-aware Embeddings

Whitney, William; Agarwal, Rajat; Cho, Kyunghyun; Gupta, Abhinav

Computer Science > Machine Learning

arXiv:1908.09357 (cs)

[Submitted on 25 Aug 2019 (v1), last revised 14 Jan 2020 (this version, v3)]

Title:Dynamics-aware Embeddings

Authors:William Whitney, Rajat Agarwal, Kyunghyun Cho, Abhinav Gupta

View PDF

Abstract:In this paper we consider self-supervised representation learning to improve sample efficiency in reinforcement learning (RL). We propose a forward prediction objective for simultaneously learning embeddings of states and action sequences. These embeddings capture the structure of the environment's dynamics, enabling efficient policy learning. We demonstrate that our action embeddings alone improve the sample efficiency and peak performance of model-free RL on control from low-dimensional states. By combining state and action embeddings, we achieve efficient learning of high-quality policies on goal-conditioned continuous control from pixel observations in only 1-2 million environment steps.

Comments:	Published at ICLR 2020
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1908.09357 [cs.LG]
	(or arXiv:1908.09357v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1908.09357

Submission history

From: William Whitney [view email]
[v1] Sun, 25 Aug 2019 16:22:04 UTC (8,050 KB)
[v2] Sun, 1 Sep 2019 13:08:05 UTC (8,050 KB)
[v3] Tue, 14 Jan 2020 14:50:13 UTC (9,402 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-08

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

William F. Whitney
Kyunghyun Cho
Abhinav Gupta

export BibTeX citation

Computer Science > Machine Learning

Title:Dynamics-aware Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Dynamics-aware Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators