DAC: The Double Actor-Critic Architecture for Learning Options

Zhang, Shangtong; Whiteson, Shimon

Computer Science > Machine Learning

arXiv:1904.12691 (cs)

[Submitted on 29 Apr 2019 (v1), last revised 11 Sep 2019 (this version, v7)]

Title:DAC: The Double Actor-Critic Architecture for Learning Options

Authors:Shangtong Zhang, Shimon Whiteson

View PDF

Abstract:We reformulate the option framework as two parallel augmented MDPs. Under this novel formulation, all policy optimization algorithms can be used off the shelf to learn intra-option policies, option termination conditions, and a master policy over options. We apply an actor-critic algorithm on each augmented MDP, yielding the Double Actor-Critic (DAC) architecture. Furthermore, we show that, when state-value functions are used as critics, one critic can be expressed in terms of the other, and hence only one critic is necessary. We conduct an empirical study on challenging robot simulation tasks. In a transfer learning setting, DAC outperforms both its hierarchy-free counterpart and previous gradient-based option learning algorithms.

Comments:	NeurIPS 2019
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1904.12691 [cs.LG]
	(or arXiv:1904.12691v7 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1904.12691

Submission history

From: Shangtong Zhang [view email]
[v1] Mon, 29 Apr 2019 14:57:47 UTC (1,283 KB)
[v2] Tue, 30 Apr 2019 08:38:00 UTC (1,283 KB)
[v3] Mon, 6 May 2019 15:23:29 UTC (1,283 KB)
[v4] Thu, 16 May 2019 14:54:53 UTC (1,380 KB)
[v5] Mon, 9 Sep 2019 15:37:19 UTC (1,381 KB)
[v6] Tue, 10 Sep 2019 08:29:53 UTC (1,381 KB)
[v7] Wed, 11 Sep 2019 08:34:22 UTC (1,381 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-04

Change to browse by:

cs
cs.AI
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shangtong Zhang
Shimon Whiteson

export BibTeX citation

Computer Science > Machine Learning

Title:DAC: The Double Actor-Critic Architecture for Learning Options

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DAC: The Double Actor-Critic Architecture for Learning Options

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators