A Variational Perturbative Approach to Planning in Graph-based Markov Decision Processes

Linzner, Dominik; Koeppl, Heinz

Computer Science > Machine Learning

arXiv:1912.01849 (cs)

[Submitted on 4 Dec 2019 (v1), last revised 30 Jun 2020 (this version, v2)]

Title:A Variational Perturbative Approach to Planning in Graph-based Markov Decision Processes

Authors:Dominik Linzner, Heinz Koeppl

View PDF

Abstract:Coordinating multiple interacting agents to achieve a common goal is a difficult task with huge applicability. This problem remains hard to solve, even when limiting interactions to be mediated via a static interaction-graph. We present a novel approximate solution method for multi-agent Markov decision problems on graphs, based on variational perturbation theory. We adopt the strategy of planning via inference, which has been explored in various prior works. We employ a non-trivial extension of a novel high-order variational method that allows for approximate inference in large networks and has been shown to surpass the accuracy of existing variational methods. To compare our method to two state-of-the-art methods for multi-agent planning on graphs, we apply the method different standard GMDP problems. We show that in cases, where the goal is encoded as a non-local cost function, our method performs well, while state-of-the-art methods approach the performance of random guess. In a final experiment, we demonstrate that our method brings significant improvement for synchronization tasks.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1912.01849 [cs.LG]
	(or arXiv:1912.01849v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1912.01849

Submission history

From: Dominik Linzner [view email]
[v1] Wed, 4 Dec 2019 08:36:58 UTC (119 KB)
[v2] Tue, 30 Jun 2020 14:43:46 UTC (218 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-12

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Dominik Linzner
Heinz Koeppl

export BibTeX citation

Computer Science > Machine Learning

Title:A Variational Perturbative Approach to Planning in Graph-based Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Variational Perturbative Approach to Planning in Graph-based Markov Decision Processes

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators