Reinforcement learning

Applied Filters

People

Publications

Conferences

Publication Date

20 Results for: Book/Issue: AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,765,092 records)|Limit your search to The ACM Full-Text Collection (758,133 records)

Showing 1 - 20of20 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
May 2024
A Survey of Multi-Agent Deep Reinforcement Learning with Communication
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2845–2847

Communication is an effective mechanism for coordinating the behaviors of multiple agents, broadening their views of the environment, and to support their collaborations. In the field of multi-agent deep reinforcement learning (MADRL), agents can improve ...
0
32
Metrics
Total Citations0
Total Downloads32
Last 12 Months32
Last 6 weeks8
Get Access
research-article
May 2024
pgeon applied to Overcooked-AI to explain agents' behaviour
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2821–2823

Policy Graphs (PGs) are a method for representing the behaviour of opaque agents by observing them in the environment and producing graphs where the state and action spaces are discretised into predicates. We present pgeon, a Python library that ...
0
15
Metrics
Total Citations0
Total Downloads15
Last 12 Months15
Last 6 weeks1
Get Access
research-article
May 2024
Toward Explainable Agent Behaviour
- Victor Gimenez-Abalos
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2740–2742

Agents are a special kind of AI-based software in that they interact in complex environments and have increased potential for emergent behaviour, even in isolation. Explaining such behaviour is key to deploying trustworthy AI, but the increasing ...
0
20
Metrics
Total Citations0
Total Downloads20
Last 12 Months20
Last 6 weeks2
Get Access
extended-abstract
May 2024
Decision Market Based Learning for Multi-agent Contextual Bandit Problems
- Wenlong Wang,
- Thomas Pfeiffer
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2549–2551

Information is often stored in a distributed and proprietary form, and agents who own this information are often self-interested and require incentives to reveal it. Suitable mechanisms are required to elicit and aggregate such distributed information ...
0
6
Metrics
Total Citations0
Total Downloads6
Last 12 Months6
Last 6 weeks1
Get Access
extended-abstract
May 2024
Unifying Regret and State-Action Space Coverage for Effective Unsupervised Environment Design
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2507–2509

Unsupervised Environment Design (UED) employs interactive training between a teacher agent and a student agent to train generally-capable student agents. Existing UED methods primarily rely on regret to progressively introduce curriculum complexity for ...
0
14
Metrics
Total Citations0
Total Downloads14
Last 12 Months14
Last 6 weeks0
Get Access
extended-abstract
May 2024
Neurological Based Timing Mechanism for Reinforcement Learning
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2504–2506

The inherently time-dependent dynamics which underly the neuronal spiking communication, are ubquitous throughout brain, and yet are not fully understood. Likewise time-based mechanisms are underdeveloped in the field of Machine and Reinforcement ...
0
6
Metrics
Total Citations0
Total Downloads6
Last 12 Months6
Last 6 weeks2
Get Access
extended-abstract
May 2024
GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2429–2431

For multi-agent reinforcement learning (MARL) systems, the problem task often involves massive problem-specific reward engineering effort. This effort is usually not directly transferable to other problems; worse, this problem is further exacerbated for ...
0
10
Metrics
Total Citations0
Total Downloads10
Last 12 Months10
Last 6 weeks0
Get Access
extended-abstract
May 2024
Emergent Dominance Hierarchies in Reinforcement Learning Agents
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2426–2428

Modern Reinforcement Learning (RL) algorithms are able to outperform humans in a wide variety of tasks. Multi-agent reinforcement learning (MARL) settings present additional challenges around cooperation in mixed-motive groups. Social conventions and ...
0
9
Metrics
Total Citations0
Total Downloads9
Last 12 Months9
Last 6 weeks3
Get Access
extended-abstract
May 2024
Time-Constrained Restless Multi-Armed Bandits with Applications to City Service Scheduling
- Yi Mao,
- Andrew Perrault
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2375–2377

Municipalities maintain critical infrastructure through inspections, both proactive and in response to complaints. For example, the Chicago Department of Public Health (CDPH) periodically inspects 7000 food establishments to maintain the safety of food ...
0
11
Metrics
Total Citations0
Total Downloads11
Last 12 Months11
Last 6 weeks2
Get Access
extended-abstract
May 2024
ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2357–2359

Offline learning derives effective policies from expert demonstrators' datasets without direct interaction. While recent research consider dataset characteristics like expertise level or multiple demonstrators, a distinct approach is necessary in zero-...
0
9
Metrics
Total Citations0
Total Downloads9
Last 12 Months9
Last 6 weeks4
Get Access
extended-abstract
May 2024
Addressing Permutation Challenges in Multi-Agent Reinforcement Learning
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 2303–2305

In Reinforcement Learning, deep neural networks play a crucial role, especially in Multi-Agent Systems. Owing to information from multiple sources, the challenge lies in handling input permutations efficiently, causing sample inefficiency and delayed ...
0
21
Metrics
Total Citations0
Total Downloads21
Last 12 Months21
Last 6 weeks2
Get Access
research-article
May 2024
Emergent Cooperation under Uncertain Incentive Alignment
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 1521–1530

Understanding the emergence of cooperation in systems of computational agents is crucial for the development of effective cooperative AI. Interaction among individuals in real-world settings are often sparse and occur within a broad spectrum of ...
0
12
Metrics
Total Citations0
Total Downloads12
Last 12 Months12
Last 6 weeks1
Get Access
research-article
May 2024
Grasper: A Generalist Pursuer for Pursuit-Evasion Problems
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 1147–1155

Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in ...
0
9
Metrics
Total Citations0
Total Downloads9
Last 12 Months9
Last 6 weeks3
Get Access
research-article
May 2024
Higher Order Reasoning under Intent Uncertainty Reinforces the Hobbesian Trap
- Otto Kuusela,
- Debraj Roy
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 1066–1074

Civilisations in the universe face the difficulty of communicating and trying to understand others' intentions. Moreover, advanced civilisations could develop weapons to pre-emptively eliminate any civilisations that present a future threat - this is ...
0
9
Metrics
Total Citations0
Total Downloads9
Last 12 Months9
Last 6 weeks0
Get Access
research-article
May 2024
Analysing the Sample Complexity of Opponent Shaping
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 623–631

Learning in general-sum games often yields collectively sub-optimal results. Addressing this, opponent shaping (OS) methods actively guide the learning processes of other agents, empirically leading to improved individual and group performances. Early OS ...
0
8
Metrics
Total Citations0
Total Downloads8
Last 12 Months8
Last 6 weeks0
Get Access
research-article
May 2024
Potential-Based Reward Shaping for Intrinsic Motivation
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 589–597

Recently there has been a proliferation of intrinsic motivation (IM) reward-shaping methods to learn in complex and sparse-reward environments. These methods can often inadvertently change the set of optimal policies in an environment, leading to ...
0
22
Metrics
Total Citations0
Total Downloads22
Last 12 Months22
Last 6 weeks3
Get Access
research-article
May 2024
Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-agent Reinforcement Learning
- Benjamin Patrick Evans,
- Sumitra Ganesh
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 534–543

Agent-based models (ABMs) have shown promise for modelling various real world phenomena incompatible with traditional equilibrium analysis. However, a critical concern is the manual definition of behavioural rules in ABMs. Recent developments in multi-...
0
22
Metrics
Total Citations0
Total Downloads22
Last 12 Months22
Last 6 weeks9
Get Access
research-article
May 2024
Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 516–524

In this paper, we study the problem of transferring the available Markov Decision Process (MDP) models to learn and plan efficiently in an unknown but similar MDP. We refer to it as Model Transfer Reinforcement Learning (MTRL) problem. First, we ...
0
10
Metrics
Total Citations0
Total Downloads10
Last 12 Months10
Last 6 weeks1
Get Access
research-article
May 2024
Boosting Continuous Control with Consistency Policy
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 335–344

Due to its training stability and strong expression, the diffusion model has attracted considerable attention in offline reinforcement learning. However, several challenges have also come with it: 1) The demand for a large number of diffusion steps makes ...
0
8
Metrics
Total Citations0
Total Downloads8
Last 12 Months8
Last 6 weeks0
Get Access
research-article
May 2024
Deep Anomaly Detection via Active Anomaly Search
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 308–316

Anomaly detection (AD) holds substantial practical value, and considering the limited labeled data, the semi-supervised anomaly detection technique has garnered increasing attention. We find that previous methods suffer from insufficient exploitation of ...
0
33
Metrics
Total Citations0
Total Downloads33
Last 12 Months33
Last 6 weeks9
Get Access

Applied Filters

People

Names

Institutions

Authors

Publications

Proceedings/Book Names

All Publications

Content Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

A Survey of Multi-Agent Deep Reinforcement Learning with Communication

pgeon applied to Overcooked-AI to explain agents' behaviour

Toward Explainable Agent Behaviour

Decision Market Based Learning for Multi-agent Contextual Bandit Problems

Unifying Regret and State-Action Space Coverage for Effective Unsupervised Environment Design

Neurological Based Timing Mechanism for Reinforcement Learning

GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems

Emergent Dominance Hierarchies in Reinforcement Learning Agents

Time-Constrained Restless Multi-Armed Bandits with Applications to City Service Scheduling

ELA: Exploited Level Augmentation for Offline Learning in Zero-Sum Games

Addressing Permutation Challenges in Multi-Agent Reinforcement Learning

Emergent Cooperation under Uncertain Incentive Alignment

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems

Higher Order Reasoning under Intent Uncertainty Reinforces the Hobbesian Trap

Analysing the Sample Complexity of Opponent Shaping

Potential-Based Reward Shaping for Intrinsic Motivation

Learning and Calibrating Heterogeneous Bounded Rational Market Behaviour with Multi-agent Reinforcement Learning

Reinforcement Learning in the Wild with Maximum Likelihood-based Model Transfer

Boosting Continuous Control with Consistency Policy

Deep Anomaly Detection via Active Anomaly Search