Home | Publications | SWB+22

Reinforcement Learning for Multi-Agent Stochastic Resource Collection

MCML Authors

Niklas Strauß

Dr.

→ Group Matthias Schubert
Spatial Artificial Intelligence

David Winkel

Dr.

* Former Member

→ Group Thomas Seidl
Database Systems, Data Mining and AI

Max Berrendorf

Dr.

* Former Member

→ Group Volker Tresp
Database Systems, Data Mining and AI

Matthias Schubert

Prof. Dr.

Principal Investigator

Spatial Artificial Intelligence

Abstract

Stochastic Resource Collection (SRC) describes tasks where an agent tries to collect a maximal amount of dynamic resources while navigating through a road network. An instance of SRC is the traveling officer problem (TOP), where a parking officer tries to maximize the number of fined parking violations. In contrast to vehicular routing problems, in SRC tasks, resources might appear and disappear by an unknown stochastic process, and thus, the task is inherently more dynamic. In most applications of SRC, such as TOP, covering realistic scenarios requires more than one agent. However, directly applying multi-agent approaches to SRC yields challenges considering temporal abstractions and inter-agent coordination. In this paper, we propose a novel multi-agent reinforcement learning method for the task of Multi-Agent Stochastic Resource Collection (MASRC). To this end, we formalize MASRC as a Semi-Markov Game which allows the use of temporal abstraction and asynchronous actions by various agents. In addition, we propose a novel architecture trained with independent learning, which integrates the information about collaborating agents and allows us to take advantage of temporal abstractions. Our agents are evaluated on the multiple traveling officer problem, an instance of MASRC where multiple officers try to maximize the number of fined parking violations. Our simulation environment is based on real-world sensor data. Results demonstrate that our proposed agent can beat various state-of-the-art approaches.

inproceedings SWB+22

ECML-PKDD 2022

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases. Grenoble, France, Sep 19-23, 2022.

Authors

N. Strauß • D. Winkel • M. Berrendorf • M. Schubert

Links

DOI

Research Area

A3 | Computational Models

BibTeXKey: SWB+22

#p-schubert #p-seidl #p-tresp