Specification Aware Multi-Agent Reinforcement Learning

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Ritz, Fabian ORCID: https://orcid.org/0000-0001-7707-1358; Phan, Thomy; Müller, Robert ORCID: https://orcid.org/0000-0003-3108-713X; Gabor, Thomas; Sedlmeier, Andreas; Zeller, Marc; Wieghardt, Jan; Schmid, Reiner; Sauer, Horst; Klein, Cornel und Linnhoff-Popien, Claudia (2022): Specification Aware Multi-Agent Reinforcement Learning. 13th International Conference, ICAART 2021, Virtual Event, February 4–6, 2021. Rocha, Ana Paula; Steels, Luc und Herik, Jaap van den (Hrsg.): In: Agents and Artificial Intelligence. 13th International Conference, ICAART 2021, Virtual Event, February 4–6, 2021, Revised Selected Papers, Lecture Notes in Computer Science Bd. 13251 Cham: Springer. S. 3-21

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1007/978-3-031-10161-8_1

Abstract

Engineering intelligent industrial systems is challenging due to high complexity and uncertainty with respect to domain dynamics and multiple agents. If industrial systems act autonomously, their choices and results must be within specified bounds to satisfy these requirements. Reinforcement learning (RL) is promising to find solutions that outperform known or handcrafted heuristics. However in industrial scenarios, it also is crucial to prevent RL from inducing potentially undesired or even dangerous behavior. This paper considers specification alignment in industrial scenarios with multi-agent reinforcement learning (MARL). We propose to embed functional and non-functional requirements into the reward function, enabling the agents to learn to align with the specification. We evaluate our approach in a smart factory simulation representing an industrial lot-size-one production facility, where we train up to eight agents using DQN, VDN, and QMIX. Our results show that the proposed approach enables agents to satisfy a given set of requirements.

Dokumententyp:	Konferenzbeitrag (Paper)
Fakultät:	Mathematik, Informatik und Statistik > Informatik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISSN:	0302-9743
Ort:	Cham
Sprache:	Englisch
Dokumenten ID:	110248
Datum der Veröffentlichung auf Open Access LMU:	28. Mrz. 2024 13:38
Letzte Änderungen:	28. Mrz. 2024 13:38

Dokument bearbeiten