Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Phan, Thomy; Belzner, Lenz; Kiermeier, Marie; Friedrich, Markus; Schmid, Kyrill und Linnhoff-Popien, Claudia (2019): Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling. In: Thirty-Third Aaai Conference on Artificial Intelligence / Thirty-First Innovative Applications of Artificial Intelligence Conference / Ninth Aaai Symposium on Educational Advances in Artificial Intelligence: S. 7941-7948

Volltext auf 'Open Access LMU' nicht verfügbar.

Abstract

State-of-the-art approaches to partially observable planning like POMCP are based on stochastic tree search. While these approaches are computationally efficient, they may still construct search trees of considerable size, which could limit the performance due to restricted memory resources. In this paper, we propose Partially Observable Stacked Thompson Sampling (POSTS), a memory bounded approach to open-loop planning in large POMDPs, which optimizes a fixed size stack of Thompson Sampling bandits. We empirically evaluate POSTS in four large benchmark problems and compare its performance with different tree-based approaches. We show that POSTS achieves competitive performance compared to tree-based open-loop planning and offers a performance-memory tradeoff, making it suitable for partially observable planning with highly restricted computational and memory resources.

Dokumententyp:	Zeitschriftenartikel
Fakultät:	Mathematik, Informatik und Statistik > Informatik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
Sprache:	Englisch
Dokumenten ID:	82318
Datum der Veröffentlichung auf Open Access LMU:	15. Dez. 2021 15:01
Letzte Änderungen:	15. Dez. 2021 15:01

Dokument bearbeiten