Quantum Advantage Actor-Critic for Reinforcement Learning

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Kölle, Michael; Hgog, Mohamad; Ritz, Fabian ORCID: https://orcid.org/0000-0001-7707-1358; Altmann, Philipp ORCID: https://orcid.org/0000-0003-1134-176X; Zorn, Maximilian ORCID: https://orcid.org/0009-0006-2750-7495; Stein, Jonas ORCID: https://orcid.org/0000-0001-5727-9151 und Linnhoff-Popien, Claudia ORCID: https://orcid.org/0000-0001-6284-9286 (2024): Quantum Advantage Actor-Critic for Reinforcement Learning. ICAART 2024: International Conference on Agents and Artificial Intelligence, Rome, Italy, 24. - 26. Februar 2024. Rocha, Ana Paula; Steels, Luc und Herik, Jaap van den (Hrsg.): In: Proceedings of the 16th International Conference on Agents and Artificial Intelligence, Bd. 1 Setúbal: SciTePress. S. 297-304

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.5220/0012383900003636

Externer Volltext: https://www.scitepress.org/Papers/2024/123839/

Abstract

Quantum computing offers efficient encapsulation of high-dimensional states. In this work, we propose a novel quantum reinforcement learning approach that combines the Advantage Actor-Critic algorithm with variational quantum circuits by substituting parts of the classical components. This approach addresses reinforcement learning’s scalability concerns while maintaining high performance. We empirically test multiple quantum Advantage Actor-Critic configurations with the well known Cart Pole environment to evaluate our approach in control tasks with continuous state spaces. Our results indicate that the hybrid strategy of using either a quantum actor or quantum critic with classical post-processing yields a substantial performance increase compared to pure classical and pure quantum variants with similar parameter counts. They further reveal the limits of current quantum approaches due to the hardware constraints of noisy intermediate-scale quantum computers, suggesting further research to scale hybrid approaches for larger and more complex control tasks.

Dokumententyp:	Konferenzbeitrag (Paper)
Fakultät:	Mathematik, Informatik und Statistik > Informatik > Künstliche Intelligenz und Maschinelles Lernen
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISBN:	978-989-758-680-4
Ort:	Setúbal
Sprache:	Englisch
Dokumenten ID:	128856
Datum der Veröffentlichung auf Open Access LMU:	10. Nov. 2025 14:53
Letzte Änderungen:	10. Nov. 2025 15:48

Dokument bearbeiten