Optimizing Variational Quantum Circuits Using Metaheuristic Strategies in Reinforcement Learning

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Kölle, Michael; Seidl, Daniel; Zorn, Maximilian ORCID: https://orcid.org/0009-0006-2750-7495; Altmann, Philipp ORCID: https://orcid.org/0000-0003-1134-176X; Stein, Jonas ORCID: https://orcid.org/0000-0001-5727-9151 und Gabor, Thomas ORCID: https://orcid.org/0000-0003-2048-8667 (2024): Optimizing Variational Quantum Circuits Using Metaheuristic Strategies in Reinforcement Learning. QCE 2024: IEEE International Conference on Quantum Computing and Engineering, Montréal, Canada, 15.- 20. September 2024. Culhane, Candace; Byrd, Greg; Muller, Hausi; Alexev, Yuri und Sheldon, Sarah (Hrsg.): In: Proceedings Volume II of III IEEE Quantum Week 2024, Los Alamitos: IEEE Computer Society. S. 323-328

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: http://dx.doi.org/10.1109/QCE60285.2024.10300

Externer Volltext: https://www.computer.org/csdl/proceedings-article/qce/2024/413702a323/23oqmUz6JjO

Abstract

Quantum Reinforcement Learning (QRL) offers potential advantages over classical Reinforcement Learning, such as compact state space representation and faster convergence in certain scenarios. However, practical benefits require further validation. QRL faces challenges like flat solution landscapes, where traditional gradient-based methods are inefficient, necessitating the use of gradient-free algorithms. This work explores the integration of metaheuristic algorithms — Particle Swarm Optimization, Ant Colony Optimization, Tabu Search, Genetic Algorithm, Simulated Annealing, and Harmony Search — into QRL. These algorithms provide flexibility and efficiency in parameter optimization. Evaluations in 5× 5 MiniGrid Reinforcement Learning environments show that, all algorithms yield nearoptimal results, with Simulated Annealing and Particle Swarm Optimization performing best. In the Cart Pole environment, Simulated Annealing, Genetic Algorithms, and Particle Swarm Optimization achieve optimal results, while the others perform slightly better than random action selection. These findings demonstrate the potential of Particle Swarm Optimization and Simulated Annealing for efficient QRL learning, emphasizing the need for careful algorithm selection and adaptation.

Dokumententyp:	Konferenzbeitrag (Paper)
Keywords:	Metaheuristics ; Stability criteria ; Reinforcement learning ; Simulated annealing ; Circuit stability ; Particle swarm optimization ; Quantum circuit ; Robots ; Genetic algorithms ; Testing
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISBN:	979-8-3315-4137-8
Ort:	Los Alamitos
Sprache:	Englisch
Dokumenten ID:	128861
Datum der Veröffentlichung auf Open Access LMU:	05. Nov. 2025 14:29
Letzte Änderungen:	05. Nov. 2025 14:29

Dokument bearbeiten