SEGym: Optimizing Large Language Model Assisted Software Engineering Agents with Reinforcement Learning

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Stenzel, Gerhard; Schmid, Kyrill; Kölle, Michael; Altmann, Philipp ORCID: https://orcid.org/0000-0003-1134-176X; Lingsch-Rosenfeld, Marian ORCID: https://orcid.org/0000-0002-8172-3184; Zorn, Maximilian ORCID: https://orcid.org/0009-0006-2750-7495; Bücher, Tim; Gabor, Thomas ORCID: https://orcid.org/0000-0003-2048-8667; Wirsing, Martin und Belzner, Lenz (2025): SEGym: Optimizing Large Language Model Assisted Software Engineering Agents with Reinforcement Learning. AISoLA 2024: Second International Conference, Chersonissos, Griechenland, 30. Oktober - 03. November 2024. Bernhard, Steffen (Hrsg.): In: Bridging the Gap Between AI and Reality : Second International Conference, AISoLA 2024, Crete, Greece, October 30 – November 3, 2024, Proceedings, Cham: Springer. S. 107-124

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1007/978-3-031-75434-0_8

Abstract

Current software development agents based on large language models (LLMs) are often defined using heuristic methods, which can limit their flexibility and effectiveness. Moreover, the entry barriers for new researchers in this field are high, largely due to the complex infrastructure required to develop and optimize these agents. This paper proposes a new approach: modeling software development agents over LLMs as a partially observable Markov decision process (POMDP) to enable data-driven optimization. To support this approach, we introduce SEGym, a framework based on the Gym interface for reinforcement learning agents. SEGym simplifies the setup of optimization experiments for software development agents within the POMDP framework, making it more accessible for researchers to engage in this field.

Dokumententyp:	Konferenzbeitrag (Paper)
Fakultät:	Mathematik, Informatik und Statistik > Informatik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik
ISBN:	978-3-031-75434-0
Ort:	Cham
Sprache:	Englisch
Dokumenten ID:	127156
Datum der Veröffentlichung auf Open Access LMU:	31. Jul. 2025 13:30
Letzte Änderungen:	20. Nov. 2025 15:55

Dokument bearbeiten