Fürnkranz, Johannes; Hüllermeier, Eyke ORCID: https://orcid.org/0000-0002-9944-4108; Cheng, Weiwei and Park, Sang-Hyeun
(2012):
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm.
In: Machine Learning, Vol. 89, No. 1-2: pp. 123-156
Item Type: | Journal article |
---|---|
Faculties: | Mathematics, Computer Science and Statistics > Computer Science > Artificial Intelligence and Machine Learning |
Subjects: | 000 Computer science, information and general works > 004 Data processing computer science |
ISSN: | 0885-6125 |
Language: | English |
Item ID: | 91494 |
Date Deposited: | 24. Mar 2022, 06:35 |
Last Modified: | 24. Mar 2022, 06:35 |