![]() | Up a level |
Fürnkranz, Johannes; Hüllermeier, Eyke ORCID: 0000-0002-9944-4108; Cheng, Weiwei; Park, Sang-Hyeun
(2012):
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm.
In: Machine Learning, Vol. 89, No. 1-2: pp. 123-156
Cheng, Weiwei; Fürnkranz, Johannes; Hüllermeier, Eyke ORCID: 0000-0002-9944-4108; Park, Sang-Hyeun
(2011):
Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning.
In: Gunopulos, Dimitrios; Hofmann, Thomas; Malerba, Donato; Vazirgiannis, Michalis (eds.) :
Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2011, Athens, Greece, September 5-9, 2011. Proceedings, Part I. Lecture Notes in Computer Science, Vol. 6911. Berlin, Heidelberg: Springer. pp. 312-327