Fürnkranz, Johannes; Hüllermeier, Eyke
ORCID: https://orcid.org/0000-0002-9944-4108; Cheng, Weiwei und Park, Sang-Hyeun
(2012):
Preference-based reinforcement learning: a formal framework and a policy iteration algorithm.
In: Machine Learning, Vol. 89, No. 1-2: pp. 123-156
| Item Type: | Journal article |
|---|---|
| Faculties: | Mathematics, Computer Science and Statistics > Computer Science > Artificial Intelligence and Machine Learning |
| Subjects: | 000 Computer science, information and general works > 004 Data processing computer science |
| ISSN: | 0885-6125 |
| Language: | English |
| Item ID: | 91494 |
| Date Deposited: | 24. Mar 2022 06:35 |
| Last Modified: | 24. Mar 2022 06:35 |
