Kaufmann, Timo
ORCID: https://orcid.org/0000-0001-5193-8574; Weng, Paul; Bengs, Viktor
ORCID: https://orcid.org/0000-0001-6988-6186 und Hüllermeier, Eyke
ORCID: https://orcid.org/0000-0002-9944-4108
(30. April 2024):
A Survey of Reinforcement Learning from Human Feedback.
[PDF, 1MB]
Preview

External fulltext: https://arxiv.org/abs/2312.14925
Item Type: | Other |
---|---|
Faculties: | Mathematics, Computer Science and Statistics > Computer Science > Artificial Intelligence and Machine Learning |
Subjects: | 000 Computer science, information and general works > 004 Data processing computer science |
URN: | urn:nbn:de:bvb:19-epub-125328-1 |
Language: | English |
Item ID: | 125328 |
Date Deposited: | 09. Apr 2025 15:53 |
Last Modified: | 09. Apr 2025 15:53 |