Kaufmann, Timo
ORCID: https://orcid.org/0000-0001-5193-8574; Weng, Paul; Bengs, Viktor
ORCID: https://orcid.org/0000-0001-6988-6186 und Hüllermeier, Eyke
ORCID: https://orcid.org/0000-0002-9944-4108
(30. April 2024):
A Survey of Reinforcement Learning from Human Feedback.
[PDF, 1MB]
Preview
External fulltext: https://arxiv.org/abs/2312.14925
| Item Type: | Other |
|---|---|
| Faculties: | Mathematics, Computer Science and Statistics > Computer Science > Artificial Intelligence and Machine Learning |
| Subjects: | 000 Computer science, information and general works > 004 Data processing computer science |
| URN: | urn:nbn:de:bvb:19-epub-125328-1 |
| Language: | English |
| Item ID: | 125328 |
| Date Deposited: | 09. Apr 2025 15:53 |
| Last Modified: | 09. Apr 2025 15:53 |
