Anzahl der Publikationen: 7
Zeitschriftenartikel
Konferenzbeitrag
Feng, Xuening; Jiang, Zhaohui; Kaufmann, Timo
ORCID: https://orcid.org/0000-0001-5193-8574; Xu, Puchen; Hüllermeier, Eyke
ORCID: https://orcid.org/0000-0002-9944-4108; Weng, Paul und Zhu, Yifei
(April 2025):
DUO: Diverse, Uncertain, On-Policy Query Generation and Selection for Reinforcement Learning from Human Feedback.
The 39th Annual AAAI Conference on Artificial Intelligence, Philadelphia, Pennsylvania, USA, 25. February - 4. March 2025.
Proceedings of the AAAI Conference on Artificial Intelligence.
Bd. 39, Nr. 16
S. 16604-16612
Szörényi, Balázs; Busa-Fekete, Róbert; Weng, Paul und Hüllermeier, Eyke
ORCID: https://orcid.org/0000-0002-9944-4108
(Juli 2015):
Qualitative Multi-Armed Bandits: A Quantile-Based Approach.
32nd International Conference on Machine Learning, Lille, France, July 6 - 11, 2015.
In: Proceedings of the 32nd International Conference on Machine Learning,
Bd. 37
S. 1660-1668
[PDF, 449kB]
Busa-Fekete, Róbert; Szörényi, Balázs; Cheng, Weiwei; Weng, Paul und Hüllermeier, Eyke
ORCID: https://orcid.org/0000-0002-9944-4108
(2013):
Top-k Selection based on Adaptive Sampling of Noisy Preferences.
ICML'13: 30th International Conference on International Conference on Machine Learning, Atlanta GA USA, June 16 - 21, 2013.
Dasgupta, Sanjoy und McAllester, David (Hrsg.):
In: Proceedings of the 30th International Conference on International Conference on Machine Learning,
Bd. 28, Nr. 3
S. 1094-1102
[PDF, 434kB]
Andere
Diese Liste wurde am
Sat May 31 23:36:28 2025 CEST
erstellt.