Logo Logo
Exportieren als [RSS feed] RSS 1.0 [RSS2 feed] RSS 2.0
Gruppiert nach: Dokumententyp | Veröffentlichungsdatum
Anzahl der Publikationen: 5

Zeitschriftenartikel

Busa-Fekete, Róbert; Szörényi, Balázs; Weng, Paul; Cheng, Weiwei und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2014): Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm. In: Machine Learning, Bd. 97, Nr. 3: S. 327-351

Konferenzbeitrag

Szörényi, Balázs; Busa-Fekete, Róbert; Weng, Paul und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (Juli 2015): Qualitative Multi-Armed Bandits: A Quantile-Based Approach. 32nd International Conference on Machine Learning, Lille, France, July 6 - 11, 2015. In: Proceedings of the 32nd International Conference on Machine Learning, Bd. 37 S. 1660-1668 [PDF, 449kB]

Busa-Fekete, Róbert; Szörényi, Balázs; Cheng, Weiwei; Weng, Paul und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2013): Top-k Selection based on Adaptive Sampling of Noisy Preferences. ICML'13: 30th International Conference on International Conference on Machine Learning, Atlanta GA USA, June 16 - 21, 2013. Dasgupta, Sanjoy und McAllester, David (Hrsg.): In: Proceedings of the 30th International Conference on International Conference on Machine Learning, Bd. 28, Nr. 3 S. 1094-1102 [PDF, 434kB]

Weng, Paul; Busa-Fekete, Róbert und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2013): Interactive Q-Learning with Ordinal Rewards and Unreliable Tutor. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2013). Reinforcement Learning with Generalized Feedback, Prague, 23rd September 2013. S. 1-13

Weng, Paul; Busa-Fekete, Róbert und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2013): Preference-based Evolutionary Direct Policy Search. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2013). Reinforcement Learning with Generalized Feedback, Prague, 23rd September 2013. S. 1-8

Diese Liste wurde am Sat Dec 21 20:26:51 2024 CET erstellt.