Logo Logo
Eine Ebene nach oben
Exportieren als [RSS feed] RSS 1.0 [RSS2 feed] RSS 2.0
Gruppiert nach: Dokumententyp | Veröffentlichungsdatum
Anzahl der Publikationen: 5

Zeitschriftenartikel

Busa-Fekete, Róbert; Szörényi, Balázs; Weng, Paul; Cheng, Weiwei und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2014): Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm. In: Machine Learning, Bd. 97, Nr. 3: S. 327-351

Konferenzbeitrag

Szörényi, Balázs; Busa-Fekete, Róbert; Weng, Paul und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2015): Qualitative Multi-Armed Bandits: A Quantile-Based Approach. 32nd International Conference on Machine Learning, Lille, France, July 6 - 11, 2015. In: Proceedings of the 32nd International Conference on Machine Learning, Bd. 37 S. 1660-1668

Busa-Fekete, Róbert; Szörényi, Balázs; Cheng, Weiwei; Weng, Paul und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2013): Top-k Selection based on Adaptive Sampling of Noisy Preferences. ICML'13: 30th International Conference on International Conference on Machine Learning, Atlanta GA USA, June 16 - 21, 2013. Dasgupta, Sanjoy und McAllester, David (Hrsg.): In: Proceedings of the 30th International Conference on International Conference on Machine Learning, Bd. 28, Nr. 3 S. 1094-1102

Weng, Paul; Busa-Fekete, Róbert und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2013): Interactive Q-Learning with Ordinal Rewards and Unreliable Tutor. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2013). Reinforcement Learning with Generalized Feedback, Prague, 23rd September 2013. S. 1-13

Weng, Paul; Busa-Fekete, Róbert und Hüllermeier, Eyke ORCID logoORCID: https://orcid.org/0000-0002-9944-4108 (2013): Preference-based Evolutionary Direct Policy Search. European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD 2013). Reinforcement Learning with Generalized Feedback, Prague, 23rd September 2013. S. 1-8

Diese Liste wurde am Sun Apr 14 00:52:01 2024 CEST erstellt.