Gradient-Based Label Binning in Multi-label Classification

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Rapp, Michael ORCID: https://orcid.org/0000-0001-8570-8240; Mencía, Eneldo Loza; Fürnkranz, Johannes und Hüllermeier, Eyke ORCID: https://orcid.org/0000-0002-9944-4108 (September 2021): Gradient-Based Label Binning in Multi-label Classification. Machine Learning and Knowledge Discovery in Databases, Bilbao, Spain, September 13–17, 2021. In: Machine Learning and Knowledge Discovery in Databases. Research Track, Bd. 12977 Cham: Springer. S. 462-477

Volltext auf 'Open Access LMU' nicht verfügbar.

DOI: 10.1007/978-3-030-86523-8_28

Abstract

In multi-label classification, where a single example may be associated with several class labels at the same time, the ability to model dependencies between labels is considered crucial to effectively optimize non-decomposable evaluation measures, such as the Subset 0/1 loss. The gradient boosting framework provides a well-studied foundation for learning models that are specifically tailored to such a loss function and recent research attests the ability to achieve high predictive accuracy in the multi-label setting. The utilization of second-order derivatives, as used by many recent boosting approaches, helps to guide the minimization of non-decomposable losses, due to the information about pairs of labels it incorporates into the optimization process. On the downside, this comes with high computational costs, even if the number of labels is small. In this work, we address the computational bottleneck of such approach—the need to solve a system of linear equations—by integrating a novel approximation technique into the boosting procedure. Based on the derivatives computed during training, we dynamically group the labels into a predefined number of bins to impose an upper bound on the dimensionality of the linear system. Our experiments, using an existing rule-based algorithm, suggest that this may boost the speed of training, without any significant loss in predictive performance.

Dokumententyp:	Konferenzbeitrag (Paper)
Publikationsform:	Publisher's Version
Fakultät:	Mathematik, Informatik und Statistik > Informatik > Künstliche Intelligenz und Maschinelles Lernen
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 000 Informatik, Wissen, Systeme
ISSN:	0302-9743
Ort:	Cham
Sprache:	Englisch
Dokumenten ID:	92512
Datum der Veröffentlichung auf Open Access LMU:	18. Jul. 2022, 12:42
Letzte Änderungen:	03. Mrz. 2023, 13:27

Dokument bearbeiten