Multi-Objective Optimization of Performance and Interpretability of Tabular Supervised Machine Learning Models

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Schneider, Lennart ORCID: https://orcid.org/0000-0003-4152-5308; Bischl, Bernd ORCID: https://orcid.org/0000-0001-6002-6980 und Thomas, Janek ORCID: https://orcid.org/0000-0003-4511-6245 (2023): Multi-Objective Optimization of Performance and Interpretability of Tabular Supervised Machine Learning Models. GECCO '23: Genetic and Evolutionary Computation Conference, Lisbon Portugal, July 15 - 19, 2023. Silva, Sara und Paquete, Luís (Hrsg.): In: GECCO '23: Proceedings of the Genetic and Evolutionary Computation Conference, New York, NY, United States: Association for Computing Machinery. S. 538-547 [PDF, 781kB]

Vorschau

Creative Commons: Namensnennung 4.0 (CC-BY)

Veröffentlichte Version

DOI: 10.1145/3583131.3590380

Abstract

We present a model-agnostic framework for jointly optimizing the predictive performance and interpretability of supervised machine learning models for tabular data. Interpretability is quantified via three measures: feature sparsity, interaction sparsity of features, and sparsity of non-monotone feature effects. By treating hyperparameter optimization of a machine learning algorithm as a multi-objective optimization problem, our framework allows for generating diverse models that trade off high performance and ease of interpretability in a single optimization run. Efficient optimization is achieved via augmentation of the search space of the learning algorithm by incorporating feature selection, interaction and monotonicity constraints into the hyperparameter search space. We demonstrate that the optimization problem effectively translates to finding the Pareto optimal set of groups of selected features that are allowed to interact in a model, along with finding their optimal monotonicity constraints and optimal hyperparameters of the learning algorithm itself. We then introduce a novel evolutionary algorithm that can operate efficiently on this augmented search space. In benchmark experiments, we show that our framework is capable of finding diverse models that are highly competitive or outperform state-of-the-art XGBoost or Explainable Boosting Machine models, both with respect to performance and interpretability.

Dokumententyp:	Konferenzbeitrag (Paper)
Fakultät:	Mathematik, Informatik und Statistik > Statistik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik 300 Sozialwissenschaften > 310 Statistiken
URN:	urn:nbn:de:bvb:19-epub-121930-7
ISBN:	979-8-4007-0119-1
Ort:	New York, NY, United States
Sprache:	Englisch
Dokumenten ID:	121930
Datum der Veröffentlichung auf Open Access LMU:	29. Okt. 2024 15:13
Letzte Änderungen:	29. Okt. 2024 15:13

Dokument bearbeiten