Marginal effects for non-linear prediction functions

www.lmu.de | UB | Blättern | Hilfe

Zur erweiterten Suche

English

Zur erweiterten Suche

Scholbeck, Christian A. ORCID: https://orcid.org/0000-0001-6607-4895; Casalicchio, Giuseppe ORCID: https://orcid.org/0000-0001-5324-5966; Molnar, Christoph; Bischl, Bernd und Heumann, Christian (2024): Marginal effects for non-linear prediction functions. In: Data Mining and Knowledge Discovery, Bd. 38, Nr. 5: S. 2997-3042 [PDF, 10MB]

Vorschau

Creative Commons: Namensnennung 4.0 (CC-BY)

Veröffentlichte Version

DOI: 10.1007/s10618-023-00993-x

Abstract

Beta coefficients for linear regression models represent the ideal form of an interpretable feature effect. However, for non-linear models such as generalized linear models, the estimated coefficients cannot be interpreted as a direct feature effect on the predicted outcome. Hence, marginal effects are typically used as approximations for feature effects, either as derivatives of the prediction function or forward differences in prediction due to changes in feature values. While marginal effects are commonly used in many scientific fields, they have not yet been adopted as a general model-agnostic interpretation method for machine learning models. This may stem from the ambiguity surrounding marginal effects and their inability to deal with the non-linearities found in black box models. We introduce a unified definition of forward marginal effects (FMEs) that includes univariate and multivariate, as well as continuous, categorical, and mixed-type features. To account for the non-linearity of prediction functions, we introduce a non-linearity measure for FMEs. Furthermore, we argue against summarizing feature effects of a non-linear prediction function in a single metric such as the average marginal effect. Instead, we propose to average homogeneous FMEs within population subgroups, which serve as conditional feature effect estimates.

Dokumententyp:	Zeitschriftenartikel
Fakultät:	Mathematik, Informatik und Statistik > Statistik
Themengebiete:	000 Informatik, Informationswissenschaft, allgemeine Werke > 004 Informatik 500 Naturwissenschaften und Mathematik > 510 Mathematik
URN:	urn:nbn:de:bvb:19-epub-122518-0
ISSN:	1384-5810
Sprache:	Englisch
Dokumenten ID:	122518
Datum der Veröffentlichung auf Open Access LMU:	18. Nov. 2024 09:35
Letzte Änderungen:	18. Nov. 2024 09:35

Dokument bearbeiten