Abstract
In the last two decades, regularization techniques, in particular penalty-based methods, have become very popular in statistical modelling. Driven by technological developments, most approaches have been designed for high-dimensional problems with metric variables, whereas categorical data has largely been neglected. In recent years, however, it has become clear that regularization is also very promising when modelling categorical data. A specific trait of categorical data is that many parameters are typically needed to model the underlying structure. This results in complex estimation problems that call for structured penalties which are tailored to the categorical nature of the data. This article gives a systematic overview of penalty-based methods for categorical data developed so far and highlights some issues where further research is needed. We deal with categorical predictors as well as models for categorical response variables. The primary interest of this article is to give insight into basic properties of and differences between methods that are important with respect to statistical modelling in practice, without going into technical details or extensive discussion of asymptotic properties.
| Item Type: | Journal article |
|---|---|
| Form of publication: | Publisher's Version |
| Faculties: | Mathematics, Computer Science and Statistics > Statistics > Chairs/Working Groups > Seminar for Applied Stochastic |
| Subjects: | 500 Science > 510 Mathematics |
| URN: | urn:nbn:de:bvb:19-epub-43139-0 |
| ISSN: | 1477-0342 |
| Alliance/National Licence: | This publication is with permission of the rights owner freely accessible due to an Alliance licence and a national licence (funded by the DFG, German Research Foundation) respectively. |
| Language: | English |
| Item ID: | 43139 |
| Date Deposited: | 12. Apr 2018 14:27 |
| Last Modified: | 04. Nov 2020 13:18 |

