Please use this identifier to cite or link to this item: http://hdl.handle.net/2445/174612
Title: Penalized logistic regression to improve predictive capacity of rare events in surveys
Author: Pesantez-Narvaez, Jessica
Guillén, Montserrat
Keywords: Anàlisi de regressió
Control predictiu
Processament de dades
Enquestes
Equador
Regression analysis
Predictive control
Data processing
Surveys
Ecuador
Issue Date: 2020
Publisher: IOS Press
Abstract: Logistic regression as a modelling technique of rare binary dependent variables with much fewer events (ones) than non-events (zeros) tends to underestimate their probability of occurrence. The vast literature devoted to the prediction of rare binary data identifies several ways to improve predictive performance by making modifications to the likelihood estimation. We propose two weighting mechanisms for incorporation in a pseudo-likelihood estimation that improve the predictive capacity of rare binary responses in data collected in complex surveys. We multiply sampling weights by specific correctors that lead to lower root mean square errors for event observations in almost all deciles. A case study is discussed where this method is implemented to predict the probability of suffering a workplace accident in a logistic regression model that is estimated with data from a survey conducted in Ecuador.
Note: Versió postprint del document publicat a: https://doi.org/10.3233/JIFS-179641
It is part of: Journal of Intelligent and Fuzzy Systems, 2020, vol. 38, num. 5, p. 5497-5507
URI: http://hdl.handle.net/2445/174612
Related resource: https://doi.org/10.3233/JIFS-179641
ISSN: 1064-1246
Appears in Collections:Articles publicats en revistes (Econometria, Estadística i Economia Aplicada)

Files in This Item:
File Description SizeFormat 
699712.pdf324.13 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.