Heterogeneous Variable Selection in Nonlinear Panel Data Models: A Semi-Parametric Bayesian Approach

Tinbergen Institute Discussion Paper 2020-061/III

45 Pages Posted: 12 Nov 2020

See all articles by Anoek Castelein

Anoek Castelein

Erasmus University Rotterdam (EUR)

D. Fok

Erasmus Research Institute of Management (ERIM); Econometric Institute - Erasmus University Rotterdam; Tinbergen Institute Rotterdam

Richard Paap

Erasmus University Rotterdam (EUR) - Department of Econometrics; Tinbergen Institute; Erasmus Research Institute of Management (ERIM)

Date Written: September 22, 2020

Abstract

In this paper, we develop a general method for heterogeneous variable selection in Bayesian nonlinear panel data models. Heterogeneous variable selection refers to the possibility that subsets of units are unaffected by certain variables. It may be present in applications as diverse as health treatments, consumer choice-making, macroeconomics, and operations research. Our method additionally allows for other forms of cross-sectional heterogeneity. We consider a two-group approach for the model's unit-specific parameters: each unit-specific parameter is either equal to zero (heterogeneous variable selection) or comes from a Dirichlet process (DP) mixture of multivariate normals (other cross-sectional heterogeneity). We develop our approach for general nonlinear panel data models, encompassing multinomial logit and probit models, poisson and negative binomial count models, exponential models, among many others. For inference, we develop an efficient Bayesian MCMC sampler. In a Monte Carlo study, we find that our approach is able to capture heterogeneous variable selection whereas a "standard'' DP mixture is not. In an empirical application, we find that accounting for heterogeneous variable selection and non-normality of the continuous heterogeneity leads to an improved in-sample and out-of-sample performance and interesting insights. These findings illustrate the usefulness of our approach.

Keywords: Individualized Variable Selection, Dirichlet Process, Stochastic Search, Heterogeneity, Attribute Non-Attendance, Feature Selection, Bayesian

JEL Classification: C23, C11

Suggested Citation

Castelein, Anoek and Fok, Dennis and Fok, Dennis and Paap, Richard, Heterogeneous Variable Selection in Nonlinear Panel Data Models: A Semi-Parametric Bayesian Approach (September 22, 2020). Tinbergen Institute Discussion Paper 2020-061/III, Available at SSRN: https://ssrn.com/abstract=3697480 or http://dx.doi.org/10.2139/ssrn.3697480

Anoek Castelein (Contact Author)

Erasmus University Rotterdam (EUR) ( email )

Burgemeester Oudlaan 50
3000 DR Rotterdam, Zuid-Holland 3062PA
Netherlands

Dennis Fok

Econometric Institute - Erasmus University Rotterdam ( email )

P.O. Box 1738
3000 DR Rotterdam
Netherlands

Erasmus Research Institute of Management (ERIM) ( email )

P.O. Box 1738
3000 DR Rotterdam
Netherlands
+31 10 408 1333 (Phone)
+31 10 408 9162 (Fax)

Tinbergen Institute Rotterdam ( email )

P.O. Box 1738
3000 DR Rotterdam
Netherlands

Richard Paap

Erasmus University Rotterdam (EUR) - Department of Econometrics ( email )

P.O. Box 1738
3000 DR Rotterdam
Netherlands

Tinbergen Institute ( email )

P.O. Box 1738
3000 DR Rotterdam
Netherlands

Erasmus Research Institute of Management (ERIM) ( email )

P.O. Box 1738
3000 DR Rotterdam
Netherlands

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
66
Abstract Views
452
Rank
617,725
PlumX Metrics