Data Dispersion: Now You See it... Now You Don't
14 Pages Posted: 21 May 2010 Last revised: 25 Jul 2011
Date Written: May 21, 2010
Abstract
The most popular method for modeling count data is Poisson regression. When data display over-dispersion, thereby deeming Poisson regression inadequate, typically negative-binomial regression is instead used. We show that count data that appear to be equi-dispersed or over-dispersed may actually stem from a mixture of populations with different dispersion levels. To detect and model such a mixture, we introduce a generalization of the Conway-Maxwell-Poisson (COM-Poisson) regression that allows for group-level dispersion. We illustrate mixed dispersion effects and the proposed methodology via semi-authentic data.
Keywords: Conway-Maxwell-Poisson (COM-Poisson) regression, mixture model, negative binomial regression, over dispersion, under-dispersion
Suggested Citation: Suggested Citation