Estimating an Sku-Level Brand Choice Model Combining Household Panel Data and Store Data

University of Chicago Graduate School of Business Working Paper

28 Pages Posted: 8 Sep 2003

See all articles by Pradeep K. Chintagunta

Pradeep K. Chintagunta

University of Chicago

Jean-Pierre Dubé

University of Chicago - Booth School of Business; National Bureau of Economic Research (NBER); Marketing Science Institute (MSI)

Date Written: July 18, 2003

Abstract

The extant literature using household scanner data to estimate consumer choice models has identified two key sources of bias in estimated mean responses to marketing variables. Omitted heterogeneity may bias mean responses towards zero. At the same time, omitted time-varying characteristics of alternatives that influence consumer choices may also bias mean responses towards zero if these characteristics are correlated with observed factors such as price - the endogeneity bias. Both these issues have been well recognized, and methods have been proposed to address them using household scanner panel data.

However, when estimating a choice model with these data at the SKU or the UPC level, one may not observe choices for each item in each of the time periods under consideration. Without such information, one cannot control for item and time period specific unmeasured characteristics, as there is no information on alternatives during those periods in which they are not purchased by any of the panelists. In general, when a product category has many alternatives, each with fairly small shares, the household sample may not contain sufficient choices for each alternative, negatively impacting the ability to control for endogeneity with household data. In contrast, as aggregate store-level data are the true aggregation of purchases by all households visiting the store, they contain the time-period specific item level information required to account for endogeneity as long as each item has some sales in each time period. Given the relative merits of household data to estimate the distribution of heterogeneity and store-level data to address the endogeneity problem, we propose an integrated estimation procedure that uses the information in both sources. Our approach provides consistent estimates of the mean responses to marketing variables and the heterogeneity distribution and also controls for potential endogeneity due to correlation between unmeasured item-level characteristics and prices.

Keywords: Household scanner data, store-level scanner data, price endogeneity, heterogeneity

JEL Classification: M3, L0, C5

Suggested Citation

Chintagunta, Pradeep K. and Dube, Jean-Pierre H., Estimating an Sku-Level Brand Choice Model Combining Household Panel Data and Store Data (July 18, 2003). University of Chicago Graduate School of Business Working Paper, Available at SSRN: https://ssrn.com/abstract=432661 or http://dx.doi.org/10.2139/ssrn.432661

Pradeep K. Chintagunta

University of Chicago ( email )

5807 S. Woodlawn Avenue
Chicago, IL 60637
United States
773-702-8015 (Phone)
773-702-0458 (Fax)

Jean-Pierre H. Dube (Contact Author)

University of Chicago - Booth School of Business ( email )

5807 South Woodlawn Avenue
Chicago, IL 60637
United States

HOME PAGE: http://gsb.uchicago.edu/fac/jean-pierre.dube

National Bureau of Economic Research (NBER) ( email )

1050 Massachusetts Avenue
Cambridge, MA 02138
United States

Marketing Science Institute (MSI) ( email )

1000 Massachusetts Ave.
Cambridge, MA 02138-5396
United States