Cheating with (Recursive) Models
41 Pages Posted: 15 Nov 2019 Last revised: 2 Dec 2019
Date Written: November 2019
Abstract
To what extent can misspecified models generate false estimated correlations? We focus on models that take the form of a recursive system of linear regression equations. Each equation is fitted to minimize the sum of squared errors against an arbitrarily large sample. We characterize the maximal pairwise correlation that this procedure can predict given a generic objective covariance matrix, subject to the constraint that the estimated model does not distort the mean and variance of individual variables. We show that as the number of variables in the model grows, the false pairwise correlation can become arbitrarily close to one, regardless of the true correlation.
Suggested Citation: Suggested Citation