Factorial Designs, Model Selection, and (Incorrect) Inference in Randomized Experiments

74 Pages Posted: 16 Dec 2019 Last revised: 29 Jan 2023

See all articles by Karthik Muralidharan

Karthik Muralidharan

University of California, San Diego (UCSD)

Mauricio Romero

ITAM, Centro de Investigación Económica

Kaspar Wuthrich

University of California, San Diego (UCSD)

Multiple version iconThere are 2 versions of this paper

Date Written: December 2019

Abstract

Factorial designs are widely used for studying multiple treatments in one experiment. While t-tests based on the “long” model (including main and interaction effects) provide valid inferences against “business-as-usual” counterfactuals, “short” model t-tests (that ignore interactions) yield higher power if the interactions are zero, but incorrect inferences otherwise. Out of 27 factorial experiments published in top-5 journals in 2007–2017, 19 use the short model. We reanalyze these experiments, and show that over half of their published results lose significance when interactions are included. We show that testing the interactions using the long model and presenting the short model if the interactions are not significantly different from zero leads to incorrect inference due to the implied data-dependent model selection. Based on recent econometric advances, we show that local power improvements over the long model are possible. However, if the main effects are of primary interest, leaving the interaction cells empty yields valid inferences and global power improvements. In addition, the sample size needed to detect interactions is substantially larger than that required to detect main effects, resulting in most experiments being under-powered to detect interactions. Thus, using factorial designs to explore whether interactions are meaningful can be problematic because interaction estimates are likely to considerably overestimate the magnitude of the true effect conditional on being significant.

Suggested Citation

Muralidharan, Karthik and Romero, Mauricio and Wuthrich, Kaspar, Factorial Designs, Model Selection, and (Incorrect) Inference in Randomized Experiments (December 2019). NBER Working Paper No. w26562, Available at SSRN: https://ssrn.com/abstract=3504447

Karthik Muralidharan (Contact Author)

University of California, San Diego (UCSD) ( email )

9500 Gilman Drive
Mail Code 0502
La Jolla, CA 92093-0112
United States

Mauricio Romero

ITAM, Centro de Investigación Económica ( email )

Camino a Santa Teresa No. 930
Col. Héroes de Padierna
Ciudad de México
Mexico

Kaspar Wuthrich

University of California, San Diego (UCSD) ( email )

9500 Gilman Drive
La Jolla, CA 92093
United States

Do you have negative results from your research you’d like to share?

Paper statistics

Downloads
7
Abstract Views
261
PlumX Metrics