If you need an accessible version of this item please contact JSTOR User Support

Diagnostic Plots for Missing Data in Least Squares Regression

Gary A. Simon and Jeffrey S. Simonoff
Journal of the American Statistical Association
Vol. 81, No. 394 (Jun., 1986), pp. 501-509
DOI: 10.2307/2289241
Stable URL: http://www.jstor.org/stable/2289241
Page Count: 9
  • Download PDF
  • Cite this Item

You are not currently logged in.

Access your personal account or get JSTOR access through your library or other institution:

login

Log in to your personal account or through your institution.

If you need an accessible version of this item please contact JSTOR User Support
Diagnostic Plots for Missing Data in Least Squares Regression
Preview not available

Abstract

The usual approach to handling missing data in a regression is to assume that the points are missing at random (MAR) and use either a fill-in method to replace the missing points or a method using maximally available pairs in the sample covariance matrix. We derive limits for the values of the least squares estimates of the coefficients (and their associated t statistics) when there are missing observations in one carrier. These limits are derived subject to a constraint on the relationship of the missing data to the present data. Calculating these limits while varying this constrained value results in a series of diagnostic plots that can be used to study the potential effect of the missing points on the regression (without assuming that the points are MAR). Simulations are performed to illustrate the use of the plots, and two real data sets are analyzed. The more general case of missing data in more than one carrier is also discussed.

Page Thumbnails

  • Thumbnail: Page 
501
    501
  • Thumbnail: Page 
502
    502
  • Thumbnail: Page 
503
    503
  • Thumbnail: Page 
504
    504
  • Thumbnail: Page 
505
    505
  • Thumbnail: Page 
506
    506
  • Thumbnail: Page 
507
    507
  • Thumbnail: Page 
508
    508
  • Thumbnail: Page 
509
    509