The E-MS Algorithm: Model Selection With Incomplete Data

Jiming Jiang, Thuan Nguyen, J. Sunil Rao

Research output: Contribution to journalArticlepeer-review

17 Scopus citations


We propose a procedure associated with the idea of the E-M algorithm for model selection in the presence of missing data. The idea extends the concept of parameters to include both the model and the parameters under the model, and thus allows the model to be part of the E-M iterations. We develop the procedure, known as the E-MS algorithm, under the assumption that the class of candidate models is finite. Some special cases of the procedure are considered, including E-MS with the generalized information criteria (GIC), and E-MS with the adaptive fence (AF; Jiang et al.). We prove numerical convergence of the E-MS algorithm as well as consistency in model selection of the limiting model of the E-MS convergence, for E-MS with GIC and E-MS with AF. We study the impact on model selection of different missing data mechanisms. Furthermore, we carry out extensive simulation studies on the finite-sample performance of the E-MS with comparisons to other procedures. The methodology is also illustrated on a real data analysis involving QTL mapping for an agricultural study on barley grains. Supplementary materials for this article are available online.

Original languageEnglish (US)
Pages (from-to)1136-1147
Number of pages12
JournalJournal of the American Statistical Association
Issue number511
StatePublished - Jul 3 2015


  • Backcross experiments
  • Conditional sampling
  • Consistency
  • Convergence
  • Missing data mechanism
  • Model selection
  • Regression

ASJC Scopus subject areas

  • Statistics and Probability
  • Statistics, Probability and Uncertainty


Dive into the research topics of 'The E-MS Algorithm: Model Selection With Incomplete Data'. Together they form a unique fingerprint.

Cite this