Bayesian variable selection for understanding mixtures in environmental exposures

Abstract

Social and environmental stressors are crucial factors in child development. However, there exists a multitude of measurable social and environmental factors—the effects of which may be cumulative, interactive, or null. Using a comprehensive cohort of children in North Carolina, we study the impact of social and environmental variables on 4th end‐of‐grade exam scores in reading and mathematics. To identify the essential factors that predict these educational outcomes, we design new tools for Bayesian linear variable selection using decision analysis. We extract a predictive optimal subset of explanatory variables by coupling a loss function with a novel model‐based penalization scheme, which leads to coherent Bayesian decision analysis and empirically improves variable selection, estimation, and prediction on simulated data. The Bayesian linear model propagates uncertainty quantification to all predictive evaluations, which is important for interpretable and robust model comparisons. These predictive comparisons are conducted out‐of‐sample with a customized approximation algorithm that avoids computationally intensive model refitting. We apply our variable selection techniques to identify the joint collection of social and environmental stressors—and their interactions—that offer clear and quantifiable improvements in prediction of reading and mathematics exam scores.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jun 15, 2021
Source ID
10.1002/sim.9099

Entities

People

  • Alexander A. T. Bui
  • Daniel R Kowal
  • Henry Leong
  • Kathy Ensor
  • Marie Lynn Miranda
  • Mercedes Bravo
  • Robert. J. Griffin

Organizations

  • Army Research Office
  • National Institute of Environmental Health Sciences
  • National Institute on Minority Health and Health Disparities
  • RTI International
  • Rice University
  • University of Notre Dame

Tags

Readers

  • Computational Modeling and Simulation
  • Distributed Systems and Data Platform Development
  • Statistical inference.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference