Markov Chain Monte Carlo and Exact Conditional Tests with Three-Way Contingency Tables

Abstract

We propose an algorithm modifying a popular exact conditional test involving the goodness-of-fit of contingency tables. This study focuses on improving the efficiency of Markov chain Monte Carlo (MCMC) when sampling three-way contingency tables--defined as log-linear models with three discrete random categorical variables consisting of finite levels--under the no-three-way interaction model.Standard to MCMC, we approximate the null distribution by sampling tables from the conditional distribution. However, our proposal involves expanding the conditional state space to include tables with cell count values of -1. We apply the proposed methodology, described in full detail, to randomly generated sparse and non-sparse data sets. Our results show that traditional asymptotic methods on sparse contingency tables yield inaccurate results. We also prove mathematically that a Markov chain with our proposed method is connected (i.e., ergodic) on the conditional state space for 3x3xK, with K >= 3. The output from applying the proposed methodology provides conclusive evidence that the distribution of the test statistics for sparse data sets does not resemble the asymptotic distribution.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 2018
Accession Number
AD1059978

Entities

People

  • Seungchan Lee

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Algorithms
  • Cell Count
  • Data Analysis
  • Data Mining
  • Data Science
  • Data Set
  • Databases
  • Digital Data
  • Distribution Theory
  • Goodness Of Fit Tests
  • Information Science
  • Machine Learning
  • Markov Chains
  • Monte Carlo Method
  • Probability
  • Random Variables
  • Sampling
  • Statistical Algorithms
  • Statistical Analysis
  • Statistical Sampling
  • Statistics
  • Stochastic Processes

Fields of Study

  • Mathematics

Readers

  • Regression Analysis.
  • Statistical inference.

Technology Areas

  • Space