Percentile Analysis for Goodness-of-Fit Comparisons of Models to Data

Abstract

In cognitive modeling, it is routine to report a goodness-of-fit index (e.g., R2 or RMSE) between a putative model's predictions and an observed dataset. However, there exist no standard index values for what counts as "good" or "bad", and most indices do not take into account the number of data points in an observed dataset. These limitations impair the interpretability of goodness-of-fit indices. We propose a generalized methodology, percentile analysis, which contextualizes goodness-of-fit measures in terms of performance that can be achieved by chance alone. A series of Monte Carlo simulations showed that the indices of randomized models systematically decrease as the number of data points to be fit increases, and that the relationship is nonlinear. We discuss the results of the simulation and how computational cognitive modelers can use them to place commonly used fit indices in context.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jul 01, 2014
Accession Number
ADA619081

Entities

People

  • J. Gregory Trafton
  • Sangeet Khemlani

Organizations

  • United States Naval Research Laboratory

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Cognitive Science
  • Computational Modeling
  • Data Science
  • Data Sets
  • Errors
  • Information Operations
  • Information Science
  • Interdisciplinary Science
  • Mathematical Analysis
  • Military Research
  • Monte Carlo Method
  • Probability
  • Probability Distributions
  • Simulations
  • Standards
  • Test And Evaluation

Readers

  • Educational Psychology
  • Regression Analysis.