Correcting Evaluation Bias of Relational Classifiers with Network Cross Validation

Abstract

Recently, a number of modeling techniques have been developed for data mining and machine learning in relational and network domains where the instances are not independent and identically distributed (i.i.d.). These methods specifically exploit the statistical dependencies among instances to improve classification accuracy. However, there has been little focus on how these same dependencies affect our ability to draw accurate conclusions about the performance of the models. More specifically, the complex link structure and attribute dependencies in relational data violate the assumptions of many conventional statistical tests and make it difficult to use these tests to assess the models in an unbiased manner. In this work, we examine the task of within-network classification and the question of whether two algorithms will learn models that will result in significantly different levels of performance. We show that the commonly used form of evaluation (paired t-test on overlapping network samples) can result in an unacceptable level of Type I error. Furthermore, we show that Type I errors increase as (1) the correlation among instances increases, and (2) the size of the evaluation set increases (i.e., the proportion of labeled nodes in the network decreases). We propose a method for network cross-validation that combined with paired t-tests produces more acceptable levels of Type I error while still providing reasonable levels of statistical power (i.e., 1-Type II error).

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2010
Accession Number
ADA554438

Entities

People

  • Brian Gallagher
  • Jennifer Neville
  • Tao Wang
  • Tina Eliassi-rad

Organizations

  • Purdue University

Tags

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Classification
  • Computational Science
  • Computer Science
  • Data Mining
  • Data Science
  • Data Sets
  • Databases
  • Electronic Mail
  • Information Science
  • Machine Learning
  • Network Science
  • Probability
  • Statistical Tests
  • Surveys
  • Test Sets

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Regression Analysis.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Neural Networks