Labels or Attributes? Rethinking the Neighbors for Collective Classification in Sparsely-Labeled Networks

Abstract

Many classification tasks involve linked nodes, such as people connected by friendship links. For such networks, accuracy might be increased by including, for each node, the (a) labels or (b) attributes of neighboring nodes as model features. Recent work has focused on option (a), because early work showed it was more accurate and because option (b) fit poorly with discriminative classifiers. We show, however, that when the network is sparsely labeled, \relational classification" based on neighbor attributes often has higher accuracy than\collective classification"based on neighbor labels. Moreover, we introduce an efficient method that enables discriminative classifiers to be used with neighbor attributes, yielding further accuracy gains. We show that these effects are consistent across a range of datasets, learning choices, and inference algorithms, and that using both neighbor attributes and labels often produces the best accuracy.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 2013
Accession Number
ADA602549

Entities

People

  • David W. Aha
  • Luke K. Mcdowell

Tags

Communities of Interest

  • Autonomy
  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Accuracy
  • Algorithms
  • Artificial Intelligence
  • Classification
  • Computer Science
  • Data Mining
  • Data Sets
  • Databases
  • Information Science
  • Knowledge Management
  • Learning
  • Machine Learning
  • Network Science
  • Semi-Supervised Learning
  • Social Networks
  • Supervised Machine Learning
  • Training

Fields of Study

  • Computer science

Readers

  • Computer Networking
  • Distributed Systems and Data Platform Development
  • Psychometric Testing or Psychological Assessment.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks