Group and Topic Discovery from Relations and Their Attributes

Abstract

The authors present a probabilistic generative model of entity relationships and their attributes that simultaneously discovers groups among the entities and topics among the corresponding textual attributes. Block-models of relationship data have been studied in social network analysis for some time. Here, the authors simultaneously cluster in several modalities at once, incorporating the attributes (here, words) associated with certain relationships. Significantly, joint inference allows the discovery of topics to be guided by the emerging groups, and vice-versa. They present experimental results on two large data sets: 16 years of bills put before the U.S. Senate, including their corresponding text and voting records, and 13 years of similar data from the United Nations. The authors show that in comparison with traditional, separate, latent-variable models for words, or Block-structures for votes, the Group-Topic model's joint inference discovers more cohesive groups and improved topics.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2006
Accession Number
ADA449625

Entities

People

  • Andrew McCallum
  • Natasha Mohanty
  • Xuerui Wang

Organizations

  • University of Massachusetts Amherst

Tags

Communities of Interest

  • Biomedical
  • Energy and Power Technologies
  • Weapons Technologies

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Cohesion
  • Commerce
  • Computer Science
  • Congress
  • Domestic
  • Education
  • Elections
  • Environmental Protection
  • Human Rights
  • Language
  • Law
  • Mathematical Models
  • Models
  • Nuclear Weapons
  • Political Science
  • Security

Fields of Study

  • Computer science

Readers

  • Neural Network Machine Learning.
  • Public Financial Management and Budgeting

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval