Conversation Thread Extraction and Topic Detection in Text-Based Chat

Abstract

Text-based chat systems are widely used within the Department of Defense, but the standard systems available do not provide robust capabilities for search, information retrieval, or information assurance. The objective of this research is to explore methods for the extraction of conversation threads from text-based chat systems in order to enable such tasks. As part of the research, we manually annotated over 20,000 Internet Relay Chat posts with conversation thread information and constructed a probabilistic model for automatically classifying posts according to conversation thread. We also provide an algorithm for extracting these conversation threads from the chat session in order to form discrete documents that may be used in a vector space model information retrieval system. We elaborate how this technique can be used to support search and data mining systems, as well as auditing tasks and guard functions in a security system. Using the developed probabilistic models, we have achieved classification results on par with those of human annotators.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2008
Accession Number
ADA490001

Entities

People

  • Paige H. Adams

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Autonomy
  • C4I
  • Cyber

DTIC Thesaurus Topics

  • Bayesian Networks
  • Cognitive Science
  • Computational Science
  • Computer Languages
  • Computer Programming
  • Computers
  • Data Mining
  • Information Science
  • Machine Learning
  • Natural Language Processing
  • Network Science
  • Neural Networks
  • Ontologies
  • Operating Systems
  • Probabilistic Models
  • Probability
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Artificial Intelligence
  • Enterprise Information Systems Architecture and Joint Command Capability Interoperability Support.
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation
  • Space