An Automatic Method of Finding Topic Boundaries

Abstract

This article outlines a new method of locating discourse boundaries based on lexical cohesion and a graphical technique called dotplotting. The application of dotplotting to discourse segmentation can be performed either manually, by examining a graph, or automatically using an optimization algorithm. The results of two experiments involving automatically locating boundaries between a series of concatenated documents are presented. Areas of application and future directions for this work are also outlined.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jun 01, 1994
Accession Number
ADA579870

Entities

People

  • Jeffrey C. Raynar

Organizations

  • University of Pennsylvania

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Algorithms
  • Automatic
  • Boundaries
  • Cohesion
  • Computational Linguistics
  • Computer Vision
  • Data Sets
  • Information Operations
  • Information Processing
  • Information Retrieval
  • Language
  • Linguistics
  • New York
  • Precision
  • Recognition
  • Technical Writing

Readers

  • Computational Linguistics
  • Computer Vision.
  • Systems Analysis and Design