Multivalent Documents: Anytime, Anywhere, Any Type, Every Way User-Improvable Digital Documents and Systems

Abstract

Digital documents are important. Whatever else computer workers do, they expend a considerable time working with digital documents, whether as e-mail, word processing files, presentation slides, web pages, discussion groups, or help systems, among many other ways. This dissertation shows how to improve the online manipulation capabilities of potentially all formats, media types, and genres of existing and future digital documents. The Multivalent Document Model extensively opens to enhancement all aspects of a digital document system. Document content is constructed from layers of often heterogeneous type, each with specialized purpose, all semantically aligned. All user-visible document functionality is constructed from stylized program components called behaviors. Document system operations, such as drawing a representation of the document on the screen and saving an edited version, derive from the fundamental operation found to some degree in every digital document system, newly codified as extensible programmatic protocols. This diverse open content, open functionality, and open operation are woven together by numerous mechanisms to produce a final composition that appears built from the ground up as a unified whole. A prototype of the Model, called the Multivalent Document System, has been realized in Java, deployed, and built upon by several third party developers. The System has been tested by and has significantly contributed to the development of three sample applications. The first application allows paper scanned into a computer as images to be manipulated as a live, semantic object, with text copy and paste, text search, and a "lens" operation that displays the corresponding ASCII translation of the region. In the second application, HTML, the lingua franca of the web and a very different document type than scanned page images, has been extended with new functionality including outline displays, a speed reading window, and tables sorted on demand.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 01, 1998
Accession Number
ADA603917

Entities

People

  • Thomas A. Phelps

Organizations

  • University of California, Berkeley

Tags

Communities of Interest

  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Computer Program Documentation
  • Computer Program Reliability
  • Computer Programming
  • Computer Programs
  • Computer Science
  • Computers
  • Debugging
  • Electronic Mail
  • Grammars
  • Graphical User Interface
  • Information Systems
  • Linguistics
  • Lisp Programming Language
  • Operating Systems
  • Software Development
  • Web Browsers
  • Word Processors

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Software Engineering.