A Framework and Toolkit for the Construction of Multimodal Learning Interfaces

Abstract

This dissertation contributes in three main areas: (1) theory of multimodal interaction, (2) software architecture and reusable application framework, and (3) rapid application prototyping by domain specific instantiation of a common underlying architecture. The foundation of the application framework and the rapid prototyping tools is a model of multimodal interpretation based on semantic integration of information streams. This model supports most of the conceivable human communication modalities in the context of a broad class of applications, specifically those that support state manipulation via parameterized actions. The multimodal semantic model is also the basis for a flexible, domain independent, incrementally trainable multimodal interpretation algorithm based on a connectionist network. The second major contribution is an application framework consisting of reusable components and a modular, distributed system architecture. Multimodal application developers can assemble the components in the framework into a new application, accepting default options when appropriate and providing application specific customizations when needed. The third major contribution is a design process backed by a workbench of tools to permit the rapid prototyping of a multimodal application. This design process systematically constructs customizations needed to interpret multimodal inputs in a given domain, allowing an application structure created in the proposed framework to be instantiated for that domain. The application framework and design process have been successfully applied to the construction of three multimodal systems in three different domains.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 29, 1998
Accession Number
ADA352310

Entities

People

  • Minh Tue Vo

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • C4I
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Automated Speech Recognition
  • Cognitive Science
  • Cognitive Systems Engineering
  • Computational Science
  • Computer Languages
  • Computer Programming
  • Computer Programs
  • Computer Science
  • Computers
  • Grammars
  • Human Systems Integration
  • Human-Machine Interaction
  • Information Processing
  • Information Systems
  • Linguistics
  • Natural Language Processing

Fields of Study

  • Computer science
  • Engineering

Readers

  • Artificial Intelligence
  • Distributed Systems and Data Platform Development
  • Software Engineering.