Multimodal Meeting Capture and Understanding with the CALO Meeting Assistant

Abstract

The CALO Meeting Assistant is a multimodal meeting assistant technology that integrates speech, gestures, and multimodal data collected from multiparty interactions during meetings. Using machine learning and robust discourse processing, it provides a rich, browsable record of a meeting. For nearly as long as people have been holding meetings, they have been thinking up ways to make them more bearable. Recent advances include technologies that integrate the processing of speech, natural language understanding, vision, and multimodal interaction to produce tools that perceive what happens at a meeting, extract salient events, and produce a reliable record. The ICSI Meeting Project sought to produce automated and segmented transcripts from natural, multi-party speech during meetings, while the ISL Smart Meeting Room Task and the M4 and AMI projects instrumented meeting rooms to collect data on behaviors so the interactions of meeting participants could be analyzed to produce flexible records of their activities, while providing a supportive environment for collaboration. Akin to the latter, the CALO Meeting Assistant collects data about the behaviors of people in meetings, assimilating speech, movement, and note-taking data to create a rich representation of a meeting that can be analyzed and reviewed at many levels. In addition, the CALO Meeting Assistant integrates its observations with those of a larger system of agents, which assesses the meeting data in the context of the ongoing projects and workflow for each of the meeting participants. Thus, our meeting assistant aims to reach beyond an intelligent room that understands only the activities of people in meetings, and attempts to understand their overarching concerns and interpret their behaviors from the perspective of what their meetings mean to them.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2007
Accession Number
ADA506397

Entities

People

  • Patrick Ehlen

Organizations

  • Stanford University

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Computer Languages
  • Dimensionality Reduction
  • Environment
  • Information Operations
  • Language
  • Learning
  • Machine Learning
  • Natural Language Understanding
  • Natural Languages
  • Semi-Supervised Learning
  • Standards
  • Supervised Machine Learning
  • Supervision
  • Universities
  • Unsupervised Machine Learning

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Defense Technology Research and Development.

Technology Areas

  • AI & ML