KI-LEARN: Knowledge-Intensive Learning Methods for Knowledge-Rich/Data-Poor Domains

Abstract

Knowledge Representation and Reasoning (KRR) has developed a wide range of methods for representing knowledge and reasoning from it to produce expert-level performance. Despite these accomplishments, there is one major problem preventing the wide-spread application of KRR technology: the inability to support learning. This makes KRR systems brittle and difficult to maintain. On the other hand, Machine Learning (ML) has developed a wide range of methods for learning from examples. However, there are two major problems preventing the wide-spread application of machine learning technology: the need for large amounts of training data and the high cost of manually designing the hypothesis space of the learning system. Our goal in this research effort was to develop a new methodology, called KI-LEARN (Knowledge Intensive LEARNing), that combines domain knowledge and sparse training data to construct high-performance systems. This report provides an overview of the major results we obtained on specific tasks as outlined in our proposal. More specifically, to address issues in knowledge representation and efficient learning we designed a language called First-Order Conditional Influence (FOCI) Language for expressing attributes relevant to learning. Our language extends probabilistic relational models (PRMs) which are themselves probabilistic representations most similar to first-order representation languages employed in KRR systems. A distinct feature of our language is its support for explicit expression of qualitative constraints such as monotonicity, saturation, and synergies. More importantly, we have demonstrated via mathematical proofs and experimental results how these qualitative constraints can be used and exploited when learning with sparse training data. We specifically show how qualitative constraints can be incorporated into learning algorithms. In addition, this report describes the models we constructed for our testbed domains.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 31, 2006
Accession Number
ADA454050

Entities

People

  • Alan Fern
  • Angelo Restificar
  • Bruce D'ambrosio
  • Eric Altendorf
  • Jianqiang Shen
  • Jon Herlocker
  • Prasad Tadepalli
  • Sriraam Natarajan
  • Thomas G. Dietterich
  • Xinlong Bao

Organizations

  • Oregon State University

Tags

Communities of Interest

  • Autonomy
  • C4I

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Artificial Intelligence Software
  • Bayesian Networks
  • Computer Programming
  • Computers
  • Data Mining
  • Databases
  • Information Science
  • Machine Learning
  • Neural Networks
  • Operating Systems
  • Probabilistic Models
  • Probability
  • Probability Distributions
  • Random Variables
  • User Interface
  • Web Browsers

Fields of Study

  • Computer science

Readers

  • Artificial Intelligence
  • Distributed Systems and Data Platform Development

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks
  • Space