Natural Language Semantics using Probabilistic Logic

Abstract

With better natural language semantic representations, computers can do more applications more efficiently as a result of better understanding of natural text. However, no single semantic representation at this time fulfills all requirements needed for a satisfactory representation. Logic-based representations like first-order logic capture many of the linguistic phenomena using logical constructs, and they come with standardized inference mechanisms, but standard first-order logic fails to capture the" graded" aspect of meaning in languages. Distributional models use contextual similarity to predict the "graded" semantic similarity of words and phrases but they do not adequately capture logical structure. In addition there are a few recent attempts to combine both representations either on the logic side (still, not a graded representation), or in the distribution side (not full logic). We propose using probabilistic logic to represent natural language semantics combining the expressivity and the automated inference of logic, and the gradedness of distributional representations. We evaluate this semantic representation on two tasks, Recognizing Textual Entailment (RTE) and Semantic Textual Similarity (STS). Doing RTE and STS better is an indication of a better semantic understanding. Our system has three main components, 1. Parsing and Task Representation, 2. Knowledge Base Construction, and 3. Inference. The input natural sentences of the RTE/STS task are mapped to logical form using Boxer which is a rule based system built on top of a CCG parser, then they are used to formulate the RTE/STS problem in probabilistic logic. Then, a knowledge base is represented as weighted inference rules collected from different sources like WordNet and on-the-fly lexical rules from distributional semantics.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 01, 2014
Accession Number
ADA618212

Entities

People

  • Islam Beltagy

Organizations

  • University of Texas at Austin

Tags

Communities of Interest

  • C4I

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Automata Theory
  • Cognitive Science
  • Computational Linguistics
  • Computational Science
  • Computer Languages
  • Computers
  • Language
  • Linear Programming
  • Linguistics
  • Machine Learning
  • Natural Language Processing
  • Natural Languages
  • Ontologies
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Artificial Intelligence
  • Computational Linguistics

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Translation