D-Cube: Deliteful Deepdive for Domain-Specific Indexing and Search for the Web

Abstract

This report describes the three main components of our teams work on the MEMEX program. The first component, called Delite, focuses on creating a higher-level domain specific languages (DSLs) that automate many of the tedious portions of acquiring, extracting, and analyzing search data. The second component is the DeepDive Knowledge Base Construction Engine which has enabled users to efficiently achieve enhanced domain-specific extraction performance. The third component, called Snorkel, simplifies the DeepDive pipeline to enabling data analysts to focus on the task at hand. Delite, DeepDive and Snorkel are widely deployed by both industry and academia to support critical applications and research.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 2018
Accession Number
AD1060871

Entities

People

  • Oyekunle Olukotun

Organizations

  • Stanford University

Tags

Communities of Interest

  • Autonomy
  • Biomedical
  • Human Systems

DTIC Thesaurus Topics

  • Air Force
  • Air Force Research Laboratories
  • Algorithms
  • Computational Science
  • Computer Programming
  • Computers
  • Construction
  • Data Analysis
  • Data Visualization
  • Deep Learning
  • Information Processing
  • Information Science
  • Information Systems
  • Language
  • Machine Learning
  • Natural Languages
  • Training

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Distributed Systems and Data Platform Development