High-Performance Analysis of Filtered Semantic Graphs

Abstract

High performance is a crucial consideration when executing a complex analytic query on a massive semantic graph. In a semantic graph, vertices and edges carry \attributes" of var- ious types. Analytic queries on semantic graphs typically depend on the values of these attributes; thus, the com- putation must either view the graph through a lter that passes only those individual vertices and edges of interest or else must rst materialize a subgraph or subgraphs con- sisting of only the vertices and edges of interest. The ltered approach is superior due to its generality, ease of use, and memory e ciency, but may carry a performance cost. In the Knowledge Discovery Toolbox (KDT), a Python library for parallel graph computations, the user writes l- ters in a high-level language, but those lters result in rel- atively low performance due to the bottleneck of having to call into the Python interpreter for each edge. In this work we use the Selective Embedded JIT Specialization (SEJITS) approach to automatically translate lters de ned by pro- grammers into a lower-level e ciency language, bypassing the upcall into Python. We evaluate our approach by com- paring it with the high-performance C++ /MPI Combinato- rial BLAS engine, and show that the productivity gained by using a high-level ltering language comes without sacri c- ing performance. We also present a new roo ine model for graph traversals, and show that our high-performance im- plementations do not signi cantly deviate from the roo ine.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 06, 2012
Accession Number
ADA561689

Entities

People

  • Adam Lugowski
  • Armando Fox
  • Aydin Buluc
  • John Gilbert
  • Leonid Oliker
  • Samuel C Williams
  • Shoaib A. Kamil

Organizations

  • University of California, Berkeley

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Computations
  • Computer Programming
  • Computer Programs
  • Computer Science
  • Computers
  • Data Sets
  • Engineering
  • High Level Languages
  • High Performance Computing
  • Language
  • Parallel Computing
  • Parallel Processing
  • Semantic Models
  • Social Media
  • Sparse Matrix
  • Specialization

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Graph Algorithms and Convex Optimization.
  • Snow Cover Descriptors for Reptiles and Their Illustrations.