Large-Scale Linear Optimization through Machine Learning: From Theory to Practical System Design and Implementation

Abstract

The linear programming (LP) is one of the most popular necessary optimization tool used for data analytics as well as in various scientific fields. However, the current state-of-art algorithms suffer from scalability issues when processing Big Data. For example, the commercial optimization software IBM CPLEX cannot handle an LP with more than hundreds of thousands variables or constraints. Existing algorithms are fundamentally hard to scale because they are inevitably too complex to parallelize. To address the issue, we study the possibility of using the Belief Propagation (BP) algorithm as an LP solver. BP has shown remarkable performances on various machine learning tasks and it naturally lends itself to fast parallel implementations. Despite this, very little work has been done in this area. In particular, while it is generally believed that BP implicitly solves an optimization problem, it is not well understood under what conditions the solution to a BP converges to that of a corresponding LP formulation. Our efforts consist of two main parts. First, we perform a theoretic study and establish the conditions in which BP can solve LP [1,2]. Although there has been several works studying the relation between BP and LP for certain instances, our work provides a generic condition unifying all prior works for generic LP. Second, utilizing our theoretical results, we develop a practical BP-based parallel algorithms for solving generic LPs, and it shows 71x speed up while sacrificing only 0.1 accuracy compared to the state-of-art exact algorithm. As a result of the study, the PIs have published two conference papers and two follow-up journal papers are under submission. We refer the readers to our published work for details.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 10, 2016
Accession Number
AD1018304

Entities

People

  • Jinwoo Shin

Organizations

  • KAIST

Tags

Communities of Interest

  • Autonomy
  • C4I
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Air Force Research Laboratories
  • Algorithms
  • Artificial Intelligence
  • Computations
  • Computer Programming
  • Computer Science
  • Data Analysis
  • Electrical Engineering
  • Information Theory
  • Linear Programming
  • Machine Learning
  • Operations Research
  • Optimization
  • Probability
  • Probability Distributions
  • Random Variables
  • Simplex Method

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Marine Ecological Systems Migration
  • Operations Research

Technology Areas

  • AI & ML
  • AI & ML - Machine Learning Algorithms