Designing Feature and Data Parallel Stochastic Coordinate Descent Method for Matrix and Tensor Factorization

Abstract

Given a high-order large-scale tensor, how can we decompose it into latent factors? Can we process it on commodity computers with limited memory? These questions are closely related to recommender systems, which have modeled rating data not as a matrix but as a tensor to utilize contextual information such as time and location. This increase in the order requires tensor factorization methods scalable with both the order and size of a tensor. We proposed two distributed tensor factorization methods, CDTF and SALS. Both methods are scalable with all aspects of data and show a trade-off between convergence speed and memory requirements. CDTF, based on coordinate descent, updates one parameter at a time, while SALS generalizes on the number of parameters updated at a time. In our experiments, only our methods factorized a 5-order tensor with 1 billion observable elements,10M mode length, and 1K rank, while all other state-of-the-art methods failed. Moreover, our methods required several orders of magnitude less memory than our competitors. We implemented our methods on MAPREDUCE with two widely-applicable optimization techniques: local disk caching and greedy row assignment. They speeded up our methods up to 98.2x and also the competitors up to 5.9x.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 11, 2016
Accession Number
AD1033036

Entities

People

  • U. Kang

Tags

DTIC Thesaurus Topics

  • Accuracy
  • Air Force
  • Air Force Research Laboratories
  • Algorithms
  • Classification
  • Computer Science
  • Computers
  • Convergence
  • Data Mining
  • Department Of Defense
  • Economic Forecasting
  • Education
  • Electronic Mail
  • Engineering
  • Errors
  • Military Research
  • Ratings
  • Scalability
  • Scientific Research
  • Standards
  • Test Sets
  • Training

Fields of Study

  • Computer science

Readers

  • Distributed Systems and Data Platform Development
  • Linear Algebra