Building Multiclass Classifiers for Remote Homology Detection and Fold Recognition

Abstract

Protein remote homology prediction and fold recognition are central problems in computational biology. Supervised learning algorithms based on support vector machines are currently one of the most effective methods for solving these problem. These methods are primarily used to solve binary classification problems and they have not been extensively used to solve the more general multiclass remote homology prediction and fold recognition problems. We developed a number of methods for building SVM based multiclass classification schemes in the context of SCOP protein classification. These methods includes schemes that directly build an SVM-based multiclass model, schemes that employ a second level learning approach to combine the predictions generated by a set of binary SVM-based classifiers, and schemes that build and combine binary classifiers for various levels of the SCOP hierarchy beyond those defining the target classes. Results: We performed a comprehensive study analyzing the different approaches using four different datasets. Our results show that most of the proposed multiclass SVM-based classification approaches are quite effective in solving the remote homology prediction and fold recognition problems and that the schemes that use predictions from binary models constructed for ancestral categories within the SCOP hierarchy tend to qualitatively improve the prediction results.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 05, 2006
Accession Number
ADA446086

Entities

People

  • George Karypis
  • Huzefa Rangwala

Organizations

  • University of Minnesota

Tags

Communities of Interest

  • Air Platforms
  • Autonomy

DTIC Thesaurus Topics

  • Algorithms
  • Applied Computer Science
  • Artificial Intelligence
  • Classification
  • Computer Science
  • Computers
  • Data Science
  • Detection
  • Distance Learning
  • Errors
  • Hierarchies
  • Information Science
  • Kernel Functions
  • Machine Learning
  • Recognition
  • Supervised Machine Learning
  • Test Sets

Fields of Study

  • Computer science

Readers

  • Molecular Genetics
  • Neural Network Machine Learning.

Technology Areas

  • AI & ML
  • AI & ML - Neural Networks