Identification of Moonlighting Proteins in Genomes Using Text Mining Techniques

Abstract

Moonlighting proteins is an emerging concept for considering protein functions, which indicate proteins with two or more independent and distinct functions. An increasing number of moonlighting proteins have been reported in the past years; however, a systematic study of the topic has been hindered because the secondary functions of proteins are usually found serendipitously by experiments. Toward systematic identification and study of moonlighting proteins, computational methods for identifying moonlighting proteins from several different information sources, database entries, literature, and large‐scale omics data have been developed. In this study, an overview for finding moonlighting proteins is discussed. Then, the literature‐mining method, DextMP, is applied to find new moonlighting proteins in three genomes, Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster. Potential moonlighting proteins identified by DextMP are further examined by a two‐step manual literature checking procedure, which finally yielded 13 new moonlighting proteins. Identified moonlighting proteins are categorized into two classes based on the clarity of the distinctness of two functions of the proteins. A few cases of the identified moonlighting proteins are described in detail. Further direction for improving the DextMP algorithm is also discussed.

Document Details

Document Type
Pub Defense Publication
Publication Date
Oct 10, 2018
Source ID
10.1002/pmic.201800083

Entities

People

  • Aashish Jain
  • Daisuke Kihara
  • Hareesh Gali

Organizations

  • Intelligence Advanced Research Projects Activity
  • National Institute of General Medical Sciences
  • National Institutes of Health
  • National Science Foundation
  • Purdue University
  • University of Cincinnati

Tags

Fields of Study

  • Biology

Readers

  • Molecular Genetics
  • Occupational Health and Safety.
  • Theoretical Analysis.