Signal and Noise in Gene Length Distributions of Decoded Genomes

Abstract

Development of objective measures for comparing genome data has become an important goal in computational genomics. Gene length distribution histograms derived from the predicted protein-coding gene populations of decoded genomes were considered as tools for genomic comparison. The first and second moments of the histograms map the decoded genomes as isolated points onto an Euclidean plane. The clustering of the genomic points on the plane preserves the separation of the three domains of life. The mapping of genomes onto a plane allows for the use of vector analysis in the study of genomic evolution.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 25, 2001
Accession Number
ADA411438

Entities

People

  • Aydin Toezeren
  • Neil D. Weston
  • Zihang Ou

Organizations

  • The Catholic University of America

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Amino Acids
  • Aromatic Amino Acids
  • Bacteria
  • Biology
  • Biomedical Engineering
  • Cells
  • Computer Programming
  • Decomposition
  • Engineering
  • Escherichia Coli
  • Eukaryotes
  • Histograms
  • Probability Distributions
  • Prokaryotes
  • Proteins
  • Random Variables
  • Sequence Analysis

Fields of Study

  • Biology

Readers

  • Computer Vision.
  • Molecular Genetics
  • Radio communications and signal processing.