The Metadata Coverage Index (MCI): A Standardized Metric for Quantifying Database Annotation Richness

Abstract

Variability in the extent of the descriptions of data (metadata) held in public repositories forces users to assess the quality of records individually, which rapidly becomes impractical. The automatic scoring of records on the richness of their description enables sorting byquality. Here, we introduce an objective measure for metadata the Metadata Coverage Index (MCI): the percentage of available fields actually filled in a record or description. MCI scores can be calculated for a whole database, for individual records or for their component parts (variables or subsets of the data).

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2013
Accession Number
AD1108545

Entities

People

  • Bahador Nosrat
  • Chris Taylor
  • Dawn Field
  • Ioanna Pagani
  • Konstantinos Liolios
  • Lynette Hirschman
  • Lynn Schriml
  • Nikos Kyrpides
  • Philippe Rocca-serra
  • Susanna-assunta Sansone

Organizations

  • United States Department of Energy

Tags

Communities of Interest

  • Biomedical

DTIC Thesaurus Topics

  • Bacteria
  • Calculators
  • Cell Shape
  • Chemical Compounds
  • Data Analysis
  • Data Sets
  • Databases
  • Eukaryotes
  • Genome
  • Genomics
  • Metadata
  • Microbial Genome
  • Molecular Biology
  • Nucleic Acids
  • Public Health
  • Standards
  • Validation

Fields of Study

  • Computer science

Readers

  • Computational Modeling and Simulation
  • Database Systems and Applications
  • Regression Analysis.