Improving Small File Performance in Object-Based Storage

Abstract

This paper proposes architectural refinements, server-driven metadata prefetching, and namespace flattening for improving the efficiency of small file workloads in object-based storage systems. Server-driven metadata prefetching consists of having the metadata server provide information and capabilities for multiple objects, rather than just one, in response to each lookup. Doing so allows clients to access the contents of many small files for each metadata server interaction, reducing access latency and metadata server load. Namespace flattening encodes the directory hierarchy into object IDs such that namespace locality translates to object ID similarity. Doing so exposes namespace relationships among objects (e.g., as hints to storage devices), improves locality in metadata indices, and enables use of ranges for exploiting them. Trace-driven simulations and experiments with a prototype implementation show significant performance benefits for small file workloads.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 01, 2006
Accession Number
ADA490224

Entities

People

  • Gregory R. Ganger
  • James Hendricks
  • Raja R. Sambasivan
  • Shafeeq Sinnamohideen

Organizations

  • Carnegie Mellon University

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Bandwidth
  • Classification
  • Computer Science
  • Directories
  • Efficiency
  • Electronic Mail
  • Engineering
  • Hierarchies
  • Models
  • Prototypes
  • Simulations
  • Simulators
  • Software Development
  • Storage
  • Trees (Data Structures)
  • Workload

Fields of Study

  • Computer science

Readers

  • Artificial Intelligence
  • Geospatial Intelligence and Artificial Intelligence Analytics
  • Parallel and Distributed Computing.