Eliminating Cross-Server Operations in Scalable File Systems

Abstract

Distributed file systems that scale by partitioning files and directories among a collection of servers inevitably encounter cross-server operations. A common example is a RENAME that moves a file from a directory managed by one server to a directory managed by another. Systems that provide the same semantics for cross-server operations as for those that do not span servers traditionally implement dedicated protocols for these rare operations. This paper suggests an alternate approach that exploits the existence of dynamic redistribution functionality to avoid cross-server operations (e.g., for load balancing, incorporation of new servers, etc.). When a client request would involve files on multiple servers, the system can redistribute those files onto one server and have it service the request. Although such redistribution is more expensive than a dedicated cross-server protocol, the rareness of such operations makes the overall performance impact minimal. Analysis of NFS traces indicates that cross-server operations make up fewer than 0.001% of client requests, and experiments with a prototype implementation show that the performance impact is negligible when such operations make up as much as 0.01% of operations. Thus, when dynamic redistribution functionality exists in the system, cross-server operations can be handled with little additional implementation complexity.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
May 01, 2006
Accession Number
ADA490161

Entities

People

  • Gregory R. Ganger
  • James Hendricks
  • Raja R. Sambasivan
  • Shafeeq Sinnamohideen

Organizations

  • Carnegie Mellon University

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Computer Programming
  • Computer Science
  • Computers
  • Consistency
  • Data Storage Systems
  • Directories
  • Electrical Engineering
  • Electronic Mail
  • Engineering
  • Environment
  • Fault Tolerance
  • Generators
  • Models
  • Prototypes
  • Semantics
  • Servers (Computer Hardware)
  • Trees (Data Structures)

Fields of Study

  • Computer science

Readers

  • Parallel and Distributed Computing.