End-to-End Fault Tolerance Using Transport Layer Multihoming

Abstract

This dissertation investigates the use of transport layer multihoming for providing end-to-end network fault tolerance and improved application performance. Transport layer multihoming is a feature that binds a single transport layer association to multiple network addresses at each endpoint, thus allowing the two end hosts to communicate over multiple network paths. Such path redundancy is useful for fault tolerance in that traffic of existing connections can be redirected (i.e., failover) to a peer's alternate network address without the need for applications (or users) to abort and re-establish connections. Considering the prevalence of path outages in the Internet today, multihoming support at the transport layer can improve resilience of established connections. Using the Stream Control Transmission Protocol (SCTP), we investigate possible design decisions of a multihomed transport protocol, and provide insight for future transport protocols that support multihoming. In particular, we investigate retransmission policies and failover mechanisms in two contexts: proactive (for fixed infrastructure networks), and reactive routing (for mobile ad-hoc networks) protocols. Retransmission policies control the behavior when a transport sender fails to receive acks for sent data. Failover mechanisms determine under which conditions a path is presumed failed, when a sender migrates to a new path, and if/when a sender resumes new data transmission on the original path. We provide a decision tree to suggest a retransmission policy and failover mechanism based on expected network conditions. For topologies with proactive routing and roughly symmetric path delays, our results show that the best retransmission policy is a hybrid retransmission policy introduced by this author: (a) send fast retransmissions to the primary destination address, and (b) send timeout retransmissions to an alternate destination address.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2005
Accession Number
ADA494816

Entities

People

  • Armando L. Caro Jr.

Organizations

  • University of Delaware

Tags

Communities of Interest

  • Cyber
  • Engineered Resilient Systems

DTIC Thesaurus Topics

  • Computer Communications
  • Computer Networks
  • Computers
  • Damage Detection
  • Data Transmission
  • Denial Of Service Attack
  • Detection
  • Fault Tolerance
  • Mobile Ad Hoc Networks
  • Multiple Access
  • Network Protocols
  • Network Topology
  • Operating Systems
  • Routing Protocols
  • Simulators
  • Transport Protocols
  • Wireless Communications

Fields of Study

  • Computer science

Readers

  • Computer Networking