Minimizing the hidden cost of RDMA
Philip W. Frey, Gustavo Alonso
ICDCS 2009
We present and evaluate a low-overhead approach for achieving high-availability in distributed event-processing middleware systems consisting of networks of stateful software components that communicate by either one-way (send) or twoway (call) messages. The approach is based on transparently augmenting each component to produce a deterministic component whose state can be recovered by checkpoint and replay. Determinism is achieved by augmenting messages with virtual times, and by scheduling message handling in virtual time order. Scheduling delays are reduced by computing virtual times with estimators: deterministic functions that approximate the expected real times of arrival. We describe our algorithms, show how Java components can be transparently augmented with checkpointing code and with good estimators, discuss how our deterministic runtime can be tuned to reduce overhead, and provide experimental results to measure the overhead of determinism relative to non-determinism. © 2009 IEEE.
Philip W. Frey, Gustavo Alonso
ICDCS 2009
Chitra Dorai, Martin Kienzle
IEEE Multimedia
Ba Tu Truong, Svetha Venkatesh, et al.
ICPR 2002
Ying Li, Chitra Dorai
ICME 2005