Thomas R. Puzak, A. Hartstein, et al.
CF 2007
In response to the strong desire of customers to be provided with advance notice of unplanned outages, techniques were developed that detect the occurrence of software aging due to resource exhaustion, estimate the time remaining until the exhaustion reaches a critical level, and automatically perform proactive software rejuvination of an application, process group, or entire operating system. The resulting techniques are very general and can capture a multitude of cluster system characteristics, failure behavior, and performability measures.
Thomas R. Puzak, A. Hartstein, et al.
CF 2007
Raymond Wu, Jie Lu
ITA Conference 2007
Alessandro Morari, Roberto Gioiosa, et al.
IPDPS 2011
Thomas M. Cheng
IT Professional