Today, a successful Internet service is absolutely critical to be up 100 percent of the time. Server clustering is the most promising approach to meet this requirement. However, the existing Web server-clustering solutions can merely provide high availability derived from its redundant nature, but offer no guarantee about fault resilience for the service. In this paper, we address this problem by implementing an innovative mechanism that enables a Web request to be smoothly migrated and recovered on another working node in the presence of server failure. We will show that the request migration and recovery could be efficiently achieved in the manner of user transparency. The achieved capability of fault resilience is important and essential for a variety of critical services (e.g. E-commerce), which are increasingly widespread in use. Our approach takes an important step towards providing a highly reliable Web service.
All Science Journal Classification (ASJC) codes