The Community for Technology Leaders
Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing (1994)
Austin, TX, USA
June 15, 1994 to June 17, 1994
ISBN: 0-8186-5520-8
pp: 404-413
A. Azagury , IBM Israel Sci. & Technol. Center, Haifa, Israel
D. Dolev , IBM Israel Sci. & Technol. Center, Haifa, Israel
G. Goft , IBM Israel Sci. & Technol. Center, Haifa, Israel
J. Marberg , IBM Israel Sci. & Technol. Center, Haifa, Israel
J. Satran , IBM Israel Sci. & Technol. Center, Haifa, Israel
ABSTRACT
The methodology and design of a system that provides highly available data in a cluster is presented. A highly available cluster consists of multiple machines interconnected by a common bus. Data is replicated at a primary and one or more backup machines. Data is accessed at the primary, using a location independent mechanism that ensures data integrity. If the primary copy of the data fails, access is recovered by switching to a backup copy. Switchover is transparent to the application, hence called seamless switchover. The fault model is fail-stop. The entire cluster is resilient to at least single failures. Designating data as highly available is selective in scope, and the overhead of replication and recovery is incurred only by applications that access highly available data. An experimental prototype was implemented using IBM AS/400 machines and a high-speed bus with fiber-optic links.<>
INDEX TERMS
system buses, data integrity, system recovery, local area networks, distributed databases, fault tolerant computing
CITATION

A. Azagury, D. Dolev, G. Goft, J. Marberg and J. Satran, "Highly available cluster: a case study," Proceedings of IEEE 24th International Symposium on Fault- Tolerant Computing(FTCS), Austin, TX, USA, 1994, pp. 404-413.
doi:10.1109/FTCS.1994.315620
92 ms
(Ver 3.3 (11022016))