Global Patent Index - EP 0974903 A2

EP 0974903 A2 2000-01-26 - Method and apparatus for providing failure detection and recovery with predetermined replication style for distributed applications in a network

Title (en)

Method and apparatus for providing failure detection and recovery with predetermined replication style for distributed applications in a network

Title (de)

Verfahren und Vorrichtung zur Fehlererkennung und Wiederherstellung mit vorbestimmtem Replikationsgrad für verteilte Anwendungen in einem Netzwerk

Title (fr)

Procédé et appareil pour détection de défaillance et recouvrement avec degré prédéterminé de réplication pour des applications distribuées dans un réseau

Publication

EP 0974903 A2 (EN)

Application

EP 99305515 A

Priority

US 11913998 A

Abstract (en)

An application module (A) running on a host computer in a computer network is failure-protected with one or more backup copies that are operative on other host computers in the network. In order to effect fault protection, the application module registers itself with a ReplicaManager daemon process (112) by sending a registration message, which message, in addition to identifying the registering application module and the host computer on which it is running, includes the particular replication strategy (cold backup, warm backup, or hot backup) and the degree of replication associated with that application module. The backup copies are then maintained in a fail-over state according to the registered replication strategy. A WatchDog daemon (113), running on the same host computer as the registered application periodically monitors the registered application to detect failures. When a failure, such as a crash or hangup of the application module, is detected, the failure is reported to the ReplicaManager, which effects the requested fail-over actions. An additional backup copy is then made operative in accordance with the registered replication style and the registered degree of replication. A SuperWatchDog daemon process (115-1), running on the same host computer as the ReplicaManager, monitors each host computer in the computer network. When a host failure is detected, each application module running on that host computer is individually failure-protected in accordance with its registered replication style and degree of replication. <IMAGE>

IPC 1-7

G06F 11/14

IPC 8 full level

G06F 15/177 (2006.01); G06F 11/00 (2006.01); G06F 11/14 (2006.01); G06F 11/30 (2006.01)

CPC

G06F 11/2097 (2013.01); G06F 11/1438 (2013.01); G06F 11/2023 (2013.01); G06F 11/0757 (2013.01); G06F 11/2038 (2013.01)

DCS

DE ES FR GB IT NL

DOCDB simple family

EP 0974903 A2 20000126; EP 0974903 A3 20010613; EP 0974903 B1 20030514; AU 4020299 A 20000210; AU 752844 B2 20021003; CA 2273523 A1 20000120; CA 2273523 C 20030318; DE 69907818 D1 20030618; DE 69907818 T2 20040304; JP 2000105754 A 20000411; KR 20000011835 A 20000225; US 6266781 B1 20010724