[go: up one dir, main page]

WO2002001347A3 - Method and system for automatic re-assignment of software components of a failed host - Google Patents

Method and system for automatic re-assignment of software components of a failed host Download PDF

Info

Publication number
WO2002001347A3
WO2002001347A3 PCT/SE2001/001448 SE0101448W WO0201347A3 WO 2002001347 A3 WO2002001347 A3 WO 2002001347A3 SE 0101448 W SE0101448 W SE 0101448W WO 0201347 A3 WO0201347 A3 WO 0201347A3
Authority
WO
WIPO (PCT)
Prior art keywords
hosts
monitoring
components
host
software components
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/SE2001/001448
Other languages
French (fr)
Other versions
WO2002001347A2 (en
Inventor
Edwin Tse
Nicolas Gosselin
Fergus Kelledy
David O'flanagan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Telefonaktiebolaget LM Ericsson AB
Original Assignee
Telefonaktiebolaget LM Ericsson AB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget LM Ericsson AB filed Critical Telefonaktiebolaget LM Ericsson AB
Priority to AU2001266503A priority Critical patent/AU2001266503A1/en
Publication of WO2002001347A2 publication Critical patent/WO2002001347A2/en
Publication of WO2002001347A3 publication Critical patent/WO2002001347A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2035Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Hardware Redundancy (AREA)

Abstract

In a network of co-operating hosts (80, 82, 84, 86, 88), a method and system for automatic re-assignment of software components (110, 112) of a failed host to co-operating monitoring (82, 86) or back-up hosts. In a preferred embodiment, a Central Information Repository (CIR), such as an LDAP server, keeps track of software components (110, 112) running on the network hosts (80, 82, 84, 86, 88) and a Monitoring Partnership Program (MPP), in which some hosts (80, 82, 84, 86, 88) monitor the activity of other hosts (80, 82, 84, 86, 88), is provided. Upon failure of a monitored host (84), a monitoring host (82, 86) detects the failure, and informs the other monitoring hosts (82, 86) or the other back-up hosts, if any, of the failure of the monitored host (84). The monitoring hosts (82, 86), and/or the back-up hosts query the CIR for obtaining the identity of the software components (110, 112) running on the failed host (84) before the failure, and select which such components (110, 112) each will start. The monitoring hosts (82, 86) and/or the back-up hosts then take over and start the failed components (110, 112). Upon recovery, the monitored host (84) queries the CIR and obtains the list of its software components, informs the CIR and the monitoring or back-up hosts (82, 86) that it will take over, and starts its components (110, 112), while the monitoring and/or the back-up hosts (82, 86) shut down the components (110, 112) they temporarily run.
PCT/SE2001/001448 2000-06-30 2001-06-21 Method and system for automatic re-assignment of software components of a failed host Ceased WO2002001347A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2001266503A AU2001266503A1 (en) 2000-06-30 2001-06-21 Method and system for automatic re-assignment of software components of a failedhost

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US60911100A 2000-06-30 2000-06-30
US09/609,111 2000-06-30

Publications (2)

Publication Number Publication Date
WO2002001347A2 WO2002001347A2 (en) 2002-01-03
WO2002001347A3 true WO2002001347A3 (en) 2002-06-20

Family

ID=24439380

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2001/001448 Ceased WO2002001347A2 (en) 2000-06-30 2001-06-21 Method and system for automatic re-assignment of software components of a failed host

Country Status (2)

Country Link
AU (1) AU2001266503A1 (en)
WO (1) WO2002001347A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7200781B2 (en) 2003-05-14 2007-04-03 Hewlett-Packard Development Company, L.P. Detecting and diagnosing a malfunctioning host coupled to a communications bus
US7676621B2 (en) 2003-09-12 2010-03-09 Hewlett-Packard Development Company, L.P. Communications bus transceiver

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6938256B2 (en) 2000-01-18 2005-08-30 Galactic Computing Corporation System for balance distribution of requests across multiple servers using dynamic metrics
US6816905B1 (en) 2000-11-10 2004-11-09 Galactic Computing Corporation Bvi/Bc Method and system for providing dynamic hosted service management across disparate accounts/sites
US8538843B2 (en) 2000-07-17 2013-09-17 Galactic Computing Corporation Bvi/Bc Method and system for operating an E-commerce service provider
US7055052B2 (en) 2002-11-21 2006-05-30 International Business Machines Corporation Self healing grid architecture for decentralized component-based systems
US8489741B2 (en) 2002-11-21 2013-07-16 International Business Machines Corporation Policy enabled grid architecture
US8140677B2 (en) 2002-11-21 2012-03-20 International Business Machines Corporation Autonomic web services hosting service
CA2435655A1 (en) 2003-07-21 2005-01-21 Symbium Corporation Embedded system administration
DE102004050350B4 (en) 2004-10-15 2006-11-23 Siemens Ag Method and device for redundancy control of electrical devices
CA2504333A1 (en) 2005-04-15 2006-10-15 Symbium Corporation Programming and development infrastructure for an autonomic element
US8856585B2 (en) * 2011-08-01 2014-10-07 Alcatel Lucent Hardware failure mitigation
US12271867B1 (en) 2020-02-10 2025-04-08 State Farm Mutual Automobile Insurance Company Predicting resource lifecycles and managing resources in enterprise networks
CN114020512B (en) * 2021-11-03 2025-02-11 中国工商银行股份有限公司 Background task processing method, device, computer equipment and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000010822A (en) * 1998-06-25 2000-01-14 Yokogawa Electric Corp Distributed object down detector
EP0981089A2 (en) * 1998-07-20 2000-02-23 Lucent Technologies Inc. Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
EP0990986A2 (en) * 1998-09-30 2000-04-05 Ncr International Inc. Failure recovery of partitioned computer systems including a database schema

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000010822A (en) * 1998-06-25 2000-01-14 Yokogawa Electric Corp Distributed object down detector
EP0981089A2 (en) * 1998-07-20 2000-02-23 Lucent Technologies Inc. Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
EP0990986A2 (en) * 1998-09-30 2000-04-05 Ncr International Inc. Failure recovery of partitioned computer systems including a database schema

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PATENT ABSTRACTS OF JAPAN vol. 2000, no. 04 31 August 2000 (2000-08-31) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7200781B2 (en) 2003-05-14 2007-04-03 Hewlett-Packard Development Company, L.P. Detecting and diagnosing a malfunctioning host coupled to a communications bus
US7676621B2 (en) 2003-09-12 2010-03-09 Hewlett-Packard Development Company, L.P. Communications bus transceiver

Also Published As

Publication number Publication date
WO2002001347A2 (en) 2002-01-03
AU2001266503A1 (en) 2002-01-08

Similar Documents

Publication Publication Date Title
WO2002001347A3 (en) Method and system for automatic re-assignment of software components of a failed host
CA2246603A1 (en) System and method for failure detection and recovery
CN112506702B (en) Disaster recovery method, device, equipment and storage medium for data center
KR100442884B1 (en) Method for updating firmware
US6748550B2 (en) Apparatus and method for building metadata using a heartbeat of a clustered system
DE3379353D1 (en) Method and apparatus for restarting a computing system
WO2000069142A3 (en) Method and apparatus for finding mirrored hosts
WO1999057632A3 (en) Initializing and restarting operating systems
CN112486718B (en) Database fault automatic switching method, device and computer storage medium
WO2001010073A3 (en) Method, system and computer readable storage medium for automatic device driver configuration
JPH11120012A5 (en)
WO2001084313A3 (en) Method and system for achieving high availability in a networked computer system
EP0981089A3 (en) Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
EP0974903A3 (en) Method and apparatus for providing failure detection and recovery with predetermined replication style for distributed applications in a network
WO2001029661A3 (en) Method and apparatus for maintaining a computer system
WO2000017755A3 (en) Protocol for replicated servers
WO2002012987A3 (en) Systems and methods for authenticating a user to a web server
DK0954779T3 (en) Procedure for reconstructing a calculation mode
WO2003027848A3 (en) Backup-restoration system and right management server
WO2000054149A3 (en) Methods and systems for reduced configuration dependency in thin client applications
WO2001040944A3 (en) Method and system for recovery infrastructure for computer systems
US20030167287A1 (en) Information protection system
JPH11120012A (en) Client-server type database management system and recording medium recording the program
CN109274761A (en) NAS cluster node, system and data access method
JP2003524255A (en) Internet based remote data and file recovery system and method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP