[go: up one dir, main page]

FR2984054B1 - METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT - Google Patents

METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT

Info

Publication number
FR2984054B1
FR2984054B1 FR1161584A FR1161584A FR2984054B1 FR 2984054 B1 FR2984054 B1 FR 2984054B1 FR 1161584 A FR1161584 A FR 1161584A FR 1161584 A FR1161584 A FR 1161584A FR 2984054 B1 FR2984054 B1 FR 2984054B1
Authority
FR
France
Prior art keywords
externalized
cluster
computer program
high availability
fault management
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
FR1161584A
Other languages
French (fr)
Other versions
FR2984054A1 (en
Inventor
Jean-Olivier Gerphagnon
Alain Moulle
Philippe Couvee
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bull SAS
Commissariat a lEnergie Atomique et aux Energies Alternatives CEA
Original Assignee
Bull SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Bull SAS filed Critical Bull SAS
Priority to FR1161584A priority Critical patent/FR2984054B1/en
Priority to PCT/FR2012/052732 priority patent/WO2013088020A1/en
Publication of FR2984054A1 publication Critical patent/FR2984054A1/en
Application granted granted Critical
Publication of FR2984054B1 publication Critical patent/FR2984054B1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0751Error or fault detection not based on redundancy
    • G06F11/0754Error or fault detection not based on redundancy by exceeding limits
    • G06F11/0757Error or fault detection not based on redundancy by exceeding limits by exceeding a time limit, i.e. time-out, e.g. watchdogs
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0709Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a distributed system consisting of a plurality of standalone computer nodes, e.g. clusters, client-server systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0748Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a remote unit communicating with a single-box computer node experiencing an error/fault
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2025Failover techniques using centralised failover control functionality
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2035Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Hardware Redundancy (AREA)
FR1161584A 2011-12-13 2011-12-13 METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT Active FR2984054B1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
FR1161584A FR2984054B1 (en) 2011-12-13 2011-12-13 METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT
PCT/FR2012/052732 WO2013088020A1 (en) 2011-12-13 2012-11-27 Method and computer program for the externalized and centralized management of breakdowns in a computer infrastructure including high-availability devices

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FR1161584A FR2984054B1 (en) 2011-12-13 2011-12-13 METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT

Publications (2)

Publication Number Publication Date
FR2984054A1 FR2984054A1 (en) 2013-06-14
FR2984054B1 true FR2984054B1 (en) 2015-10-02

Family

ID=47436079

Family Applications (1)

Application Number Title Priority Date Filing Date
FR1161584A Active FR2984054B1 (en) 2011-12-13 2011-12-13 METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT

Country Status (2)

Country Link
FR (1) FR2984054B1 (en)
WO (1) WO2013088020A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106603722B (en) * 2017-01-22 2020-06-09 杭州迪普科技股份有限公司 Management equipment determining method and device

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7130899B1 (en) * 2002-06-14 2006-10-31 Emc Corporation Robust indication processing
US8943191B2 (en) * 2008-04-02 2015-01-27 International Business Machines Corporation Detection of an unresponsive application in a high availability system
US8055933B2 (en) * 2009-07-21 2011-11-08 International Business Machines Corporation Dynamic updating of failover policies for increased application availability

Also Published As

Publication number Publication date
FR2984054A1 (en) 2013-06-14
WO2013088020A1 (en) 2013-06-20

Similar Documents

Publication Publication Date Title
EP2737404A4 (en) METHOD FOR DETECTING ABNORMAL ACTIONS IN A COMPUTER NETWORK
EP2685417A4 (en) SYSTEM, METHOD AND COMPUTER PROGRAM FOR MANAGING ENERGY CONSUMPTION
FR2982386B1 (en) METHOD, COMPUTER PROGRAM, AND CLUSTER COMPUTER RESOURCE ALLOCATION DEVICE FOR EXECUTING A WORK SUBMITTED TO AUDIT CLUSTER
EP2788724A4 (en) System and method for identifying related events in a resource network monitoring system
EP2795804A4 (en) METHOD AND APPARATUS FOR MANAGING A MESSAGE
PL2628080T3 (en) A computer cluster arrangement for processing a computation task and method for operation thereof
BR112013001738A2 (en) "production apparatus for producing images in a computer program, method and program"
EP2735971A4 (en) MANAGEMENT DEVICE AND MANAGEMENT METHOD FOR A STORAGE DEVICE
FR2992081B1 (en) ELECTRONIC SYSTEM COMPRISING A PLURALITY OF ELECTRONIC EQUIPMENT, ELECTRONIC EQUIPMENT OF SUCH A SYSTEM AND METHOD OF MANAGING AND MAINTAINING SUCH A SYSTEM
GB201303104D0 (en) Computer system, method for managing same and program
EP2737680A4 (en) Mediation server, control method therefor, subscription information managing apparatus, control method therefor, subscription management server, and control method therefor
GB201406704D0 (en) Monitoring system for monitoring unauthorized access points, monitoring server, method and program
FR3013866B1 (en) METHOD, COMPUTER PROGRAM AND DEVICE FOR CONFIGURING OR MAINTAINING A COMPUTER SYSTEM IN A CLUSTER
FR2987587B1 (en) METHOD FOR MANAGING A TRAINING FACILITY
EP2733605A4 (en) METHOD AND MANAGEMENT DEVICE FOR WEB PAGE APPLICATION PROGRAM
FR2971908B1 (en) METHOD AND DEVICE FOR NEAR FIELD COMMUNICATION AND CORRESPONDING COMPUTER PROGRAM.
FR2960369B1 (en) METHOD FOR OPTIMIZING ROUTING IN A CLUSTER COMPRISING STATIC COMMUNICATION LINKS AND COMPUTER PROGRAM USING SAID METHOD
EP2827621A4 (en) METHOD, TERMINAL AND SERVER FOR APPLICATION PROGRAM DISTRIBUTION
SG10201602497QA (en) Transmission system, participation fee management method, computer program product, and maintenance system
EP2901757A4 (en) METHOD AND APPARATUS FOR MANAGING INFORMATION IN A NETWORK
FR2990782B1 (en) METHOD FOR MANAGING TASK EXECUTION IN A COMPUTER SYSTEM
EP2482245A4 (en) Method for managing advertisement and advertisement management server
FR2984054B1 (en) METHOD AND COMPUTER PROGRAM FOR EXTERNALIZED AND CENTRALIZED FAULT MANAGEMENT IN A CLUSTER COMPRISING HIGH AVAILABILITY EQUIPMENT
FR2973903B1 (en) METHOD AND DEVICE FOR MANAGING WIRING IN A CLUSTER
FR2960732B1 (en) METHOD FOR PSEUDO-DYNAMIC ROUTING IN A CLUSTER COMPRISING STATIC COMMUNICATION LINKS AND COMPUTER PROGRAM USING THE SAME

Legal Events

Date Code Title Description
PLFP Fee payment

Year of fee payment: 5

PLFP Fee payment

Year of fee payment: 6

PLFP Fee payment

Year of fee payment: 7

PLFP Fee payment

Year of fee payment: 9

PLFP Fee payment

Year of fee payment: 10

PLFP Fee payment

Year of fee payment: 11

PLFP Fee payment

Year of fee payment: 12

TQ Partial transmission of property

Owner name: LE COMMISSARIAT A L'ENERGIE ATOMIQUE ET AUX EN, FR

Effective date: 20221031

Owner name: BULL SAS, FR

Effective date: 20221031

PLFP Fee payment

Year of fee payment: 13

PLFP Fee payment

Year of fee payment: 14

PLFP Fee payment

Year of fee payment: 15