[go: up one dir, main page]

CN104657166B - server system and node replacement method - Google Patents

server system and node replacement method Download PDF

Info

Publication number
CN104657166B
CN104657166B CN201310597425.0A CN201310597425A CN104657166B CN 104657166 B CN104657166 B CN 104657166B CN 201310597425 A CN201310597425 A CN 201310597425A CN 104657166 B CN104657166 B CN 104657166B
Authority
CN
China
Prior art keywords
node
server system
hardware
preset time
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201310597425.0A
Other languages
Chinese (zh)
Other versions
CN104657166A (en
Inventor
卢盈志
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inventec Pudong Technology Corp
Inventec Corp
Original Assignee
Inventec Pudong Technology Corp
Inventec Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inventec Pudong Technology Corp, Inventec Corp filed Critical Inventec Pudong Technology Corp
Priority to CN201310597425.0A priority Critical patent/CN104657166B/en
Publication of CN104657166A publication Critical patent/CN104657166A/en
Application granted granted Critical
Publication of CN104657166B publication Critical patent/CN104657166B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Computer And Data Communications (AREA)

Abstract

一种节点替换方法,适用于服务器系统。该节点替换方法的步骤流程如下:检测节点是否插入服务器系统,并于检测到节点插入服务器系统时产生提示信号,提示信号用以指示节点不能拔离服务器系统。检测节点的第一识别码与节点中的硬件的第一硬件配置信息。依据第一识别码与硬件配置信息以及节点插入服务器系统之前的第二识别码与硬件配置信息,判断节点或硬件是否已被替换。若判断为否,则关闭节点的电源,以执行节点替换程序。若判断为是,则对节点安装操作系统、软件包数据与固件包数据至少其中之一。

A node replacement method, suitable for server systems. The steps of the node replacement method are as follows: detect whether the node is inserted into the server system, and generate a prompt signal when detecting that the node is inserted into the server system, and the prompt signal is used to indicate that the node cannot be removed from the server system. The first identification code of the node and the first hardware configuration information of the hardware in the node are detected. Whether the node or the hardware has been replaced is determined according to the first identification code and hardware configuration information and the second identification code and hardware configuration information before the node is inserted into the server system. If the judgment is negative, then turn off the power of the node to execute the node replacement program. If the determination is yes, at least one of the operating system, software package data and firmware package data is installed on the node.

Description

服务器系统与节点替换方法Server system and node replacement method

技术领域technical field

本发明涉及一种服务器系统(例如机柜式的数据中心)与节点替换方法,特别是一种可快速进行节点替换程序的服务器系统与节点替换方法。The present invention relates to a server system (such as a cabinet-type data center) and a node replacement method, in particular to a server system and a node replacement method that can quickly perform a node replacement program.

背景技术Background technique

随着科技的发展,通过互联网能够使得世界各地的电脑进行连结。一台电脑通过网络连线便能够与另一台电脑进行数据的交换、存取等动作。在客户端与服务器系统架构上,客户端与服务器便是通过网络来进行沟通。With the development of technology, computers all over the world can be connected through the Internet. A computer can exchange and access data with another computer through a network connection. In the client and server system architecture, the client and the server communicate through the network.

一般来说,服务器系统可配置有多个节点,且每一个节点同时运行多个虚拟机器(virtual machine,VM),藉以提供给每一使用者独立的运作环境。并且,每个节点可视为各自独立的计算机,亦即各节点具有存储器、储存空间、运算能力与网络连结功能。因此,各节点可以运行独自的操作系统,且各节点之间也可以通过网络设备进行沟通与数据传输。Generally speaking, the server system can be configured with multiple nodes, and each node runs multiple virtual machines (virtual machines, VMs) at the same time, so as to provide each user with an independent operating environment. Moreover, each node can be regarded as an independent computer, that is, each node has a memory, a storage space, a computing capability and a network connection function. Therefore, each node can run its own operating system, and each node can also communicate and transmit data through network equipment.

在服务器系统架构完毕后,必须对其内的节点完成部署,亦即安装各节点所需的操作系统、软件包数据与固件包数据,以使服务器系统运作并提供服务给使用者使用。然而,当服务器系统中的任意一组节点的硬件产生损坏时,将会导致对应于此硬件的节点无法正常运作,此时检测人员由于无法精确得知是哪一节点中的特定硬件发生损坏,而仅能依序将节点拔离与插入服务器系统来进行检测,造成检测时间上的浪费。After the server system structure is completed, the deployment of the nodes must be completed, that is, the operating system, software package data and firmware package data required by each node must be installed, so that the server system can operate and provide services to users. However, when the hardware of any group of nodes in the server system is damaged, the nodes corresponding to this hardware will not be able to operate normally. At this time, since the inspectors cannot accurately know which specific hardware in the node is damaged, However, the detection can only be performed by removing and inserting the nodes into the server system in sequence, resulting in a waste of detection time.

发明内容Contents of the invention

本发明所要解决的技术问题是提供一种服务器系统与节点替换方法,其可自动地依据节点的识别码与节点中的硬件的硬件配置信息来判断特定节点或此节点中的特定硬件是否需要被替换,以让检测人员可快速方便地对节点进行替换。The technical problem to be solved by the present invention is to provide a server system and a node replacement method, which can automatically determine whether a specific node or specific hardware in this node needs to be replaced according to the identification code of the node and the hardware configuration information of the hardware in the node. Replacement, allowing inspectors to quickly and easily replace nodes.

为了实现上述目的,本发明提供了一种节点替换方法,此节点替换方法适用于服务器系统。此节点替换方法的步骤流程如下所述。检测节点是否插入服务器系统,并于检测到节点插入服务器系统时产生第一提示信号,其中此第一提示信号用以指示节点不能拔离服务器系统。检测节点的第一识别码与节点中的硬件的第一硬件配置信息。依据第一识别码与第一硬件配置信息以及节点插入服务器系统之前的第二识别码与第二硬件配置信息,判断节点或节点中的硬件是否已被替换。若判断出节点与节点中的硬件皆未被替换,则关闭节点的电源,以执行节点替换程序。若判断出节点或节点中的硬件被替换,则对节点安装操作系统、软件包数据与固件包数据至少其中之一。In order to achieve the above object, the present invention provides a node replacement method, which is suitable for a server system. The flow of steps for this node replacement method is described below. Detect whether the node is plugged into the server system, and generate a first prompt signal when detecting that the node is plugged into the server system, wherein the first prompt signal is used to indicate that the node cannot be removed from the server system. The first identification code of the node and the first hardware configuration information of the hardware in the node are detected. According to the first identification code and first hardware configuration information and the second identification code and second hardware configuration information before the node is inserted into the server system, it is judged whether the node or the hardware in the node has been replaced. If it is determined that neither the node nor the hardware in the node has been replaced, the power of the node is turned off to execute the node replacement procedure. If it is determined that the node or the hardware in the node is replaced, at least one of the operating system, software package data and firmware package data is installed on the node.

在一实施例中,于对节点安装操作系统、软件包数据与固件包数据至少其中之一的步骤之后,还包括以下所述的步骤流程。持续检测节点中的硬件的状况,以判断硬件是否发生错误。若判断出硬件产生不可修复的错误,则关闭此节点的电源,以执行节点替换程序。若判断出硬件产生可修复的错误次数达到预设的门槛值,则对此节点进行正常关机程序,并据以执行节点替换程序。In one embodiment, after the step of installing at least one of the operating system, software package data, and firmware package data on the node, the following steps are further included. Continuously detect the status of the hardware in the node to determine whether there is an error in the hardware. If it is judged that the hardware has an irreparable error, then turn off the power of this node to execute the node replacement program. If it is judged that the number of recoverable errors generated by the hardware reaches the preset threshold value, a normal shutdown procedure is performed on the node, and a node replacement procedure is executed accordingly.

在一实施例中,节点替换程序包括以下所述的步骤流程。将节点设定为初始模式。产生第二提示信号,其中此第二提示信号用以指示节点能拔离服务器系统。检测节点是否拔离服务器系统。检测节点或另一节点是否插入服务器系统。若检测到节点或另一节点插入服务器系统,则产生第一提示信号,并且接续检测节点的第一识别码与节点中的硬件的第一硬件配置信息之后的步骤。In one embodiment, the node replacement procedure includes the steps described below. Sets the node to initial mode. A second prompt signal is generated, wherein the second prompt signal is used to indicate that the node can be disconnected from the server system. Detect whether the node is unplugged from the server system. Detect if a node or another node is plugged into the server system. If it is detected that the node or another node is plugged into the server system, a first prompt signal is generated, and the steps after detecting the first identification code of the node and the first hardware configuration information of the hardware in the node are continued.

承接上述的实施例,于检测节点是否拔离服务器系统的步骤中,还包括以下所述的步骤流程。设定第一预设时间并开始计时。判断节点是否已拔离服务器系统。若判断出节点尚未拔离服务器系统,则重置第一预设时间并重新计时。若判断出节点拔离服务器系统且已经过第一预设时间,则执行检测节点或另一节点是否插入服务器系统之后的步骤。Following the above-mentioned embodiment, the step of detecting whether the node is disconnected from the server system further includes the steps described below. Set the first preset time and start timing. Determine whether the node has been disconnected from the server system. If it is determined that the node has not been disconnected from the server system, the first preset time is reset and timed again. If it is determined that the node is disconnected from the server system and the first preset time has elapsed, the steps after detecting whether the node or another node is plugged into the server system are performed.

承接上述的实施例,于检测节点或另一节点是否插入服务器系统的步骤中,还包括以下所述的步骤流程。设定第一预设时间与第二预设时间并开始计时,其中第二预设时间接续于第一预设时间后。判断节点是否插入服务器系统。若判断出节点尚未插入服务器系统,则重置第一预设时间并重新计时。若判断出节点插入服务器系统且已经过第一预设时间,则接着判断节点于第二预设时间中是否仍持续插入服务器系统。若判断出节点于第二预设时间中拔离服务器系统,则接续将节点设定为初始模式之后的步骤。若判断出节点于经过第二预设时间后仍未拔离服务器系统,则产生第一提示信号,并且接续检测节点的第一识别码与节点中的硬件的第一硬件配置信息之后的步骤。Following the above-mentioned embodiments, the step of detecting whether the node or another node is plugged into the server system further includes the steps described below. Setting the first preset time and the second preset time and starting timing, wherein the second preset time follows the first preset time. Determine whether the node is plugged into the server system. If it is determined that the node has not been inserted into the server system, the first preset time is reset and the timing is restarted. If it is determined that the node is plugged into the server system and the first preset time has elapsed, then it is judged whether the node is still plugged into the server system during the second preset time. If it is determined that the node is disconnected from the server system within the second preset time, then continue with the steps after setting the node to the initial mode. If it is judged that the node has not been disconnected from the server system after the second preset time, a first prompt signal is generated, and the steps after detecting the first identification code of the node and the first hardware configuration information of the hardware in the node are continued.

为了更好地实现上述目的,本发明还提供了一种服务器系统,此服务器系统包括节点、检测模块、提示模块以及处理模块。节点具有硬件。检测模块通讯连接节点,此检测模块用以检测节点是否插入或拔离服务器系统,以及用以检测节点的第一识别码与硬件的第一硬件配置信息。提示模块通讯连接检测模块,此提示模块用以于检测模块检测到节点插入服务器系统时产生第一提示信号。其中,此第一提示信号用以指示节点不能拔离服务器系统。处理模块通讯连接于检测模块与节点之间,此处理模块用以依据第一识别码与第一硬件配置信息以及节点插入服务器系统之前的第二识别码与第二硬件配置信息来判断节点或硬件是否已被替换。其中,若处理模块判断出节点与硬件皆未被替换,则关闭节点的电源,以执行节点替换程序,若处理模块判断出节点或硬件被替换,则对节点安装操作系统、软件包数据与固件包数据至少其中之一。In order to better achieve the above object, the present invention also provides a server system, which includes a node, a detection module, a prompt module and a processing module. Nodes have hardware. The detection module is connected to the node through communication. The detection module is used to detect whether the node is plugged into or removed from the server system, and is used to detect the first identification code of the node and the first hardware configuration information of the hardware. The prompt module communicates with the detection module, and the prompt module is used to generate a first prompt signal when the detection module detects that the node is inserted into the server system. Wherein, the first prompt signal is used to indicate that the node cannot be disconnected from the server system. The processing module is communicatively connected between the detection module and the node, and the processing module is used to judge the node or the hardware according to the first identification code and the first hardware configuration information and the second identification code and the second hardware configuration information before the node is inserted into the server system has been replaced. Wherein, if the processing module determines that neither the node nor the hardware has been replaced, then turn off the power of the node to execute the node replacement program; if the processing module determines that the node or the hardware has been replaced, then install the operating system, software package data and firmware on the node Packet data at least one of them.

在一实施例中,于处理模块对节点安装操作系统、软件包数据与固件包数据至少其中之一之后,处理模块还持续判断硬件是否发生错误。若处理模块判断出硬件产生不可修复的错误,则关闭节点的电源,以执行节点替换程序。若处理模块判断出硬件产生可修复的错误次数达到预设的门槛值,则对节点进行正常关机程序,并据以执行节点替换程序。In one embodiment, after the processing module installs at least one of the operating system, software package data, and firmware package data on the node, the processing module continues to determine whether a hardware error occurs. If the processing module determines that an unrecoverable error occurs in the hardware, the power of the node is turned off to execute the node replacement program. If the processing module judges that the number of recoverable errors generated by the hardware reaches the preset threshold value, it performs a normal shutdown procedure on the node, and executes the node replacement procedure accordingly.

在一实施例中,于服务器系统执行节点替换程序时,处理模块将节点设定为初始模式。接着,提示模块产生第二提示信号,此第二提示信号用以指示节点能拔离服务器系统。另外,检测模块检测节点是否拔离服务器系统,并于检测到节点拔离服务器系统后接续检测节点或另一节点是否插入服务器系统。若检测模块检测到节点或另一节点插入服务器系统,则提示模块产生第一提示信号,并且检测模块接续检测节点的第一识别码与硬件的第一硬件配置信息之后的处理程序。In one embodiment, when the server system executes the node replacement procedure, the processing module sets the node to an initial mode. Then, the prompting module generates a second prompting signal, and the second prompting signal is used to indicate that the node can be disconnected from the server system. In addition, the detection module detects whether the node is disconnected from the server system, and then detects whether the node or another node is plugged into the server system after detecting that the node is disconnected from the server system. If the detection module detects that the node or another node is inserted into the server system, the prompt module generates a first prompt signal, and the detection module continues the processing procedure after detecting the first identification code of the node and the first hardware configuration information of the hardware.

承接上述的实施例,服务器系统还包括计时模块,此计时模块通讯连接检测模块。于检测模块检测节点是否拔离服务器系统时,计时模块会设定第一预设时间并开始计时。若检测模块检测出节点于第一预设时间内仍未拔离服务器系统,则计时模块将重置第一预设时间并重新计时。若检测模块检测出节点拔离服务器系统且已经过第一预设时间,则接续检测节点或另一节点是否插入服务器系统之后的处理程序。Following the above-mentioned embodiment, the server system further includes a timing module, and the timing module is a communication connection detection module. When the detection module detects whether the node is disconnected from the server system, the timing module will set the first preset time and start timing. If the detection module detects that the node has not been disconnected from the server system within the first preset time, the timing module will reset the first preset time and start counting again. If the detection module detects that the node is disconnected from the server system and the first preset time has elapsed, the processing procedure after detecting whether the node or another node is inserted into the server system is continued.

承接上述的实施例,服务器系统还包括计时模块,此计时模块通讯连接检测模块。于检测模块检测节点或另一节点是否插入服务器系统时,计时模块会设定第一预设时间与第二预设时间并开始计时,其中第二预设时间接续于第一预设时间后。若检测模块检测出节点于第一预设时间内尚未插入服务器系统,则计时模块重置第一预设时间并重新计时。若检测模块检测出节点插入服务器系统且已经过第一预设时间,则接续检测节点于第二预设时间中是否仍持续插入服务器系统。若检测模块检测出节点于第二预设时间中拔离服务器系统,则接续处理模块将节点设定为初始模式的处理程序。若检测模块检测出节点于经过第二预设时间后仍未拔离服务器系统,则提示模块产生第一提示信号,并且检测模块接续检测节点的第一识别码与硬件的第一硬件配置信息之后的处理程序。Following the above-mentioned embodiment, the server system further includes a timing module, and the timing module is a communication connection detection module. When the detection module detects whether the node or another node is plugged into the server system, the timing module sets the first preset time and the second preset time and starts timing, wherein the second preset time follows the first preset time. If the detection module detects that the node has not been inserted into the server system within the first preset time, the timing module resets the first preset time and restarts timing. If the detection module detects that the node is plugged into the server system and the first preset time has elapsed, then the connection checks whether the node is still plugged into the server system during the second preset time. If the detection module detects that the node is disconnected from the server system within the second preset time, the connection processing module sets the node as the processing procedure of the initial mode. If the detection module detects that the node has not been removed from the server system after the second preset time, the prompt module generates a first prompt signal, and the detection module continues to detect the first identification code of the node and the first hardware configuration information of the hardware handler.

本发明的技术效果在于:Technical effect of the present invention is:

本发明的服务器系统与节点替换方法,其通过检测节点的识别码与节点中的硬件的硬件配置信息,来判断节点或节点中的硬件的硬件是否已被替换,进而选择性地执行节点替换程序或对此节点安装操作系统、软件包数据或固件包数据。此外,本发明的服务器系统与节点替换方法还可于节点安装操作系统、软件包数据或固件包数据后,持续地检测节点中的硬件状况,并于节点中的硬件产生错误时通过提示信号来让检测人员得知可以进行节点替换程序。The server system and node replacement method of the present invention determine whether the node or the hardware of the node has been replaced by detecting the identification code of the node and the hardware configuration information of the hardware in the node, and then selectively execute the node replacement program Or install an operating system, package data, or firmware package data for this node. In addition, the server system and node replacement method of the present invention can also continuously detect the hardware status of the node after the node installs the operating system, software package data or firmware package data, and when the hardware in the node generates an error, it can be prompted by a signal Inform inspectors that node replacement procedures are available.

以下结合附图和具体实施例对本发明进行详细描述,但不作为对本发明的限定。The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments, but not as a limitation of the present invention.

附图说明Description of drawings

图1为根据本发明一实施例的服务器系统的功能框图;1 is a functional block diagram of a server system according to an embodiment of the present invention;

图2A为根据本发明一实施例的服务器系统的节点替换方法的步骤流程图;FIG. 2A is a flowchart of steps of a node replacement method of a server system according to an embodiment of the present invention;

图2B为根据本发明另一实施例的服务器系统的节点替换方法的步骤流程图;FIG. 2B is a flowchart of steps of a node replacement method of a server system according to another embodiment of the present invention;

图3为根据本发明一实施例的节点替换程序的步骤流程图;FIG. 3 is a flowchart of steps of a node replacement program according to an embodiment of the present invention;

图4为根据图3中的步骤S304的详细步骤流程图;Fig. 4 is a detailed step flow chart according to step S304 in Fig. 3;

图5为根据图3中的步骤S306的详细步骤流程图。FIG. 5 is a detailed flow chart of steps according to step S306 in FIG. 3 .

其中,附图标记Among them, reference signs

1 服务器系统1 server system

10 节点10 nodes

12 检测模块12 detection module

14 提示模块14 prompt module

16 处理模块16 processing modules

18 计时模块18 timing module

S200~S214、S300~S308、S400~S404、S500~S506 步骤S200~S214, S300~S308, S400~S404, S500~S506 steps

具体实施方式Detailed ways

下面结合附图对本发明的结构原理和工作原理作具体的描述:Below in conjunction with accompanying drawing, structural principle and working principle of the present invention are specifically described:

请参照图1,图1为根据本发明一实施例的服务器系统的功能框图。如图1所示,服务器系统1包括节点10、检测模块12、提示模块14、处理模块16与计时模块18,其中检测模块12通讯连接于节点10提示模块14、处理模块16与计时模块18之间,且节点10又与处理模块16通讯连接。本发明所述的通讯连接可以是以实体线路连接而实现,或是以无线通讯连接而实现,本发明在此不加以限制。于实务上,服务器系统1可以为一种机柜式的数据中心(container data center),但不以此为限。以下将分别就服务器系统1中的各部功能模块作详细的说明。Please refer to FIG. 1 , which is a functional block diagram of a server system according to an embodiment of the present invention. As shown in Figure 1, the server system 1 includes a node 10, a detection module 12, a prompt module 14, a processing module 16 and a timing module 18, wherein the detection module 12 is connected to the node 10 by communicating with the prompt module 14, the processing module 16 and the timing module 18. , and the node 10 is in communicative connection with the processing module 16 . The communication connection described in the present invention may be realized by a physical line connection or a wireless communication connection, which is not limited in the present invention. In practice, the server system 1 can be a cabinet-type data center (container data center), but not limited thereto. Each functional module of the server system 1 will be described in detail below.

节点10具有至少一硬件,所述的硬件可以包括有基板管理控制器(baseboardmanagement controller,BMC)、网络界面控制器(network interface controller,NIC,亦称网络卡)、硬盘(hard disk drive,HDD)、DIMM(Dual In-line Memory Module)与中央处理器(CPU)等,但不以此为限。此外,虽然图1仅绘示有一组节点,然而本发明的服务器系统在此不加以限制节点的个数。The node 10 has at least one piece of hardware, and the hardware may include a baseboard management controller (baseboard management controller, BMC), a network interface controller (network interface controller, NIC, also known as a network card), a hard disk drive (hard disk drive, HDD) , DIMM (Dual In-line Memory Module) and central processing unit (CPU), etc., but not limited thereto. In addition, although FIG. 1 only shows a group of nodes, the server system of the present invention does not limit the number of nodes here.

检测模块12用以检测节点10是否有插入或拔离服务器系统1,以及用以检测节点10的第一识别码与节点10中硬件的第一硬件配置信息(hardware configurationinformation)。于实务上,节点10的识别码可以为一种通用唯一识别码(universallyunique identifier,UUID),但不以此为限。一般来说,此种通用唯一识别码是由一串16位元组(亦称128位元)的16进位数字所构成,用以让每一个节点10都能有唯一的辨识信息,再者,此UUID能由SMBIOS(System Management BIOS)Type1数据结构的UUID栏位取得;节点10的硬件配置信息能被计算而得其唯一4位元组的硬件签字(Hardware Signature),BIOS(Basic Input Output System)将于其POST(Power On Self Test)时取得硬件配置信息并进得将硬件签字存于ACPI(Advanced Configuration Power Management Interface)FACS(Firmware ACPI Control Structure)表的Hardware Signature栏位,此HardwareSignature栏位可用来快速决定是否硬件配置信息不同;再者,可通过ping节点10上BMC的NIC来检测节点10是否插入或拔除。The detection module 12 is used to detect whether the node 10 is plugged into or removed from the server system 1 , and is used to detect the first identification code of the node 10 and the first hardware configuration information of the hardware in the node 10 . In practice, the identification code of the node 10 may be a universal unique identifier (UUID), but not limited thereto. Generally speaking, this kind of UUID is composed of a string of 16-bit (also known as 128-bit) hexadecimal numbers, so that each node 10 can have unique identification information. Furthermore, This UUID can be obtained from the UUID column of the SMBIOS (System Management BIOS) Type1 data structure; the hardware configuration information of the node 10 can be calculated to obtain its unique 4-byte hardware signature (Hardware Signature), BIOS (Basic Input Output System ) will obtain the hardware configuration information during its POST (Power On Self Test) and store the hardware signature in the Hardware Signature column of the ACPI (Advanced Configuration Power Management Interface) FACS (Firmware ACPI Control Structure) table. This HardwareSignature column is available to quickly determine whether the hardware configuration information is different; moreover, it is possible to detect whether the node 10 is inserted or removed by pinging the NIC of the BMC on the node 10.

提示模块14用以于检测模块12检测到节点10插入服务器系统1时产生一组第一提示信号,此第一提示信号用以指示节点10不能拔离服务器系统1。此外,在某些情况中,提示模块14产生一组第二提示信号,此第二提示信号用以指示节点10能拔离服务器系统1。于实务上,提示模块14可以为一种显示模块(例如发光二极体、显示面板、七段显示器等电子显示元件)或是发声模块(例如喇叭、蜂鸣器等电子发声元件),本发明在此不加以限制。若提示模块14为显示模块的话,则提示信号以影像或是光线的形式呈现给使用者;若提示模块14为发声模块的话,则提示信号以声音的型式呈现给使用者。The prompt module 14 is used for generating a set of first prompt signals when the detection module 12 detects that the node 10 is plugged into the server system 1 , and the first prompt signals are used to indicate that the node 10 cannot be removed from the server system 1 . In addition, in some cases, the prompt module 14 generates a set of second prompt signals, and the second prompt signals are used to indicate that the node 10 can be disconnected from the server system 1 . In practice, the prompt module 14 can be a display module (such as electronic display components such as light-emitting diodes, display panels, and seven-segment displays) or a sound module (such as electronic sound components such as speakers and buzzers). No limitation is imposed here. If the prompt module 14 is a display module, the prompt signal is presented to the user in the form of image or light; if the prompt module 14 is a sound module, the prompt signal is presented to the user in the form of sound.

处理模块16用以依据节点10的识别码与节点10中硬件的硬件配置信息以及节点10插入服务器系统1之前的第二识别码与第二硬件配置信息来判断节点10或节点10中的硬件是否已被替换;需知,“节点10插入服务器系统1之前的第二识别码与第二硬件配置信息”是“节点10之前次插入服务器系统1时的其识别码与硬件配置信息”,再者,要是新的节点10插入服务器系统,则其第二识别码与第二硬件配置信息皆是空的。计时模块18用以设定至少一组预设时间,并且开始计时。于某些情况中,计时模块18会在计时的过程中将计时归零,以重新开始计时。The processing module 16 is used to determine whether the node 10 or the hardware in the node 10 is It has been replaced; it should be known that "the second identification code and second hardware configuration information before the node 10 is inserted into the server system 1" is "the identification code and hardware configuration information of the node 10 when it was inserted into the server system 1 last time", and , if a new node 10 is inserted into the server system, its second identification code and second hardware configuration information are both empty. The timing module 18 is used to set at least one set of preset times and start timing. In some cases, the timing module 18 resets the timing to zero during the timing, so as to restart the timing.

为了更加清楚说明本发明的服务器系统1与节点替换方法的实际运作方式,请一并参照图1与图2A,图2A为根据本发明一实施例的服务器系统的节点替换方法的步骤流程图。如图2A所示,在步骤S200中,检测模块12会检测节点10是否插入服务器系统1,并于检测到节点10插入服务器系统时,提示模块14会产生一组第一提示信号,并进入步骤S202。若检测模块12未检测到节点10插入服务器系统1,则继续重复执行步骤S200,直至检测模块12检测到节点10插入服务器系统1为止。In order to more clearly illustrate the actual operation of the server system 1 and the node replacement method of the present invention, please refer to FIG. 1 and FIG. 2A together. FIG. 2A is a flow chart of the steps of the node replacement method of the server system according to an embodiment of the present invention. As shown in Figure 2A, in step S200, the detection module 12 will detect whether the node 10 is inserted into the server system 1, and when detecting that the node 10 is inserted into the server system, the prompt module 14 will generate a set of first prompt signals, and enter the step S202. If the detection module 12 does not detect that the node 10 is inserted into the server system 1 , step S200 is repeated until the detection module 12 detects that the node 10 is inserted into the server system 1 .

在步骤S202中,检测模块12会接着检测节点10的第一识别码与此节点10中的其中一硬件的第一硬件配置信息。在步骤S204中,处理模块16会依据第一识别码与第一硬件配置信息以及节点10插入服务器系统1之前的第二识别码与第二硬件配置信息,判断节点10或节点10中的硬件是否已被替换。若处理模块16判断出节点10或节点10中的硬件已被替换,则执行步骤S206;若处理模块16判断出节点10与节点10中的硬件皆未被替换,则执行步骤S208。需知,在节点10或其硬件没被替换下,也可强制重新对此节点10安装操作系统、软件包数据或固件包数据(图未示),其可应用于实际的情境如下﹕当节点10只是因其上的硬件因接触不良而产生硬件错误,此时可将节点10拔除,然后使其硬件接触良好,然后再将节点10重新插回服务器系统1。In step S202 , the detection module 12 then detects the first identification code of the node 10 and the first hardware configuration information of one of the hardware in the node 10 . In step S204, the processing module 16 will determine whether the node 10 or the hardware in the node 10 is has been replaced. If the processing module 16 determines that the node 10 or the hardware in the node 10 has been replaced, execute step S206; if the processing module 16 determines that neither the node 10 nor the hardware in the node 10 has been replaced, execute step S208. It should be known that if the node 10 or its hardware has not been replaced, the operating system, software package data or firmware package data (not shown) can also be forced to be re-installed on the node 10, which can be applied to the actual situation as follows: when the node 10 just because the hardware on it produces hardware error because of bad contact, node 10 can be unplugged at this moment, make its hardware contact well then, and then node 10 is inserted back into server system 1 again.

在步骤S206中,处理模块16会对节点10安装操作系统(operating system,OS)、软件包数据(software package data)与固件包数据(firmware package data)至少其中之一。在步骤S208中,处理模块16会关闭(power off)节点10的电源,以执行节点替换程序。In step S206 , the processing module 16 installs at least one of an operating system (operating system, OS), software package data (software package data) and firmware package data (firmware package data) on the node 10 . In step S208, the processing module 16 will power off the power of the node 10 to execute the node replacement procedure.

请参照图2B,图2B为根据本发明另一实施例的服务器系统的节点替换方法的步骤流程图。如图2B所示,在对节点安装操作系统、软件包数据与固件包数据至少其中之一的步骤(即步骤S206)之后,检测模块12或是另一组监控模块(图未示)可以持续地检测节点中的硬件的状况,以使处理模块16判断硬件是否发生错误(即步骤S210)。若检测模块12或是另一组监控模块检测出节点10当中的硬件产生不可修复的错误(un-correct error),则执行步骤S212;若检测模块12或是另一组监控模块检测出节点10当中的硬件产生可修复的错误(correct error)次数达到预设的门槛值(default threshold value),则执行步骤S214。Please refer to FIG. 2B . FIG. 2B is a flowchart of steps of a node replacement method of a server system according to another embodiment of the present invention. As shown in Figure 2B, after the step of installing at least one of the operating system, software package data and firmware package data on the node (ie step S206), the detection module 12 or another group of monitoring modules (not shown) can continue The condition of the hardware in the node is accurately detected, so that the processing module 16 determines whether an error occurs in the hardware (that is, step S210). If the detection module 12 or another group of monitoring modules detects that the hardware in the node 10 produces an uncorrectable error (un-correct error), then perform step S212; if the detection module 12 or another group of monitoring modules detects that the node 10 When the number of correct errors generated by the hardware reaches a preset threshold value (default threshold value), step S214 is executed.

在步骤S212中,由于节点10当中的硬件产生不可修复的错误,亦即此时节点10已经损坏而无法正常运作,则处理模块16会关闭节点10的电源,以执行节点替换程序。在步骤S214中,由于节点10当中的硬件产生可修复的错误次数达到预设的门槛值(例如,一小时内产生10次以上的可修复的错误次数),亦即此时节点10快要损坏而即将无法正常运作,则处理模块16会对节点10进行正常关机(shutdown)程序,并据以执行节点替换程序。In step S212, since the hardware in the node 10 has an irreparable error, that is, the node 10 is damaged and cannot operate normally, the processing module 16 will turn off the power of the node 10 to execute the node replacement procedure. In step S214, since the number of repairable errors generated by the hardware in the node 10 reaches the preset threshold value (for example, more than 10 repairable errors are generated within one hour), that is, the node 10 is about to be damaged at this time. If it is about to fail to operate normally, the processing module 16 will perform a normal shutdown (shutdown) procedure on the node 10, and execute a node replacement procedure accordingly.

请参照图3,图3为根据本发明一实施例的节点替换程序的步骤流程图。如图3所示,在步骤S300中,处理模块16会将节点10设定为初始模式。于本实施例中,初始模式为动态主机配置协议(dynamic host configuration protocol,DHCP)模式。在实际的操作中,当处理模块16判断出节点10能拔离服务器系统1时,会自动地将节点10的基板管理控制器设定回DHCP模式,以取得新的一组基板管理控制器的互联网协议位址(internet protocoladdress,IP address)。Please refer to FIG. 3 . FIG. 3 is a flowchart of steps of a node replacement program according to an embodiment of the present invention. As shown in FIG. 3 , in step S300 , the processing module 16 sets the node 10 to an initial mode. In this embodiment, the initial mode is a dynamic host configuration protocol (DHCP) mode. In actual operation, when the processing module 16 determines that the node 10 can be removed from the server system 1, it will automatically set the baseboard management controller of the node 10 back to the DHCP mode to obtain a new set of baseboard management controllers. Internet protocol address (internet protocol address, IP address).

在步骤S302中,提示模块14会产生一组第二提示信号,此第二提示信号用以指示节点10能拔离服务器系统1。在步骤S304中,检测模块12会检测节点10是否拔离服务器系统1。若检测模块12检测出节点10仍未拔离服务器系统1,则继续执行步骤S304;若检测模块12检测出节点10拔离服务器系统1,则执行步骤S306。在步骤S306中,检测模块12会接续检测节点10或另一组节点是否插入服务器系统1。若检测模块12检测出有一组节点(节点10或另一组节点)插入服务器系统1,则执行步骤S308;若检测模块12检测出仍未有节点插入服务器系统1,则继续执行步骤S306。在步骤S308中,提示模块14会产生第一提示信号,并接续执行步骤S202。In step S302 , the prompt module 14 generates a set of second prompt signals, and the second prompt signals are used to indicate that the node 10 can be disconnected from the server system 1 . In step S304 , the detection module 12 detects whether the node 10 is disconnected from the server system 1 . If the detection module 12 detects that the node 10 has not been disconnected from the server system 1 , continue to execute step S304 ; if the detection module 12 detects that the node 10 is disconnected from the server system 1 , then execute step S306 . In step S306 , the detection module 12 will continue to detect whether the node 10 or another group of nodes is plugged into the server system 1 . If the detection module 12 detects that a group of nodes (node 10 or another group of nodes) is inserted into the server system 1, then execute step S308; if the detection module 12 detects that no node has been inserted into the server system 1, then continue to execute step S306. In step S308, the prompting module 14 generates a first prompting signal, and continues to execute step S202.

请参照图4,图4为根据图3中的步骤S304的详细步骤流程图。如图4所示,在提示模块14产生第二提示信号(即步骤S302)之后,计时模块18会设定一组第一预设时间(例如一分钟)并且开始计时。在步骤S402中,会判断节点10是否已拔离服务器系统1。若判断出节点10于第一预设时间内仍未拔离服务器系统1,则执行步骤S404;若判断出节点10拔离服务器系统1且已经过第一预设时间,则执行步骤S306。在步骤S404中,计时模块18会重置第一预设时间并重新计时,并接续步骤S402的判断程序。Please refer to FIG. 4 . FIG. 4 is a detailed flow chart of steps according to step S304 in FIG. 3 . As shown in FIG. 4 , after the prompting module 14 generates the second prompting signal (ie, step S302 ), the timing module 18 sets a set of first preset times (for example, one minute) and starts timing. In step S402 , it is determined whether the node 10 has been disconnected from the server system 1 . If it is determined that the node 10 has not been disconnected from the server system 1 within the first preset time, then step S404 is performed; if it is determined that the node 10 has been disconnected from the server system 1 and the first preset time has passed, then step S306 is performed. In step S404, the timing module 18 resets the first preset time and restarts the timing, and continues the determination procedure of step S402.

此外,步骤S402中所执行的判断节点10是否已拔离服务器系统1的步骤可以通过检测模块12、处理模块16或节点10的网络界面控制器来达成,本发明在此不加以限制,例如可通过ping节点10上BMC的NIC来检测节点10是否拔离。藉此,通过图4的判断机制可以避免因网络不稳或接触不良所造成的误判节点10已拔离服务器系统1的情况,换句话说,图4的判断机制为一种de-bounce机制。In addition, the step of judging whether the node 10 has been disconnected from the server system 1 performed in step S402 can be achieved by the detection module 12, the processing module 16 or the network interface controller of the node 10, the present invention is not limited here, for example, Check whether the node 10 is disconnected by pinging the NIC of the BMC on the node 10. In this way, the judging mechanism in FIG. 4 can avoid misjudgment that the node 10 has been disconnected from the server system 1 due to network instability or poor contact. In other words, the judging mechanism in FIG. 4 is a de-bounce mechanism .

请参照图5,图5为根据图3中的步骤S306的详细步骤流程图。如图5所示,于检测模块12检测节点10是否拔离服务器系统1的步骤(即步骤S304)之后,计时模块18会设定一组第一预设时间与一组第二预设时间并开始计时,其中第二预设时间接续于第一预设时间之后。举例来说,第一预设时间为计时模块18开始计时的第一分钟(即第0~60秒),而第二预设时间则为计时模块18开始计时的第二分钟(即第61~120秒),需知,第一预设时间与第二预设时间是可不同的。Please refer to FIG. 5 , which is a detailed flow chart of steps according to step S306 in FIG. 3 . As shown in FIG. 5, after the detection module 12 detects whether the node 10 is disconnected from the server system 1 (i.e. step S304), the timing module 18 will set a set of first preset times and a set of second preset times and Start counting, wherein the second preset time follows the first preset time. For example, the first preset time is the first minute (that is, the 0th to 60th seconds) when the timing module 18 starts timing, and the second preset time is the second minute (that is, the 61st to 60th seconds) when the timing module 18 starts timing. 120 seconds), it should be noted that the first preset time and the second preset time can be different.

在步骤S502中,会判断节点10是否插入服务器系统1。若判断出节点10仍未插入服务器系统1,则执行步骤S504;若判断出节点10插入服务器系统1,则执行步骤S506。在步骤S504中,计时模块18会重置第一预设时间并重新计时,并接续步骤S502的判断程序,所以步骤S502及步骤S504的判断机制为一种de-bounce机制。此外,步骤S502所执行的判断程序可以系通过检测模块12、处理模块16或节点10的网络界面控制器来达成,本发明在此不加以限制,例如可通过ping节点10上BMC的NIC来检测节点10是否插入。In step S502, it is determined whether the node 10 is plugged into the server system 1 or not. If it is determined that the node 10 has not been inserted into the server system 1, then execute step S504; if it is determined that the node 10 is inserted into the server system 1, then execute step S506. In step S504, the timing module 18 resets the first preset time and counts again, and continues the judging process of step S502, so the judging mechanism of steps S502 and S504 is a de-bounce mechanism. In addition, the judgment procedure executed in step S502 can be achieved by the detection module 12, the processing module 16 or the network interface controller of the node 10, the present invention is not limited here, for example, it can be detected by pinging the NIC of the BMC on the node 10 Whether node 10 is inserted.

在步骤S506中,若判断出节点10插入服务器系统1且已经过第一预设时间,则接着判断节点10于第二预设时间中是否仍持续插入服务器系统1。若判断节点10于第二预设时间中仍持续插入服务器系统1,表示节点10与所插入服务器系统1中的位置皆正确无误,则执行步骤S308之后的步骤;若判断节点10于第二预设时间中拔离服务器系统1,表示节点10与所插入服务器系统1中的位置可能有错误或插错节点10而被拔离,则执行步骤S300之后的步骤,以将正确的节点10插入至服务器系统1中的正确位置,所以步骤S506的判断机制为一种人为防呆(fool-proofing)机制。In step S506, if it is determined that the node 10 is plugged into the server system 1 and the first preset time has elapsed, then it is judged whether the node 10 is still plugged into the server system 1 during the second preset time. If it is judged that the node 10 is still inserted into the server system 1 continuously during the second preset time, it means that the positions of the node 10 and the inserted server system 1 are correct, then the steps after step S308 are executed; Assuming that the server system 1 is pulled out during the time, it means that the position of the node 10 and the inserted server system 1 may be wrong or the wrong node 10 is pulled out, then the steps after step S300 are executed to insert the correct node 10 into the The correct location in the server system 1, so the judging mechanism in step S506 is a fool-proofing mechanism.

此外,步骤S506中所执行的判断程序可以通过检测模块12、处理模块16或节点10的网络界面控制器来达成,本发明在此不加以限制,例如可通过ping节点10上BMC的NIC来检测节点10是否持续插入。藉此,通过图5的判断机制除了可以避免因网络不稳或接触不良所造成的误判节点10已插入服务器系统1的情况之外,还能让使用者有机会当误差节点时,有机会将此节点拔除并插入正确的节点,换句话说,图5的判断机制为一种de-bounce机制与防呆(fool-proofing)机制的组合。In addition, the judgment procedure executed in step S506 can be achieved by the detection module 12, the processing module 16 or the network interface controller of the node 10, the present invention is not limited here, for example, it can be detected by pinging the NIC of the BMC on the node 10 Whether node 10 is continuously inserted. In this way, through the judging mechanism in Figure 5, in addition to avoiding the misjudgment that the node 10 has been inserted into the server system 1 due to network instability or poor contact, it also allows users to have the opportunity to be an error node and have the opportunity to Pull out this node and insert it into a correct node. In other words, the judging mechanism in FIG. 5 is a combination of a de-bounce mechanism and a fool-proofing mechanism.

综合以上所述,本发明实施例提供的服务器系统与节点替换方法,其通过检测节点的识别码与节点中的硬件的硬件配置信息,来判断节点或节点中的硬件是否已被替换或新的节点是否被加入,进而选择性地执行节点替换程序或对此节点安装操作系统、软件包数据或固件包数据,甚至在节点或硬件没被替换下,也可强制重新对此节点安装操作系统、软件包数据或固件包数据。此外,本发明的服务器系统与节点替换方法还可于节点安装操作系统、软件包数据或固件包数据后,持续地检测节点中的硬件状况,并于节点中的硬件产生错误时通过提示信号来让使用者得知可以进行节点替换程序。藉此,本发明的服务器系统与节点替换方法可以自动地执行节点是否需要换置的处理程序,使用者仅需依据提示信号来将节点插入或拔离服务器系统,而不需要去做其他的检测程序,十分具有实用性。Based on the above, the server system and node replacement method provided by the embodiments of the present invention judge whether the node or the hardware in the node has been replaced or new by detecting the identification code of the node and the hardware configuration information of the hardware in the node. Whether the node is added, and then selectively execute the node replacement program or install the operating system, software package data or firmware package data on this node, even if the node or hardware is not replaced, it can also force the re-installation of the operating system, Package data or firmware package data. In addition, the server system and node replacement method of the present invention can also continuously detect the hardware status of the node after the node installs the operating system, software package data or firmware package data, and when the hardware in the node generates an error, it can be prompted by a signal Let the user know that the node replacement procedure is available. In this way, the server system and node replacement method of the present invention can automatically execute the processing procedure of whether the node needs to be replaced, and the user only needs to insert or unplug the node from the server system according to the prompt signal, and does not need to do other tests The program is very practical.

当然,本发明还可有其它多种实施例,在不背离本发明精神及其实质的情况下,熟悉本领域的技术人员当可根据本发明作出各种相应的改变和变形,但这些相应的改变和变形都应属于本发明所附的权利要求的保护范围。Certainly, the present invention also can have other multiple embodiments, without departing from the spirit and essence of the present invention, those skilled in the art can make various corresponding changes and deformations according to the present invention, but these corresponding Changes and deformations should belong to the scope of protection of the appended claims of the present invention.

Claims (10)

1. a kind of node replacement method, suitable for a server system, it is characterised in that the node replacement method includes:
Detect a node and whether insert the server system, and produce when detecting that the node inserts the server system one the One cue, first cue is indicating that the node can not pull away the server system;
Detect one first identification code and one first hardware configuration information of the hardware in the node of the node;
Inserted according to first identification code and first hardware configuration information and the node before the server system the Two identification codes and one second hardware configuration information, judge whether the hardware in the node or the node has been replaced;
If judging, the node is not all replaced with the hardware in the node, closes the power supply of the node, to perform a section Point replaces program;And
If judging, the hardware in the node or the node is replaced, to the operating system of node installation one, a software kit Data and a firmware bag data at least one.
2. node replacement method as claimed in claim 1, it is characterised in that in the node installation operating system, this is soft After part bag data and at least one of step of firmware bag data, in addition to:
The situation of the hardware in the node is persistently detected, to judge whether the hardware makes a mistake;
If judging, the hardware produces the mistake of unrepairable, closes the power supply of the node, and program is replaced to perform the node; And
If judging, the hardware produces recoverable errors number and reaches default threshold value, and normal shutdown is carried out to the node Program, and perform the node according to this and replace program.
3. node replacement method as claimed in claim 1, it is characterised in that the node, which replaces program, to be included:
It is an initial pattern by the node sets;
One second cue is produced, second cue is indicating that the node can pull away the server system;
Detect whether the node pulls away the server system;
If detecting, the node pulls away the server system, detects the node or whether another node inserts the server system System;And
If detecting, the node or another node insert the server system, produce first cue, and the inspection that continues Survey the step of first identification code of the node is with after first hardware configuration information of the hardware in the node.
4. node replacement method as claimed in claim 3, it is characterised in that whether the node pulls away the server system in detection In the step of system, in addition to:
Set one first preset time and start timing;
Judge whether the node has pulled away the server system;
If judging, the node not yet pulls away the server system, resets first preset time and reclocking;And
If judging, the node pulls away the server system and has been subjected to first preset time, perform detection node or another The step of whether one node is inserted after the server system.
5. node replacement method as claimed in claim 3, it is characterised in that whether inserted in the detection node or another node In the step of server system, in addition to:
Set one first preset time and one second preset time and start timing, wherein second preset time be connected in this After one preset time;
Judge whether the node inserts the server system;
If judging, the node not yet inserts the server system, resets first preset time and reclocking;
If judging, the node inserts the server system and has been subjected to first preset time, then judges the node in this Whether the server system is still persistently inserted in second preset time;
If judging, the node pulls away the server system in second preset time, continues the node sets are first for this The step of after beginning pattern;And
If judging the node in not pulling away the server system yet after second preset time, first prompting is produced Signal, and first hardware configuration information of the hardware in continue first identification code and the node for detecting the node it Step afterwards.
A kind of 6. server system, it is characterised in that including:
One node, there is a hardware;
One detection module, communication connects the node, to detect whether the node inserts or pull away the server system, Yi Jiyong To detect one first hardware configuration information of one first identification code of the node and the hardware;
One reminding module, communication connects the detection module, to detect that the node inserts the server system in the detection module One first cue is produced during system, first cue is indicating that the node can not pull away the server system;And
One processing module, communication be connected between the detection module and the node, to according to first identification code and this first One second identification code that hardware configuration information and the node are inserted before the server system and one second hardware configuration information To judge whether the node or the hardware have been replaced;
Wherein, if the processing module judges that the node is not all replaced with the hardware, the power supply of the node is closed, to perform One node replaces program, if the processing module judges that the node or the hardware are replaced, system is operated to the node installation one System, a package data and a firmware bag data at least one.
7. server system as claimed in claim 6, it is characterised in that in the processing module to the node installation operation system After at least one, whether the processing module also persistently judges the hardware for system, the package data and the firmware bag data Make a mistake, if judging, the hardware produces the mistake of unrepairable, closes the power supply of the node, to perform node replacement Program, if judging, the hardware produces recoverable errors number and reaches default threshold value, and the node is normally closed Machine program, and perform the node according to this and replace program.
8. server system as claimed in claim 6, it is characterised in that perform the node in the server system and replace program When, the node sets are an initial pattern by the processing module, and then the reminding module produces one second cue, and this second Cue is indicating that the node can pull away the server system, then carrys out the detection module and detect whether the node pulls away the clothes It is engaged in device system, and continues after detecting that the node pulls away the server system and to detect the node or whether another node inserts this Server system, if the detection module detects that the node or another node insert the server system, reminding module production Raw first cue, and the detection module continue detect first identification code of the node and the hardware this is first hard Processing routine after part configuration information.
9. server system as claimed in claim 8, it is characterised in that the server system also includes a timing module, should Timing module communication connects the detection module, when the detection module detects the node and whether pulls away the server system, the meter When module set and one first preset time and start timing, if the detection module detects the node in first preset time The server system is not pulled away yet, then the timing module resets first preset time and reclocking, if the detection module is examined Measure the node to pull away the server system and have been subjected to first preset time, then continue and detect the node or another node and be The no processing routine inserted after the server system.
10. server system as claimed in claim 8, it is characterised in that the server system also includes:
One timing module, timing module communication connect the detection module, the node or another node are detected in the detection module When whether inserting the server system, the timing module sets one first preset time and one second preset time and starts to count When, after wherein second preset time is connected in first preset time, if the detection module detect the node in this first The server system is not yet inserted in preset time, then the timing module resets first preset time and reclocking, if should Detection module detects that the node inserts the server system and has been subjected to first preset time, then continue detect the node in Whether the server system is still persistently inserted in second preset time, if the detection module detects that the node is second pre- in this If pulling away the server system in the time, then processing routine of the processing module that continues by the node sets for the initial pattern, If the detection module detects the node in not pulling away the server system yet after second preset time, the prompting mould Block produces first cue, and the detection module continue detect first identification code of the node and the hardware this Processing routine after one hardware configuration information.
CN201310597425.0A 2013-11-22 2013-11-22 server system and node replacement method Expired - Fee Related CN104657166B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310597425.0A CN104657166B (en) 2013-11-22 2013-11-22 server system and node replacement method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310597425.0A CN104657166B (en) 2013-11-22 2013-11-22 server system and node replacement method

Publications (2)

Publication Number Publication Date
CN104657166A CN104657166A (en) 2015-05-27
CN104657166B true CN104657166B (en) 2018-03-20

Family

ID=53248348

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310597425.0A Expired - Fee Related CN104657166B (en) 2013-11-22 2013-11-22 server system and node replacement method

Country Status (1)

Country Link
CN (1) CN104657166B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110457074A (en) * 2019-07-26 2019-11-15 新华三技术有限公司成都分公司 Configuration method, device, electronic equipment and the storage medium of calculate node

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001082678A2 (en) * 2000-05-02 2001-11-08 Sun Microsystems, Inc. Cluster membership monitor
CN102135932A (en) * 2011-03-08 2011-07-27 浪潮(北京)电子信息产业有限公司 Monitoring system and monitoring method thereof
CN102769673A (en) * 2012-07-25 2012-11-07 楚云汉智武汉网络存储系统有限公司 Failure detection method suitable to large-scale storage cluster
CN103186403A (en) * 2011-12-28 2013-07-03 英业达股份有限公司 Node replacement processing method and server system using the method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001082678A2 (en) * 2000-05-02 2001-11-08 Sun Microsystems, Inc. Cluster membership monitor
CN102135932A (en) * 2011-03-08 2011-07-27 浪潮(北京)电子信息产业有限公司 Monitoring system and monitoring method thereof
CN103186403A (en) * 2011-12-28 2013-07-03 英业达股份有限公司 Node replacement processing method and server system using the method
CN102769673A (en) * 2012-07-25 2012-11-07 楚云汉智武汉网络存储系统有限公司 Failure detection method suitable to large-scale storage cluster

Also Published As

Publication number Publication date
CN104657166A (en) 2015-05-27

Similar Documents

Publication Publication Date Title
CN106681751A (en) Unified firmware management system and management method and computer readable medium
CN105589776A (en) Fault location method and server
CN104572226A (en) Method and device for detecting mainboard starting abnormity
EP2189906A1 (en) Method and apparatus for abnormality recovering of data card, and data card
CN114995860B (en) Graphics processor firmware upgrade method and graphics processor
CN115629825A (en) Server and asset information acquisition method, asset information providing method and asset information providing device
US9703937B2 (en) Method to prevent operating system digital product key activation failures
CN102902613B (en) Computer system and diagnosis method thereof
CN104809044A (en) Method and system for detecting starting state of baseplate management controller
TWI611289B (en) Server and error detecting method thereof
CN104731617A (en) Server starting device determining method
TW201314574A (en) Method and device for updating BIOS program for computer system
US8495353B2 (en) Method and circuit for resetting register
CN105912414A (en) Method and system for server management
CN115934446A (en) A self-test method, server, device and storage medium
CN104657166B (en) server system and node replacement method
TWI501090B (en) Server system and node detection method
JP6217086B2 (en) Information processing apparatus, error detection function diagnosis method, and computer program
TW201705015A (en) Method and system for remote system configuration management and non-transitory computer-readable storage medium
CN110020565B (en) Probe information reading fault prompting method, device, server and storage medium
CN107168837A (en) A kind of automatic switchover OS method of testing and test system
CN119046051A (en) Fault processing method and product of computer system
CN119271250A (en) BMC chip firmware upgrade method, BMC chip, device and storage medium
CN115292113B (en) Fault detection method, device and electronic equipment for memory of server
TWI518519B (en) Server system and node replacement method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20180320