CN103684918A - Method and device for detecting link failure - Google Patents
Method and device for detecting link failure Download PDFInfo
- Publication number
- CN103684918A CN103684918A CN201310690351.5A CN201310690351A CN103684918A CN 103684918 A CN103684918 A CN 103684918A CN 201310690351 A CN201310690351 A CN 201310690351A CN 103684918 A CN103684918 A CN 103684918A
- Authority
- CN
- China
- Prior art keywords
- port
- controller
- wwpn
- address
- switch
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Data Exchanges In Wide-Area Networks (AREA)
Abstract
Description
技术领域technical field
本发明实施例涉及通信技术,尤其涉及一种链路故障检测方法和装置。Embodiments of the present invention relate to communication technologies, and in particular to a link fault detection method and device.
背景技术Background technique
光纤通道(Fiber Channel,简称:FC)协议主要应用于存储领域,是服务器扩展存储的一种主要方式。FC网络具体可区分为存储区域网络(Storage Area Network,简称:SAN)和直接附加存储(Direct AttachedStorage,简称:DAS)。其中,SAN是一种通过光纤集线器、光纤路由器、光纤交换机等连接设备将磁盘阵列、磁带等存储设备与相关服务器连接起来的高速专用子网。在DAS组网方式中,存储设备通过电缆直接到服务器,I/O(输入/输出)请求直接发送到存储设备。无论哪种组网,FC的整个链路都包含多个网络节点,例如主机总线适配器(Host Bus Adapter,简称:HBA)、交换机(Switch)、磁盘阵列等。链路上的任何一个节点故障,都会导致整个FC链路故障。The Fiber Channel (FC for short) protocol is mainly used in the storage field and is a major way for servers to expand storage. The FC network can be specifically divided into storage area network (Storage Area Network, referred to as: SAN) and direct attached storage (Direct Attached Storage, referred to as: DAS). Among them, SAN is a high-speed dedicated subnet that connects disk arrays, tapes and other storage devices with related servers through fiber optic hubs, fiber optic routers, fiber optic switches and other connecting devices. In the DAS networking mode, the storage device is directly connected to the server through the cable, and the I/O (input/output) request is directly sent to the storage device. No matter what kind of networking, the entire link of FC includes multiple network nodes, such as host bus adapter (Host Bus Adapter, HBA for short), switch (Switch), disk array, etc. Any node failure on the link will cause the failure of the entire FC link.
基于可靠性的考虑,FC存储通常采用冗余组网,在主机和逻辑存储设备之间有多条路径可以选择。当某条路径故障时,通过上层多路径软件自动检测故障并进行链路切换,避免业务受损。Based on reliability considerations, FC storage usually adopts redundant networking, and multiple paths can be selected between the host and logical storage devices. When a path fails, the upper-layer multipath software automatically detects the failure and performs link switching to avoid service damage.
但是,现有技术一般是基于某个节点的单点检测:例如在主机侧检测HBA卡端口的状态,link状态、link速率、光模块状态等;在交换机检测系统温度、电压、光模块状态、端口变化等;在存储阵列侧检测端口状态、光模块等。即,现有的技术是在系统中的各个节点单独进行的,每个节点只能进行最基本的检测,对于一些更加复杂的错误,比如光纤没有连接/光纤连接错误、对端设备不在位或者异常等故障无法进行检测,因此FC网络的可靠性仍然较低。However, the existing technology is generally based on a single point detection of a node: for example, the status of the HBA card port, link status, link rate, optical module status, etc. are detected on the host side; the system temperature, voltage, optical module status, etc. are detected on the switch. Port changes, etc.; detect port status, optical modules, etc. on the storage array side. That is, the existing technology is performed on each node in the system independently, and each node can only perform the most basic detection. For some more complex errors, such as no fiber connection/fiber connection error, the peer device is not in place or Faults such as abnormalities cannot be detected, so the reliability of the FC network is still low.
发明内容Contents of the invention
本发明实施例提供一种链路故障检测方法和装置,以克服现有技术的FC链路检测不全面,导致FC网络可靠性低的问题。Embodiments of the present invention provide a link fault detection method and device to overcome the problem of low reliability of the FC network due to incomplete FC link detection in the prior art.
第一方面,本发明实施例提供一种链路故障检测方法,包括:In a first aspect, an embodiment of the present invention provides a link fault detection method, including:
检测装置收集光纤信道网络中至少两个节点的互相对应的端口的状态信息;The detection device collects state information of ports corresponding to each other of at least two nodes in the fiber channel network;
所述检测装置将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障。The detection device compares the state information of the ports corresponding to each other at least two nodes, and determines whether there is a fault according to the comparison result.
在第一方面的第一种可能的实现方式中,所述检测装置收集光纤信道网络中至少两个节点的互相对应的状态信息,包括:In a first possible implementation manner of the first aspect, the detection device collects state information corresponding to each other of at least two nodes in the fiber channel network, including:
所述检测装置通过同一接口收集光纤信道网络中所有节点的所有端口的状态信息。The detection device collects state information of all ports of all nodes in the fiber channel network through the same interface.
在第一方面的第二种可能的实现方式中,所述端口的状态信息包括以下至少一个:端口类型、连接状态、工作模式、自协商结果和全球端口名称wwpn地址。In a second possible implementation manner of the first aspect, the state information of the port includes at least one of the following: port type, connection state, working mode, auto-negotiation result, and worldwide port name wwpn address.
根据第一方面的第二种可能的实现方式,在第三种可能的实现方式中,所述节点包括:主机总线适配器HBA、交换机Switch和存储设备;According to the second possible implementation manner of the first aspect, in the third possible implementation manner, the node includes: a host bus adapter HBA, a switch Switch, and a storage device;
所述检测装置将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障,包括:The detection device compares the status information of the ports corresponding to each other at least two nodes, and determines whether there is a fault according to the comparison result, including:
所述检测装置将所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址进行比较,若所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述HBA卡与所述Switch之间存在故障;或者The detection device compares the wwpn address of the port of the HBA with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the HBA is inconsistent with the address of the device connected to the corresponding port on the Switch , it is determined that there is a fault between the HBA card and the Switch; or
所述检测装置将所述存储设备的控制器的端口的wwpn地址与Switch上对应端口已连接设备的地址进行比较,若所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述存储设备的控制器与所述Switch之间存在故障;或者The detection device compares the wwpn address of the port of the controller of the storage device with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the controller of the storage device is the same as that of the corresponding port on the Switch If the addresses of the connected devices are inconsistent, it is determined that there is a fault between the controller of the storage device and the Switch; or
所述检测装置判断所述Switch上连接不同设备的端口的端口类型是否相同,若相同,则确定所述Switch的连接或配置存在故障。The detection device judges whether the port types of the ports connected to different devices on the Switch are the same, and if they are the same, it is determined that there is a fault in the connection or configuration of the Switch.
根据第一方面的第三种可能的实现方式,在第四种可能的实现方式中,在所述检测装置确定所述HBA与所述Switch之间存在故障之后,还包括:According to the third possible implementation manner of the first aspect, in the fourth possible implementation manner, after the detection device determines that there is a fault between the HBA and the Switch, it further includes:
当所述检测装置确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,所述检测装置确定所述故障的类型为链路异常;或者When the detection device determines that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, the detection device determines that the type of the fault is a link abnormality; or
当所述检测装置确定所述HBA的端口的wwpn地址与所述Switch上对应端口的wwpn均不为空,且所述两个wwpn地址不一致时,所述检测装置确定所述故障的类型为所述故障的类型为误连接。When the detection device determines that the wwpn address of the port of the HBA and the wwpn of the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, the detection device determines that the type of the fault is the The above fault type is wrong connection.
根据第一方面的第三种可能的实现方式,在第五种可能的实现方式中,在所述检测装置确定所述存储设备的控制器与所述Switch之间存在故障之后,还包括:According to the third possible implementation manner of the first aspect, in the fifth possible implementation manner, after the detection device determines that there is a fault between the controller of the storage device and the Switch, it further includes:
当所述检测装置确定所述存储设备的控制器的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,所述检测装置确定所述存储设备与所述Switch之间的故障的类型为链路异常;或者When the detection device determines that the wwpn address of the port of the controller of the storage device is not empty, and the wwpn address of the corresponding port on the Switch is empty, the detection device determines that the connection between the storage device and the Switch The type of fault between is link abnormal; or
当所述检测装置确定所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址均不为空,且所述两个wwpn地址不一致时,则所述检测装置确定所述存储设备的控制器与所述Switch之间的故障的类型为所述存储设备的控制器与所述Switch之间未连接光纤。When the detection device determines that the wwpn address of the port of the controller of the storage device and the address of the device connected to the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, the detection device It is determined that the type of the fault between the controller of the storage device and the Switch is that no optical fiber is connected between the controller of the storage device and the Switch.
根据第一方面的第二种可能的实现方式,在第六种可能的实现方式中,所述节点包括:主机总线适配器HBA和存储设备;According to the second possible implementation manner of the first aspect, in a sixth possible implementation manner, the node includes: a host bus adapter HBA and a storage device;
所述检测装置将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障,包括:The detection device compares the status information of the ports corresponding to each other at least two nodes, and determines whether there is a fault according to the comparison result, including:
所述检测装置将HBA的端口的wwpn地址与存储设备的控制器的wwpn地址进行比较,若所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址不一致,则确定所述HBA与所述存储设备的控制器之间存在故障。The detection device compares the wwpn address of the port of the HBA with the wwpn address of the controller of the storage device, and if the wwpn address of the port of the HBA is inconsistent with the wwpn address of the controller of the storage device, it is determined that the HBA There is a failure with the controller of the storage device.
根据第一方面的第六种可能的实现方式,在第七种可能的实现方式中,在所述检测装置确定所述HBA与所述存储设备的控制器之间存在故障之后,还包括:According to the sixth possible implementation manner of the first aspect, in the seventh possible implementation manner, after the detection device determines that there is a fault between the HBA and the controller of the storage device, it further includes:
当所述检测装置确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,则检测装置确定所述HBA与所述存储设备的控制器之间的故障的类型为链路异常;或者When the detecting device determines that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, the detecting device determines the connection between the HBA and the controller of the storage device The type of fault is a link anomaly; or
当所述检测装置确定所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址均不为空且所述两个wwpn地址不一致时,所述检测装置确定所述HBA与所述存储设备的控制器之间的故障的类型为误连接。When the detecting means determines that neither the wwpn address of the port of the HBA nor the wwpn address of the controller of the storage device is empty and the two wwpn addresses are inconsistent, the detecting means determines that the HBA and the The type of failure between the controllers of the storage device is misconnection.
根据第一方面的第二种可能的实现方式,在第八种可能的实现方式中,所述节点包括:存储设备的第一控制器和第二控制器,所述第一控制器和第二控制器分别与主机总线适配器HBA连接,构成第一链路和第二链路,或分别与交换机Switch连接,构成第一链路和第二链路,所述第一控制器和第二控制器的链路互为备份;According to the second possible implementation manner of the first aspect, in an eighth possible implementation manner, the node includes: a first controller and a second controller of the storage device, and the first controller and the second The controllers are respectively connected to the host bus adapter HBA to form the first link and the second link, or respectively connected to the switch Switch to form the first link and the second link, the first controller and the second controller The links of each other are backups;
所述检测装置将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障,包括:The detection device compares the status information of the ports corresponding to each other at least two nodes, and determines whether there is a fault according to the comparison result, including:
所述检测装置将所述第一链路对应的所述第一控制器的端口的wwpn地址与所述第二链路对应的所述第二控制器的端口的wwpn地址进行比较,若所述两个wwpn地址不一致,则确定所述存储设备的连接存在故障。The detection device compares the wwpn address of the port of the first controller corresponding to the first link with the wwpn address of the port of the second controller corresponding to the second link, if the If the two wwpn addresses are inconsistent, it is determined that there is a fault in the connection of the storage device.
第二方面,本发明实施例提供一种链路故障检测装置,包括:In a second aspect, an embodiment of the present invention provides a link fault detection device, including:
收集模块,用于收集光纤信道网络中至少两个节点的互相对应的端口的状态信息;A collecting module, configured to collect state information of ports corresponding to each other of at least two nodes in the fiber channel network;
确定模块,用于将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障。The determination module is configured to compare the status information of the ports corresponding to each other at least two nodes, and determine whether there is a fault according to the comparison result.
在第二方面的第一种可能的实现方式中,所述收集模块具体用于:In a first possible implementation manner of the second aspect, the collection module is specifically configured to:
通过同一接口收集光纤信道网络中所有节点的所有端口的状态信息。Collect status information for all ports of all nodes in the Fiber Channel network through the same interface.
在第二方面的第二种可能的实现方式中,所述端口的状态信息包括以下至少一个:端口类型、连接状态、工作模式、自协商结果和全球端口名称wwpn地址。In a second possible implementation manner of the second aspect, the state information of the port includes at least one of the following: port type, connection state, working mode, auto-negotiation result, and worldwide port name wwpn address.
根据第二方面的第二种可能的实现方式,在第三种可能的实现方式中,所述节点包括:主机总线适配器HBA、交换机Switch和存储设备;According to a second possible implementation of the second aspect, in a third possible implementation, the node includes: a host bus adapter HBA, a switch Switch, and a storage device;
所述确定模块具体用于:The determination module is specifically used for:
将所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址进行比较,若所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述HBA卡与所述Switch之间存在故障;或者Compare the wwpn address of the port of the HBA with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the HBA is inconsistent with the address of the device connected to the corresponding port on the Switch, then determine the There is a fault between the HBA card and the Switch; or
将所述存储设备的控制器的端口的wwpn地址与Switch上对应端口已连接设备的地址进行比较,若所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述存储设备的控制器与所述Switch之间存在故障;或者Comparing the wwpn address of the port of the controller of the storage device with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the controller of the storage device is the same as the address of the device connected to the corresponding port on the Switch If the addresses are inconsistent, it is determined that there is a fault between the controller of the storage device and the Switch; or
判断所述Switch上连接不同设备的端口的端口类型是否相同,若相同,则确定所述Switch的连接或配置存在故障。It is judged whether the port types of the ports connected to different devices on the Switch are the same, and if they are the same, it is determined that there is a fault in the connection or configuration of the Switch.
根据第二方面的第三种可能的实现方式,在第四种可能的实现方式中,所述确定模块还用于:According to a third possible implementation of the second aspect, in a fourth possible implementation, the determining module is further configured to:
当确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,确定所述故障的类型为链路异常;或者When it is determined that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, it is determined that the type of the fault is a link exception; or
当确定所述HBA的端口的wwpn地址与所述Switch上对应端口的wwpn均不为空,且所述两个wwpn地址不一致时,确定所述故障的类型为所述故障的类型为误连接。When it is determined that the wwpn address of the port of the HBA and the wwpn of the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, it is determined that the type of the fault is that the type of the fault is a misconnection.
根据第二方面的第三种可能的实现方式,在第五种可能的实现方式中,所述确定模块还用于:According to a third possible implementation of the second aspect, in a fifth possible implementation, the determining module is further configured to:
当确定所述存储设备的控制器的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,确定所述存储设备与所述Switch之间的故障的类型为链路异常;或者When it is determined that the wwpn address of the port of the controller of the storage device is not empty, and the wwpn address of the corresponding port on the Switch is empty, determine that the type of failure between the storage device and the Switch is a link abnormal; or
当确定所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址均不为空,且所述两个wwpn地址不一致时,则确定所述存储设备的控制器与所述Switch之间的故障的类型为所述存储设备的控制器与所述Switch之间未连接光纤。When it is determined that the wwpn address of the port of the controller of the storage device and the address of the device connected to the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, then determine the controller of the storage device The type of the fault with the Switch is that the optical fiber is not connected between the controller of the storage device and the Switch.
根据第二方面的第二种可能的实现方式,在第六种可能的实现方式中,所述节点包括:主机总线适配器HBA和存储设备;According to a second possible implementation manner of the second aspect, in a sixth possible implementation manner, the node includes: a host bus adapter HBA and a storage device;
所述确定模块具体用于:The determination module is specifically used for:
将HBA的端口的wwpn地址与存储设备的控制器的wwpn地址进行比较,若所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址不一致,则确定所述HBA与所述存储设备的控制器之间存在故障。Comparing the wwpn address of the port of the HBA with the wwpn address of the controller of the storage device, if the wwpn address of the port of the HBA is inconsistent with the wwpn address of the controller of the storage device, it is determined that the HBA and the storage device There is a fault between the controllers of the device.
根据第二方面的第六种可能的实现方式,在第七种可能的实现方式中,所述确定模块还用于:According to the sixth possible implementation manner of the second aspect, in a seventh possible implementation manner, the determining module is further configured to:
当确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,则检测装置确定所述HBA与所述存储设备的控制器之间的故障的类型为链路异常;或者When it is determined that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, then the detection device determines that the type of fault between the HBA and the controller of the storage device is link abnormality; or
当确定所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址均不为空且所述两个wwpn地址不一致时,确定所述HBA与所述存储设备的控制器之间的故障的类型为误连接。When it is determined that the wwpn address of the port of the HBA and the wwpn address of the controller of the storage device are not empty and the two wwpn addresses are inconsistent, determine the connection between the HBA and the controller of the storage device The type of failure is misconnection.
根据第二方面的第二种可能的实现方式,在第八种可能的实现方式中,所述节点包括:存储设备的第一控制器和第二控制器,所述第一控制器和第二控制器分别与主机总线适配器HBA连接,构成第一链路和第二链路,或分别与交换机Switch连接,构成第一链路和第二链路,所述第一控制器和第二控制器的链路互为备份;According to the second possible implementation manner of the second aspect, in an eighth possible implementation manner, the node includes: a first controller and a second controller of the storage device, and the first controller and the second The controllers are respectively connected to the host bus adapter HBA to form the first link and the second link, or respectively connected to the switch Switch to form the first link and the second link, the first controller and the second controller The links of each other are backups;
所述确定模块具体用于:The determination module is specifically used for:
将所述第一链路对应的所述第一控制器的端口的wwpn地址与所述第二链路对应的所述第二控制器的端口的wwpn地址进行比较,若所述两个wwpn地址不一致,则确定所述存储设备的连接存在故障。Comparing the wwpn address of the port of the first controller corresponding to the first link with the wwpn address of the port of the second controller corresponding to the second link, if the two wwpn addresses If they are inconsistent, it is determined that there is a fault in the connection of the storage device.
本发明实施例提供的链路故障检测方法和装置,通过统一收集整个链路中各端口的状态信息,并通过将至少两个所述节点的互相对应的端口的状态信息进行比较,检查所述比较结果是否符合预期,从而判断是否存在故障,相比与现有技术的检测方法,能够检测更为复杂和隐蔽的故障,从而提高FC网络的可靠性。The link fault detection method and device provided by the embodiments of the present invention collect the state information of each port in the entire link uniformly, and compare the state information of the ports corresponding to each other at least two nodes to check the Whether the comparison result meets expectations, thereby judging whether there is a fault, compared with the detection method in the prior art, can detect more complex and hidden faults, thereby improving the reliability of the FC network.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍,显而易见地,下面描述中的附图是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description These are some embodiments of the present invention. For those skilled in the art, other drawings can also be obtained according to these drawings without any creative effort.
图1为一种典型的SAN网络的结构示意图;Fig. 1 is a schematic structural diagram of a typical SAN network;
图2为一种典型的DAS网络的结构示意图;Fig. 2 is a schematic structural diagram of a typical DAS network;
图3为本发明链路故障检测方法实施例的流程图;Fig. 3 is the flow chart of the embodiment of link fault detection method of the present invention;
图4为本发明链路故障检测装置实施例的结构示意图。FIG. 4 is a schematic structural diagram of an embodiment of a link fault detection device according to the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.
图1为一种典型的SAN网络的结构示意图,如图1所示,SAN网络中包括HBA、交换机和磁盘阵列,其中磁盘阵列包括控制器Controller和存储器。图1所示的SAN组网中,主用链路是第一交换机(例如Slot7FC Switch)一端连接磁盘阵列的第一控制器(Controller A),另一端连接HBA的零号端口(port0端口),备份链路是第二交换机(例如Slot8FC Switch)的一端连接磁盘阵列的第二控制器(Controller B),另一端连接HBA的一号端口(port1端口)。如果Slot8FC Switch与Controller B之间的光纤线没有连接或者损坏。一旦主用链路发生故障,备份链路也是不可用的,此时上层的业务将中断。FIG. 1 is a schematic structural diagram of a typical SAN network. As shown in FIG. 1 , the SAN network includes HBAs, switches, and disk arrays, wherein the disk array includes a controller and storage. In the SAN network shown in Figure 1, the primary link is the first switch (for example, Slot7FC Switch) with one end connected to the first controller (Controller A) of the disk array, and the other end connected to the zero port (port0 port) of the HBA. The backup link is that one end of the second switch (for example, Slot8FC Switch) is connected to the second controller (Controller B) of the disk array, and the other end is connected to port 1 (port1) of the HBA. If the optical fiber cable between the Slot8FC Switch and Controller B is not connected or damaged. Once the active link fails, the backup link is also unavailable, and the upper-layer services will be interrupted at this time.
图2为一种典型的DAS网络的结构示意图,如图2所示,FC网络中包括HBA和磁盘阵列,一种常见的连接方式为:所有HBA的零号端口(port0)连接磁盘阵列的第一控制器Controller A;所有HBA卡一号端口(port1)端口连接磁盘阵列的第二控制器Controller B,当某个HBA的port1出现故障,端口不能连接,上层的业务将中断。Figure 2 is a schematic diagram of a typical DAS network structure. As shown in Figure 2, the FC network includes HBAs and disk arrays. A common connection method is: all HBA port zero (port0) connects to the first port of the disk array. One controller, Controller A; the first port (port1) of all HBA cards is connected to the second controller Controller B of the disk array. When port1 of an HBA fails, the port cannot be connected, and the upper-layer business will be interrupted.
图3为本发明链路故障检测方法实施例的流程图,本实施例的方法可以应用于如图1和图2所示的FC网络,也可以应用于其他FC网络。本实施例中,检测装置可以为网管软件进程。如图3所示,本实施例的方法可以包括:FIG. 3 is a flow chart of an embodiment of a link fault detection method according to the present invention. The method of this embodiment can be applied to the FC network shown in FIG. 1 and FIG. 2 , and can also be applied to other FC networks. In this embodiment, the detection device may be a network management software process. As shown in Figure 3, the method of this embodiment may include:
步骤301、检测装置收集光纤信道网络中至少两个节点的互相对应的端口的状态信息。
其中,所述端口的状态信息包括以下至少一个:端口类型、连接状态、工作模式、自协商结果和全球端口名称(World Wide Port Name,简称:wwpn)地址。状态信息也可以包括其他参数,例如链路两端的端口速率是否一致,是否能够从一个端口正确获取对端的WWPN号等,本发明实施例对此不做限定。Wherein, the status information of the port includes at least one of the following: port type, connection status, working mode, auto-negotiation result and World Wide Port Name (WWPN) address. The status information may also include other parameters, such as whether the speeds of the ports at both ends of the link are the same, whether the WWPN number of the opposite end can be correctly obtained from one port, etc., which is not limited in this embodiment of the present invention.
进一步地,在具体实现时,可以设置为,检测装置通过同一接口收集光纤信道网络中所有节点的所有端口的状态信息。Further, during specific implementation, it may be set that the detection device collects status information of all ports of all nodes in the fiber channel network through the same interface.
相比于现有技术的检测方法,本实施例中,统一收集端口的状态信息:而不是各个节点单独进行。Compared with the detection method in the prior art, in this embodiment, the status information of the ports is collected in a unified manner: instead of being performed by each node separately.
步骤302、所述检测装置将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障。Step 302, the detecting device compares the state information of the ports corresponding to each other at least two nodes, and determines whether there is a fault according to the comparison result.
具体地,检测装置可以校验FC组网中所有端口的状态信息是否符合预期,如果检测到实际结果不符合预期则可以确定存在故障,可以上报告警。Specifically, the detection device can verify whether the state information of all ports in the FC network meets expectations, and if the detected actual results do not meet expectations, it can be determined that there is a fault and an alarm can be reported.
具体实现时,可以将HBA卡的wwpn地址与Switch上对应端口已连接设备的地址进行比较,如果对应端口没有查询到wwpn地址说明HBA卡与Switch之间链路异常,如果wwpn地址错误说明存在误连接;In actual implementation, you can compare the wwpn address of the HBA card with the address of the device connected to the corresponding port on the Switch. If the corresponding port does not find the wwpn address, it means that the link between the HBA card and the Switch is abnormal. If the wwpn address is wrong, it means that there is an error. connect;
也可以将存储设备控制器的wwpn地址与Switch上对应端口已连接设备的地址进行比较,如果对应端口没有查询到wwpn地址说明存储设备与Switch之间链路异常,如果wwpn地址错误说明存储设备与Switch之间没有连接光纤。You can also compare the wwpn address of the storage device controller with the address of the device connected to the corresponding port on the Switch. If the wwpn address is not found on the corresponding port, it means that the link between the storage device and the Switch is abnormal. No fiber is connected between the switches.
并且,交换机上连接不同设备的端口的端口类型应该不同,因此,通过判断端口类型也可以确定是否存在配置错误或者连接错误的情况。In addition, the port types of ports connected to different devices on the switch should be different. Therefore, by judging the port type, it can also be determined whether there is a configuration error or a connection error.
本实施例,通过统一收集整个链路中各端口的状态信息,并通过将至少两个所述节点的互相对应的端口的状态信息进行比较,检查所述比较结果是否符合预期,从而判断是否存在故障,相比与现有技术的检测方法,能够检测更为复杂和隐蔽的故障,从而提高FC网络的可靠性。In this embodiment, by uniformly collecting the state information of each port in the entire link, and comparing the state information of at least two corresponding ports of the nodes, checking whether the comparison result meets expectations, thereby judging whether there is The fault, compared with the detection method in the prior art, can detect more complex and hidden faults, thereby improving the reliability of the FC network.
可选地,在一种场景中,当上述实施例的方法应用于SAN网络时,所述节点包括:主机总线适配器HBA、交换机Switch和存储设备,步骤302具体为:Optionally, in a scenario, when the method of the foregoing embodiment is applied to a SAN network, the nodes include: a host bus adapter HBA, a switch Switch, and a storage device, and step 302 is specifically:
所述检测装置将所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址进行比较,若所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述HBA卡与所述Switch之间存在故障;或者The detection device compares the wwpn address of the port of the HBA with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the HBA is inconsistent with the address of the device connected to the corresponding port on the Switch , it is determined that there is a fault between the HBA card and the Switch; or
所述检测装置将所述存储设备的控制器的端口的wwpn地址与Switch上对应端口已连接设备的地址进行比较,若所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述存储设备的控制器与所述Switch之间存在故障;或者The detection device compares the wwpn address of the port of the controller of the storage device with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the controller of the storage device is the same as that of the corresponding port on the Switch If the addresses of the connected devices are inconsistent, it is determined that there is a fault between the controller of the storage device and the Switch; or
所述检测装置判断所述Switch上连接不同设备的端口的端口类型是否相同,若相同,则确定所述Switch的连接或配置存在故障。The detection device judges whether the port types of the ports connected to different devices on the Switch are the same, and if they are the same, it is determined that there is a fault in the connection or configuration of the Switch.
进一步可选地,若所述检测装置确定所述HBA与所述Switch之间存在故障,则步骤302之后,还可以包括:Further optionally, if the detection device determines that there is a fault between the HBA and the Switch, after step 302, it may further include:
当所述检测装置确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,所述检测装置确定所述故障的类型为链路异常;或者When the detection device determines that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, the detection device determines that the type of the fault is a link abnormality; or
当所述检测装置确定所述HBA的端口的wwpn地址与所述Switch上对应端口的wwpn均不为空,且所述两个wwpn地址不一致时,所述检测装置确定所述故障的类型为所述故障的类型为误连接。When the detection device determines that the wwpn address of the port of the HBA and the wwpn of the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, the detection device determines that the type of the fault is the The above fault type is wrong connection.
进一步可选地,若所述检测装置确定所述存储设备的控制器与所述Switch之间存在故障,则步骤302之后,还可以包括:Further optionally, if the detecting device determines that there is a fault between the controller of the storage device and the Switch, after step 302, it may further include:
当所述检测装置确定所述存储设备的控制器的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,所述检测装置确定所述存储设备与所述Switch之间的故障的类型为链路异常;或者When the detection device determines that the wwpn address of the port of the controller of the storage device is not empty, and the wwpn address of the corresponding port on the Switch is empty, the detection device determines that the connection between the storage device and the Switch The type of fault between is link abnormal; or
当所述检测装置确定所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址均不为空,且所述两个wwpn地址不一致时,则所述检测装置确定所述存储设备的控制器与所述Switch之间的故障的类型为所述存储设备的控制器与所述Switch之间未连接光纤。When the detection device determines that the wwpn address of the port of the controller of the storage device and the address of the device connected to the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, the detection device It is determined that the type of the fault between the controller of the storage device and the Switch is that no optical fiber is connected between the controller of the storage device and the Switch.
以图1所示SAN网络的主链路为例,步骤302具体可以为:所述检测装置将所述HBA的port0端口的wwpn地址与Slot7FC Switch上对应端口已连接设备的地址进行比较,若所述HBA的port0端口的wwpn地址与Slot7FCSwitch上对应端口已连接设备的地址不一致,则确定所述HBA与Slot7FCSwitch之间存在故障;和\或,所述检测装置将所述磁盘阵列的Controller A的端口的wwpn地址与Slot7FC Switch上对应端口已连接设备的地址进行比较,若所述磁盘阵列的Controller A的端口的wwpn地址与Slot7FC Switch上对应端口已连接设备的地址不一致,则确定所述磁盘阵列的控制器与Controller A之间存在故障;和\或,所述检测装置判断Controller A上连接不同设备的端口的端口类型是否相同,若相同,则确定所述Switch的连接或配置存在故障。Taking the main link of the SAN network shown in Figure 1 as an example, step 302 can specifically be: the detection device compares the wwpn address of the port0 port of the HBA with the address of the connected device on the corresponding port on the Slot7FC Switch, if the If the wwpn address of the port0 port of the HBA is inconsistent with the address of the connected device on the corresponding port on the Slot7FCSwitch, then it is determined that there is a fault between the HBA and the Slot7FCSwitch; The wwpn address of the corresponding port on the Slot7FC Switch is compared with the address of the device connected to the corresponding port. If the wwpn address of the port of Controller A of the disk array is inconsistent with the address of the device connected to the corresponding port on the Slot7FC Switch, determine the address of the device connected to the corresponding port of the disk array. There is a fault between the controller and Controller A; and/or, the detection device judges whether the port types of ports connected to different devices on Controller A are the same, and if they are the same, it is determined that there is a fault in the connection or configuration of the Switch.
对于备链路的检测与上述对主链路的检测类似,只要将相应的设备及端口替换为备链路中的设备及端口即可。若对备链路检测结果为:可以在Slot8FC Switch上查询到所有连接的HAB的port1端口的wwpn地址,但在连接磁阵的端口却查询不到磁阵控制器Controller B的wwpn地址,则可以确定备份链路存在误连接或连接异常,此时可以产生告警,提示用户排查备份链路的状态。The detection of the standby link is similar to the above detection of the primary link, as long as the corresponding equipment and ports are replaced with the equipment and ports in the standby link. If the detection result of the standby link is: the wwpn address of port1 port of all connected HABs can be queried on the Slot8FC Switch, but the wwpn address of the disk array controller Controller B cannot be queried on the port connected to the disk array, then you can If it is determined that there is a wrong connection or an abnormal connection in the backup link, an alarm can be generated at this time to prompt the user to check the status of the backup link.
可选地,在另一种场景中,当上述实施例的方法应用于DSA网络时,所述节点包括:主机总线适配器HBA和存储设备;Optionally, in another scenario, when the method in the foregoing embodiment is applied to a DSA network, the nodes include: a host bus adapter HBA and a storage device;
步骤302具体为:Step 302 is specifically:
所述检测装置将HBA的端口的wwpn地址与存储设备的控制器的wwpn地址进行比较,若所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址不一致,则确定所述HBA与所述存储设备的控制器之间存在故障。The detection device compares the wwpn address of the port of the HBA with the wwpn address of the controller of the storage device, and if the wwpn address of the port of the HBA is inconsistent with the wwpn address of the controller of the storage device, it is determined that the HBA There is a failure with the controller of the storage device.
进一步地,若所述检测装置确定所述HBA与所述存储设备的控制器之间存在故障,则步骤302之后,还可以包括:Further, if the detecting device determines that there is a fault between the HBA and the controller of the storage device, after step 302, it may further include:
当所述检测装置确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,则检测装置确定所述HBA与所述存储设备的控制器之间的故障的类型为链路异常;或者When the detecting device determines that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, the detecting device determines the connection between the HBA and the controller of the storage device The type of failure is a link anomaly; or
当所述检测装置确定所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址均不为空且所述两个wwpn地址不一致时,所述检测装置确定所述HBA与所述存储设备的控制器之间的故障的类型为误连接。When the detecting means determines that neither the wwpn address of the port of the HBA nor the wwpn address of the controller of the storage device is empty and the two wwpn addresses are inconsistent, the detecting means determines that the HBA and the The type of failure between the controllers of the storage device is misconnection.
以图2所示的DAS网络的主链路为例,步骤302具体可以为:Taking the main link of the DAS network shown in FIG. 2 as an example, step 302 may specifically be:
所述检测装置将HBA的port0端口的wwpn地址与磁盘阵列的ControllerA的wwpn地址进行比较,若所述HBA的port0端口的wwpn地址与所述磁盘阵列的Controller A的wwpn地址不一致,则可以确定所述HBA与所述存储设备的控制器之间存在故障。The detection device compares the wwpn address of the port0 port of the HBA with the wwpn address of the ControllerA of the disk array, if the wwpn address of the port0 port of the HBA is inconsistent with the wwpn address of the Controller A of the disk array, it can be determined that the There is a fault between the HBA and the controller of the storage device.
在本发明链路故障检测方法的另一种应用场景为,所述节点包括:存储设备的第一控制器和第二控制器,所述第一控制器和第二控制器分别与主机总线适配器HBA连接,构成第一链路和第二链路,或分别与交换机Switch连接,构成第一链路和第二链路,所述第一控制器和第二控制器的链路互为备份;In another application scenario of the link fault detection method of the present invention, the node includes: a first controller and a second controller of the storage device, and the first controller and the second controller are respectively connected to the host bus adapter The HBA is connected to form a first link and a second link, or is respectively connected to a switch Switch to form a first link and a second link, and the links of the first controller and the second controller are mutually backed up;
步骤302可以包括:Step 302 may include:
所述检测装置将所述第一链路对应的所述第一控制器的端口的wwpn地址与所述第二链路对应的所述第二控制器的端口的wwpn地址进行比较,若所述两个wwpn地址不一致,则确定所述存储设备的连接存在故障。The detection device compares the wwpn address of the port of the first controller corresponding to the first link with the wwpn address of the port of the second controller corresponding to the second link, if the If the two wwpn addresses are inconsistent, it is determined that there is a fault in the connection of the storage device.
例如针对图1或图2的FC网络,都可以比较Controller A和Controller B的对应端口的wwpn地址,如果两个对应端口的wwpn地址不一致,则可以确定所述存储设备的连接存在故障。例如,若检测到Controller A在某个端口有连接设备的wwpn地址,而在Controller B上对应的端口没有发现该连接设备的wwpn地址,则可以确定存储设备的连接存在故障,此时可以产生告警,提示用户排查故障。For example, for the FC network in Figure 1 or Figure 2, the wwpn addresses of the corresponding ports of Controller A and Controller B can be compared, if the wwpn addresses of the two corresponding ports are inconsistent, it can be determined that the connection of the storage device is faulty. For example, if it is detected that Controller A has the wwpn address of the connected device on a certain port, but the corresponding port on Controller B does not find the wwpn address of the connected device, it can be determined that there is a failure in the connection of the storage device, and an alarm can be generated at this time , prompting the user to troubleshoot.
图4为本发明链路故障检测装置实施例的结构示意图,如图4所示,本实施例的装置400可以包括:收集模块11和确定模块12,其中,FIG. 4 is a schematic structural diagram of an embodiment of a link fault detection device according to the present invention. As shown in FIG. 4 , the
收集模块11,可以用于收集光纤信道网络中至少两个节点的互相对应的端口的状态信息;The
确定模块12,可以用于将至少两个所述节点的互相对应的端口的状态信息进行比较,根据比较结果确定是否存在故障。The
本实施例的装置,可以用于执行图3所示方法实施例的技术方案,其实现原理类似,此处不再赘述。The device of this embodiment can be used to implement the technical solution of the method embodiment shown in FIG. 3 , and its implementation principles are similar, so details are not repeated here.
本实施例的链路故障检测装置实施例,通过统一收集整个链路中各端口的状态信息,并通过将至少两个所述节点的互相对应的端口的状态信息进行比较,检查所述比较结果是否符合预期,从而判断是否存在故障,相比与现有技术的检测方法,能够检测更为复杂和隐蔽的故障,从而提高FC网络的可靠性。In the embodiment of the link fault detection device in this embodiment, the state information of each port in the entire link is uniformly collected, and the state information of at least two corresponding ports of the nodes is compared to check the comparison result Whether it meets expectations, so as to determine whether there is a fault. Compared with the detection method in the prior art, it can detect more complex and hidden faults, thereby improving the reliability of the FC network.
进一步可选地,所述收集模块11具体可以用于:Further optionally, the
通过同一接口收集光纤信道网络中所有节点的所有端口的状态信息。Collect status information for all ports of all nodes in the Fiber Channel network through the same interface.
进一步可选地,所述端口的状态信息包括以下至少一个:端口类型、连接状态、工作模式、自协商结果和全球端口名称wwpn地址。Further optionally, the state information of the port includes at least one of the following: port type, connection state, working mode, auto-negotiation result and worldwide port name wwpn address.
进一步可选地,所述节点包括:主机总线适配器HBA、交换机Switch和存储设备;Further optionally, the node includes: a host bus adapter HBA, a switch Switch, and a storage device;
所述确定模块12具体可以用于:The
将所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址进行比较,若所述HBA的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述HBA卡与所述Switch之间存在故障;或者Compare the wwpn address of the port of the HBA with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the HBA is inconsistent with the address of the device connected to the corresponding port on the Switch, then determine the There is a fault between the HBA card and the Switch; or
将所述存储设备的控制器的端口的wwpn地址与Switch上对应端口已连接设备的地址进行比较,若所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址不一致,则确定所述存储设备的控制器与所述Switch之间存在故障;或者Comparing the wwpn address of the port of the controller of the storage device with the address of the device connected to the corresponding port on the Switch, if the wwpn address of the port of the controller of the storage device is the same as the address of the device connected to the corresponding port on the Switch If the addresses are inconsistent, it is determined that there is a fault between the controller of the storage device and the Switch; or
判断所述Switch上连接不同设备的端口的端口类型是否相同,若相同,则确定所述Switch的连接或配置存在故障。It is judged whether the port types of ports connected to different devices on the Switch are the same, and if they are the same, it is determined that there is a fault in the connection or configuration of the Switch.
进一步可选地,所述确定模块12还可以用于:Further optionally, the determining
当确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,确定所述故障的类型为链路异常;或者When it is determined that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, it is determined that the type of the fault is a link exception; or
当确定所述HBA的端口的wwpn地址与所述Switch上对应端口的wwpn均不为空,且所述两个wwpn地址不一致时,确定所述故障的类型为所述故障的类型为误连接。When it is determined that the wwpn address of the port of the HBA and the wwpn of the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, it is determined that the type of the fault is that the type of the fault is a misconnection.
进一步可选地,所述确定模块12还可以用于:Further optionally, the determining
当确定所述存储设备的控制器的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,确定所述存储设备与所述Switch之间的故障的类型为链路异常;或者When it is determined that the wwpn address of the port of the controller of the storage device is not empty, and the wwpn address of the corresponding port on the Switch is empty, determine that the type of failure between the storage device and the Switch is a link abnormal; or
当确定所述存储设备的控制器的端口的wwpn地址与所述Switch上对应端口已连接设备的地址均不为空,且所述两个wwpn地址不一致时,则确定所述存储设备的控制器与所述Switch之间的故障的类型为所述存储设备的控制器与所述Switch之间未连接光纤。When it is determined that the wwpn address of the port of the controller of the storage device and the address of the device connected to the corresponding port on the Switch are not empty, and the two wwpn addresses are inconsistent, then determine the controller of the storage device The type of the fault with the Switch is that the optical fiber is not connected between the controller of the storage device and the Switch.
进一步可选地,所述节点可以包括:主机总线适配器HBA和存储设备;Further optionally, the node may include: a host bus adapter HBA and a storage device;
所述确定模块12具体可以用于:The
将HBA的端口的wwpn地址与存储设备的控制器的wwpn地址进行比较,若所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址不一致,则确定所述HBA与所述存储设备的控制器之间存在故障。Comparing the wwpn address of the port of the HBA with the wwpn address of the controller of the storage device, if the wwpn address of the port of the HBA is inconsistent with the wwpn address of the controller of the storage device, it is determined that the HBA and the storage device There is a fault between the controllers of the device.
进一步可选地,所述确定模块12还可以用于:Further optionally, the determining
当确定所述HBA的端口的wwpn地址不为空,而所述Switch上对应端口的wwpn地址为空时,则检测装置确定所述HBA与所述存储设备的控制器之间的故障的类型为链路异常;或者When it is determined that the wwpn address of the port of the HBA is not empty, and the wwpn address of the corresponding port on the Switch is empty, then the detection device determines that the type of fault between the HBA and the controller of the storage device is link abnormality; or
当确定所述HBA的端口的wwpn地址与所述存储设备的控制器的wwpn地址均不为空且所述两个wwpn地址不一致时,确定所述HBA与所述存储设备的控制器之间的故障的类型为误连接。When it is determined that the wwpn address of the port of the HBA and the wwpn address of the controller of the storage device are not empty and the two wwpn addresses are inconsistent, determine the connection between the HBA and the controller of the storage device The type of failure is misconnection.
进一步可选地,所述节点可以包括:存储设备的第一控制器和第二控制器,所述第一控制器和第二控制器分别与主机总线适配器HBA连接,构成第一链路和第二链路,或分别与交换机Switch连接,构成第一链路和第二链路,所述第一控制器和第二控制器的链路互为备份;Further optionally, the node may include: a first controller and a second controller of the storage device, the first controller and the second controller are respectively connected to the host bus adapter HBA to form the first link and the second link. Two links, or respectively connected to the switch Switch to form a first link and a second link, and the links of the first controller and the second controller are mutually backup;
所述确定模块12具体可以用于:The
将所述第一链路对应的所述第一控制器的端口的wwpn地址与所述第二链路对应的所述第二控制器的端口的wwpn地址进行比较,若所述两个wwpn地址不一致,则确定所述存储设备的连接存在故障。Comparing the wwpn address of the port of the first controller corresponding to the first link with the wwpn address of the port of the second controller corresponding to the second link, if the two wwpn addresses If they are inconsistent, it is determined that there is a fault in the connection of the storage device.
本领域普通技术人员可以理解:实现上述各方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成。前述的程序可以存储于一计算机可读取存储介质中。该程序在执行时,执行包括上述各方法实施例的步骤;而前述的存储介质包括:ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。Those of ordinary skill in the art can understand that all or part of the steps for implementing the above method embodiments can be completed by program instructions and related hardware. The aforementioned program can be stored in a computer-readable storage medium. When the program is executed, it executes the steps including the above-mentioned method embodiments; and the aforementioned storage medium includes: ROM, RAM, magnetic disk or optical disk and other various media that can store program codes.
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that: the above embodiments are only used to illustrate the technical solutions of the present invention, rather than limiting them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: It is still possible to modify the technical solutions described in the foregoing embodiments, or perform equivalent replacements for some or all of the technical features; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the various embodiments of the present invention. scope.
Claims (18)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201310690351.5A CN103684918A (en) | 2013-12-16 | 2013-12-16 | Method and device for detecting link failure |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201310690351.5A CN103684918A (en) | 2013-12-16 | 2013-12-16 | Method and device for detecting link failure |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN103684918A true CN103684918A (en) | 2014-03-26 |
Family
ID=50321297
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201310690351.5A Pending CN103684918A (en) | 2013-12-16 | 2013-12-16 | Method and device for detecting link failure |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN103684918A (en) |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105915385A (en) * | 2016-05-31 | 2016-08-31 | 深圳市双赢伟业科技股份有限公司 | Switch port state prompting method and device |
| CN106452880A (en) * | 2016-10-19 | 2017-02-22 | 深圳市深信服电子科技有限公司 | Network wiring detection method and apparatus thereof |
| WO2017193966A1 (en) * | 2016-05-13 | 2017-11-16 | 华为技术有限公司 | Link switching method for service in storage system, and storage device |
| CN107547240A (en) * | 2017-05-11 | 2018-01-05 | 新华三云计算技术有限公司 | A kind of chain circuit detecting method and device |
| CN109428744A (en) * | 2017-08-23 | 2019-03-05 | 华为技术有限公司 | Web inspection method and device, storage medium |
| CN110138446A (en) * | 2019-05-09 | 2019-08-16 | 中国航空工业集团公司西安航空计算技术研究所 | A kind of detection method of dual-redundancy optical fiber channel network remaining degradation failure |
| CN112463487A (en) * | 2020-11-25 | 2021-03-09 | 苏州浪潮智能科技有限公司 | Connection error detection method and system of full flash memory array and related components |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1902578A (en) * | 2004-01-21 | 2007-01-24 | 国际商业机器公司 | Method and apparatus for controlling access to logical units |
| CN101577585A (en) * | 2009-06-05 | 2009-11-11 | 中兴通讯股份有限公司 | Method and device for determining failure in automatic switched optical network |
| CN101729408A (en) * | 2009-11-23 | 2010-06-09 | 福建星网锐捷网络有限公司 | Method for determining interface network type and router |
| CN101753207A (en) * | 2008-12-16 | 2010-06-23 | 华为技术有限公司 | Fiber link fault recognition method, device and system |
| CN102457403A (en) * | 2010-10-15 | 2012-05-16 | 无锡江南计算技术研究所 | Network connection fault detection method and device |
| CN102742184A (en) * | 2011-12-07 | 2012-10-17 | 华为技术有限公司 | Optical fiber link detection method, optical line terminal and passive optical network system |
-
2013
- 2013-12-16 CN CN201310690351.5A patent/CN103684918A/en active Pending
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1902578A (en) * | 2004-01-21 | 2007-01-24 | 国际商业机器公司 | Method and apparatus for controlling access to logical units |
| CN101753207A (en) * | 2008-12-16 | 2010-06-23 | 华为技术有限公司 | Fiber link fault recognition method, device and system |
| CN101577585A (en) * | 2009-06-05 | 2009-11-11 | 中兴通讯股份有限公司 | Method and device for determining failure in automatic switched optical network |
| CN101729408A (en) * | 2009-11-23 | 2010-06-09 | 福建星网锐捷网络有限公司 | Method for determining interface network type and router |
| CN102457403A (en) * | 2010-10-15 | 2012-05-16 | 无锡江南计算技术研究所 | Network connection fault detection method and device |
| CN102742184A (en) * | 2011-12-07 | 2012-10-17 | 华为技术有限公司 | Optical fiber link detection method, optical line terminal and passive optical network system |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017193966A1 (en) * | 2016-05-13 | 2017-11-16 | 华为技术有限公司 | Link switching method for service in storage system, and storage device |
| US10764119B2 (en) | 2016-05-13 | 2020-09-01 | Huawei Technologies Co., Ltd. | Link handover method for service in storage system, and storage device |
| CN105915385A (en) * | 2016-05-31 | 2016-08-31 | 深圳市双赢伟业科技股份有限公司 | Switch port state prompting method and device |
| CN106452880A (en) * | 2016-10-19 | 2017-02-22 | 深圳市深信服电子科技有限公司 | Network wiring detection method and apparatus thereof |
| CN106452880B (en) * | 2016-10-19 | 2020-04-14 | 深信服科技股份有限公司 | Network wiring detection method and device |
| CN107547240A (en) * | 2017-05-11 | 2018-01-05 | 新华三云计算技术有限公司 | A kind of chain circuit detecting method and device |
| CN109428744A (en) * | 2017-08-23 | 2019-03-05 | 华为技术有限公司 | Web inspection method and device, storage medium |
| CN110138446A (en) * | 2019-05-09 | 2019-08-16 | 中国航空工业集团公司西安航空计算技术研究所 | A kind of detection method of dual-redundancy optical fiber channel network remaining degradation failure |
| CN110138446B (en) * | 2019-05-09 | 2022-02-11 | 中国航空工业集团公司西安航空计算技术研究所 | Detection method for redundancy degradation fault of dual-redundancy fiber channel network |
| CN112463487A (en) * | 2020-11-25 | 2021-03-09 | 苏州浪潮智能科技有限公司 | Connection error detection method and system of full flash memory array and related components |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN103684918A (en) | Method and device for detecting link failure | |
| US9998215B2 (en) | Diagnostic port for inter-switch link testing in electrical, optical and remote loopback modes | |
| US10103851B2 (en) | Network link monitoring and testing | |
| US7864704B2 (en) | Intelligent automatic reconfiguration method and apparatus for network system | |
| CN101291243B (en) | Split brain preventing method for highly available cluster system | |
| CN103944750B (en) | One kind binding net card failure processing method and processing device | |
| CN101404568A (en) | Double-network card hot backup redundancy method | |
| CN105095001A (en) | Virtual machine exception recovery method under distributed environment | |
| CN105827419A (en) | Forwarding equipment fault processing method, equipment and controller | |
| CN109120522B (en) | Multipath state monitoring method and device | |
| EP2922247A2 (en) | Method and apparatus for rapidly restoring spanning tree backup port | |
| CN103490914A (en) | Switching system and method for multi-machine hot backup of network application equipment | |
| CN102484603B (en) | Method and device for creating redundant logical connection and storage automation system device | |
| CN118550752A (en) | Cloud platform fault detection and operation and maintenance system, method, equipment and storage medium | |
| JP6429188B2 (en) | Relay device | |
| CN118802965A (en) | FC-SAN storage link setting method, device and electronic equipment | |
| CN107342902A (en) | A kind of link reconfiguration method and system of four controls server | |
| CN116436839A (en) | Link adaptive fault-tolerant method, device and server for storage multi-controller cluster | |
| CN112637010B (en) | Equipment checking method and device | |
| CN116248581B (en) | Cloud scene gateway cluster master-slave switching method and system based on SDN | |
| CN106648949A (en) | Storage system and data transmission method | |
| WO2024239569A1 (en) | Cluster service processing method, server, and system | |
| CN102946321B (en) | A kind of fault handling method based on IRF network and equipment | |
| US11366828B1 (en) | Replication validation | |
| CN102684914B (en) | Method and system for achieving bridge interface linkage |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140326 |
|
| RJ01 | Rejection of invention patent application after publication |