JPH05100801A

JPH05100801A - Highly reliable disk subsystem

Info

Publication number: JPH05100801A
Application number: JP3261820A
Authority: JP
Inventors: Tomohiro Murata; 智洋村田; Kenzo Kurihara; 謙三栗原; Masaharu Akatsu; 雅晴赤津; Yoshiro Shirayanagi; 芳朗白柳
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 1991-10-09
Filing date: 1991-10-09
Publication date: 1993-04-23

Abstract

(57)【要約】【目的】本発明はディスク制御装置に関し、特に、多数
のドライブを接続する大型ディスクサブシステムにおい
て、ドライブのアクセス障害に対する耐障害性を向上さ
せる方法に関する。【構成】現用ドライブ（トラック）に関する訂正可能な
アクセスエラー発生に関する統計情報を収集し、それを
もとに、現用ドライブ（トラック）障害の前兆があると
判断した場合に、ディスク制御装置が現用ドライブ（ト
ラック）から予備ドライブ（トラック）へのデータをコ
ピー後、現用ドライブ（トラック）から予備ドライブ
（トラック）への切り替えを行なう事により達成され
る。【効果】ディスク制御装置が、障害の前兆があると判断
された現用ドライブ（トラック）のデータを、予備ドラ
イブ（トラック）に退避した後、予備ドライブ（トラッ
ク）を新たな現用ドライブ（トラック）として使用する
ため、現用ドライブの障害を予防的に回避することがで
き、ドライブ障害の確率が減少し、ディスクサブシステ
ムの信頼性が向上する。 (57) [Abstract] [Object] The present invention relates to a disk controller, and more particularly to a method for improving fault tolerance against an access failure of a drive in a large-sized disk subsystem in which a large number of drives are connected. [Structure] Collects statistical information on correctable access errors related to the current drive (track), and if it determines that there is a sign of failure of the current drive (track) based on the collected statistical information, the disk control device determines that the current drive This is achieved by copying the data from the (track) to the spare drive (track) and then switching from the active drive (track) to the spare drive (track). [Effect] The disk control device saves the data of the working drive (track) determined to have a sign of failure to the spare drive (track), and then uses the spare drive (track) as a new working drive (track). Since it is used, it is possible to prevent the failure of the working drive proactively, reduce the probability of the drive failure, and improve the reliability of the disk subsystem.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明はディスク制御装置に関
し、特に、多数のドライブを接続する大型ディスクサブ
システムにおいて、ドライブのアクセス障害に対する耐
障害性を向上させる方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a disk controller and, more particularly, to a method for improving the fault tolerance against an access failure of a drive in a large-sized disk subsystem connecting a large number of drives.

【０００２】[0002]

【従来の技術】ホストコンピュータからのディスクドラ
イブに対する一回のデータ書き込みで、ディスク制御装
置が、正，副ペアドライブに同一データを書き込み、
正，副いずれかのドライブでアクセス障害が発生して
も、正常な方のドライブでアクセスを継続することによ
り、ドライブのアクセス障害に対する耐障害性を向上さ
せる方法（ディスク制御装置による二重書き）がアクセ
ス３月／４月号（１９８８年）第９頁から第１０頁
（ＡＣＣＥＳＳＭＡＲ／ＡＰＲ１９８８ｐｐ９−
１０）に記載されている。2. Description of the Related Art When a host computer writes data to a disk drive once, the disk controller writes the same data to a primary and secondary pair drive,
Even if an access failure occurs in either the primary or secondary drive, the normal drive continues the access to improve the fault tolerance against the access failure of the drive (double writing by the disk controller) Access March / April issue (1988) page 9 to page 10 (ACCESS MAR / APR 1988 pp9-
10).

【０００３】[0003]

【発明が解決しようとする課題】正，副ペアドライブに
よる二重書きは、耐障害性が高いがコストも大きい。こ
のため、オンライン処理系のドライブに適用が限定され
る場合が多い。すなわち、アクセス頻度はそう多くない
が、一旦アクセス障害が発生するとその回復処理のため
の影響が大きいオフライン処理系のドライブについて
も、耐障害性向上の要求はあるものの、一般にその数が
多いため、二重書きの適用はコスト面で難しい。The dual writing by the primary and secondary pair drives has high fault tolerance but also high cost. Therefore, the application is often limited to the online processing system drive. In other words, although the access frequency is not so high, even if there is a demand for improving the fault tolerance of an offline processing system drive that has a great influence on the recovery processing once the access failure occurs, there are generally many, so there are many Applying double writing is difficult in terms of cost.

【０００４】本発明の目的は、上記オフライン処理系の
ドライブを対象に、予防保守を行なうことにより、その
障害の確率を減らす方法を提供する事である。An object of the present invention is to provide a method of reducing the probability of failure by performing preventive maintenance on the drive of the above-mentioned offline processing system.

【０００５】[0005]

【課題を解決するための手段】上記の目的は、現用ドラ
イブ（トラック）に関する訂正可能なアクセスエラー発
生に関する統計情報を収集しておき、それをもとに、現
用ドライブ(トラック)障害の前兆があると判断した場合
には、ディスク制御装置が現用ドライブ（トラック）か
ら予備ドライブ（トラック）へのデータコピー後、現用
ドライブ（トラック）から予備ドライブ（トラック）へ
の切り替えを行なう事により達成される。[Means for Solving the Problems] The above-mentioned object is to collect statistical information on correctable access error occurrences relating to the current drive (track), and based on this, a sign of failure of the current drive (track) can be obtained. If it is determined that there is, the disk controller copies the data from the current drive (track) to the spare drive (track), and then switches from the current drive (track) to the spare drive (track). ..

【０００６】[0006]

【作用】障害の前兆があると判断された現用ドライブ
（トラック）のデータを、ディスク制御装置が予備ドラ
イブ（トラック）にコピーした後、予備ドライブ（トラ
ック）を新たな現用ドライブ（トラック）として使用す
ることで、現用ドライブの障害を予防的に回避する。[Operation] After the data of the working drive (track) determined to have a sign of failure is copied to the spare drive (track) by the disk controller, the spare drive (track) is used as a new working drive (track). By doing so, the failure of the working drive can be avoided proactively.

【０００７】[0007]

【実施例】図１に、本発明をディスク制御サブシステム
に適用した場合の一実施例を示す。図１において、ディ
スク制御装置１００は、上位側でホストコンピュータ10
9と接続され、また下位側で記憶媒体であるドライブ１
１１，１１２，１１３，１１４と接続される。ディスク
制御装置１００は、これらのドライブ上でホストコンピ
ュータ１０９の要求に応じてデータのリード，ライトを
行なう。以下、図１におけるディスク制御装置１００の
構成を参照しながら、本発明の主内容であるディスク制
御装置１００の動作を説明する。FIG. 1 shows an embodiment in which the present invention is applied to a disk control subsystem. In FIG. 1, the disk control device 100 includes a host computer 10 on the upper side.
Drive 1 that is connected to 9 and is a storage medium on the lower side
11, 112, 113, 114 are connected. The disk control device 100 reads and writes data on these drives in response to a request from the host computer 109. The operation of the disk control device 100, which is the main content of the present invention, will be described below with reference to the configuration of the disk control device 100 in FIG.

【０００８】各ドライブとホストコンピュータ１０９と
の間のデータ転送を実際に行なうのはディスク制御装置
１００に内蔵されたプロセッサ１０２，１０３である。
これらのプロセッサは、チャネルインタフェース１０
１，ドライブインタフェース１０７，１０８を介し、ホ
ストコンピュータ１０９、及び各ドライブとそれぞれ接
続する。制御メモリ１０４はすべてのプロセッサからア
クセス可能な共通メモリであり、ディスク制御装置１０
０がドライブをアクセスするための共通制御情報が格納
される。共通制御情報の内容については、以下で必要に
応じて説明する。The processors 102 and 103 incorporated in the disk controller 100 actually perform the data transfer between each drive and the host computer 109.
These processors have a channel interface 10
1, via a drive interface 107, 108, respectively connected to the host computer 109 and each drive. The control memory 104 is a common memory that can be accessed by all the processors, and is a disk controller 10.
Common control information for 0 to access the drive is stored. The contents of the common control information will be described below as needed.

【０００９】図２は共通制御情報であるドライブ制御ブ
ロック２００(Device ControlBlock ；略してＤＣＢ）
を示す。ＤＣＢには、各デバイスをディスク制御装置１
００が識別するためのドライブ番号２０１(Device Conn
ection Address；略してＤＣＡ）、当該ドライブの型式
の種類を示すドライブタイプ各ドライブを２０２が格納
される。また、ホストコンピュータの要求に応じてディ
スク制御装置１００が実際にデータの入出力を行なう現
用ドライブと、現用ドライブのバックアップ用である予
備ドライブの２種類に各ドライブを区別し、当該ドライ
ブがそのどちらであるかを用途区別（現用／予備）２０
３に格納する。その他のＤＣＢ制御情報の内容について
は後述する。FIG. 2 shows a drive control block 200 (Device Control Block; DCB for short) which is common control information.
Indicates. Each device is a disk controller 1 in the DCB.
Drive number 201 (Device Conn
ection Address; DCA for short), and 202 for each drive type drive indicating the type of model of the drive. Further, each drive is distinguished into two types, a current drive in which the disk control device 100 actually inputs / outputs data in response to a request from the host computer, and a spare drive for backup of the current drive. Whether it is used or not (current / spare) 20
Store in 3. The contents of other DCB control information will be described later.

【００１０】図３は共通制御情報である交替トラック管
理情報３００を示す。各ドライブごとに、データを格納
する現用トラックのバックアップとして使用可能な未使
用予備トラックと、すでに現用トラックのバックアップ
として使用中である予備トラックを区分して管理する。
具体的には、予備トラックのトラック番号を格納した交
替トラック制御ブロックを、未使用と使用中に分けてキ
ューで管理する。FIG. 3 shows replacement track management information 300 which is common control information. For each drive, an unused spare track that can be used as a backup for the working track that stores data and a spare track that is already being used as a backup for the working track are separately managed.
Specifically, the replacement track control block that stores the track number of the spare track is managed in a queue by dividing it into an unused track and a used track.

【００１１】図４は共通制御情報であるトラックアクセ
ス・テンポラリエラー統計情報800である。ディスク制
御装置１００内の各プロセッサは、各ドライブのトラッ
ク上のデータリード処理において、訂正可能データ転送
エラーを検出する度に、トラックアクセス・テンポラリ
エラー統計情報８００の該当ドライブ８０１の該当する
トラック欄８０２に格納されているテンポラリ・アクセ
スエラー合計回数803をインクリメントする。これらの
情報は、トラック又はドライブの障害発生の兆候を検出
するために用いられるが、詳細については後述する。FIG. 4 shows track access / temporary error statistical information 800 which is common control information. Each time the processor in the disk control device 100 detects a correctable data transfer error in the data read process on the track of each drive, the corresponding track column 802 of the corresponding drive 801 of the track access / temporary error statistical information 800. The total number of temporary access errors 803 stored in is incremented. These pieces of information are used to detect the onset of a failure of the truck or the drive, and the details will be described later.

【００１２】続いて、本発明におけるディスク制御装置
１００内の各プロセッサの動作を説明する。図５はドラ
イブヘルスチェック処理のフローを示す。本処理はディ
スク制御装置１００内の各プロセッサにより、ある周期
で繰返し実行される。最初に共通制御情報であるチェッ
クドライブ番号格納テーブル８００（図８）にドライブ
番号として０を設定しておく。まず、チェックドライブ
番号格納テーブル800より、チェックすべきドライブの
番号を読みだし、自プロセッサのローカルメモリ（１５
２，１５３）に格納する（５０１）。Next, the operation of each processor in the disk controller 100 according to the present invention will be described. FIG. 5 shows a flow of drive health check processing. This process is repeatedly executed by each processor in the disk control device 100 at a certain cycle. First, 0 is set as the drive number in the check drive number storage table 800 (FIG. 8) which is the common control information. First, the number of the drive to be checked is read from the check drive number storage table 800, and the local memory (15
No. 2, 153) (501).

【００１３】次に、当該ドライブ番号を有するドライブ
のスワップ要求テーブル９００（図９）のスワップ要求
９０１が‘off ’であることをチェックする（５０
２）。ここで、スワップ要求テーブル９００において、
スワップ要求９０１が‘off ’であるとは、当該ドライ
ブに関して予備ドライブ又は、予備トラックへの切り替
え要求がないことを示す。スワップ要求９０１が‘off
’である場合は当該ドライブ番号が最大ドライブ番号
に等しいか否かをチェックし、等しい場合はチェックド
ライブ番号格納テーブル８００に−１を設定しておく。Next, it is checked whether the swap request 901 in the swap request table 900 (FIG. 9) of the drive having the drive number is'off '(50).
2). Here, in the swap request table 900,
When the swap request 901 is “off”, it means that there is no request to switch to the spare drive or the spare track for the drive. Swap request 901 is'off
If it is', it is checked whether or not the drive number is equal to the maximum drive number. If they are equal, -1 is set in the check drive number storage table 800.

【００１４】続いて、ローカル変数であるトラックサー
チポインタをゼロクリアし(５０５)、先に自プロセッサ
のローカルメモリに格納しておいたチェックドライブ番
号を持つドライブについて、上記トラックサーチポイン
タと等しいトラック番号を有するトラックのテンポラリ
アクセスエラー回数が上限値（スレショルド）を越えて
いるか否かを判定し、越えている場合は当該ドライブの
テンポラリエラー多発トラック数合計８０４を＋１し
（５１４）、当該ドライブのテンポラリエラー多発トラ
ック数合計８０４が上限値（スレショルド）を越えてい
るか否かを判定し、越えている場合は当該ドライブの処
理済みマーク８０１をセットした後、当該ドライブのス
ワップ要求を登録し（５１１）、チェックドライブ番号
を＋１して処理を終わる。もし、上限値を越えていなけ
れば上記トラックの処理済みマーク８０５をセットした
後、当該トラックのスワップ要求を登録し（５０９）、
トラックサーチポインタを＋１して次のトラックのヘル
スチェックに行く。Then, the track search pointer, which is a local variable, is cleared to zero (505), and for the drive having the check drive number previously stored in the local memory of its own processor, the track number equal to the track search pointer is set. It is determined whether or not the number of temporary access errors of the own track exceeds the upper limit (threshold). If the number of temporary access errors exceeds the upper limit (threshold), the total number of temporary error frequently occurring tracks 804 is incremented by +1 (514), and the temporary error of the drive is increased. It is determined whether or not the total number of frequently-occurring tracks 804 exceeds the upper limit (threshold), and if it exceeds, the processed mark 801 of the drive is set, and then the swap request of the drive is registered (511). The check drive number is incremented by 1 and the process ends. That. If it does not exceed the upper limit, after setting the processed mark 805 of the track, the swap request of the track is registered (509),
The track search pointer is incremented by 1 and the health check of the next track is performed.

【００１５】図６は図５におけるトラックスワップ要求
登録処理５０９の処理内容を示すフローである。まず、
スワップ処理の対象となるトラックを持つドライブのロ
ックを取得し（６０１）、未使用の交替トラックキュー
３１０（図３）からコピー先となる交替トラックを一本
選ぶ（６０４）。もし、未使用の交替トラックがなけれ
ばトラックスワップ不可を表示し（６０６）、取得した
ロックを解放して処理を終わる。未使用の交替トラック
があれば、スワップ対象ドライブのＤＣＢに交替トラッ
ク番号２０４を、またコピー先ドライブ番号２０５とし
て自ドライブ番号を設定する。さらにコピー要求２０６
をトラックコピーに設定し、コピー開始トラック２１２
にスワップ対象トラック番号を設定する。そしてスワッ
プ要求２０８をonし（７０８）、取得していたロックを
解放する。FIG. 6 is a flow chart showing the processing contents of the track swap request registration processing 509 in FIG. First,
The lock of the drive having the track to be swapped is acquired (601), and one replacement track to be the copy destination is selected from the unused replacement track queue 310 (FIG. 3) (604). If there is no unused replacement track, it is displayed that the track swap is impossible (606), the acquired lock is released, and the processing is ended. If there is an unused replacement track, the replacement track number 204 is set in the DCB of the swap target drive, and the own drive number is set as the copy destination drive number 205. Further copy request 206
Set to track copy, and copy start track 212
Set the swap target track number to. Then, the swap request 208 is turned on (708), and the acquired lock is released.

【００１６】図７は図５におけるドライブスワップ要求
登録処理５１１の処理内容を示すフローである。まず、
スワップ処理の対象となるドライブのロックを取得し
（701)、ディスク制御装置１００に接続するドライブの
ＤＣＢを調べ、用途区別２０３が予備ドライブであり、
かつスワップ対象ドライブとドライブタイプが一致する
ドライブを一つ選ぶ（７０４）。予備ドライブがなけれ
ばスワップ不可を表示し（７０６）、上記ロックを解除
して処理を終わる。予備ドライブがあれば、スワップ対
象ドライブのＤＣＢのコピー先ドライブ番号２０５に上
記予備ドライブ番号を設定し、さらにコピー要求２０６
をドライブコピーに設定し、コピー開始トラック２１
２、及びコピー終了トラック２１３に上記ドライブタイ
プに対応したコピー開始トラック番号とコピー終了トラ
ック番号をそれぞれ設定する(７０７)。そしてスワップ
要求２０８をonし（７０８）、取得していたロックを解
放する。FIG. 7 is a flow chart showing the processing contents of the drive swap request registration processing 511 in FIG. First,
The lock of the drive to be swapped is acquired (701), the DCB of the drive connected to the disk control device 100 is checked, and the application distinction 203 is the spare drive,
Moreover, one drive having the same drive type as the swap target drive is selected (704). If there is no spare drive, swap not possible is displayed (706), the above lock is released, and the process ends. If there is a spare drive, the spare drive number is set in the copy destination drive number 205 of the DCB of the swap target drive, and the copy request 206 is made.
Is set to drive copy, and copy start track 21
The copy start track number and the copy end track number corresponding to the drive type are set in 2 and the copy end track 213, respectively (707). Then, the swap request 208 is turned on (708), and the acquired lock is released.

【００１７】図１０はトラックスワップ処理の処理内容
を示すフローである。本処理も任意のプロセッサで、あ
る周期ごとに実行される。まず、スワップ処理ドライブ
番号格納テーブル９００よりスワップ処理の候補となる
ドライブ番号を読みだし、プロセッサのローカルメモリ
に格納する（１００１）。当該ドライブ番号が最大ドラ
イブ番号に等しければ当該テーブル９００をゼロクリア
し（１００３）、等しくなければ当該テーブル９００を
＋１する（１０１５）。続いて、当該ドライブのＤＣＢ
内のスワップ要求２０８が‘on’か否かをチェックし、
‘on’でなければ処理を終わる。‘on’であれば、当該
ドライブのコピー要求がトラックコピーかドライブコピ
ーかを判定し、ドライブコピーであればドライブスワッ
プ処理を行なって処理を終わる。トラックコピーであれ
ば、スワップ要求２０８を処理中にし（１００６）、当
該ドライブのロックを取得し（１００７）、コピー開始
トラック番号を有するトラックからトラックデータをキ
ャッシュメモリ１０５にロードし（１００８）、交替ト
ラック番号を有するトラックに対し、上記ロードデータ
をストアした後(１０１０）、スワップ処理要求を‘of
f’に設定し(1010)取得していた上記ロックを解放して
処理を終わる。FIG. 10 is a flow chart showing the processing contents of the track swap processing. This processing is also executed by an arbitrary processor in a certain cycle. First, a swap process candidate drive number is read from the swap process drive number storage table 900 and stored in the local memory of the processor (1001). If the drive number is equal to the maximum drive number, the table 900 is cleared to zero (1003), and if not equal, the table 900 is incremented by 1 (1015). Next, the DCB of the drive
Check if the swap request 208 inside is'on ',
If not'on ', the process ends. If it is'on ', it is determined whether the copy request of the drive is a track copy or a drive copy. If it is a drive copy, the drive swap processing is performed and the processing ends. If it is a track copy, the swap request 208 is processed (1006), the lock of the drive is acquired (1007), the track data from the track having the copy start track number is loaded into the cache memory 105 (1008), and the replacement is performed. After storing the above load data for the track having the track number (1010), a swap processing request is issued.
Set it to f '(1010) and release the above lock, and finish the process.

【００１８】図１１はドライブスワップ処理の処理内容
を示すフローである。本処理も任意のプロセッサで、あ
る周期ごとに実行される。まず、スワップ要求２０８を
処理中にし、コピー済トラック番号２１４を（コピー開
始番号−１）の値に初期化する（１１０１）。コピー済
トラック番号２１４がコピー終了トラック番号に等しく
なるまで以下を繰り返す。まず、コピー元ドライブ(当
該ドライブ)、およびコピー先ドライブのロックをとも
に取得し（１１０３）、未コピートラック（コピー済ト
ラック＋１の番号を有するトラック）からトラックデー
タをキャッシュメモリ１０５にロードし（１１０５）、
コピー先ドライブのトラックに上記ロードデータをスト
アした後（１０１０）、取得したロックを解放しコピー
済トラックの番号を＋１する（１１０７）。そして、ホ
ストコンピュータからの、スワップ処理中の当該ドライ
ブに対するデータ入出力を受け付け可能とするために、
一定時間スワップ処理を中断した後（１１０８）、次の
トラックのコピー処理に移る。FIG. 11 is a flow chart showing the processing contents of the drive swap processing. This processing is also executed by an arbitrary processor in a certain cycle. First, the swap request 208 is set in process, and the copied track number 214 is initialized to the value of (copy start number-1) (1101). The following is repeated until the copied track number 214 becomes equal to the copy end track number. First, the locks of the copy source drive (the drive concerned) and the copy destination drive are both acquired (1103), and track data is loaded from the uncopied track (copied track + 1 track) into the cache memory 105 (1105). ),
After storing the load data in the track of the copy destination drive (1010), the acquired lock is released and the number of the copied track is incremented by 1 (1107). Then, in order to be able to receive the data input / output from the host computer to the drive under the swap processing,
After the swap processing is interrupted for a certain period of time (1108), the next track copy processing is started.

【００１９】コピー済トラック番号２１４がコピー終了
トラック番号に等しくなった時点で、コピー元、および
コピー先ドライブのロックを共に取得し、コピー先ドラ
イブへの差分データのデステージ処理（１１１１）、及
び、アクセスパススワップ処理（１１１２）を行ない、
最後に、取得していたロックを解放し（１１１４）処理
を終わる。At the time when the copied track number 214 becomes equal to the copy end track number, the locks of the copy source and copy destination drives are both acquired, and destaging processing of the differential data to the copy destination drive (1111), and , Perform access path swap processing (1112),
Finally, the acquired lock is released (1114), and the process ends.

【００２０】以下、差分データのデステージ処理（１１
１１）の内容を説明するが、その前に、ドライブスワッ
プ処理中の差分データの管理について述べておく。ホス
トコンピュータから、コピー元ドライブのあるトラック
に書き込み要求が来た場合、当該トラックの番号がコピ
ー済トラック番号より大きいか等しい場合は、データ書
き込みの対象となるトラックは未コピーのトラックであ
り、当該トラックに単にデータの書き込みを行なえばよ
く、コピー終了時点で、コピー元とコピー先のドライブ
でデータが異なることはない。一方、当該トラックの番
号がコピー済トラック番号より小さい場合は、データ書
き込みの対象となるトラックはコピー済のトラックであ
るため、当該トラックにデータの書き込みを行なうと、
コピー終了時点でコピー元とコピー先のドライブでデー
タが異なる。In the following, destage processing of the difference data (11
The content of 11) will be described, but before that, the management of the differential data during the drive swap processing will be described. When a write request is sent from a host computer to a track on the copy source drive, if the track number is greater than or equal to the copied track number, the data write target track is an uncopied track. It suffices to simply write the data on the track, and at the end of the copy, the data does not differ between the copy source drive and the copy destination drive. On the other hand, when the track number is smaller than the copied track number, the track to which data is to be written is the copied track, and therefore when data is written to the track,
At the end of copying, the data differs between the copy source and copy destination drives.

【００２１】後者の場合に対処するために、図１４に示
すように、コピー元ドライブ上のコピー済のトラックに
データの書き込みを行なった場合には、当該トラック
（このようなトラックを差分トラックと呼ぶ）に対する
書き込みデータをキャッシュ１０５内のキャッシュ本体
４０２にも保持する。例えば、トラック２（ＴＲ２）に
対する書き込みがこのケースとする。この場合、まず、
キャッシュ本体４０２内の空いているデータ格納場所の
アドレスを保持しているスロット制御ブロック（ＳＣ
Ｂ）のキュー４３に有ったＳＣＢ（４０６）を一つ選
び、そのＳＣＢが指す格納場所（スロット２）に上記書
き込みデータを格納し、キャッシュ１０５内のディレク
トリ４０１に設けたトラック表４０３の、ＴＲ２に対応
するエントリに、上記ＳＣＢへのポインタを格納する。
このようにすることで、差分トラックがどれで、その書
き込みデータが何であるかが、キャッシュを見れば分か
る。差分トラックのデータは、ドライブコピーが一通り
終了した段階、すなわち、コピー済トラック番号２１４
がコピー終了トラック番号に等しくなった時点で、再度
コピー先ドライブに書き込みを行なう必要がある。To deal with the latter case, as shown in FIG. 14, when data is written to a copied track on the copy source drive, the track (such a track is referred to as a differential track) is written. The write data for the call) is also held in the cache body 402 in the cache 105. For example, this is the case when writing to track 2 (TR2). In this case, first
A slot control block (SC that holds the address of an empty data storage location in the cache body 402)
B) Select one SCB (406) in the queue 43, store the write data in the storage location (slot 2) pointed to by the SCB, and store the write data in the track table 403 provided in the directory 401 in the cache 105. The pointer to the SCB is stored in the entry corresponding to TR2.
By doing this, it is possible to know which difference track is and what write data is by looking at the cache. The data of the differential track is at the stage when the drive copying is completed, that is, the copied track number 214
When becomes equal to the copy end track number, it is necessary to write again to the copy destination drive.

【００２２】さて、図１２は、差分データのデステージ
処理（１１１１）の内容を示すフローである。まず、上
記ディレクトリ４０１のトラック表４０３に、上記差分
トラックがリンクしているか否かチェックし（１２０
１）、もし、一本もリンクしていなければ処理を終わ
る。リンクしている場合は、差分トラックのＳＣＢが指
すキャッシュ上の書き込みデータをすべて、コピー先ド
ライブの該当するトラックに書き込む（１２０４）。Now, FIG. 12 is a flow showing the contents of the destage processing (1111) of the difference data. First, it is checked whether or not the differential track is linked to the track table 403 of the directory 401 (120
1) If there is no link, the process ends. If they are linked, all the write data on the cache pointed to by the SCB of the differential track is written to the corresponding track of the copy destination drive (1204).

【００２３】次に、アクセスパスマッピングテーブル１
４００（図１５）について説明する。アクセスパスマッ
ピングテーブル１４００（図１５）は、ホストコンピュ
ータ１０９から見たドライブの識別番号ＣＣＡ（Channe
l Connection Address）と、ディスク制御装置１００で
内部的に管理するドライブの識別番号ＤＣＡ(DeviceCon
nection Address）の対応表を与えるものである。ホス
トコンピュータ１０９の入出力要求で指示されるドライ
ブ番号（ＣＣＡ）は、ディスク制御装置１００によりＤ
ＣＡに変換され、ＤＣＡに対応するドライブが実際にア
クセスされる。具体的には図１５で、ＣＣＡの値をｋと
すると、アクセスパスマッピングテーブル１４００のＣ
ＣＡマッピング情報１４０１のＢｋの欄の値ｉを読みだ
してDCAの値とし、ドライブ番号ｉのドライブが実際に
アクセスされる。このマッピング機能により、上記のド
ライブスワップ処理で現用ドライブが予備ドライブに切
り替えられても、ＣＣＡとＤＣＡの対応関係を変えるこ
とにより、ホストコンッピュータ側でドライブ番号を変
えずにすむ。Next, access path mapping table 1
400 (FIG. 15) will be described. The access path mapping table 1400 (FIG. 15) shows the drive identification number CCA (Channe) seen from the host computer 109.
l Connection Address), and the identification number DCA (DeviceCon
nection Address) correspondence table. The drive number (CCA) designated by the input / output request of the host computer 109 is D by the disk controller 100.
Converted to CA, the drive corresponding to DCA is actually accessed. Specifically, assuming that the value of CCA is k in FIG. 15, C of the access path mapping table 1400 is set.
The value i in the Bk column of the CA mapping information 1401 is read and set as the value of DCA, and the drive of drive number i is actually accessed. With this mapping function, even if the current drive is switched to the spare drive in the above drive swap processing, the correspondence between CCA and DCA is changed so that the drive number does not need to be changed on the host computer side.

【００２４】最後に、上記のアクセスパスマッピングテ
ーブル１４００を用いたアクセスパススワップ処理につ
いて、図１３のフローに従い説明する。まず、コピー元
ドライブのＤＣＡ（ｉとする）とコピー先ドライブのＤ
ＣＡ（ｊとする）を記憶する（１３０１）。次に、ＤＣ
Ａマッピング情報１４０２のＢｉの欄の値(ｋとする)
と、ＤＣＡマッピング情報１４０２のＢｊの欄の値（ｌ
とする）をリードする（１３０２）。続いて、ＤＣＡマ
ッピング情報１４０２のＢｉの欄に値ｌを書き込み、Ｃ
ＣＡマッピング情報１４０１のＢｌの欄に値ｉを書き込
む(１３０３)。最後に、ＤＣＡマッピング情報１４０２
のＢｊの欄に値ｋを書き込み、ＣＣＡマッピング情報１
４０１のＢｋの欄に値ｊを書き込む（１３０３）。Finally, the access path swap processing using the above access path mapping table 1400 will be described according to the flow of FIG. First, the copy source drive DCA (denoted as i) and the copy destination drive D
The CA (j) is stored (1301). Then DC
Value in the Bi column of the A mapping information 1402 (k)
And the value (l in the Bj column of the DCA mapping information 1402
() Is read (1302). Then, the value 1 is written in the Bi field of the DCA mapping information 1402, and C
The value i is written in the Bl column of the CA mapping information 1401 (1303). Finally, DCA mapping information 1402
Write the value k in the Bj column of CCA mapping information 1
The value j is written in the Bk column of 401 (1303).

【００２５】これらの処理により、障害の前兆があると
判断された現用ドライブ(トラック)のデータを、ディス
ク制御装置が予備ドライブ（トラック）にコピーした
後、予備ドライブ（トラック）を新たな現用ドライブ
（トラック）として使用することが自動的に実現でき、
ディスクサブシステムの信頼性が向上する。By these processes, the disk controller copies the data of the working drive (track) determined to have a sign of failure to the spare drive (track), and then the spare drive (track) is replaced with a new working drive. It can be automatically used as a (track),
Improves disk subsystem reliability.

【００２６】[0026]

【発明の効果】障害の前兆があると判断された現用ドラ
イブ（トラック）のデータを、ディスク制御装置が予備
ドライブ（トラック）にコピーした後、予備ドライブ
（トラック）を新たな現用ドライブ（トラック）として
使用するため、現用ドライブの障害を予防的に回避する
ことができ、ドライブ障害の発生確率が減少し、ディス
クサブシステムの信頼性が向上する。As described above, the disk controller copies the data of the current drive (track), which is judged to have a sign of failure, to the spare drive (track), and then the spare drive (track) is replaced with a new current drive (track). Therefore, it is possible to prevent the failure of the current drive proactively, reduce the probability of occurrence of the drive failure, and improve the reliability of the disk subsystem.

[Brief description of drawings]

【図１】本発明の一実施例を示す制御構成図である。FIG. 1 is a control configuration diagram showing an embodiment of the present invention.

【図２】ドライブ制御ブロックの構成図である。FIG. 2 is a configuration diagram of a drive control block.

【図３】交替トラックの管理情報である。FIG. 3 is management information of a replacement truck.

【図４】トラックアクセス・テンポラリエラー統計情報
を示す図である。FIG. 4 is a diagram showing track access / temporary error statistical information.

【図５】ドライブヘルスチェック処理のフローである。FIG. 5 is a flow chart of drive health check processing.

【図６】トラックスワップ要求登録処理のフローであ
る。FIG. 6 is a flow chart of a track swap request registration process.

【図７】ドライブスワップ要求登録処理のフローであ
る。FIG. 7 is a flow chart of drive swap request registration processing.

【図８】チェックドライブ番号格納テーブル。FIG. 8 is a check drive number storage table.

【図９】スワップ処理ドライブ番号格納テーブル。FIG. 9 is a swap processing drive number storage table.

【図１０】トラックスワップ処理のフローである。FIG. 10 is a flow of track swap processing.

【図１１】ドライッブスワップ処理のフローである。FIG. 11 is a flow chart of a dry swap process.

【図１２】差分トラックデステージ処理のフローであ
る。FIG. 12 is a flowchart of a differential track destage process.

【図１３】アクセスパススワップ処理のフローである。FIG. 13 is a flow chart of access path swap processing.

【図１４】キャッシュメモリの構成図である。FIG. 14 is a configuration diagram of a cache memory.

【図１５】アクセスパスマッピングテーブルの構成図で
ある。FIG. 15 is a configuration diagram of an access path mapping table.

[Explanation of symbols]

１００…ディスク制御装置、１０２，１０３…プロセッ
サ、１０４…制御メモリ、１０９…ホストコンピュー
タ、１１１，１１２，１１３，１１４…ドライブ。100 ... Disk control device, 102, 103 ... Processor, 104 ... Control memory, 109 ... Host computer, 111, 112, 113, 114 ... Drive.

───────────────────────────────────────────────────── フロントページの続き (72)発明者白柳芳朗神奈川県小田原市国府津2880番地株式会社日立製作所小田原工場内 ─────────────────────────────────────────────────── ─── Continuation of the front page (72) Inventor Yoshiro Shirayanagi 2880 Kozu, Odawara City, Kanagawa Stock Company Hitachi Ltd. Odawara Factory

Claims

[Claims]

1. A disk subsystem comprising a disk controller and a drive which is a magnetic storage medium, wherein the drive connected to the disk controller is divided into one or more working drives and one or more spare drives for disk control. Registered in the device in advance, the number of correctable data transfer system failures that occurred during input / output from the host computer to the active drive is counted for each drive by the disk controller, and the number of occurrences is a predetermined value. A high reliability system for a disk subsystem, characterized in that the contents of the active drive are exceeded by the disk controller to an appropriate spare drive, and the spare drive is used as a substitute for the active drive.

2. The disk control device according to claim 1, wherein the tracks of the drive are divided into an active track and an alternate track and registered in advance in the disk control device, and when the input / output from the host computer to the active track is executed. The number of correctable data transfer system failures that have occurred is counted for each track by the disk control device, and for the current track in which the number of occurrences exceeds a predetermined value, the disk control device selects an appropriate number in the same drive. A high reliability system for a disk subsystem, characterized in that, after copying the contents to a spare track, the replacement track is used as a working track instead of the working track.