WO2018037509A1

WO2018037509A1 - Storage system and storage control method

Info

Publication number: WO2018037509A1
Application number: PCT/JP2016/074691
Authority: WO
Inventors: 恭男渡邊; 智大川口
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2016-08-24
Filing date: 2016-08-24
Publication date: 2018-03-01
Anticipated expiration: 2019-02-24

Abstract

This storage system sets an area aligned with two or more chunk pages based on a first chunk as a parcel which is a chunk component, regarding two or more zones when a first chunk that is at least one of one or more chunks is configured by using the two or more zones provided by two or more zone-providing storage devices. The storage system directly or indirectly allocates, to a logical volume, at least one chunk page among the two or more chunk pages based on the first chunk.

Description

Storage system and storage control method

　本発明は、概して、記憶制御に関し、例えば、ＳＭＲ（Shingled Magnetic Recording）－ＨＤＤ（Hard Disk Drive）を有するストレージシステムの記憶制御に関する。 The present invention generally relates to storage control, for example, storage control of a storage system having SMR (Shingled Magnetic Recording) -HDD (Hard Disk Drive).

　ストレージシステムとして、例えば、特許文献１及び２に開示のストレージシステムが知られている。 As a storage system, for example, storage systems disclosed in Patent Documents 1 and 2 are known.

US7,904,749US7,904,749 US8,775,733US8,775,733

　ユーザが保持又は利用するデータの増加に伴い、ストレージシステムに搭載するＰＤＥＶ（物理記憶デバイス）の容量拡大が求められている。ＰＤＥＶは、典型的には、不揮発性の記憶デバイスであり、ＰＤＥＶの一例として、ＨＤＤ（Hard Disk Drive）がある。通常のＨＤＤよりも記憶容量の大きいＨＤＤとして、ＳＭＲ（Shingled Magnetic Recording）技術が適用されたＨＤＤ、つまり、ＳＭＲ－ＨＤＤが知られている。ＨＤＤは、将来的にも比較的安価であることが継続する見通しであり、このことから、ＳＭＲ－ＨＤＤをＰＤＥＶとして備えるストレージシステムが考えられる。 As the data held or used by users increases, the capacity of the PDEV (physical storage device) installed in the storage system is required to be increased. A PDEV is typically a non-volatile storage device, and an example of a PDEV is an HDD (Hard Disk Drive). As an HDD having a larger storage capacity than a normal HDD, an HDD to which an SMR (Shingled Magnetic Recording) technology is applied, that is, an SMR-HDD is known. The HDD is expected to continue to be relatively inexpensive in the future, and from this, a storage system including the SMR-HDD as a PDEV is conceivable.

　ＳＭＲ－ＨＤＤの特徴の１つによれば、ＳＭＲ－ＨＤＤには、ゾーンと呼ばれる連続した論理領域が予め定義されている。ゾーンとしては、ランダムライトが可能であるコンベンショナルゾーンを定義することもできるが、ＳＭＲに従うライトのためには、ランダムライトが不可能である（つまりライトについてはシーケンシャルライトのみが可能である）シーケンシャルゾーンを定義する必要がある。具体的には、例えば、コンベンショナルゾーンは、コンベンショナルなコマンド（典型的にはSCSI Block Command）によりサポートできる。しかし、シーケンシャルゾーンは、コンベンショナルなコマンドによってはサポートできず、拡張コマンドの一例であるSMR最適化コマンド（Zoned Block Command）によってサポートできる。 According to one of the features of the SMR-HDD, a continuous logical area called a zone is defined in advance in the SMR-HDD. As a zone, a conventional zone in which random writing is possible can be defined, but a sequential writing in which random writing is not possible (that is, only sequential writing is possible for writing) is possible for writing according to SMR. Need to be defined. Specifically, for example, the conventional zone can be supported by a conventional command (typically, SCSI Block Command). However, the sequential zone cannot be supported by a conventional command, but can be supported by an SMR optimization command (Zoned Block Command) which is an example of an extended command.

　ＰＤＥＶとしてＳＭＲ－ＨＤＤが採用されれば、ＳＭＲ－ＨＤＤに基づく論理領域を、ストレージシステムのホストシステム（１以上のホスト計算機）に提供される論理ボリュームに直接的に又は間接的に割り当てることができる。しかし、論理ボリュームに割り当てられる論理領域の基になる記憶領域としてシーケンシャルゾーンを採用した技術は知られていない。 If SMR-HDD is adopted as PDEV, a logical area based on SMR-HDD can be directly or indirectly allocated to a logical volume provided to a host system (one or more host computers) of a storage system. . However, there is no known technique that employs a sequential zone as a storage area that is the basis of a logical area allocated to a logical volume.

　ＳＭＲ－ＨＤＤに限らず、シーケンシャルゾーンが定義された他種のゾーン提供記憶デバイス（ＰＤＥＶ）もあり得る。上述の問題は、ＳＭＲ－ＨＤＤに限らず、シーケンシャルゾーンが定義された他種のゾーン提供記憶デバイスがストレージシステムに搭載されるＰＤＥＶとして採用された場合にもあり得る。 Not limited to SMR-HDD, there may be other types of zone providing storage devices (PDEVs) in which sequential zones are defined. The above-described problem is not limited to SMR-HDD, but may be a case where another type of zone providing storage device in which a sequential zone is defined is adopted as a PDEV installed in the storage system.

　ストレージシステムは、１以上のチャンクのうちの少なくとも１つのチャンクである第１のチャンクを、２以上のゾーン提供記憶デバイスが提供する２以上のゾーンを用いて構成する場合、前記２以上のゾーンの各々について、前記第１のチャンクに基づく２以上のチャンクページとアラインする領域を、チャンクの構成要素であるパーセルとする。ストレージステムは、第１のチャンクに基づく２以上のチャンクページのうちの少なくとも１つのチャンクページを論理ボリュームに直接的に又は間接的に割り当てる。「間接的に割り当てる」とは、チャンクページの割当て先領域を経由してチャンクページが論理ボリュームに割り当てられることを意味する。「割当て先領域」の一例が、後述の仮想ページである。「直接的に割り当てる」とは、チャンクページがいずれの割当て先領域を経由せずにチャンクページが論理ボリュームに割り当てられることを意味する。 When the storage system is configured by using two or more zones provided by two or more zone providing storage devices, the first chunk, which is at least one of the one or more chunks, is configured using the two or more zones. For each, an area aligned with two or more chunk pages based on the first chunk is a parcel that is a component of the chunk. The storage system allocates at least one of the two or more chunk pages based on the first chunk directly or indirectly to the logical volume. “Indirect allocation” means that a chunk page is allocated to a logical volume via an allocation destination area of the chunk page. An example of the “allocation destination area” is a virtual page described later. “Direct allocation” means that the chunk page is allocated to the logical volume without passing through any allocation destination area.

　ゾーン提供記憶デバイスが提供するゾーンに基づく論理領域を適切に論理ボリュームに割り当てることができる。 A logical area based on a zone provided by a zone providing storage device can be appropriately allocated to a logical volume.

実施例１の概要図である。1 is a schematic diagram of Example 1. FIG. 実施例１に係るシステム全体の構成を示す。1 shows a configuration of an entire system according to a first embodiment. 実施例１に係る記憶階層の一例を示す。An example of the storage hierarchy based on Example 1 is shown. メモリ内のプログラム、テーブル及び領域を示す。Shows programs, tables and areas in memory. チャンクとチャンクページとの対応関係であってＲＡＩＤ５に従う対応関係を示す。A correspondence relationship between chunks and chunk pages and corresponding to RAID5 is shown. チャンクとチャンクページとの対応関係であってＲＡＩＤ４に従う対応関係を示す。A correspondence relationship between a chunk and a chunk page and corresponding to RAID4 is shown. アドレスマッピングの一例を示す。An example of address mapping is shown. 論理ボリューム管理テーブルの構成を示す。The structure of a logical volume management table is shown. ＳＭＲ－ＨＤＤ管理テーブルの構成を示す。The structure of an SMR-HDD management table is shown. クラスタマッピングテーブルの構成を示す。The structure of a cluster mapping table is shown. クラスタ逆マッピングテーブルの構成を示す。The structure of a cluster reverse mapping table is shown. キャッシュ管理テーブルの構成を示す。The structure of a cache management table is shown. ライト先仮想ページ管理テーブルの構成を示す。The structure of a write destination virtual page management table is shown. 仮想ページライトポインタ管理テーブルの構成を示す。The structure of a virtual page write pointer management table is shown. キャッシュメモリライト処理の流れを示す。The flow of cache memory write processing is shown. リード処理の流れを示す。The flow of read processing is shown. パリティ生成方式（１）の概要の模式図である。It is a schematic diagram of the outline | summary of a parity production | generation system (1). パリティ生成方式（１）に従うパリティ生成処理の流れを示す。The flow of parity generation processing according to the parity generation method (1) is shown. パリティ生成方式（２）の概要の模式図である。It is a schematic diagram of the outline | summary of a parity production | generation system (2). パリティ生成方式（３）の概要の模式図である。It is a schematic diagram of the outline | summary of a parity production | generation system (3). 一時格納領域管理テーブルの構成を示す。The structure of a temporary storage area management table is shown. パリティ生成方式（３）に従うパリティ生成処理の流れを示す。The flow of parity generation processing according to the parity generation method (3) is shown. 使用可能容量報告処理の流れを示す。The flow of the available capacity report processing is shown. 実施例２に係る記憶階層の一例を示す。An example of the storage hierarchy based on Example 2 is shown. 実施例３に係るアドレスマッピングの一例を示す。An example of the address mapping which concerns on Example 3 is shown. ゾーン管理テーブルの構成を示す。The structure of a zone management table is shown. デステージ処理の流れを示す。The flow of destage processing is shown.

　以下、幾つかの実施例を説明する。 Hereinafter, some examples will be described.

　なお、以下の説明では、「インターフェース部」は、１以上のインターフェースを含む。１以上のインターフェースは、１以上の同種のインターフェースデバイス（例えば１以上のＮＩＣ（Network Interface Card））であってもよいし２以上の異種のインターフェースデバイス（例えばＮＩＣとＨＢＡ（Host Bus Adapter））であってもよい。 In the following description, the “interface part” includes one or more interfaces. The one or more interfaces may be one or more similar interface devices (for example, one or more NIC (Network Interface Card)) or two or more different interface devices (for example, NIC and HBA (Host Bus Adapter)). There may be.

　また、以下の説明では、「記憶部」は、１以上のメモリを含む。少なくとも１つのメモリは、揮発性メモリであってもよいし不揮発性メモリであってもよい。記憶部は、主に、プロセッサ部による処理の際に使用される。 In the following description, the “storage unit” includes one or more memories. The at least one memory may be a volatile memory or a non-volatile memory. The storage unit is mainly used during processing by the processor unit.

　また、以下の説明では、「プロセッサ部」は、１以上のプロセッサを含む。少なくとも１つのプロセッサは、典型的には、ＣＰＵ（Central Processing Unit）である。プロセッサは、処理の一部または全部を行うハードウェア回路を含んでもよい。 In the following description, the “processor unit” includes one or more processors. The at least one processor is typically a CPU (Central Processing Unit). The processor may include a hardware circuit that performs part or all of the processing.

　また、以下の説明では、「ｘｘｘテーブル」といった表現にて情報を説明することがあるが、情報は、どのようなデータ構造で表現されていてもよい。すなわち、情報がデータ構造に依存しないことを示すために、「ｘｘｘテーブル」を「ｘｘｘ情報」と言うことができる。また、以下の説明において、各テーブルの構成は一例であり、１つのテーブルは、２以上のテーブルに分割されてもよいし、２以上のテーブルの全部又は一部が１つのテーブルであってもよい。 In the following description, information may be described using an expression such as “xxx table”, but the information may be expressed in any data structure. That is, in order to show that the information does not depend on the data structure, the “xxx table” can be referred to as “xxx information”. In the following description, the configuration of each table is an example, and one table may be divided into two or more tables, or all or part of the two or more tables may be a single table. Good.

　また、以下の説明では、「プログラム」を主語として処理を説明する場合があるが、プログラムは、プロセッサ（例えばＣＰＵ（Central Processing Unit））によって実行されることで、定められた処理を、適宜に記憶部（例えばメモリ）及び／又はインターフェースデバイス（例えば通信ポート）等を用いながら行うため、処理の主語が、プロセッサ（或いは、そのプロセッサを有する装置又はシステム）とされてもよい。また、プロセッサは、処理の一部または全部を行うハードウェア回路を含んでもよい。プログラムは、プログラムソースから計算機のような装置にインストールされてもよい。プログラムソースは、例えば、プログラム配布サーバまたは計算機が読み取り可能な（例えば非一時的な）記録媒体であってもよい。また、以下の説明において、２以上のプログラムが１つのプログラムとして実現されてもよいし、１つのプログラムが２以上のプログラムとして実現されてもよい。 In the following description, the process may be described using “program” as a subject. However, the program is executed by a processor (for example, a CPU (Central Processing Unit)), so that a predetermined processing is appropriately performed. Since the processing is performed using a storage unit (for example, a memory) and / or an interface device (for example, a communication port), the subject of processing may be a processor (or an apparatus or system having the processor). The processor may include a hardware circuit that performs a part or all of the processing. The program may be installed in a computer-like device from a program source. The program source may be, for example, a recording medium (for example, non-transitory) readable by a program distribution server or a computer. In the following description, two or more programs may be realized as one program, or one program may be realized as two or more programs.

　また、以下の説明では、「ホストシステム」は、１以上の物理的なホスト計算機（例えばホスト計算機のクラスタ）であってもよいし、少なくとも１つの仮想的なホスト計算機（例えばＶＭ（Virtual Machine））を含んでもよい。以下、ホストシステムを、単に「ホスト」と呼ぶ。 In the following description, the “host system” may be one or more physical host computers (for example, a cluster of host computers), or at least one virtual host computer (for example, VM (Virtual Machine)). ) May be included. Hereinafter, the host system is simply referred to as “host”.

　また、以下の説明では、「ストレージシステム」は、１以上の物理的なストレージ装置であってもよいし、少なくとも１つの仮想的なストレージ装置（例えばＬＰＡＲ（Logical Partition）又はＳＤＳ（Software Defined Storage））を含んでもよい。 In the following description, the “storage system” may be one or more physical storage devices, or at least one virtual storage device (for example, LPAR (Logical Partition) or SDS (Software Defined Storage). ) May be included.

　また、以下の説明では、「ＰＤＥＶ」は、物理的な記憶デバイスを意味し、典型的には、不揮発性の記憶デバイス（例えば補助記憶デバイス）である。以下の実施例では、ＰＤＥＶは、典型的には、ＳＭＲ－ＨＤＤである。しかし、ＳＭＲ－ＨＤＤに限らず、他種のＰＤＥＶ、特に、予めシーケンシャルゾーンが定義された他種のＰＤＥＶも、本発明に適用できる。 In the following description, “PDEV” means a physical storage device, and is typically a nonvolatile storage device (for example, an auxiliary storage device). In the following embodiments, the PDEV is typically an SMR-HDD. However, not only the SMR-HDD but also other types of PDEVs, in particular, other types of PDEVs in which sequential zones are defined in advance, can be applied to the present invention.

　また、以下の説明では、「ＲＡＩＤ」は、Redundant Array of Independent (or Inexpensive) Disksの略である。 Also, in the following description, “RAID” is an abbreviation for Redundant “Array” of “Independent” (or “Inexpensive)” Disks.

　また、以下の説明では、「ＲＡＩＤグループ」は、複数のＳＭＲ－ＨＤＤで構成され関連付けられたＲＡＩＤレベル（ＲＡＩＤ構成）に従いデータを記憶するグループでもよいし、複数のパーセル（後述）で構成され関連付けられたＲＡＩＤレベル（ＲＡＩＤ構成）に従いデータを記憶するグループでもよい。後者のグループは、後述するように、実施例では「チャンク」と呼ばれることがある。 In the following description, the “RAID group” may be a group that stores data according to a RAID level (RAID configuration) configured and associated with a plurality of SMR-HDDs, or is configured and associated with a plurality of parcels (described later). It may be a group that stores data according to a specified RAID level (RAID configuration). The latter group may be called “chunk” in the embodiment, as will be described later.

　また、以下の説明では、同種の要素を区別しないで説明する場合には、参照符号（又は参照符号における共通部分）を使用し、同種の要素を区別して説明する場合は、要素のＩＤ（又は要素の参照符号）を使用することがある。例えば、ＳＭＲ－ＨＤＤを特に区別しないで説明する場合には、「ＳＭＲ－ＨＤＤ５１」と記載し、個々のＳＭＲ－ＨＤＤを区別して説明する場合には、「ＳＭＲ－ＨＤＤ１－１」のように記載することがある。 Moreover, in the following description, when explaining without distinguishing the same kind of element, a reference code (or a common part in the reference sign) is used, and when explaining the same kind of element separately, the element ID (or Element reference signs) may be used. For example, “SMR-HDD 51” is described when the SMR-HDD is not particularly distinguished, and “SMR-HDD 1-1” is described when the individual SMR-HDD is distinguished. There are things to do.

　また、以下の説明では、要素の識別子として主に番号が使用されるが、番号に代えて又は加えて他種の符号が使用されてもよい。 In the following description, numbers are mainly used as element identifiers, but other types of codes may be used instead of or in addition to numbers.

　また、以下の実施例における複数種類の記憶領域の各々の定義は、下記の通りである。
・「ゾーン」とは、ＳＭＲ－ＨＤＤに予め定義されている記憶領域である。ゾーンは、連続した論理アドレス（典型的にはＬＢＡ（Logical Block Address））に従う連続した論理的な記憶領域である。ゾーンは、ＳＭＲ－ＨＤＤに対するコマンドによりサポート（操作）可能である。例えば、或る制御コマンドにより、ゾーンの容量が記述された応答を取得することが可能である。複数のゾーンが、１つのＳＭＲ－ＨＤＤに存在する。なお、ＳＭＲ－ＨＤＤにおいていずれのゾーンのサイズも同じである。
・「パーセル」とは、ゾーンの全部又は一部である。
・「ストライプ」とは、パーセルの一部である。１つのＳＭＲ－ＨＤＤにおける複数のストライプが、そのＳＭＲ－ＨＤＤにおける１つのパーセルを構成する。ストライプに、そのストライプを含んだストライプ列（後述）に対応したデータユニットの一部分であるユーザデータと、そのデータユニットに対応したパリティとのうちのいずれかが格納される。ユーザデータが格納されるストライプを「データストライプ」と言うことができ、パリティが格納されるストライプを「パリティストライプ」と言うことができる。「データユニット」は、ストライプ列単位のデータを意味する。
・「チャンク」とは、複数のＳＭＲ－ＨＤＤに存在する複数のパーセルで構成された記憶領域である。従って、チャンクは、複数のストライプを含む。その複数のストライプには、データストライプもパリティストライプも存在する。
・「チャンクページ」とは、複数のＳＭＲ－ＨＤＤに存在する複数のデータストライプで構成された記憶領域である。別の言い方をすれば、チャンクページは、複数のストライプで構成されるが、その複数のストライプは、いずれもデータストライプであり、パリティストライプを含まない。
・「ストライプ列」とは、１つのＲＡＩＤグループを構成する複数のＳＭＲ－ＨＤＤ領域にそれぞれ存在する同一アドレスの複数のストライプで構成された記憶領域である。
・「キャッシュストライプ列」とは、ストライプ列に対応した記憶領域であってキャッシュメモリ領域上の記憶領域である。キャッシュストライプ列内のデータが、そのキャッシュストライプ列に対応したストライプ列に格納（デステージ）される。
・「論理ボリューム」とは、ホストに認識される論理的な記憶領域である。
・「論理ページ」とは、論理ボリュームの一部である。論理ボリュームは、複数の論理ページで構成される。
・「論理クラスタ」とは、論理ページの一部である。論理ページは、複数の論理クラスタで構成される。
・「仮想ボリューム」とは、いわゆる容量拡張ボリュームであり、典型的には、Thin Provisioningに従う論理的な記憶領域である。仮想ボリュームは、論理ボリュームに関連付けられる内部ボリューム（ホストに提供されないボリューム）である。
・「仮想ページ」とは、仮想ボリュームの一部である。仮想ボリュームは、複数の仮想ページで構成される。
・「仮想クラスタ」とは、仮想ページの一部である。仮想ページは、複数の仮想クラスタで構成される。 In addition, the definitions of the plurality of types of storage areas in the following embodiments are as follows.
“Zone” is a storage area defined in advance in the SMR-HDD. A zone is a continuous logical storage area according to continuous logical addresses (typically LBA (Logical Block Address)). The zone can be supported (operated) by a command for the SMR-HDD. For example, a response describing the capacity of a zone can be acquired by a certain control command. A plurality of zones exist in one SMR-HDD. Note that the size of any zone in the SMR-HDD is the same.
“Parcel” means all or part of a zone.
“Stripes” are parts of parcels. A plurality of stripes in one SMR-HDD constitute one parcel in the SMR-HDD. In the stripe, either user data that is a part of a data unit corresponding to a stripe column (described later) including the stripe, or parity corresponding to the data unit is stored. A stripe in which user data is stored can be referred to as a “data stripe”, and a stripe in which parity is stored can be referred to as a “parity stripe”. “Data unit” means data in stripe columns.
“Chunk” is a storage area composed of a plurality of parcels existing in a plurality of SMR-HDDs. Accordingly, the chunk includes a plurality of stripes. The plurality of stripes include a data stripe and a parity stripe.
A “chunk page” is a storage area composed of a plurality of data stripes existing in a plurality of SMR-HDDs. In other words, the chunk page is composed of a plurality of stripes, but each of the plurality of stripes is a data stripe and does not include a parity stripe.
A “stripe column” is a storage area composed of a plurality of stripes of the same address respectively existing in a plurality of SMR-HDD areas constituting one RAID group.
The “cache stripe column” is a storage area corresponding to the stripe column and a storage area on the cache memory area. Data in the cache stripe column is stored (destaged) in a stripe column corresponding to the cache stripe column.
“Logical volume” is a logical storage area recognized by the host.
A “logical page” is a part of a logical volume. A logical volume is composed of a plurality of logical pages.
A “logical cluster” is a part of a logical page. A logical page is composed of a plurality of logical clusters.
The “virtual volume” is a so-called capacity expansion volume, and is typically a logical storage area according to Thin Provisioning. The virtual volume is an internal volume (a volume not provided to the host) associated with the logical volume.
A “virtual page” is a part of a virtual volume. A virtual volume is composed of a plurality of virtual pages.
A “virtual cluster” is a part of a virtual page. The virtual page is composed of a plurality of virtual clusters.

　図１は、実施例１の概要図である。 FIG. 1 is a schematic diagram of the first embodiment.

　複数のＳＭＲ－ＨＤＤ５１が存在する。各ＳＭＲ－ＨＤＤ５１は、ゾーン提供記憶デバイスの一例である。各ＳＭＲ－ＨＤＤ５１は、複数のゾーン５０を提供する。ゾーン５０として、少なくともシーケンシャルゾーン５０Ｓがある。コンベンショナルゾーン５０Ｃがあってもよい。コンベンショナルゾーン５０Ｃは、ランダムライトが可能であるゾーンである。一方、シーケンシャルゾーン５０Ｓは、ランダムライトが不可能なゾーン、言い換えれば、ライトについてはシーケンシャルライトのみが可能であるゾーンである。本実施例では、コンベンショナルゾーン５０Ｃは、コンベンショナルなコマンドの一例であるSCSI Block Commandによりサポートできる。一方、シーケンシャルゾーン５０Ｓは、コンベンショナルなコマンドによってはサポートできず、拡張コマンドの一例であるSMR最適化コマンド（Zoned Block Command）によってサポートされる。 There are multiple SMR-HDDs 51. Each SMR-HDD 51 is an example of a zone providing storage device. Each SMR-HDD 51 provides a plurality of zones 50. As the zone 50, there is at least a sequential zone 50S. There may be a conventional zone 50C. The conventional zone 50C is a zone where random writing is possible. On the other hand, the sequential zone 50S is a zone where random writing is impossible, in other words, a zone where only sequential writing is possible for writing. In the present embodiment, the conventional zone 50C can be supported by a SCSI Block 例 Command which is an example of a conventional command. On the other hand, the sequential zone 50S cannot be supported by a conventional command, but is supported by an SMR optimization command (Zoned Block Command) which is an example of an extended command.

　各ゾーン５０は、パーセル４１を含む。パーセル４１は、そのパーセル４１を有するＳＭＲ－ＨＤＤ５１が提供する複数のストライプ５２で構成される。ストライプ５２のサイズは、例えば５１２ＫＢである。 Each zone 50 includes a parcel 41. The parcel 41 includes a plurality of stripes 52 provided by the SMR-HDD 51 having the parcel 41. The size of the stripe 52 is, for example, 512 KB.

　複数のＳＭＲ－ＨＤＤ５１に基づき複数のチャンクページ６０が提供される。各チャンクページ６０は、そのチャンクページ６０を提供する複数のＳＭＲ－ＨＤＤ５１がそれぞれ提供する複数のストライプ５２で構成される。 A plurality of chunk pages 60 are provided based on a plurality of SMR-HDDs 51. Each chunk page 60 includes a plurality of stripes 52 provided by a plurality of SMR-HDDs 51 that provide the chunk page 60, respectively.

　本実施例によれば、少なくとも１つのシーケンシャルゾーン５０Ｓに、不使用領域５３が設けられ得る。つまり、少なくとも１つのシーケンシャルゾーン５０Ｓは、パーセル４１と不使用領域５３とで構成され得る。言い換えれば、シーケンシャルゾーン５０Ｓの全域を使い切るのではなく（複数のストライプ５２として提供するのではなく）、シーケンシャルゾーン５０Ｓの一部の領域は、不使用領域５３として定義され得る。「不使用領域」とは、有効な記憶領域としては使用されない記憶領域である。別の言い方をすれば、不使用領域のいずれの部分も、ストライプ５２の少なくとも一部として提供されない（使用されない）。従って、不使用領域５３には、ユーザデータもパリティも格納されない。 According to the present embodiment, the unused area 53 can be provided in at least one sequential zone 50S. That is, at least one sequential zone 50 </ b> S can be composed of the parcel 41 and the unused area 53. In other words, instead of using up the entire area of the sequential zone 50 </ b> S (rather than providing it as a plurality of stripes 52), a partial area of the sequential zone 50 </ b> S can be defined as the unused area 53. The “unused area” is a storage area that is not used as an effective storage area. In other words, no part of the unused area is provided (not used) as at least part of the stripe 52. Accordingly, neither user data nor parity is stored in the unused area 53.

　本実施例によれば、ストレージシステムの使用可能容量は、以下の定義となる。なお、「使用可能容量」は、ユーザデータを格納可能な領域の容量であり、パリティを格納可能な領域の容量を含まない。また、典型的には、ゾーン５０のサイズ（例えばシーケンシャルゾーン５０Ｓのサイズ）は、ページサイズより大きい。また、以下の定義では、ＲＡＩＤレベルを、ｍＤ＋ｎＰとする（ｍは自然数。ｎは０以上の整数）、すなわち、１つのストライプ列が、ｍ個のデータストライプとｎ個のパリティストライプで構成されているとする。
使用可能容量
＝データストライプ合計容量
＝パーセルサイズ×ｍ÷（ｍ＋ｎ）×パーセル数・・・・（式１） According to the present embodiment, the usable capacity of the storage system is defined as follows. The “usable capacity” is the capacity of an area where user data can be stored, and does not include the capacity of an area where parity can be stored. Typically, the size of the zone 50 (for example, the size of the sequential zone 50S) is larger than the page size. In the following definition, the RAID level is mD + nP (m is a natural number, n is an integer of 0 or more), that is, one stripe column is composed of m data stripes and n parity stripes. Suppose that
Usable capacity = total data stripe capacity = parcel size x m / (m + n) x number of parcels (1)

　「パーセルサイズ」は、以下の定義となる。なお、以下の定義では、（式１）と同様に、ＲＡＩＤレベルを、ｍＤ＋ｎＰとする（ｍは自然数。ｎは０以上の整数）、すなわち、１つのストライプ列が、ｍ個のデータストライプとｎ個のパリティストライプで構成されているとする。
パーセルサイズ
＝ｉｎｔ｛ゾーンサイズ／（ページサイズ／ｍ）｝×（ページサイズ／ｍ）・・・（式２）
※「ｉｎｔ」は、小数点以下を切り捨てることを意味する。
※「ｍ」は、「ｍＤ＋ｎＰ」における「ｍ」。 “Parcel size” is defined as follows. In the following definition, as in (Equation 1), the RAID level is mD + nP (m is a natural number, n is an integer of 0 or more), that is, one stripe column includes m data stripes and n Suppose that it is composed of one parity stripe.
Parcel size = int {zone size / (page size / m)} × (page size / m) (Formula 2)
* “Int” means that the decimal part is rounded down.
* “M” is “m” in “mD + nP”.

　以上の定義によれば、ゾーンサイズが２５６ＭＢであり、ページサイズが４２ＭＢであり、ｍ＝３の場合、パーセルサイズは、２５２ＭＢである。従って、不使用領域５３のサイズは、４ＭＢ（＝２５６ＭＢ（ゾーンサイズ）－２５２ＭＢ（パーセルサイズ））である。なお、「ページサイズ」とは、仮想ページ及び論理ページのうちのいずれのページのサイズでもよい（典型的には、仮想ページと論理ページのサイズは同じである）。また、上記の定義によれば、パーセルサイズはゾーンサイズと同じになることもあり、その場合には、不使用領域５３のサイズは０、すなわち、不使用領域５３が必要無い、ということになる。 According to the above definition, when the zone size is 256 MB, the page size is 42 MB, and m = 3, the parcel size is 252 MB. Therefore, the size of the unused area 53 is 4 MB (= 256 MB (zone size) -252 MB (parcel size)). The “page size” may be the size of any one of the virtual page and the logical page (typically, the size of the virtual page and the logical page is the same). Further, according to the above definition, the parcel size may be the same as the zone size. In this case, the size of the unused area 53 is 0, that is, the unused area 53 is not necessary. .

　上述の定義に基づく具体例では、ページサイズは４２ＭＢであるが、その具体例のように、ページサイズは４２ＭＢであることが好ましい。その理由の一例は、下記の通りである。 In the specific example based on the above definition, the page size is 42 MB, but as in the specific example, the page size is preferably 42 MB. An example of the reason is as follows.

　ストライプサイズは、一般には、２のべき乗であり、典型的には５１２ＫＢである。ストライプサイズが５１２ＫＢの場合、各ストライプ列におけるデータストライプ５２の合計サイズに関して、下記の状況が得られる。
ＲＡＩＤレベルケース１：ＲＡＩＤ１（１Ｄ＋１Ｐ）
データストライプ５２の合計サイズ＝５１２ＫＢ×１＝５１２ＫＢ
ＲＡＩＤレベルケース２：ＲＡＩＤ１（２Ｄ＋２Ｐ）
データストライプ５２の合計サイズ＝５１２ＫＢ×２＝１０２４ＫＢ
ＲＡＩＤレベルケース３：ＲＡＩＤ５（３Ｄ＋１Ｐ）
データストライプ５２の合計サイズ＝５１２ＫＢ×３＝１５３６ＫＢ
ＲＡＩＤレベルケース４：ＲＡＩＤ５（７Ｄ＋１Ｐ）
データストライプ５２の合計サイズ＝５１２ＫＢ×７＝３５８４ＫＢ
ＲＡＩＤレベルケース５：ＲＡＩＤ６（６Ｄ＋２Ｐ）
データストライプ５２の合計サイズ＝５１２ＫＢ×６＝３０７２ＫＢ
ＲＡＩＤレベルケース６：ＲＡＩＤ６（１４Ｄ＋２Ｐ）
データストライプ５２の合計サイズ＝５１２ＫＢ×１４＝７１６８ＫＢ The stripe size is generally a power of 2 and is typically 512 KB. When the stripe size is 512 KB, the following situation is obtained with respect to the total size of the data stripes 52 in each stripe column.
RAID level case 1: RAID1 (1D + 1P)
Total size of data stripe 52 = 512 KB × 1 = 512 KB
RAID level case 2: RAID1 (2D + 2P)
Total size of data stripe 52 = 512 KB × 2 = 1024 KB
RAID level case 3: RAID 5 (3D + 1P)
Total size of data stripe 52 = 512 KB × 3 = 1536 KB
RAID level case 4: RAID 5 (7D + 1P)
Total size of data stripe 52 = 512 KB × 7 = 3584 KB
RAID level case 5: RAID 6 (6D + 2P)
Total size of data stripe 52 = 512 KB × 6 = 3072 KB
RAID level case 6: RAID 6 (14D + 2P)
Total size of data stripe 52 = 512 KB × 14 = 7168 KB

　ＲＡＩＤレベルケース１～６の最小公倍数は、２１ＭＢ（＝５１２ＫＢ×２×３×７）である。ページサイズを、その最小公倍数（２１ＭＢ）の自然数倍、例えば、４２ＭＢ（＝２１ＭＢ×２）とすることで、５１２ＫＢ×１、５１２ＫＢ×２、５１２ＫＢ×３、５１２ＫＢ×７、５１２ＫＢ×６、及び、５１２ＫＢ×１４のいずれでもページサイズが割り切れる。このため、上記のいずれのケースのＲＡＩＤレベルが使用されても、ページにデータストライプ５２を対応付ける際に、ページにＮ個のストライプ列を対応付けることができる（Ｎは自然数）。すなわち、ページサイズは、異なる複数のＲＡＩＤレベルにそれぞれ対応した複数のデータストライプサイズ合計（ストライプサイズ×データストライプ数）の最小公倍数の自然数倍であることが好ましいということである。この考察に従うページサイズとは異なるページサイズ、例えば、ページサイズを６４ＭＢとした場合、上記の結果は得られない。 The least common multiple of RAID level cases 1 to 6 is 21 MB (= 512 KB × 2 × 3 × 7). By setting the page size to a natural multiple of the least common multiple (21 MB), for example, 42 MB (= 21 MB × 2), 512 KB × 1, 512 KB × 2, 512 KB × 3, 512 KB × 7, 512 KB × 6, and The page size is divisible by any of 512 KB × 14. For this reason, even when the RAID level of any of the above cases is used, when the data stripe 52 is associated with the page, N stripe columns can be associated with the page (N is a natural number). That is, the page size is preferably a natural number multiple of the least common multiple of a plurality of data stripe sizes total (stripe size × number of data stripes) respectively corresponding to a plurality of different RAID levels. When the page size is different from the page size according to this consideration, for example, when the page size is 64 MB, the above result cannot be obtained.

　このような考察に従いページサイズを決めることができるが、ページサイズは、ＳＭＲ－ＨＤＤ５１に予め定義されているゾーンサイズとの関係でも適切なサイズであることが望ましいが、ゾーンサイズを考慮した適切なページサイズの決定は困難であり領域管理負担が大きいという技術的課題が存在する。 Although the page size can be determined in accordance with such consideration, it is desirable that the page size is an appropriate size in relation to the zone size defined in advance in the SMR-HDD 51. There is a technical problem that it is difficult to determine the page size and the area management burden is large.

　そこで、上述したように、少なくとも１つのシーケンシャルゾーン５０Ｓについては、その全域を使用可能領域とするのではなく、不使用領域５３を設けるというシンプルな技術的工夫により、上記技術的課題を解決することができる。すなわち、ページサイズを調整するのではなく、ゾーンサイズに不使用領域５３を設けるというゾーン側サイズ（パーセルサイズ）の調整により、異なる記憶階層間の記憶領域のサイズの関係を適切に設定することができる。上記のような技術的課題が残ると、ストレージシステムが有するＲＡＩＤグループを構成するＰＤＥＶとして、ＳＭＲ－ＨＤＤ５１のようなＰＤＥＶを採用することは困難であるが、上記の技術的工夫により技術的課題を解決することができるため、ＳＭＲ－ＨＤＤ５１を搭載したストレージシステム、すなわち、通常のＨＤＤを搭載したストレージシステムよりも大容量のストレージシステムを提供することができる。 Therefore, as described above, at least one sequential zone 50S is not used as a usable area, but the above technical problem is solved by a simple technical idea that a non-use area 53 is provided. Can do. That is, instead of adjusting the page size, it is possible to appropriately set the storage area size relationship between different storage hierarchies by adjusting the zone side size (parcel size) by providing the unused area 53 in the zone size. it can. If the above technical problems remain, it is difficult to adopt a PDEV such as the SMR-HDD 51 as the PDEV that constitutes the RAID group of the storage system. Therefore, it is possible to provide a storage system having the SMR-HDD 51, that is, a storage system having a larger capacity than a storage system having a normal HDD.

　以下、本実施例を詳細に説明する。なお、コンベンショナルゾーン５０Ｃについても、図１に示すように、不使用領域５３が設けられてもよい。 Hereinafter, this embodiment will be described in detail. Note that the non-use area 53 may also be provided in the conventional zone 50C as shown in FIG.

　図２は、実施例１に係るシステム全体の構成を示す。 FIG. 2 shows the configuration of the entire system according to the first embodiment.

　ストレージシステム１００は、ＳＡＮ（Storage Area Network）のような通信ネットワークを経由して１以上のホスト３００に接続される。 The storage system 100 is connected to one or more hosts 300 via a communication network such as SAN (Storage Area Network).

　ストレージシステム１００は、複数のＳＭＲ－ＨＤＤ５１を含んだ複数のＰＤＥＶと、複数のＰＤＥＶに接続されたストレージコントローラ１１０とを有する。ストレージシステム１００に、ＳＭＲ－ＨＤＤ５１に基づくＲＡＩＤグループと、他種のＰＤＥＶ（例えばＳＳＤ又は通常のＨＤＤ）に基づくＲＡＩＤグループとが混在してもよい。 The storage system 100 includes a plurality of PDEVs including a plurality of SMR-HDDs 51 and a storage controller 110 connected to the plurality of PDEVs. The storage system 100 may include a RAID group based on the SMR-HDD 51 and a RAID group based on another type of PDEV (for example, an SSD or a normal HDD).

　ストレージコントローラ１１０は、図１を参照して説明したゾーン側サイズ（パーセルサイズ）の調整を実行する。具体的には、例えば、ストレージコントローラ１１０は、ＳＭＲ－ＨＤＤ５１にゾーンサイズを問い合わせるコマンド（例えばZoned Block Command でサポートされている“REPORT ZONE”というコマンド）を送信し、そのコマンドに対する応答（ゾーンサイズが記述された応答）をＳＭＲ－ＨＤＤ５１から受信する。ストレージコントローラ１１０は、ＲＡＩＤ構成（いずれのＳＭＲ－ＨＤＤ５１のいずれのパーセル４１でＲＡＩＤが構成されるか）、ＳＭＲ－ＨＤＤ５１のゾーンサイズ、及び、ＲＡＩＤ構成におけるデータストライプ数（ｍ）を用いて上述の（式２）を算出することにより、ＲＡＩＤ構成毎にパーセルサイズを算出する。また、パーセルサイズ、ストレージシステム１００におけるパーセル数、ＲＡＩＤ構成におけるデータストライプ数（ｍ）及びＲＡＩＤ構成におけるパリティストライプ数（ｎ）を用いて上述の（式１）を算出することにより、ストレージシステム１００の使用可能容量を算出する。 The storage controller 110 adjusts the zone side size (parcel size) described with reference to FIG. Specifically, for example, the storage controller 110 sends a command for inquiring the zone size to the SMR-HDD 51 (for example, a command “REPORT ZONE” supported by Zoned Block Command), and a response to the command (the zone size is The response described) is received from the SMR-HDD 51. The storage controller 110 uses the RAID configuration (which parcel 41 of which SMR-HDD 51 configures the RAID), the zone size of the SMR-HDD 51, and the number of data stripes (m) in the RAID configuration, as described above. By calculating (Equation 2), the parcel size is calculated for each RAID configuration. Further, by calculating the above (Equation 1) using the parcel size, the number of parcels in the storage system 100, the number of data stripes (m) in the RAID configuration, and the number of parity stripes (n) in the RAID configuration, Calculate available capacity.

　ストレージコントローラ１１０は、インターフェース部、記憶部及びプロセッサ部を有する。インターフェース部の一例が、ホストに接続されるフロントエンドインターフェース部の一例である複数のＦＣ（Fibre Channel）　Ｉ／Ｆ（インターフェース）１１３と、ＰＤＥＶに接続されるバックエンドインターフェース部の一例である複数のＳＡＳ（Serial Attached SCSI）　Ｉ／Ｆ１１５とである。記憶部の一例が、メモリ１１２である。プロセッサ部の一例が、ＣＰＵ１１１である。 The storage controller 110 has an interface unit, a storage unit, and a processor unit. An example of the interface unit is a plurality of FC (Fibre Channel) I / F (interface) 113, which is an example of a front-end interface unit connected to the host, and a plurality of examples of a back-end interface unit connected to the PDEV. SAS (Serial Attached SCSI) I / F115. An example of the storage unit is the memory 112. An example of the processor unit is the CPU 111.

　ＦＣ　Ｉ／Ｆ１１３は、ホストと通信するためのインターフェースである。ＳＡＳ　Ｉ／Ｆ１１５は、ＳＭＲ－ＨＤＤ５１と通信するためのインターフェースである。メモリ１１２は、プログラムとテーブルを格納する。メモリ１１２は、キャッシュメモリ領域を含んでよく、その領域に、ＳＭＲ－ＨＤＤ５１のようなＰＤＥＶに入出力されるデータが一時的に記憶される。ＣＰＵ１１１が、メモリからプログラムを読み出して実行し、メモリ１１２内のテーブルを使用する。 FC I / F 113 is an interface for communicating with the host. The SAS I / F 115 is an interface for communicating with the SMR-HDD 51. The memory 112 stores programs and tables. The memory 112 may include a cache memory area in which data input / output to / from the PDEV such as the SMR-HDD 51 is temporarily stored. The CPU 111 reads and executes a program from the memory, and uses a table in the memory 112.

　ストレージコントローラ１１０は、更に、ハードウェア回路の一例であるＡＳＩＣ（Application Specific Integrated Circuit）１１６を有する。ＡＳＩＣ１１６は、プロセッサ部に含まれる要素の一例でよい。ＡＳＩＣ１１６に代えて、他種のハードウェア回路（例えばＦＰＧＡ（Field-Programmable Gate Array））が採用されてもよい。ＡＳＩＣ１１６は、パリティを作成する。本実施例では、ＡＳＩＣ１１６及びＳＡＳ　Ｉ／Ｆ１１５がＰＤＥＶ群１２１毎に設けられる。ＰＤＥＶ群１２１は、ＲＡＩＤグループであってもよいし、ＲＡＩＤグループでなくてもよい。 The storage controller 110 further has an ASIC (Application Specific Integrated Circuit) 116 that is an example of a hardware circuit. The ASIC 116 may be an example of an element included in the processor unit. Instead of the ASIC 116, other types of hardware circuits (for example, FPGA (Field-Programmable Gate Array)) may be employed. The ASIC 116 creates parity. In this embodiment, an ASIC 116 and a SAS I / F 115 are provided for each PDEV group 121. The PDEV group 121 may be a RAID group or may not be a RAID group.

　なお、図示しないが、ストレージシステム１００は、ダーティデータ（ＰＤＥＶに未だ格納されていないデータであってキャッシュメモリ領域に格納されているデータ）の保護のため電源障害に備えたバッテリーを有していてもよい。また、メモリ１１２は、二重化されていてもよい。また、メモリ１１２が二重化されている場合、それらのメモリ１１２の電源系統は別々でもよい。 Although not shown, the storage system 100 has a battery for power failure to protect dirty data (data not yet stored in the PDEV and stored in the cache memory area). Also good. Further, the memory 112 may be duplicated. Further, when the memories 112 are duplicated, the power supply systems of these memories 112 may be different.

　ストレージコントローラ１１０は、後述のLog-Structuredライトを実行してよい。ストレージコントローラ１１０は、Log-Structuredライトに関し、データ量削減のための重複排除機能及び圧縮機能のうちの少なくとも１つを有していてもよい。 The storage controller 110 may execute a Log-Structured write described later. The storage controller 110 may have at least one of a deduplication function and a compression function for reducing the amount of data regarding the Log-Structured write.

　図３は、実施例１に係る記憶階層の一例を示す。 FIG. 3 shows an example of a storage hierarchy according to the first embodiment.

　この例によれば、分散ＲＡＩＤが採用されている。通常のＲＡＩＤではＨＤＤを基本単位としてＲＡＩＤグループが構成されるのに対して、分散ＲＡＩＤではパーセルを基本単位としてＲＡＩＤグループが構成される。複数のパーセル４１により構成されるＲＡＩＤグループをチャンク３０３と呼ぶ。例えば、複数のＳＭＲ－ＨＤＤ群として、ＳＭＲ－ＨＤＤ１－１～１－３から成る第１のＳＭＲ－ＨＤＤ群と、ＳＭＲ－ＨＤＤ２－１～２－３から成る第２のＳＭＲ－ＨＤＤ群と、ＳＭＲ－ＨＤＤ３－１～３－３から成る第３のＳＭＲ－ＨＤＤ群と、ＳＭＲ－ＨＤＤ４－１～４－３から成る第４のＳＭＲ－ＨＤＤ群とがあるとする。分散ＲＡＩＤの実現の一例として、ＳＭＲ－ＨＤＤ１－１内のパーセル４１、ＳＭＲ－ＨＤＤ２－２内のパーセル４１、ＳＭＲ－ＨＤＤ３－１内のパーセル４１、及び、ＳＭＲ－ＨＤＤ４－３内のパーセル４１により、チャンク３０３が構成されている。チャンク３０３を構成するパーセル４１を格納するＳＭＲ－ＨＤＤ５１の組み合わせは、複数通りである。分散ＲＡＩＤでは、あるＳＭＲ－ＨＤＤ５１が故障した際に、ＣＰＵ１１１が、ストレージシステム１００内の（ほぼ）全てのＳＭＲ－ＨＤＤ５１がＲＡＩＤのリビルド（コレクションコピー）処理に関与するようにＳＭＲ－ＨＤＤ５１を選択することができる。 According to this example, distributed RAID is adopted. In a normal RAID, a RAID group is configured with HDD as a basic unit, whereas in distributed RAID, a RAID group is configured with parcel as a basic unit. A RAID group composed of a plurality of parcels 41 is called a chunk 303. For example, as a plurality of SMR-HDD groups, a first SMR-HDD group consisting of SMR-HDDs 1-1 to 1-3, a second SMR-HDD group consisting of SMR-HDDs 2-1 to 2-3, Suppose that there is a third SMR-HDD group consisting of SMR-HDDs 3-1 to 3-3 and a fourth SMR-HDD group consisting of SMR-HDDs 4-1 to 4-3. As an example of the realization of the distributed RAID, the parcel 41 in the SMR-HDD 1-1, the parcel 41 in the SMR-HDD 2-2, the parcel 41 in the SMR-HDD 3-1 and the parcel 41 in the SMR-HDD 4-3 are used. Chunk 303 is configured. There are a plurality of combinations of SMR-HDDs 51 that store the parcels 41 constituting the chunk 303. In the distributed RAID, when a certain SMR-HDD 51 fails, the CPU 111 selects the SMR-HDD 51 so that (almost) all the SMR-HDDs 51 in the storage system 100 are involved in RAID rebuild (collection copy) processing. be able to.

　チャンク３０３が有する複数のデータストライプが、複数のチャンクページ６０を構成する。チャンクページ６０が、仮想ページ３０２に割り当てられる記憶領域である。チャンク３０３は、複数の仮想ボリューム１４１に共通であってもよいし、仮想ボリューム１４１毎にチャンク３０３が設けられてもよい。１以上のチャンク３０３が、例えば、Thin Provisioningに従う容量プール（チャンクページ６０のプール）に相当してよい。 A plurality of data stripes included in the chunk 303 constitute a plurality of chunk pages 60. The chunk page 60 is a storage area allocated to the virtual page 302. The chunk 303 may be common to a plurality of virtual volumes 141, or a chunk 303 may be provided for each virtual volume 141. One or more chunks 303 may correspond to, for example, a capacity pool (a pool of chunk pages 60) according to Thin Provisioning.

　複数の仮想ボリューム１４１と複数の論理ボリューム１５１とがある。仮想ボリューム１４１と論理ボリューム１５１は、１：１で対応してもよいが、本実施例では、多：多で対応している。すなわち、論理ボリューム１５１における領域（例えば論理ページ３０１）は、仮想ボリューム１４１内の領域（例えば仮想ページ３０２）に対応し、同一の論理ボリューム１５１における他の領域は、別の仮想ボリューム１４１内の領域に対応している。論理ボリューム１５１と仮想ボリューム１４１は後述するクラスタマッピングテーブルによって関連付けられている。論理ボリューム１５１が、ホスト３００に提供される。 There are a plurality of virtual volumes 141 and a plurality of logical volumes 151. The virtual volume 141 and the logical volume 151 may correspond by 1: 1, but in this embodiment, they correspond by many: many. That is, an area (for example, logical page 301) in the logical volume 151 corresponds to an area (for example, virtual page 302) in the virtual volume 141, and another area in the same logical volume 151 is an area in another virtual volume 141. It corresponds to. The logical volume 151 and the virtual volume 141 are associated by a cluster mapping table described later. A logical volume 151 is provided to the host 300.

　図４は、メモリ１１２内のプログラム、テーブル及び領域を示す。 FIG. 4 shows programs, tables, and areas in the memory 112.

　メモリ１１２には、Ｉ／Ｏ制御プログラム４０１及び領域管理プログラム４０２が格納される。また、メモリ１１２には、クラスタマッピングテーブル４１１、クラスタ逆マッピングテーブル４１２、ページマッピングテーブル４１３、チャンクマッピングテーブル４１４、パーセルマッピングテーブル４１５、ＲＡＩＤ管理テーブル４１６、ライト先仮想ページ管理テーブル４１７、仮想ページライトポインタ管理テーブル４１８、論理ボリューム管理テーブル４１９、ＳＭＲ－ＨＤＤ管理テーブル４２０、キャッシュ管理テーブル４２１、一時格納領域管理テーブル４２２、及び、ゾーン管理テーブル４２３が格納される。また、メモリ１１２は、キャッシュメモリ領域４３０を有する。メモリ１１２上のプログラム及びテーブルのうちの少なくとも１つが、少なくとも１つのＳＭＲ－ＨＤＤ５１にコピー（バックアップ）されてもよい。 In the memory 112, an I / O control program 401 and an area management program 402 are stored. The memory 112 also includes a cluster mapping table 411, a cluster reverse mapping table 412, a page mapping table 413, a chunk mapping table 414, a parcel mapping table 415, a RAID management table 416, a write destination virtual page management table 417, and a virtual page write pointer. A management table 418, a logical volume management table 419, an SMR-HDD management table 420, a cache management table 421, a temporary storage area management table 422, and a zone management table 423 are stored. The memory 112 has a cache memory area 430. At least one of the program and table on the memory 112 may be copied (backed up) to at least one SMR-HDD 51.

　Ｉ／Ｏ制御プログラム４０１は、データのＩ／Ｏを制御する。領域管理プログラム４０２は、領域の容量の算出等を実行する。 The I / O control program 401 controls data I / O. The area management program 402 executes calculation of area capacity.

　クラスタマッピングテーブル４１１は、論理クラスタから仮想クラスタへの方向に関してのクラスタ対応関係（論理クラスタから仮想クラスタへのマッピング）を保持するテーブルである。クラスタ逆マッピングテーブル４１２は、仮想クラスタから論理クラスタへの方向に関してのクラスタ対応関係（仮想クラスタから論理クラスタへのマッピング）を保持するテーブルである。ページマッピングテーブル４１３は、仮想ページ３０２とチャンクページ６０との対応関係を保持するテーブルである。チャンクマッピングテーブル４１４は、チャンク３０３とチャンクページ６０との対応関係と、チャンクページ６０とストライプ５２との対応関係とを保持するテーブルである。パーセルマッピングテーブル４１５は、パーセル４１とストライプ５２との対応関係と、パーセル４１とゾーン５０との対応関係を保持するテーブルである。パーセル４１とゾーン５０との対応関係には、パーセル４１を含んだゾーン５０の種類（コンベンショナルゾーンであるかシーケンシャルゾーンであるか）が含まれていてもよい。ストレージコントローラ１１０は、これらのマッピングテーブルのうちの少なくとも１つを参照することで、記憶階層間の領域の対応関係を特定できる（具体的には、例えば、複数の記憶階層における領域（アドレス）を辿って、最終的なライト先又はリード元の領域（例えば、ストライプ５２又はパーセル４１）を特定することができる）。 The cluster mapping table 411 is a table that holds a cluster correspondence (mapping from the logical cluster to the virtual cluster) regarding the direction from the logical cluster to the virtual cluster. The cluster reverse mapping table 412 is a table that holds the cluster correspondence (mapping from the virtual cluster to the logical cluster) regarding the direction from the virtual cluster to the logical cluster. The page mapping table 413 is a table that holds the correspondence relationship between the virtual page 302 and the chunk page 60. The chunk mapping table 414 is a table that holds the correspondence between the chunk 303 and the chunk page 60 and the correspondence between the chunk page 60 and the stripe 52. The parcel mapping table 415 is a table that holds the correspondence between the parcel 41 and the stripe 52 and the correspondence between the parcel 41 and the zone 50. The correspondence relationship between the parcel 41 and the zone 50 may include the type of the zone 50 including the parcel 41 (whether it is a conventional zone or a sequential zone). The storage controller 110 can identify the correspondence relationship of the areas between the storage tiers by referring to at least one of these mapping tables (specifically, for example, areas (addresses) in a plurality of storage tiers can be identified. The final write destination or read source area (for example, the stripe 52 or the parcel 41 can be specified).

　ＲＡＩＤ管理テーブル４１６は、ＲＡＩＤグループ毎のＲＡＩＤ構成（ＲＡＩＤレベル）に関する情報（例えば、ＳＭＲ－ＨＤＤ５１の番号、パーセル４１の番号、ＲＡＩＤレベル等）を保持する。ライト先仮想ページ管理テーブル４１７は、論理ボリューム１５１毎にライト先仮想ページ（現在の書き込み対象の仮想ページ３０２）の情報を保持するテーブルである。仮想ページライトポインタ管理テーブル４１８は、仮想ページ３０２毎に仮想ページライトポインタ（仮想ページ３０２内の相対位置アドレス）の情報を保持するテーブルである。論理ボリューム管理テーブル４１９は、論理ボリューム１５１毎の容量の情報を保持するテーブルである。ＳＭＲ－ＨＤＤ管理テーブル４２０は、ＳＭＲ－ＨＤＤ５１毎にＨＤＤ番号、ゾーンサイズ、パーセルサイズ及び不使用領域サイズの情報を保持するテーブルである。キャッシュ管理テーブル４２１は、キャッシュスロット（後述）とストライプ５２との対応関係を保持するテーブルである。一時格納領域管理テーブル４２２は、一時格納領域として利用されているストライプ５２に関する情報を保持するテーブルである。ゾーン管理テーブル４２３は、ゾーン５０に関する情報を保持するテーブルである。 The RAID management table 416 holds information on the RAID configuration (RAID level) for each RAID group (for example, the SMR-HDD 51 number, parcel 41 number, RAID level, etc.). The write destination virtual page management table 417 is a table that holds information on the write destination virtual page (current write target virtual page 302) for each logical volume 151. The virtual page write pointer management table 418 is a table that holds information on a virtual page write pointer (relative position address in the virtual page 302) for each virtual page 302. The logical volume management table 419 is a table that holds capacity information for each logical volume 151. The SMR-HDD management table 420 is a table that holds information on the HDD number, zone size, parcel size, and unused area size for each SMR-HDD 51. The cache management table 421 is a table that holds the correspondence between cache slots (described later) and stripes 52. The temporary storage area management table 422 is a table that holds information regarding the stripe 52 used as a temporary storage area. The zone management table 423 is a table that holds information regarding the zone 50.

　キャッシュメモリ領域４３０は、複数のキャッシュスロットで構成される。つまり、キャッシュスロットとは、キャッシュメモリ領域４３０の一部である。 The cache memory area 430 is composed of a plurality of cache slots. That is, the cache slot is a part of the cache memory area 430.

　図５は、チャンク３０３とチャンクページ６０との対応関係であってＲＡＩＤ５に従う対応関係を示す。 FIG. 5 shows the correspondence relationship between the chunk 303 and the chunk page 60, and the correspondence relationship according to RAID5.

　チャンク３０３は、複数のパーセル４１で構成される。その複数のパーセル４１は、典型的には、異なる複数のＳＭＲ－ＨＤＤ５１にそれぞれ存在する。 The chunk 303 is composed of a plurality of parcels 41. The plurality of parcels 41 typically exist in a plurality of different SMR-HDDs 51, respectively.

　チャンク３０３は、複数のパーセル４１で構成されたＲＡＩＤグループとみなすことができる。ＲＡＩＤ５のため、各パーセル４１に、データストライプ５２Ｄとパリティストライプ５２Ｐとが混在する。図５及び図６以降の図において、ストライプ５２について、「１」、「２」、「３」、…のように数字の記載がある四角は、データストライプ５２Ｄを意味し、「Ｐ」の記載がある四角は、パリティストライプ５２Ｐを意味する。 The chunk 303 can be regarded as a RAID group composed of a plurality of parcels 41. Because of RAID5, data stripes 52D and parity stripes 52P are mixed in each parcel 41. 5 and FIG. 6 and subsequent figures, for the stripe 52, the squares with numerals such as “1”, “2”, “3”,... Mean the data stripe 52D, and the description of “P”. A square with a means the parity stripe 52P.

　１つのチャンク３０３が、複数（又は１つ）のチャンクページ６０を含む。各チャンクページ６０は、複数のストライプ５２で構成されている。それら複数のストライプ５２は、いずれも、データストライプ５２Ｄである。各チャンクページ６０は、パリティストライプ５２Ｐを含まない。 One chunk 303 includes a plurality (or one) of chunk pages 60. Each chunk page 60 is composed of a plurality of stripes 52. The plurality of stripes 52 are all data stripes 52D. Each chunk page 60 does not include the parity stripe 52P.

　図６は、チャンク３０３とチャンクページ６０との対応関係であってＲＡＩＤ４に従う対応関係を示す。 FIG. 6 shows the correspondence between the chunk 303 and the chunk page 60 and the correspondence according to RAID4.

　ＲＡＩＤ４に従う対応関係のため、チャンク３０３における或るパーセル４１を構成する複数のストライプ５２は、いずれもパリティストライプ５２Ｐである。 Because of the correspondence according to RAID 4, all of the plurality of stripes 52 constituting a certain parcel 41 in the chunk 303 are parity stripes 52P.

　ＲＡＩＤ４は、例えば、後述のパリティ生成方式（２）で採用されるＲＡＩＤレベル（ＲＡＩＤ構成）である。ＲＡＩＤ４では、或るパーセル４１内のストライプ５２はいずれもパリティストライプ５２Ｐである。或るパーセル４１を含んだゾーン５０は、コンベンショナルゾーン５０Ｃでよい。 RAID 4 is, for example, a RAID level (RAID configuration) employed in a parity generation method (2) described later. In RAID4, all the stripes 52 in a certain parcel 41 are parity stripes 52P. A zone 50 including a certain parcel 41 may be a conventional zone 50C.

　図７は、アドレスマッピングの一例を示す。 FIG. 7 shows an example of address mapping.

　論理ボリューム１５１は、ホスト３００に提供される。論理ボリューム１５１は、複数の論理ページ３０１で構成される。論理ページ３０１のサイズは、固定サイズ（例えば４２ＭＢ）である。各論理ページ３０１は、複数の論理クラスタ７０１で構成される。論理クラスタ７０１のサイズも、固定サイズ（例えば８ＫＢ）である。 The logical volume 151 is provided to the host 300. The logical volume 151 is composed of a plurality of logical pages 301. The size of the logical page 301 is a fixed size (for example, 42 MB). Each logical page 301 is composed of a plurality of logical clusters 701. The size of the logical cluster 701 is also a fixed size (for example, 8 KB).

　仮想ボリューム１４１は、ホスト３００に直接提供されないいわゆる内部ボリュームである。仮想ボリューム１４１は、複数の仮想ページ３０２で構成される。仮想ページ３０２のサイズも、固定サイズ（例えば４２ＭＢ）である。各仮想ページ３０２は、複数の仮想クラスタ７０２で構成される。仮想クラスタ７０２のサイズは、固定サイズでもよいが、本実施例では可変サイズである。例えば、仮想クラスタ７０２のサイズは、５１２Ｂ（バイト）×Ｎ（Ｎは自然数）である。ストレージコントローラ１１０の圧縮機能が利用されている場合には、仮想クラスタのサイズは、論理クラスタのサイズより小さいことがありうる。 The virtual volume 141 is a so-called internal volume that is not directly provided to the host 300. The virtual volume 141 is composed of a plurality of virtual pages 302. The size of the virtual page 302 is also a fixed size (for example, 42 MB). Each virtual page 302 is composed of a plurality of virtual clusters 702. The size of the virtual cluster 702 may be a fixed size, but is a variable size in this embodiment. For example, the size of the virtual cluster 702 is 512 B (bytes) × N (N is a natural number). When the compression function of the storage controller 110 is used, the size of the virtual cluster may be smaller than the size of the logical cluster.

　論理クラスタ７０１と仮想クラスタ７０２は、クラスタマッピングテーブル４１１及びクラスタ逆マッピングテーブル４１２によりマッピングされている。論理クラスタ７０１と仮想クラスタ７０２は、Ｘ：１の関係になっている（Ｘは自然数）。論理クラスタ７０１が１つの仮想クラスタ７０２に重複排除されている場合、Ｘは１より大きい。論理クラスタ７０１と仮想クラスタ７０２間のマッピングを、「クラスタマッピング」と呼ぶことができる。 The logical cluster 701 and the virtual cluster 702 are mapped by the cluster mapping table 411 and the cluster reverse mapping table 412. The logical cluster 701 and the virtual cluster 702 have a relationship of X: 1 (X is a natural number). X is greater than 1 if the logical cluster 701 is deduplicated into one virtual cluster 702. The mapping between the logical cluster 701 and the virtual cluster 702 can be called “cluster mapping”.

　１つの仮想ページ３０２に１つのチャンクページ６０がマッピングされる。論理ページ３０１毎に（又は仮想ページ３０２毎に）、ホスト３００Ｉ／Ｏ頻度等の統計情報をメモリ１１２上の少なくとも１つのテーブルに格納することができる。仮想ページ３０２とチャンクページ６０間のマッピングを、「ページマッピング」と呼ぶことができる。 One chunk page 60 is mapped to one virtual page 302. Statistical information such as the host 300 I / O frequency can be stored in at least one table on the memory 112 for each logical page 301 (or for each virtual page 302). The mapping between the virtual page 302 and the chunk page 60 can be referred to as “page mapping”.

　上述した複数のマッピングテーブルのうちの少なくとも１つを参照することで、ストレージコントローラ１１０は、最終的なリード元の領域（例えば、ストライプ５２又はパーセル４１）を特定できる。具体的には、例えば、ストレージコントローラ１１０がホスト３００から受信するリード要求には、リード元が指定されている。リード元を表す情報は、例えば、論理ボリューム番号及び論理アドレスを含むとする。論理ボリューム番号は、例えば、ＬＵＮ（Logical Unit Number）である。論理アドレスは、例えばＬＢＡ（Logical Block Address）である。リード元を表す情報から、以下の流れで、最終的なリード元の領域を特定することができる。
（Ｓ１）Ｉ／Ｏ制御プログラム４０１が、リード元としての論理ボリューム番号及び論理アドレスから論理クラスタ７０１を特定する。特定された論理クラスタ７０１を含んだ論理ページ３０１の先頭アドレスからの、リード元として指定された論理アドレスのオフセットも特定される。
（Ｓ２）Ｉ／Ｏ制御プログラム４０１が、クラスタマッピングテーブル４１１を参照することにより、特定された論理クラスタ７０１に対応する仮想クラスタ７０２を特定する。
（Ｓ３）Ｉ／Ｏ制御プログラム４０１が、ページマッピングテーブル４１３を参照することにより、特定された仮想クラスタ７０２を含んだ仮想ページ３０２に割り当てられているチャンクページ６０を特定する。
（Ｓ４）Ｉ／Ｏ制御プログラム４０１が、チャンクマッピングテーブル４１４と、上記特定されたオフセットとを基に、特定されたチャンクページ６０に含まれるデータストライプ５２Ｄを特定する。
（Ｓ５）Ｉ／Ｏ制御プログラム４０１が、パーセルマッピングテーブル４１５を参照することで、特定されたデータストライプ５２Ｄを含んだパーセル４１と、そのパーセル４１を提供するＳＭＲ－ＨＤＤ５１とを特定する。
（Ｓ６）Ｉ／Ｏ制御プログラム４０１が、特定されたＳＭＲ－ＨＤＤ５１に、特定されたデータストライプ５２Ｄのアドレスを指定したリードコマンドを送信する。 By referring to at least one of the plurality of mapping tables described above, the storage controller 110 can identify the final read source area (for example, the stripe 52 or the parcel 41). Specifically, for example, the read source is specified in the read request received from the host 300 by the storage controller 110. The information indicating the read source includes, for example, a logical volume number and a logical address. The logical volume number is, for example, a LUN (Logical Unit Number). The logical address is, for example, an LBA (Logical Block Address). From the information indicating the read source, the final read source area can be specified in the following flow.
(S1) The I / O control program 401 identifies the logical cluster 701 from the logical volume number and logical address as the read source. The offset of the logical address specified as the read source from the head address of the logical page 301 including the specified logical cluster 701 is also specified.
(S2) The I / O control program 401 identifies the virtual cluster 702 corresponding to the identified logical cluster 701 by referring to the cluster mapping table 411.
(S3) The I / O control program 401 refers to the page mapping table 413, and identifies the chunk page 60 assigned to the virtual page 302 including the identified virtual cluster 702.
(S4) The I / O control program 401 specifies the data stripe 52D included in the specified chunk page 60 based on the chunk mapping table 414 and the specified offset.
(S5) The I / O control program 401 refers to the parcel mapping table 415 to identify the parcel 41 including the identified data stripe 52D and the SMR-HDD 51 that provides the parcel 41.
(S6) The I / O control program 401 transmits a read command specifying the address of the specified data stripe 52D to the specified SMR-HDD 51.

　ところで、Log-Structuredライトを採用することができる。例えば、仮想ページ３０２に対するライトとして、Log-Structuredライトを採用することができる。具体的には、例えば、論理ボリューム１５１が更新される場合、Log-Structuredライト方式によれば、仮想ページ３０２内の旧データ（更新前データ）が新データ（更新後データ）に上書きされるのではなく、仮想ページ３０２内の空き仮想クラスタ７０２（空き領域の一例）が確保され、確保された空き仮想クラスタ７０２に、新データが書き込まれる。空き仮想クラスタ７０２は、仮想ページ３０２についてのライトポインタにより特定することができる。この場合、旧仮想クラスタ７０２（旧データが格納されている仮想クラスタ７０２）がマッピングされている論理クラスタ７０１には、旧仮想クラスタ７０２に代えて新仮想クラスタ７０２（新データが書き込まれた仮想クラスタ７０２）がマッピングされる。 By the way, Log-Structured light can be adopted. For example, a Log-Structured light can be adopted as a light for the virtual page 302. Specifically, for example, when the logical volume 151 is updated, old data (pre-update data) in the virtual page 302 is overwritten with new data (updated data) according to the Log-Structured write method. Instead, a free virtual cluster 702 (an example of a free area) in the virtual page 302 is secured, and new data is written to the secured free virtual cluster 702. The free virtual cluster 702 can be specified by a write pointer for the virtual page 302. In this case, in the logical cluster 701 to which the old virtual cluster 702 (virtual cluster 702 in which old data is stored) is mapped, a new virtual cluster 702 (virtual cluster in which new data is written) is used instead of the old virtual cluster 702. 702) are mapped.

　なお、旧仮想クラスタ７０２、すなわち、いずれの論理クラスタ７０１にもマッピングされない仮想クラスタ７０２は、無効なデータ、又は、未使用の領域である。無効なデータは、ガーベージデータと呼ぶこともできる。ガーベージデータは、いわゆるガーベージコレクション処理により回収されてよい。旧仮想クラスタ７０２が１つでも仮想ページ３０２に含まれていると、その仮想ページ３０２にはチャンクページ６０がマッピングされたままであり、割当て可能なチャンクページ６０が不足し得るからである。ガーベージコレクション処理は、チャンク３０３単位で行われる。ガーベージコレクション処理は、例えば次の通りである。すなわち、ストレージコントローラ１１０は、コピー元チャンク３０３から有効データ（新データ）のみをコピー先チャンク３０３（空き（未使用）のチャンク３０３）にコピーする。ストレージコントローラ１１０は、コピー元チャンク３０３から無効データを破棄し、コピー元チャンク３０３を空きチャンク３０３として管理する。また、ストレージコントローラ１１０は、空きチャンク３０３に含まれる複数のパーセル４１の各々について、そのパーセル４１を含んだシーケンシャルゾーン５０Ｓに対しては、Zoned Block Command “RESET WRITE POINTER”（ドライブライトポインタのリセットコマンドの一例）により、シーケンシャルゾーン５０Ｓのドライブライトポインタを先頭にリセットする（これにより、後述のドライブライトポインタ１５０４が更新される）。このように、ＳＭＲ－ＨＤＤ５１においては、ライトポインタのリセットの単位（ＳＭＲ－ＨＤＤ５１内の使用中の領域を空き（未使用）の領域に変更する際の単位と言い換えることもできる）が、ゾーン５０である。また、１つのチャンク３０３は、複数のゾーン５０に関連付けられていることから、必然的に、ガベージコレクション処理は、チャンク３０３単位で実行せざるを得ない（例えば、チャンクページ６０単位でのガベージコレクション処理は実行できない）。 Note that the old virtual cluster 702, that is, the virtual cluster 702 that is not mapped to any logical cluster 701, is invalid data or an unused area. Invalid data can also be called garbage data. Garbage data may be collected by a so-called garbage collection process. This is because if even one old virtual cluster 702 is included in the virtual page 302, the chunk page 60 remains mapped to the virtual page 302, and the assignable chunk page 60 may be insufficient. Garbage collection processing is performed in units of chunks 303. The garbage collection process is as follows, for example. That is, the storage controller 110 copies only valid data (new data) from the copy source chunk 303 to the copy destination chunk 303 (an empty (unused) chunk 303). The storage controller 110 discards invalid data from the copy source chunk 303 and manages the copy source chunk 303 as an empty chunk 303. Further, the storage controller 110, for each of the plurality of parcels 41 included in the empty chunk 303, with respect to the sequential zone 50S including the parcel 41, Zoned Block Command “RESET WRITE POINTER” (drive write pointer reset command) ), The drive write pointer of the sequential zone 50S is reset to the head (this updates a drive write pointer 1504 described later). As described above, in the SMR-HDD 51, the unit for resetting the write pointer (which can be rephrased as a unit for changing the used area in the SMR-HDD 51 to an empty (unused) area) is the zone 50. It is. In addition, since one chunk 303 is associated with a plurality of zones 50, the garbage collection process is inevitably executed in units of chunks 303 (for example, garbage collection in units of chunk pages 60). Processing cannot be performed).

　図８は、論理ボリューム管理テーブル４１９の構成を示す。 FIG. 8 shows the configuration of the logical volume management table 419.

　論理ボリューム管理テーブル４１９は、論理ボリューム１５１毎に、論理ボリューム番号８０１及び容量８０２を保持する。論理ボリューム番号８０１は、論理ボリューム１５１の番号を表す。容量８０２は、論理ボリューム１５１の容量（使用可能容量）を表す。 The logical volume management table 419 holds a logical volume number 801 and a capacity 802 for each logical volume 151. The logical volume number 801 represents the number of the logical volume 151. The capacity 802 represents the capacity (usable capacity) of the logical volume 151.

　図９は、ＳＭＲ－ＨＤＤ管理テーブル４２０の構成を示す。 FIG. 9 shows the configuration of the SMR-HDD management table 420.

　ＳＭＲ－ＨＤＤ管理テーブル４２０は、ＳＭＲ－ＨＤＤ５１毎に、ＨＤＤ番号９０１、ゾーンサイズ９０２、パーセルサイズ９０４、及び、不使用領域サイズ９０５を保持する。ＨＤＤ番号９０１は、ゾーン５０を有するＳＭＲ－ＨＤＤ５１の番号を表す。ゾーンサイズ９０２は、ゾーン５０のサイズを表す。ゾーン種類９０３は、ゾーン５０の種類（コンベンショナルゾーンかシーケンシャルゾーンか）を表す。パーセルサイズ９０４は、ゾーン５０に含まれるパーセル４１のサイズを表す。不使用領域サイズ９０４は、ゾーン５０に含まれる不使用領域５３のサイズを表す。 The SMR-HDD management table 420 holds an HDD number 901, a zone size 902, a parcel size 904, and an unused area size 905 for each SMR-HDD 51. The HDD number 901 represents the number of the SMR-HDD 51 having the zone 50. The zone size 902 represents the size of the zone 50. The zone type 903 represents the type of the zone 50 (conventional zone or sequential zone). The parcel size 904 represents the size of the parcel 41 included in the zone 50. The unused area size 904 represents the size of the unused area 53 included in the zone 50.

　なお、このテーブル４２０に記載のパーセルサイズ及び不使用領域サイズは、例えば、領域管理プログラム４０２により算出されたサイズである。それらのサイズは、例えば、上述のＲＡＩＤ管理テーブル４１６を基に上述の（式２）から得られる。 The parcel size and the unused area size described in this table 420 are, for example, the sizes calculated by the area management program 402. These sizes are obtained from (Expression 2) described above based on the RAID management table 416 described above, for example.

　また、パーセル４１の先頭アドレス（ＬＢＡ）及び終端アドレス（ＬＢＡ）は、次の通りに定義される。
・パーセル４１の先頭アドレス＝パーセル番号×（ゾーンサイズ÷セクタサイズ）
・パーセル４１の終端アドレス＝パーセル番号×（ゾーンサイズ÷セクタサイズ）＋パーセルサイズ÷セクタサイズ－１ The start address (LBA) and end address (LBA) of the parcel 41 are defined as follows.
-Start address of parcel 41 = parcel number x (zone size / sector size)
End address of parcel 41 = parcel number × (zone size ÷ sector size) + parcel size ÷ sector size−1

　なお、「セクタ」とは、ＨＤＤの論理記憶領域の最小単位である。セクタサイズは、典型的なＨＤＤにおいては５１２バイトである。 Note that the “sector” is the smallest unit of the logical storage area of the HDD. The sector size is 512 bytes in a typical HDD.

　図１０は、クラスタマッピングテーブル４１１の構成を示す。 FIG. 10 shows the configuration of the cluster mapping table 411.

　クラスタマッピングテーブル４１１は、論理クラスタ７０１毎に、論理クラスタ識別情報１００１及び仮想クラスタ識別情報１００２を保持する。論理クラスタ識別情報１００１は、論理クラスタ７０１の識別情報であり、例えば、論理ボリューム番号、及び、論理ボリューム１５１についてのＬＢＡを表す。仮想クラスタ識別情報１００２は、論理クラスタ７０１からマッピングされた仮想クラスタ７０２の識別情報であり、例えば、仮想ボリューム１４１番号、仮想ボリューム１４１についてのＬＢＡ、及び、仮想クラスタ７０２のサイズを表す。 The cluster mapping table 411 holds logical cluster identification information 1001 and virtual cluster identification information 1002 for each logical cluster 701. The logical cluster identification information 1001 is identification information of the logical cluster 701 and represents, for example, a logical volume number and an LBA for the logical volume 151. The virtual cluster identification information 1002 is identification information of the virtual cluster 702 mapped from the logical cluster 701, and represents the virtual volume 141 number, the LBA for the virtual volume 141, and the size of the virtual cluster 702, for example.

　図１１は、クラスタ逆マッピングテーブル４１２の構成を示す。 FIG. 11 shows the configuration of the cluster reverse mapping table 412.

　クラスタ逆マッピングテーブル４１２は、仮想クラスタ７０２毎に、仮想クラスタ識別情報１１０１及び論理クラスタ識別情報１１０２を保持する。仮想クラスタ識別情報１１０１及び論理クラスタ識別情報１１０２の各々の構成は、図１０を参照して説明した通りである。 The cluster reverse mapping table 412 holds virtual cluster identification information 1101 and logical cluster identification information 1102 for each virtual cluster 702. The configurations of the virtual cluster identification information 1101 and the logical cluster identification information 1102 are as described with reference to FIG.

　図１２は、キャッシュ管理テーブル４２１の構成を示す。 FIG. 12 shows the configuration of the cache management table 421.

　キャッシュ管理テーブル４２１は、複数のキャッシュスロット１２０１に対応した複数のエントリ１２０２を有する。具体的には、キャッシュ管理テーブル４２１には、キャッシュステータス毎にキューがあり、キューに、そのキューに対応したキャッシュステータスのスロット１２０１に対応したエントリ１２０２が接続されている。キャッシュステータスとしては、空き、クリーン及びダーティがある。空きスロットは、データが格納されていないスロット１２０１である。クリーンスロットは、クリーンデータ（ＳＭＲ－ＨＤＤ５１に格納済のデータ）のみが格納されているスロット１２０１である。ダーティスロットは、ダーティデータ（ＳＭＲ－ＨＤＤ５１に未格納のデータ）が格納されているスロット１２０１である。クリーンデータ及びダーティデータのようにキャッシュメモリ領域４３０に格納されているデータを「キャッシュデータ」と呼ぶことができる。 The cache management table 421 has a plurality of entries 1202 corresponding to a plurality of cache slots 1201. Specifically, the cache management table 421 has a queue for each cache status, and an entry 1202 corresponding to the cache status slot 1201 corresponding to the queue is connected to the queue. The cache status includes empty, clean, and dirty. An empty slot is a slot 1201 in which no data is stored. The clean slot is a slot 1201 in which only clean data (data already stored in the SMR-HDD 51) is stored. The dirty slot is a slot 1201 in which dirty data (data not stored in the SMR-HDD 51) is stored. Data stored in the cache memory area 430 such as clean data and dirty data can be called “cache data”.

　キャッシュデータは、上位キャッシュデータと下位キャッシュデータのいずれかに分類される。言い換えれば、ダーティスロット及びクリーンスロットを、上位キャッシュスロットと下位キャッシュスロットとに分類することができる。 Cache data is classified as either upper cache data or lower cache data. In other words, dirty slots and clean slots can be classified into upper cache slots and lower cache slots.

　上位キャッシュスロットは、論理ボリューム１５１内の領域に対応したスロットである。上位キャッシュスロットのサイズは、ストライプサイズ以下である。 The upper cache slot is a slot corresponding to an area in the logical volume 151. The size of the upper cache slot is equal to or smaller than the stripe size.

　下位キャッシュスロットは、仮想ボリューム１４１内の領域に対応したスロットである。下位キャッシュスロットのサイズは、ストライプ５２のサイズ以下である。下位キャッシュスロットは１つのストライプ５２に対応してよい。 The lower cache slot is a slot corresponding to an area in the virtual volume 141. The size of the lower cache slot is equal to or smaller than the size of the stripe 52. A lower cache slot may correspond to one stripe 52.

　クリーンスロット及びダーティスロットは、上述したように、上位キャッシュスロット及び下位キャッシュスロットのいずれかに分類される。一方、空きスロットは、上位キャッシュスロット及び下位キャッシュスロットのいずれにも分類されない。 Clean slots and dirty slots are classified as either upper cache slots or lower cache slots as described above. On the other hand, empty slots are not classified as either upper cache slots or lower cache slots.

　上位キャッシュスロット内のデータが、上位キャッシュデータである。上位キャッシュデータとしてのクリーンデータである上位クリーンデータは、ホスト３００からのライト要求に従うデータの少なくとも一部のデータであって既に仮想ボリューム１４１に書き込まれたデータ、又は、ホスト３００からのリード要求に従うデータの少なくとも一部のデータであって仮想ボリューム１４１から読み出されたデータである。上位キャッシュデータとしてのダーティデータである上位ダーティデータは、ホスト３００からのライト要求に従うデータの少なくとも一部のデータであって未だ仮想ボリューム１４１に書き込まれていないデータである。 The data in the upper cache slot is the upper cache data. The upper clean data, which is the clean data as the upper cache data, is at least a part of the data in accordance with the write request from the host 300 and is already written in the virtual volume 141 or in accordance with the read request from the host 300. The data is at least a part of the data and is read from the virtual volume 141. The upper dirty data, which is dirty data as the upper cache data, is data that is at least a part of data in accordance with a write request from the host 300 and has not yet been written to the virtual volume 141.

　下位キャッシュスロット内のデータが、下位キャッシュデータである。下位キャッシュデータとしてのクリーンデータである下位クリーンデータは、論理ボリューム１５１から書き込まれたデータであって既にＰＤＥＶ（典型的にはＳＭＲ－ＨＤＤ５１）に書き込まれたデータ、又は、ＰＤＥＶから読み出されたデータである。下位キャッシュデータとしてのダーティデータである下位ダーティデータは、論理ボリューム１５１から書き込まれたデータであって未だＰＤＥＶに書き込まれていないデータである。 The data in the lower cache slot is the lower cache data. The lower clean data, which is clean data as the lower cache data, is data written from the logical volume 151 and already written to the PDEV (typically SMR-HDD 51) or read from the PDEV. It is data. The lower dirty data, which is dirty data as lower cache data, is data written from the logical volume 151 and not yet written to the PDEV.

　以上のように、本実施例では、仮想ページ３０２についてLog-Structuredライトを実現するべく、仮想ボリューム１４１の上位に論理ボリューム１５１が設けられ、それに伴い、キャッシュ管理が、上位の管理と下位の管理とに分けられている。 As described above, in this embodiment, the logical volume 151 is provided above the virtual volume 141 in order to realize the Log-Structured write for the virtual page 302, and accordingly, the cache management is performed by the upper management and the lower management. It is divided into and.

　なお、ストレージコントローラ１１０の圧縮機能を利用する場合、下位キャッシュスロットには、圧縮済みのデータが格納され得る。圧縮方式としては、ハフマン符号に基づいたアルゴリズムなどの様々な可逆圧縮方式を用いることができる。 Note that, when the compression function of the storage controller 110 is used, compressed data can be stored in the lower cache slot. As the compression method, various lossless compression methods such as an algorithm based on a Huffman code can be used.

　図１３は、ライト先仮想ページ管理テーブル４１７の構成を示す。 FIG. 13 shows the configuration of the write destination virtual page management table 417.

　ライト先仮想ページ管理テーブル４１７は、論理ボリューム１５１毎に、論理ボリューム番号１３０１及びライト先仮想ページ番号１３０２を有する。ライト先仮想ページ番号１３０２は、対応する論理ボリューム１５１についてのライト先仮想ページ（現在の書き込み対象の仮想ページ３０２）に対応したページ番号を表す。論理ボリューム１５１とライト先仮想ページは、１：１、多：１、多：多で対応してよい。 The write destination virtual page management table 417 has a logical volume number 1301 and a write destination virtual page number 1302 for each logical volume 151. The write destination virtual page number 1302 represents the page number corresponding to the write destination virtual page (the current write target virtual page 302) for the corresponding logical volume 151. The logical volume 151 and the write destination virtual page may correspond to 1: 1, many: 1, and many: many.

　図１４は、仮想ページライトポインタ管理テーブル４１８の構成を示す。 FIG. 14 shows the configuration of the virtual page write pointer management table 418.

　仮想ページライトポインタ管理テーブル４１８は、仮想ページ３０２毎に、仮想ページ番号１４０１、仮想ページライトポインタ１４０２及び先頭フラグ１４０３を保持する。仮想ページ番号１４０１は、仮想ページ３０２の番号を表す。仮想ページライトポインタ１４０２は、仮想ページ３０２の仮想ページライトポインタ（仮想ページ３０２内の相対位置情報）を表す。先頭フラグ１４０３は、仮想ページライトポインタが仮想ページ３０２の先頭を指しているか否かを示すフラグである。なお、先頭フラグは無くてもよく、その場合、仮想ページライトポインタの値に基づいて先頭を指しているか否かが判断されてもよい。 The virtual page write pointer management table 418 holds a virtual page number 1401, a virtual page write pointer 1402, and a head flag 1403 for each virtual page 302. The virtual page number 1401 represents the number of the virtual page 302. A virtual page write pointer 1402 represents a virtual page write pointer of the virtual page 302 (relative position information in the virtual page 302). The head flag 1403 is a flag indicating whether or not the virtual page write pointer points to the head of the virtual page 302. Note that the head flag may be omitted, and in that case, it may be determined whether the head is pointed based on the value of the virtual page write pointer.

　図２６は、ゾーン管理テーブル４２３の構成を示す。 FIG. 26 shows the configuration of the zone management table 423.

　ゾーン管理テーブル４２３は、ゾーン５０毎に、ＨＤＤ番号１５０１、ゾーン番号１５０２、ゾーン種類１５０３及びドライブポインタ１５０４を保持する。ＨＤＤ番号１５０１は、ゾーン５０を含んだＳＭＲ－ＨＤＤ５１の番号を表す。ゾーン番号１５０２は、ゾーン５０の番号を表す。ゾーン種類１５０３は、ゾーン５０の種類を表す。 The zone management table 423 holds an HDD number 1501, a zone number 1502, a zone type 1503, and a drive pointer 1504 for each zone 50. The HDD number 1501 represents the number of the SMR-HDD 51 including the zone 50. The zone number 1502 represents the zone 50 number. The zone type 1503 represents the type of the zone 50.

　ドライブライトポインタ１５０４は、シーケンシャルゾーン５０Ｓ内でのライトポインタを示すアドレス情報である。ドライブポインタ１５０４としては、ＳＭＲ－ＨＤＤ５１のＬＢＡが用いられる。ドライブライトポインタ１５０４は、ＳＭＲ－ＨＤＤ５１がREPORT ZONEコマンドの応答として返却するライトポインタに一致するように管理される。 The drive write pointer 1504 is address information indicating a write pointer in the sequential zone 50S. As the drive pointer 1504, the LBA of the SMR-HDD 51 is used. The drive write pointer 1504 is managed so that it matches the write pointer that the SMR-HDD 51 returns as a response to the REPORT ZONE command.

　ドライブライトポインタ１５０４は、データストライプ又はパリティストライプのダーティデータがＳＭＲ－ＨＤＤ５１にデステージされたとき、若しくは、ガーベージコレクション処理において発行されたRESET WRITE POINTERコマンドに応答して、更新される。 The drive write pointer 1504 is updated when dirty data of the data stripe or parity stripe is destaged to the SMR-HDD 51 or in response to a RESET WRITE POINTER command issued in the garbage collection process.

　なお、ドライブライトポインタは、ＳＭＲ－ＨＤＤ５１に対するREPORT ZONEコマンドの応答としてＳＭＲ－ＨＤＤ５１から取得可能のため、全てのシーケンシャルゾーン５０Ｓについてドライブライトポインタ１５０４がメモリ１１２に格納されていなくてもよい。必要に応じてドライブライトポインタがＳＭＲ－ＨＤＤ５１から取得されメモリ１１２に登録されてもよい。 Since the drive write pointer can be acquired from the SMR-HDD 51 as a response to the REPORTREZONE command to the SMR-HDD 51, the drive write pointer 1504 need not be stored in the memory 112 for all the sequential zones 50S. A drive write pointer may be acquired from the SMR-HDD 51 and registered in the memory 112 as necessary.

　図１５は、キャッシュメモリライト処理の流れを示す。キャッシュメモリライト処理は、キャッシュメモリ領域にデータを書き込む処理である。キャッシュメモリライト処理は、ストレージコントローラ１１０がホスト３００からライト要求を受信した場合にＩ／Ｏ制御プログラム４０１により開始される。Ｓ１５０１及びＳ１５０２は、ライト要求に応答して行われる処理、つまり同期処理である。一方、Ｓ１５０３～Ｓ１５０７は、ライト要求に応答して行われる処理とは別の処理、つまり非同期処理である。非同期処理は、適宜繰り返し実行される。 FIG. 15 shows the flow of the cache memory write process. The cache memory write process is a process for writing data to the cache memory area. The cache memory write process is started by the I / O control program 401 when the storage controller 110 receives a write request from the host 300. S1501 and S1502 are processes performed in response to a write request, that is, synchronous processes. On the other hand, S1503 to S1507 are processing different from processing performed in response to the write request, that is, asynchronous processing. Asynchronous processing is repeatedly executed as appropriate.

　Ｉ／Ｏ制御プログラム４０１は、ライトデータ（ライト要求に従うデータ）の格納先の上位キャッシュスロットを確保するためのキャッシュ制御を実行する（Ｓ１５０１）。Ｉ／Ｏ制御プログラム４０１は、上位キャッシュスロットにライトデータを書き込み、ライト要求に対する応答としてライト完了報告をホスト３００に送信する（Ｓ１５０２）。 The I / O control program 401 executes cache control for securing an upper cache slot for storing write data (data according to a write request) (S1501). The I / O control program 401 writes the write data to the upper cache slot, and sends a write completion report to the host 300 as a response to the write request (S1502).

　その後、Ｉ／Ｏ制御プログラム４０１は、新たなライト先の仮想ページ３０２が必要か否かを判断する（Ｓ１５０３）。例えば、いずれの使用中仮想ページ３０２の空き領域のサイズも、ライトデータのサイズ未満の場合、Ｓ１５０３の判断結果が真となる。「使用中仮想ページ３０２」とは、当該ライトデータに対応するライト先仮想ページ管理テーブル４１７のライト先仮想ページ番号１３０２で指定される仮想ページ３０２を意味する。使用中仮想ページ３０２の空き領域のサイズは、仮想ページライトポインタ管理テーブル４１８を基に、仮想ページサイズと、その使用中仮想ページ３０２に対応した仮想ページライトポインタ１４０２とから特定可能である。 Thereafter, the I / O control program 401 determines whether or not a new write destination virtual page 302 is necessary (S1503). For example, if the size of the free area of any virtual page 302 in use is less than the size of the write data, the determination result in S1503 is true. “In-use virtual page 302” means the virtual page 302 specified by the write destination virtual page number 1302 of the write destination virtual page management table 417 corresponding to the write data. Based on the virtual page write pointer management table 418, the size of the free area of the virtual page 302 in use can be specified from the virtual page size and the virtual page write pointer 1402 corresponding to the virtual page 302 in use.

　Ｓ１５０３の判断結果が真の場合（Ｓ１５０３：Ｙ）、Ｉ／Ｏ制御プログラム４０１は、ライトポインタが仮想ページ先頭である仮想ページ３０２（例えば先頭フラグが“Ｔｒｕｅ”の仮想ページ３０２）を、ライト先仮想ページ３０２として選択し、ライト先仮想ページ管理テーブル４１７を更新する（Ｓ１５０４）。Ｉ／Ｏ制御プログラム４０１は、仮想ページ３０２の仮想ページライトポインタ１４０２に従う仮想クラスタ７０２に対応付けられる下位キャッシュスロットを確保する（Ｓ１５０５）。Ｉ／Ｏ制御プログラム４０１は、Ｓ１５０２でライトデータが書き込まれた上位キャッシュスロットから、Ｓ１５０５で確保した下位キャッシュスロットにライトデータをコピーする（Ｓ１５０６）。Ｉ／Ｏ制御プログラム４０１は、下位キャッシュスロットを含んだキャッシュストライプ列内のデータを用いて、パリティを生成する（Ｓ１５０７）。パリティの生成は、後述するパリティ生成方式（１）～（３）のうちのいずれかの方式に従う。 If the determination result in S1503 is true (S1503: Y), the I / O control program 401 uses the virtual page 302 whose write pointer is the top of the virtual page (for example, the virtual page 302 whose top flag is “True”) as the write destination. The virtual page 302 is selected and the write destination virtual page management table 417 is updated (S1504). The I / O control program 401 secures a lower cache slot associated with the virtual cluster 702 according to the virtual page write pointer 1402 of the virtual page 302 (S1505). The I / O control program 401 copies the write data from the upper cache slot in which the write data was written in S1502 to the lower cache slot secured in S1505 (S1506). The I / O control program 401 uses the data in the cache stripe column including the lower cache slot to generate parity (S1507). Parity generation follows any one of parity generation methods (1) to (3) described later.

　Ｓ１５０３の判断結果が偽の場合（Ｓ１５０３：Ｎ）、Ｉ／Ｏ制御プログラム４０１は、Ｓ１５０４をスキップし、Ｓ１５０５～Ｓ１５０７を実行する。そのＳ１５０５では、空き領域のサイズがライトデータのサイズ以上のいずれかの使用中仮想ページ３０２が、ライト先である。 If the determination result in S1503 is false (S1503: N), the I / O control program 401 skips S1504 and executes S1505 to S1507. In S1505, any in-use virtual page 302 whose free area size is equal to or larger than the write data size is the write destination.

　重複排除機能及び圧縮機能のうちの少なくとも１つのようなデータ削減機能がストレージコントローラ１１０に採用されている場合、重複排除処理又は圧縮処理は、Ｓ１５０５及びＳ１５０６で実行されてよい。重複排除処理では、論理クラスタ７０１内のデータが重複データであれば、クラスタマッピングの張替えが行われる。また、重複排除処理では、論理クラスタ７０１内のデータがユニークデータ（非重複データ）であれば、下位キャッシュスロットにデータが書き込まれる。圧縮処理では、下位キャッシュスロットに論理クラスタ７０１のデータが書き込まれる際に、論理クラスタ７０１のデータの圧縮が行われる。 When at least one of the deduplication function and the compression function is employed in the storage controller 110, the deduplication process or the compression process may be executed in S1505 and S1506. In the deduplication processing, if the data in the logical cluster 701 is duplicate data, the cluster mapping is replaced. In the deduplication process, if the data in the logical cluster 701 is unique data (non-duplicate data), the data is written to the lower cache slot. In the compression process, the data of the logical cluster 701 is compressed when the data of the logical cluster 701 is written into the lower cache slot.

　キャッシュメモリライト処理では、ライト先論理クラスタ７０１（ライト要求で指定されたＬＢＡに該当する論理クラスタ７０１）が新たに仮想クラスタ７０２にマッピングされるか、又は、ライト先論理クラスタ７０１のマッピング先が旧仮想クラスタ７０２から新仮想クラスタ７０２に変わる。また、ライト先論理クラスタ７０１のマッピング先の仮想クラスタ７０２を含んだ仮想ページ３０２についての仮想ページライトポインタの位置も、ライトデータのサイズ分、先に進む。これらに伴い、Ｉ／Ｏ制御プログラム４０１は、クラスタマッピングテーブル４１１及びクラスタ逆マッピングテーブル４１２を更新したり、仮想ページライトポインタ管理テーブル４１８を更新したりする。 In the cache memory write process, the write destination logical cluster 701 (the logical cluster 701 corresponding to the LBA specified by the write request) is newly mapped to the virtual cluster 702, or the write destination logical cluster 701 has an old mapping destination. The virtual cluster 702 changes to the new virtual cluster 702. Further, the position of the virtual page write pointer for the virtual page 302 including the virtual cluster 702 that is the mapping destination of the write destination logical cluster 701 also advances by the size of the write data. Accordingly, the I / O control program 401 updates the cluster mapping table 411 and the cluster reverse mapping table 412 and updates the virtual page write pointer management table 418.

　また、キャッシュメモリライト処理では、新たにライト先仮想ページ３０２が選択された場合、その仮想ページ３０２にデータが書き込まれることになるためその仮想ページ３０２に新たにチャンクページ６０が割り当てられる。これに伴い、ページマッピングテーブル４１３も更新される。つまり、新たな仮想ページ３０２と、その仮想ページ３０２に割り当てられたチャンクページ６０との対応関係がページマッピングテーブル４１３に登録される。 In the cache memory write process, when a new write destination virtual page 302 is selected, data is written to the virtual page 302, and therefore a new chunk page 60 is allocated to the virtual page 302. Along with this, the page mapping table 413 is also updated. That is, the correspondence between the new virtual page 302 and the chunk page 60 assigned to the virtual page 302 is registered in the page mapping table 413.

　図２７は、デステージ処理の流れを示す。デステージ処理は、キャッシュメモリ領域からＳＭＲ－ＨＤＤ５１へのデータの格納の処理である。 FIG. 27 shows the flow of destage processing. The destage process is a process of storing data from the cache memory area to the SMR-HDD 51.

　Ｉ／Ｏ制御プログラム４０１は、デステージ対象のダーティデータを選択する（Ｓ２７０１）。格納先がシーケンシャルゾーンであるダーティデータについては、Ｉ／Ｏ制御プログラム４０１は、各シーケンシャルゾーンについて格納先アドレスが小さいダーティデータから先に選択する。複数のダーティデータの格納先がシーケンシャルゾーンの場合、それら複数のダーティデータはシーケンシャルに書き込まれる必要があるためである。 The I / O control program 401 selects the dirty data to be destaged (S2701). For dirty data whose storage destination is a sequential zone, the I / O control program 401 first selects dirty data having a smaller storage destination address for each sequential zone. This is because when a plurality of dirty data is stored in a sequential zone, the plurality of dirty data needs to be written sequentially.

　Ｉ／Ｏ制御プログラム４０１は、Ｓ２７０１で選択されたダーティデータをＳＭＲ－ＨＤＤ５１にデステージする（Ｓ２７０２）。格納先（デステージ先）がシーケンシャルゾーンの場合、そのシーケンシャルゾーンに対応したドライブライトポインタ１５０４が指すアドレスから、Ｓ２７０１で選択されたダーティデータが、格納先アドレスの小さい順に書き込まれる。デステージ後、ドライブライトポインタ１５０４が更新される。 The I / O control program 401 destages the dirty data selected in S2701 to the SMR-HDD 51 (S2702). When the storage destination (destage destination) is a sequential zone, the dirty data selected in S2701 is written in ascending order of the storage destination address from the address indicated by the drive write pointer 1504 corresponding to the sequential zone. After destage, the drive write pointer 1504 is updated.

　なお、デステージ対象がパリティの場合、パリティが生成された後にその生成されたパリティのデステージが行われるが。デステージ対象がユーザデータ（パリティの基になるデータ）の場合、パリティ生成後にユーザデータがデステージされてもよいし、ユーザデータのデステージ後にパリティ生成が行われてもよい。 Note that, when the destage target is parity, the generated parity is destaged after the parity is generated. When the destaging target is user data (data based on parity), the user data may be destaged after parity generation, or parity generation may be performed after destaging user data.

　図１６は、リード処理の流れを示す。リード処理は、ストレージコントローラ１１０がホスト３００からリード要求を受信した場合にＩ／Ｏ制御プログラム４０１により開始される。 FIG. 16 shows the flow of read processing. The read process is started by the I / O control program 401 when the storage controller 110 receives a read request from the host 300.

　Ｉ／Ｏ制御プログラム４０１は、リードデータ（リード要求に従うデータ）を格納している上位キャッシュスロットを検索するキャッシュ制御を実行する（Ｓ１６０１）。Ｉ／Ｏ制御プログラム４０１は、上位キャッシュスロットにリードデータが存在するか否かを判断する（Ｓ１６０２）。 The I / O control program 401 executes cache control for searching for an upper cache slot storing read data (data according to a read request) (S1601). The I / O control program 401 determines whether read data exists in the upper cache slot (S1602).

　Ｓ１６０２の判断結果が真の場合（Ｓ１６０２：Ｙ）、Ｉ／Ｏ制御プログラム４０１は、上位キャッシュスロット内のリードデータをホスト３００に送信する（Ｓ１６０７）。 If the determination result in S1602 is true (S1602: Y), the I / O control program 401 transmits the read data in the upper cache slot to the host 300 (S1607).

　Ｓ１６０２の判断結果が偽の場合（Ｓ１６０２：Ｎ）、Ｉ／Ｏ制御プログラム４０１は、リードデータを格納している下位キャッシュスロットを検索するキャッシュ制御を実行する（Ｓ１６０３）。Ｉ／Ｏ制御プログラム４０１は、下位キャッシュスロットにリードデータが存在するか否かを判断する（Ｓ１６０４）。Ｓ１６０４の判断結果が真の場合（Ｓ１６０４：Ｙ）、Ｉ／Ｏ制御プログラム４０１は、下位キャッシュスロットから上位キャッシュスロットにリードデータをコピーし（Ｓ１６０６）、その上位キャッシュスロット内のリードデータをホスト３００に送信する（Ｓ１６０７）。一方、Ｓ１６０４の判断結果が偽の場合（Ｓ１６０４：Ｎ）、Ｉ／Ｏ制御プログラム４０１は、下位キャッシュスロットを確保しその下位キャッシュスロットにＳＭＲ－ＨＤＤ５１からリードデータを読み出すステージング制御を実行する（Ｓ１６０５）。その後、Ｉ／Ｏ制御プログラム４０１は、Ｓ１６０６及びＳ１６０７を実行する。 If the determination result in S1602 is false (S1602: N), the I / O control program 401 executes cache control for searching for a lower cache slot storing read data (S1603). The I / O control program 401 determines whether read data exists in the lower cache slot (S1604). If the determination result in S1604 is true (S1604: Y), the I / O control program 401 copies the read data from the lower cache slot to the upper cache slot (S1606), and the read data in the upper cache slot is transferred to the host 300. (S1607). On the other hand, if the determination result in S1604 is false (S1604: N), the I / O control program 401 executes staging control that secures a lower cache slot and reads read data from the SMR-HDD 51 in the lower cache slot (S1605). ). Thereafter, the I / O control program 401 executes S1606 and S1607.

　本実施例では、以下のパリティ生成方式（１）～（３）の少なくとも１つが採用される。以下、パリティ生成方式（１）～（３）の各々を説明する。なお、本実施例において、「パリティ生成」は、キャッシュメモリ領域上のデータ操作である。そのため、以下、ストライプ列内のデータストライプに対応した領域であってキャッシュストライプ列内の領域を「キャッシュデータストライプ」と言い、ストライプ列内のパリティストライプに対応した領域であってキャッシュストライプ列内の領域を「キャッシュパリティストライプ」と言う。 In this embodiment, at least one of the following parity generation methods (1) to (3) is adopted. Hereinafter, each of the parity generation methods (1) to (3) will be described. In this embodiment, “parity generation” is a data operation on the cache memory area. Therefore, hereinafter, the area corresponding to the data stripe in the stripe string and the area in the cache stripe string is referred to as “cache data stripe”, and the area corresponding to the parity stripe in the stripe string and corresponding to the parity stripe in the cache stripe string. The area is called “cache parity stripe”.

　図１７は、パリティ生成方式（１）の概要の模式図である。 FIG. 17 is a schematic diagram of an outline of the parity generation method (1).

　パリティ生成方式（１）として、ストライプ列における全てのデータストライプ５２Ｄにそれぞれ対応した全てのユーザデータがキャッシュストライプ列にある場合にパリティを生成する方式である全ストライプライト方式が採用される。言い換えれば、パリティ生成方式（１）は、ストライプ列におけるいずれかのデータストライプ５２Ｄからユーザデータをキャッシュメモリ領域に読み出しそのユーザデータを更新してパリティを更新するリードモディファイライト方式が発生しない方式である。リードモディファイライト方式によれば、ユーザデータが更新されるとパリティが更新される。パリティの更新及び格納が複数回発生すると、パリティストライプ５２Ｐを含んだゾーン５０に対してランダムライトが発生し得るが、そのゾーン５０がシーケンシャルゾーン５０Ｓの場合、ランダムライトは不可能である。 As the parity generation method (1), an all-stripe write method, which is a method for generating parity when all user data respectively corresponding to all the data stripes 52D in the stripe column are in the cache stripe column, is adopted. In other words, the parity generation method (1) is a method in which a read-modify-write method in which user data is read from one of the data stripes 52D in the stripe column to the cache memory area and the user data is updated to update the parity does not occur. . According to the read-modify-write method, the parity is updated when the user data is updated. When the parity update and storage occur a plurality of times, a random write may occur for the zone 50 including the parity stripe 52P. However, if the zone 50 is the sequential zone 50S, the random write is not possible.

　そこで、パリティ生成方式（１）として、全ストライプライト方式が採用される。パリティ生成方式（１）は、特に、パリティストライプ５２Ｐを含んだゾーン５０がシーケンシャルゾーン５０Ｓの場合に採用されてよい。このため、例えば、対象のストライプ列を含んだチャンク３００のＲＡＩＤレベルが、ＲＡＩＤ５であれば、そのチャンク３００を構成する全てのパーセル４１を含んだ全てのゾーン５０は、シーケンシャルゾーン５０Ｓである。また、例えば、対象のストライプ列を含んだチャンク３００のＲＡＩＤレベルが、ＲＡＩＤ４であれば、そのチャンク３００を構成するパーセル４１のうちの少なくとも１つのゾーン５０は、シーケンシャルゾーン５０Ｓである。パリティ生成方式（１）では、キャッシュストライプ列内の全てのユーザデータに基づいてパリティ（Ｐ）が生成される。 Therefore, the all stripe write method is adopted as the parity generation method (1). The parity generation method (1) may be employed particularly when the zone 50 including the parity stripe 52P is the sequential zone 50S. Therefore, for example, if the RAID level of the chunk 300 including the target stripe row is RAID 5, all the zones 50 including all the parcels 41 constituting the chunk 300 are sequential zones 50S. Further, for example, if the RAID level of the chunk 300 including the target stripe row is RAID 4, at least one zone 50 of the parcels 41 constituting the chunk 300 is a sequential zone 50S. In the parity generation method (1), parity (P) is generated based on all user data in the cache stripe column.

　なお、上述したように、１つのストライプ５２に対応した領域であってキャッシュメモリ領域内の領域を、「キャッシュストライプ」と呼ぶことができる。ストライプ５２が固定サイズであるため、キャッシュストライプも固定サイズである。キャッシュストライプのサイズは、キャッシュスロット１２０１のサイズのＰ倍である（Ｐは自然数）。ストライプ列に対応したデータユニット（全てのユーザデータ）がキャッシュメモリ領域４３０に存在する場合にそのデータユニットを基にパリティがキャッシュメモリ領域４３０上に生成され、非同期で、パリティがＳＭＲ－ＨＤＤ５１に書き込まれる。データユニットは、図１５のキャッシュメモリライト処理と非同期でＳＭＲ－ＨＤＤ５１に書き込まれる。データストライプ５２Ｄ及びパリティストライプ５２Ｐは、シーケンシャルゾーン５０Ｓの部分領域でよい。 As described above, an area corresponding to one stripe 52 and in the cache memory area can be called a “cache stripe”. Since the stripe 52 is a fixed size, the cache stripe is also a fixed size. The size of the cache stripe is P times the size of the cache slot 1201 (P is a natural number). When a data unit (all user data) corresponding to the stripe column exists in the cache memory area 430, parity is generated on the cache memory area 430 based on the data unit, and the parity is asynchronously written to the SMR-HDD 51. It is. The data unit is written to the SMR-HDD 51 asynchronously with the cache memory write process of FIG. The data stripe 52D and the parity stripe 52P may be partial regions of the sequential zone 50S.

　図１８は、パリティ生成方式（１）に従うパリティ生成処理の流れを示す。 FIG. 18 shows a flow of parity generation processing according to the parity generation method (1).

　Ｉ／Ｏ制御プログラム４０１は、キャッシュストライプ列における全てのキャッシュデータストライプ１７０１Ｄに全てのユーザデータが書き込まれたならば（Ｓ１８０１：Ｙ）、その全てのユーザデータを基にキャッシュメモリ領域４３０上にパリティを生成する（Ｓ１８０３）。つまり、その全てのユーザデータを基に作成されたパリティが、パリティストライプ１７０１Ｐに格納される。 If all user data is written in all the cache data stripes 1701D in the cache stripe column (S1801: Y), the I / O control program 401 sets the parity in the cache memory area 430 based on all the user data. Is generated (S1803). That is, the parity created based on all the user data is stored in the parity stripe 1701P.

　Ｉ／Ｏ制御プログラム４０１は、キャッシュストライプ列における全てのキャッシュデータストライプ１７０１Ｄに少なくとも１つのユーザデータが書き込まれていなくても（Ｓ１８０１：Ｎ）、パリティを生成する他の条件が満たされている場合（Ｓ１８０２：Ｙ）、キャッシュメモリ領域４３０上にパリティを生成する（Ｓ１８０３）。他の条件が満たされている場合とは、例えば、キャッシュメモリ領域４３０に空きスロットが枯渇した場合（例えば、空きスロット数が閾値未満になった場合）、ストレージシステム１００の電源供給が断たれメモリ１１２がバッテリーで動作している場合、ストレージシステム１００の電源をオフにする場合、二重化されたメモリ１１２の一方が故障したためにダーティデータの二重化が維持されなくなった場合、のうちの少なくとも１つの場合でよい。この場合、Ｉ／Ｏ制御プログラム４０１は、ストライプ列におけるデータストライプ５２Ｄの未書き込み領域（ドライブライトポインタ１５０４が示す領域）にゼロデータ（すべてのビットが０のデータ）を書き込んだ後に、ストライプ列に対応した全てのユーザデータを基にパリティを生成し（例えばＡＳＩＣ１１６を用いてパリティを生成し）、その生成されたパリティをパリティストライプ５２Ｐに書き込んでよい。なお、書き込まれたゼロデータの分、ドライブライトポインタ１５０４と仮想ページライトポインタ１４０２が更新される。 When the I / O control program 401 does not write at least one user data to all the cache data stripes 1701D in the cache stripe column (S1801: N), the other conditions for generating the parity are satisfied (S1802: Y), parity is generated on the cache memory area 430 (S1803). When other conditions are satisfied, for example, when the empty slots are depleted in the cache memory area 430 (for example, when the number of empty slots becomes less than the threshold), the power supply to the storage system 100 is cut off and the memory At least one of the cases where 112 is operating on a battery, the storage system 100 is turned off, the duplication of dirty data is not maintained due to a failure of one of the duplicated memories 112 It's okay. In this case, the I / O control program 401 writes zero data (data in which all bits are 0) to the unwritten area (area indicated by the drive write pointer 1504) of the data stripe 52D in the stripe string, and then writes it to the stripe string. Parity may be generated based on all corresponding user data (for example, parity is generated using the ASIC 116), and the generated parity may be written in the parity stripe 52P. Note that the drive write pointer 1504 and the virtual page write pointer 1402 are updated by the amount of the written zero data.

　図１９は、パリティ生成方式（２）の概要の模式図である。 FIG. 19 is a schematic diagram of the outline of the parity generation method (2).

　パリティ生成方式（２）では、ストライプ列に対応した全てのユーザデータがキャッシュストライプ列に書き込まれる前に、キャッシュストライプ列内のユーザデータに基づいてパリティが生成され、そのパリティがパリティストライプ５２Ｐに書き込まれてもよいし、ストライプ列に対応した全てのユーザデータがキャッシュストライプ列に書き込まれた後に、それら全てのユーザデータに基づいてパリティが生成され、そのパリティがパリティストライプ５２Ｐに書き込まれてもよい。「ストライプ列に対応した全てのユーザデータがキャッシュストライプ列に書き込まれる前」のパリティ生成は、リードモディファイライト方式に従うパリティ更新であってもよいし、全ストライプライト方式に従うパリティ更新であってもよい。なお、データストライプ５２Ｄの未書き込みの領域はゼロデータ（ビット値が全て０のデータ）とみなす必要がある。 In the parity generation method (2), before all the user data corresponding to the stripe column is written to the cache stripe column, the parity is generated based on the user data in the cache stripe column, and the parity is written to the parity stripe 52P. Alternatively, after all user data corresponding to the stripe column is written to the cache stripe column, parity may be generated based on all the user data, and the parity may be written to the parity stripe 52P. . The parity generation “before all user data corresponding to the stripe column is written to the cache stripe column” may be a parity update according to the read-modify-write method or a parity update according to the all-stripe write method. . Note that an unwritten area of the data stripe 52D needs to be regarded as zero data (data whose bit values are all 0).

　パリティ生成方式（２）では、下記の（２－１）及び（２－２）のうちのいずれも採用可能である。
（２－１）パリティストライプ５２Ｐを含んだゾーン５０が、コンベンショナルゾーン５０Ｃである。具体的には、例えば、パリティ生成方式（２）は、例えばＲＡＩＤ４（図６）のように或るパーセル４１に含まれる全てのストライプ５２がパリティストライプ５２Ｐの場合に採用可能であり、パリティストライプ５２Ｐを含んだゾーン５０として、コンベンショナルゾーン５０Ｃが採用される。或るパーセル４１を構成する全てのストライプ５２がパリティストライプ５２Ｐでなくても、パリティ生成方式（２）は、パリティストライプ５２Ｐを含んだゾーン５０がコンベンショナルゾーン５０Ｃの場合に採用されてよい。パリティ生成方式（２）は、パリティを分散格納しないＲＡＩＤレベルであれば、ＲＡＩＤ４以外のＲＡＩＤレベルでも採用可能であり、そのようなパリティを分散格納しないＲＡＩＤレベルとしては例えばＲＡＩＤ１が挙げられる。
（２－２）パリティストライプ５２Ｐは、ランダムライト可能な論理領域を提供するＰＤＥＶ（典型的には通常のＨＤＤやＳＳＤ）に基づくストライプ５２である。パリティストライプ５２Ｐの基になるＰＤＥＶは、例えば、フラッシュメモリデバイスでよい。 In the parity generation method (2), any of the following (2-1) and (2-2) can be adopted.
(2-1) The zone 50 including the parity stripe 52P is the conventional zone 50C. Specifically, for example, the parity generation method (2) can be adopted when all the stripes 52 included in a certain parcel 41 are parity stripes 52P as in RAID4 (FIG. 6), for example. The conventional zone 50C is employed as the zone 50 including the. Even if not all the stripes 52 constituting a certain parcel 41 are parity stripes 52P, the parity generation method (2) may be employed when the zone 50 including the parity stripes 52P is the conventional zone 50C. The parity generation method (2) can be adopted at any RAID level other than RAID 4 as long as the parity level does not scatter and store parity. For example, RAID 1 is a RAID level that does not scatter and store parity.
(2-2) The parity stripe 52P is a stripe 52 based on a PDEV (typically a normal HDD or SSD) that provides a randomly writable logical area. The PDEV on which the parity stripe 52P is based may be, for example, a flash memory device.

　図２０は、パリティ生成方式（３）の概要の模式図である。 FIG. 20 is a schematic diagram of the outline of the parity generation method (3).

　パリティ生成方式（３）によれば、一時的なパリティストライプ５２Ｐとして使用される一時格納領域２００１が用意される。一時格納領域２００１は、例えばキャッシュメモリ領域４３０に設けられる。一時格納領域２００１は、本実施例では１つのチャンク３０３（つまり１つのＲＡＩＤグループ）につき１つであるが、複数のチャンク３０３につき１つでもよい。一時格納領域２００１は、ランダムライト可能な論理領域（例えばコンベンショナルゾーン５０Ｃ）を提供するＰＤＥＶ２００２（例えばＳＭＲ－ＨＤＤ５１）に対応付けられる。一時格納領域２００１内のパリティは、適宜、そのＰＤＥＶ２００２が提供するランダムライト可能な論理領域に格納される（デステージ）。つまり、一時格納領域２００１内のパリティのデステージ先として、一時格納領域２００１に、そのランダムライト可能な論理領域が関連付けられている。 According to the parity generation method (3), a temporary storage area 2001 used as a temporary parity stripe 52P is prepared. The temporary storage area 2001 is provided in the cache memory area 430, for example. In the present embodiment, one temporary storage area 2001 is provided for one chunk 303 (that is, one RAID group), but one temporary storage area 2001 may be provided for a plurality of chunks 303. The temporary storage area 2001 is associated with a PDEV 2002 (for example, SMR-HDD 51) that provides a randomly writable logical area (for example, the conventional zone 50C). The parity in the temporary storage area 2001 is appropriately stored in a randomly writable logical area provided by the PDEV 2002 (destage). That is, as a destaging destination of the parity in the temporary storage area 2001, the random storage writable logical area is associated with the temporary storage area 2001.

　パリティ生成方式（３）によれば、ストライプ列に対応した全てのユーザデータがキャッシュストライプ列に書き込まれる前に、キャッシュストライプ列内のユーザデータに基づいてパリティが生成され、そのパリティが一時格納領域２００１に格納される。そして、ストライプ列に対応した全てのユーザデータがキャッシュストライプ列に書き込まれた後に、それら全てのユーザデータに基づいてパリティが生成され、そのパリティが一時格納領域２００１に書き込まれた後、一時格納領域２００１内のパリティがキャッシュパリティストライプ１７０１Ｐに書き込まれる。 According to the parity generation method (3), before all the user data corresponding to the stripe column is written to the cache stripe column, the parity is generated based on the user data in the cache stripe column, and the parity is stored in the temporary storage area. Stored in 2001. After all user data corresponding to the stripe column is written to the cache stripe column, parity is generated based on all the user data, and after the parity is written to the temporary storage area 2001, the temporary storage area The parity in 2001 is written to the cache parity stripe 1701P.

　なお、いわゆるコレクションコピーやコレクションリードの際、キャッシュストライプ列内のパリティに対応するダーティデータがキャッシュメモリ領域内に存在しない、且つ、ストライプ列内のパリティにデータが未書き込みであるという条件が満たされている場合、一時格納領域２００１内のパリティを用いてコレクションコピーやコレクションリードを行うことができる。ストライプ列内のパリティにデータが未書き込みであるか否かは、ドライブライトポインタ１５０４から判断可能である。具体的には、ストライプ列内のパリティにデータが未書き込みであるか否かは、パリティストライプの末尾のＬＢＡを越えているか否かに相当する。つまり、キャッシュストライプ列内のパリティに対応するダーティデータがキャッシュメモリ領域内に存在すれば、そのダーティデータを用いてコレクションコピー等が実行されてよい。キャッシュストライプ列内のパリティに対応するダーティデータがキャッシュメモリ領域に存在しないがストライプ列内のパリティにデータが書き込み済みである場合、そのパリティを用いてコレクションコピー等が実行されてよい。 During so-called collection copy and collection read, the condition that the dirty data corresponding to the parity in the cache stripe column does not exist in the cache memory area and the data is not written to the parity in the stripe column is satisfied. In such a case, correction copy or correction read can be performed using the parity in the temporary storage area 2001. It can be determined from the drive write pointer 1504 whether data has not been written in the parity in the stripe column. Specifically, whether or not data has been written to the parity in the stripe column corresponds to whether or not the LBA at the end of the parity stripe is exceeded. In other words, if dirty data corresponding to the parity in the cache stripe column exists in the cache memory area, correction copy or the like may be executed using the dirty data. If dirty data corresponding to the parity in the cache stripe column does not exist in the cache memory area, but data has been written to the parity in the stripe column, correction copy or the like may be executed using the parity.

　パリティ生成方式（３）では、下記の（３－１）～（３－３）のうちのいずれも採用可能である。
（３－１）一時格納領域２００１に対応したデステージ先の領域が、ＳＭＲ－ＨＤＤ５１のコンベンショナルゾーン５０Ｃにおける領域である。
（３－２）一時格納領域２００１に対応したデステージ先のＰＤＥＶ２００２が、ＳＭＲ－ＨＤＤ５１以外のＰＤＥＶである。例えば、そのようなＰＤＥＶとして、フラッシュメモリデバイスが採用される。
（３－３）一時格納領域２００１に対応したデステージ先のＰＤＥＶが存在しない。 In the parity generation method (3), any of the following (3-1) to (3-3) can be adopted.
(3-1) The destage destination area corresponding to the temporary storage area 2001 is an area in the conventional zone 50C of the SMR-HDD 51.
(3-2) The destage-destination PDEV 2002 corresponding to the temporary storage area 2001 is a PDEV other than the SMR-HDD 51. For example, a flash memory device is employed as such a PDEV.
(3-3) There is no destage destination PDEV corresponding to the temporary storage area 2001.

　図２１は、一時格納領域管理テーブル４２２の構成を示す。 FIG. 21 shows the configuration of the temporary storage area management table 422.

　一時格納領域管理テーブル４２２は、一時格納領域２００１毎に、チャンク番号２１０１、ディスク位置情報２１０２及びキャッシュ位置情報２１０３を保持する。チャンク番号２１０１は、一時格納領域２００１に対応したチャンク３０３の番号である。ディスク位置情報２１０２は、一時格納領域２００１に対応した領域であって、その一時格納領域２００１に対応したＰＤＥＶにおける領域の位置を表す情報である。ディスク位置情報２１０２は、例えば、パーセル番号及びパーセル４１内アドレスの組合せである。キャッシュ位置情報２１０３は、一時格納領域２００１に対応した領域であって、キャッシュメモリ領域４３０における領域の位置を表す情報である。キャッシュ位置情報２１０３は、例えば、キャッシュアドレスである。 The temporary storage area management table 422 holds a chunk number 2101, disk position information 2102, and cache position information 2103 for each temporary storage area 2001. The chunk number 2101 is the number of the chunk 303 corresponding to the temporary storage area 2001. The disk position information 2102 is an area corresponding to the temporary storage area 2001 and is information indicating the position of the area in the PDEV corresponding to the temporary storage area 2001. The disk position information 2102 is, for example, a combination of a parcel number and an address in the parcel 41. The cache position information 2103 is an area corresponding to the temporary storage area 2001 and is information indicating the position of the area in the cache memory area 430. The cache position information 2103 is, for example, a cache address.

　パリティ生成方式（３）では、リードモディファイライト方式及び全ストライプライト方式が選択的に採用される。以下、パリティ生成方式（３）に従うパリティ生成処理の流れを説明するが、その前に、リードモディファイライト方式及び全ストライプライト方式を説明する。なお、その説明では、ユーザデータを「Ｄ」と表記し、パリティを「Ｐ」と表記する。 In the parity generation method (3), the read-modify-write method and the all-stripe write method are selectively adopted. Hereinafter, the flow of parity generation processing according to the parity generation method (3) will be described, but before that, the read-modify-write method and the all-stripe write method will be described. In the description, user data is represented as “D” and parity is represented as “P”.

　例えば、ストライプ列に、旧Ｄ１、旧Ｄ２、旧Ｄ３、及び旧Ｐが格納されているとする。Ｐ＝（Ｄ１）ＸＯＲ（Ｄ２）ＸＯＲ（Ｄ３）である。 For example, it is assumed that old D1, old D2, old D3, and old P are stored in the stripe row. P = (D1) XOR (D2) XOR (D3).

　旧Ｄ１のみが新Ｄ１に更新された場合、リードモディファイライト方式を採用可能である。リードモディファイライト方式では、以下の式で新Ｐが得られる。
新Ｐ＝（旧Ｄ１）ＸＯＲ（新Ｄ１）ＸＯＲ（旧Ｐ） When only the old D1 is updated to the new D1, the read-modify-write method can be adopted. In the read-modify-write method, a new P is obtained by the following formula.
New P = (old D1) XOR (new D1) XOR (old P)

　一方、全ストライプライト方式では，これらの例のように、Ｄ１（新Ｄ１または旧Ｄ１）、Ｄ２（新Ｄ２または旧Ｄ２）及びＤ３（新Ｄ３または旧Ｄ３）を用いて新Ｐが生成される。例えば、旧Ｄ１、旧Ｄ２及び旧Ｄ３が、それぞれ、新Ｄ１、新Ｄ２及び新Ｄ３に更新された場合、全ストライプライト方式を採用可能である。その場合、以下の式で新Ｐが得られる。
新Ｐ＝（新Ｄ１）ＸＯＲ（新Ｄ２）ＸＯＲ（新Ｄ３） On the other hand, in the all stripe write method, as in these examples, a new P is generated using D1 (new D1 or old D1), D2 (new D2 or old D2) and D3 (new D3 or old D3). . For example, when the old D1, the old D2, and the old D3 are updated to the new D1, the new D2, and the new D3, respectively, the all-stripe write method can be adopted. In that case, a new P is obtained by the following equation.
New P = (New D1) XOR (New D2) XOR (New D3)

　また、例えば、旧Ｄ３のみが新Ｄ３に更新された場合であっても、全ストライプライト方式を採用可能である。その場合，以下の式で新Ｐが得られる。
新Ｐ＝（旧Ｄ１）ＸＯＲ（旧Ｄ２）ＸＯＲ（新Ｄ３） Further, for example, even when only the old D3 is updated to the new D3, the all stripe write method can be adopted. In that case, a new P is obtained by the following equation.
New P = (old D1) XOR (old D2) XOR (new D3)

　図２２は、パリティ生成方式（３）に従うパリティ生成処理の流れを示す。なお、以下の説明では、リードモディファイライト方式を「ＲＭＷ」と表記し、全ストライプライト方式を「ＡＳＷ」と表記する。また、以下の説明では、対象のストライプ列には、Ｄ１、Ｄ２及びＤ３とＰが格納されるとする。 FIG. 22 shows the flow of parity generation processing according to the parity generation method (3). In the following description, the read-modify-write method is expressed as “RMW”, and the all-stripe write method is expressed as “ASW”. In the following description, it is assumed that D1, D2, D3, and P are stored in the target stripe column.

　Ｉ／Ｏ制御プログラム４０１は、一時格納領域２００１が未初期化であれば、一時格納領域２００１を初期化する（Ｓ２２０１）。具体的には、例えば、Ｉ／Ｏ制御プログラム４０１は、ストライプ列内の全てのデータストライプ５２Ｄ内のデータがゼロデータであると仮定した場合のパリティ（初期値）を一時格納領域２００１に格納する。 If the temporary storage area 2001 is uninitialized, the I / O control program 401 initializes the temporary storage area 2001 (S2201). Specifically, for example, the I / O control program 401 stores the parity (initial value) in the temporary storage area 2001 when it is assumed that the data in all the data stripes 52D in the stripe column is zero data. .

　Ｉ／Ｏ制御プログラム４０１は、仮想ページライトポインタ管理テーブル４１８を参照し、ライト先の仮想ページ３０２についてのライトポインタがストライプ列の終端を越えているか否かを判断する（Ｓ２２０２）。 The I / O control program 401 refers to the virtual page write pointer management table 418 and determines whether or not the write pointer for the write destination virtual page 302 exceeds the end of the stripe column (S2202).

　Ｓ２２０２の判断結果が真の場合（Ｓ２２０２：Ｙ）、Ｉ／Ｏ制御プログラム４０１は、ＲＭＷとＡＳＷのいずれを選択するかを決定する（Ｓ２２０３）。Ｓ２２０３では、キャッシュメモリ領域４３０に上に存在するユーザデータ数に基づき、ディスクアクセス回数（ＳＭＲ－ＨＤＤ５１へのアクセスの回数）が最小になる方式が選択される。ＲＭＷとＡＳＷのどちらを使うかによってディスクアクセス回数が異なるからである。例えば、新Ｄ１、新Ｄ２及び新Ｄ３がキャッシュ領域に存在しているのであれば、ＡＳＷが選択されることが好ましい。ディスクアクセスを発生させることなくパリティの生成が可能だからである。 When the determination result in S2202 is true (S2202: Y), the I / O control program 401 determines which of RMW and ASW is selected (S2203). In S2203, a method is selected that minimizes the number of disk accesses (the number of accesses to the SMR-HDD 51) based on the number of user data existing in the cache memory area 430. This is because the number of disk accesses differs depending on whether RMW or ASW is used. For example, if new D1, new D2, and new D3 exist in the cache area, it is preferable to select ASW. This is because parity can be generated without causing disk access.

　Ｓ２２０３でＲＭＷが選択された場合（Ｓ２２０３：ＲＭＷ）、Ｉ／Ｏ制御プログラム４０１は、ＲＭＷに従い、一時格納領域２００１内のパリティを基にパリティを生成し、生成されたパリティを、キャッシュパリティストライプ１７０１Ｐに格納する（Ｓ２２０４）。一方、Ｓ２２０３でＡＳＷが選択された場合（Ｓ２２０３：ＡＳＷ）、Ｉ／Ｏ制御プログラム４０１は、ＡＳＷに従い、キャッシュストライプ列内の全てのユーザデータを基にパリティを生成し、生成したパリティを、キャッシュパリティストライプ１７０１Ｐに格納する（Ｓ２２０５）。 When RMW is selected in S2203 (S2203: RMW), the I / O control program 401 generates a parity based on the parity in the temporary storage area 2001 according to the RMW, and the generated parity is used as a cache parity stripe 1701P. (S2204). On the other hand, when ASW is selected in S2203 (S2203: ASW), the I / O control program 401 generates parity based on all user data in the cache stripe column in accordance with ASW, and the generated parity is cached. It is stored in the parity stripe 1701P (S2205).

　さて、Ｓ２２０２の判断結果が偽の場合でも（Ｓ２２０２：Ｎ）、Ｉ／Ｏ制御プログラム４０１は、ＲＭＷとＡＳＷのいずれを選択するかを決定する（Ｓ２２０６）。Ｓ２２０６でも、Ｓ２２０３と同様、キャッシュメモリ領域４３０上に存在するユーザデータ数に基づき、ディスクアクセス回数が最小になる方式が選択される。 Now, even if the determination result of S2202 is false (S2202: N), the I / O control program 401 determines which of RMW and ASW to select (S2206). In S2206, as in S2203, the method that minimizes the number of disk accesses is selected based on the number of user data existing in the cache memory area 430.

　Ｓ２２０６でＲＭＷが選択された場合（Ｓ２２０６：ＲＭＷ）、Ｉ／Ｏ制御プログラム４０１は、ＲＭＷに従い、一時格納領域２００１内のパリティに基づいてパリティを生成し、生成されたパリティを一時格納領域２００１に格納する（Ｓ２２０７）。一方、Ｓ２２０６でＡＳＷが選択された場合（Ｓ２２０３：ＡＳＷ）、Ｉ／Ｏ制御プログラム４０１は、ＡＳＷに従い、キャッシュストライプ列内のユーザデータに基づいてパリティを生成し、生成されたパリティを一時格納領域２００１に格納する（Ｓ２２０８）。 When the RMW is selected in S2206 (S2206: RMW), the I / O control program 401 generates a parity based on the parity in the temporary storage area 2001 according to the RMW, and the generated parity is stored in the temporary storage area 2001. Store (S2207). On the other hand, when ASW is selected in S2206 (S2203: ASW), the I / O control program 401 generates parity based on the user data in the cache stripe column in accordance with ASW, and the generated parity is temporarily stored in the storage area. It is stored in 2001 (S2208).

　図２３は、使用可能容量報告処理の流れを示す。この処理は、ホスト３００からストレージコントローラ１１０が容量問合せを受けた場合に領域管理プログラム４０２により開始される。 FIG. 23 shows the flow of the available capacity reporting process. This processing is started by the area management program 402 when the storage controller 110 receives a capacity inquiry from the host 300.

　領域管理プログラム４０２は、データストライプ５２Ｄの合計サイズを算出し（Ｓ２３０１）、算出された合計サイズを使用可能容量としてホスト３００へ報告する（Ｓ２３０２）。つまり、使用可能容量には、ゾーン５０内の不使用領域５３の容量は含まれない。また、Ｓ２３０１の算出は、上述した（式１）の算出でもよい。 The area management program 402 calculates the total size of the data stripe 52D (S2301), and reports the calculated total size to the host 300 as an available capacity (S2302). In other words, the usable capacity does not include the capacity of the unused area 53 in the zone 50. Further, the calculation of S2301 may be the calculation of (Equation 1) described above.

　以下、実施例２を説明する。その際、実施例１との相違点を主に説明し、実施例１との共通点については説明を省略又は簡略する。 Hereinafter, Example 2 will be described. At that time, differences from the first embodiment will be mainly described, and description of common points with the first embodiment will be omitted or simplified.

　図２４は、実施例２に係る記憶階層の一例を示す。 FIG. 24 shows an example of a storage hierarchy according to the second embodiment.

　実施例２では、分散ＲＡＩＤが採用されない。ＲＡＩＤグループとして、複数のＳＭＲ－ＨＤＤ５１で構成されたＲＡＩＤグループが採用される。 In the second embodiment, distributed RAID is not adopted. A RAID group composed of a plurality of SMR-HDDs 51 is employed as the RAID group.

　以下、実施例３を説明する。その際、実施例１及び２との相違点を主に説明し、実施例１及び２との共通点については説明を省略又は簡略する。 Hereinafter, Example 3 will be described. At that time, the differences from the first and second embodiments will be mainly described, and the description of the common points with the first and second embodiments will be omitted or simplified.

　図２５は、実施例３に係るアドレスマッピングの一例を示す。 FIG. 25 illustrates an example of address mapping according to the third embodiment.

　実施例３によれば、論理ボリューム１５１の下位に、仮想ボリューム１４１が存在しない。論理クラスタ７０１に、チャンクページ６０における部分領域である下位クラスタ２５０１が割り当てられる。実施例３では、チャンクページ６０に対してLog-Structured格納が行われる。Log-Structured格納が行われるチャンクページ６０を含んだチャンクを構成する全てのパーセルが、シーケンシャルゾーンに含まれるパーセルでよい。 According to the third embodiment, the virtual volume 141 does not exist under the logical volume 151. A lower cluster 2501 that is a partial area in the chunk page 60 is allocated to the logical cluster 701. In the third embodiment, Log-Structured storage is performed on the chunk page 60. All parcels constituting a chunk including the chunk page 60 in which Log-Structured storage is performed may be parcels included in the sequential zone.

　論理クラスタ７０１と下位クラスタ２５０１との対応関係が、クラスタマッピングテーブル（但し、仮想クラスタ識別情報１００２が下位クラスタ識別情報に差し替えられる）により管理される。下位クラスタ２５０１のサイズは、ブロック（例えば５１２Ｂ）の整数倍でよい。下位クラスタ２５０１のサイズは、可変でよい。 The correspondence relationship between the logical cluster 701 and the lower cluster 2501 is managed by a cluster mapping table (however, the virtual cluster identification information 1002 is replaced with the lower cluster identification information). The size of the lower cluster 2501 may be an integer multiple of a block (for example, 512B). The size of the lower cluster 2501 may be variable.

　実施例３では、ＲＡＩＤグループは、図３を参照して説明した分散ＲＡＩＤに従うＲＡＩＤグループ（すなわち、異なる複数のＳＭＲ－ＨＤＤ５１にそれぞれ存在する複数のパーセル４１で構成されたＲＡＩＤグループ）であってもよいし、図２４を参照して説明した通常のＲＡＩＤグループ（複数のＨＤＤで構成されたＲＡＩＤグループ）であってもよい。 In the third embodiment, the RAID group is a RAID group according to the distributed RAID described with reference to FIG. 3 (that is, a RAID group configured by a plurality of parcels 41 respectively existing in a plurality of different SMR-HDDs 51). Alternatively, it may be a normal RAID group (RAID group composed of a plurality of HDDs) described with reference to FIG.

　以上、実施例１～３を説明したが、その説明を基に、下記のように総括することができる。なお、下記の総括では、上述の説明に無い事項が含まれていてもよいし、逆に上述の説明に存在する事項が含まれていなくてもよい。 As described above, the first to third embodiments have been described, and can be summarized as follows based on the description. In the following summary, matters not included in the above description may be included, and conversely, items existing in the above description may not be included.

　＜総括＞ <Summary>

　ストレージシステムの記憶容量を大きくするために、通常のＨＤＤに代えてＳＭＲ－ＨＤＤを採用することが考えられる。ＳＭＲ－ＨＤＤには、ゾーンと呼ばれる連続した論理的な記憶領域が定義される。ストレージシステムにより提供される論理ボリュームには、ゾーンの少なくとも一部を含んだ記憶領域が割り当てられることになる。そこで、以下の課題のうちの少なくとも１つが考えられる。
（課題１）ゾーンとゾーンよりも上位の記憶階層に属する領域との関係。
（課題２）ストレージシステムには、一般に、ＲＡＩＤ構成が採用される。ゾーンとして、シーケンシャルゾーンがあるが、パリティが生じるＲＡＩＤレベル（ＲＡＩＤ構成）について、１つのストライプ列に対応したユーザデータ及びパリティのセットがシーケンシャルに書き込まれるとは限らない。
（課題３）Thin Provisioningのような容量拡張機能をストレージシステムが有する場合、割り当てられたページにおけるライト先が、そのページの基になっているシーケンシャルゾーンのライトポインタを超えてしまう可能性がある。 In order to increase the storage capacity of the storage system, it is conceivable to adopt an SMR-HDD instead of a normal HDD. In the SMR-HDD, a continuous logical storage area called a zone is defined. A storage area including at least a part of a zone is allocated to the logical volume provided by the storage system. Therefore, at least one of the following problems can be considered.
(Problem 1) Relationship between a zone and an area belonging to a higher storage hierarchy than the zone.
(Problem 2) A RAID configuration is generally adopted for a storage system. There is a sequential zone as a zone, but for a RAID level (RAID configuration) where parity occurs, a set of user data and parity corresponding to one stripe column is not always written sequentially.
(Problem 3) When the storage system has a capacity expansion function such as Thin Provisioning, the write destination of the allocated page may exceed the write pointer of the sequential zone that is the basis of the page.

　課題１～３は、例えば、下記により解決される。なお、ストレージシステム１００が有する複数のＰＤＥＶは、実施例では、典型的には複数のＳＭＲ－ＨＤＤ５１であるが、複数のＰＤＥＶは、複数のＳＭＲ－ＨＤＤ５１以外のＰＤＥＶ、例えば、通常のＨＤＤやＳＤＤのようなランダムライト可能な論理領域を提供するＰＤＥＶを２以上含んでいてもよい。 Issues 1 to 3 are solved by, for example, the following. In the embodiment, the plurality of PDEVs included in the storage system 100 are typically a plurality of SMR-HDDs 51. However, the plurality of PDEVs are PDEVs other than the plurality of SMR-HDDs 51, for example, normal HDDs and SDDs. Two or more PDEVs that provide a randomly writable logic area may be included.

　＜課題１の解決＞
ストレージコントローラ１１０が、ストライプ５２を含む領域として提供されるパーセル４１のサイズを、そのパーセル４１を含むゾーン５０のサイズと、仮想ページ３０２又は論理ページ３０１に関連付けられストライプ５２を含む論理領域であるチャンクページ６０のサイズと、そのパーセル４１を含むチャンク（ＲＡＩＤグループ）のＲＡＩＤレベルに従うデータストライプ数（１つのストライプ列におけるデータストライプ５２Ｄの数）とに基づいて、決定する。パーセルサイズがゾーンサイズ未満の場合、ゾーン５０とパーセル４１との差分が、不使用領域５３（ユーザデータ及びパリディのいずれも格納されることのない領域）となる。具体的には、上述の（式２）に従いパーセルサイズが決定される。 <Solution of Problem 1>
The storage controller 110 provides the size of the parcel 41 provided as the area including the stripe 52, the size of the zone 50 including the parcel 41, and the chunk that is the logical area including the stripe 52 associated with the virtual page 302 or the logical page 301. This is determined based on the size of the page 60 and the number of data stripes (the number of data stripes 52D in one stripe column) according to the RAID level of the chunk (RAID group) including the parcel 41. When the parcel size is smaller than the zone size, a difference between the zone 50 and the parcel 41 becomes an unused area 53 (an area in which neither user data nor parody is stored). Specifically, the parcel size is determined according to the above (Equation 2).

　＜課題２の解決＞
（第１の解決）ストレージコントローラ１１０が、シーケンシャルゾーン５０Ｓに含まれるパリティストライプ５２Ｐに関連付けられたストライプ列については、全ストライプライト方式に従う書込みを実行する。
（第２の解決）ストレージコントローラ１１０が、パリティストライプ５２Ｐを含んだゾーン５０として、コンベンショナルゾーン５０Ｃを採用する。
（第３の解決）ストレージコントローラ１１０が、シーケンシャルゾーン５０Ｓに含まれるパリティストライプ５２Ｐを含んだストライプ列について、キャッシュストライプ列内の全てのユーザデータが更新されるまで、パリティを、一時格納領域２００１に格納（退避）する。つまり、ストレージコントローラ１１０は、ストライプ列についての一部の旧ユーザデータが新ユーザデータに更新された場合、旧ユーザデータ、新ユーザデータ及び旧パリティ（一時格納領域２００１内のパリティ）を用いて新パリティを生成し、新パリティを一時格納領域２００１に格納する。キャッシュストライプ列内の全てのユーザデータが新ユーザデータになった場合、ストレージコントローラ１１０は、その全ての新ユーザデータに基づく新パリティをパリティストライプ５２Ｐに格納する。一時格納領域２００１は、キャッシュメモリ領域４３０上に設けられてよい。一時格納領域２００１のデステージ先は、ランダムライト可能な論理領域を提供するＰＤＥＶである。「ランダムライト可能な論理領域」は、例えば、ＳＭＲ－ＨＤＤ５１におけるコンベンショナルゾーン５０Ｃ、又は、ＳＳＤや通常のＨＤＤのようなランダムライト可能なＰＤＥＶが提供する論理領域である。 <Solution of Problem 2>
(First Solution) The storage controller 110 executes writing according to the all stripe write method for the stripe columns associated with the parity stripes 52P included in the sequential zone 50S.
(Second Solution) The storage controller 110 adopts the conventional zone 50C as the zone 50 including the parity stripe 52P.
(Third Solution) For the stripe column including the parity stripe 52P included in the sequential zone 50S, the storage controller 110 stores the parity in the temporary storage area 2001 until all user data in the cache stripe column is updated. Store (save). That is, when a part of the old user data for the stripe column is updated to the new user data, the storage controller 110 uses the old user data, the new user data, and the old parity (parity in the temporary storage area 2001). Parity is generated and the new parity is stored in the temporary storage area 2001. When all user data in the cache stripe column becomes new user data, the storage controller 110 stores new parity based on all the new user data in the parity stripe 52P. The temporary storage area 2001 may be provided on the cache memory area 430. The destage destination of the temporary storage area 2001 is a PDEV that provides a logical area in which random writing is possible. The “random-writable logical area” is, for example, a logical area provided by a conventional zone 50C in the SMR-HDD 51 or a random-writable PDEV such as an SSD or a normal HDD.

　＜課題３の解決＞
ストレージコントローラ１１０は、論理ボリューム１５１に割り当てられる仮想ページ３０２として、ライトポインタが仮想ページ３０２の先頭ＬＢＡを指している仮想ページ３０２を選択し、選択した仮想ページ３０２を論理ボリューム１５１に割り当てる。すなわち、ストレージコントローラ１１０は、少なくとも１つの途中書き仮想ページ３０２（ライトポインタが先頭ＬＢＡを指していない仮想ページ３０２）にライトデータを書き込むことができるようであれば、その途中書き仮想ページ３０２にライトデータを追記するようになっている。言い換えれば、ストレージコントローラ１１０は、少なくとも１つの途中書き仮想ページ３０２にライトデータを書き込むことができるようであれば、新たに仮想ページ３０２を論理ボリューム１５１に割り当てることを抑止する。仮想ボリューム１４１が存在しない実施例では、仮想ページ３０２をチャンクページ６０と読み替えることができ、チャンクページ６０毎にチャンクページライトポインタが管理されてよい。 <Solution of Problem 3>
The storage controller 110 selects the virtual page 302 whose write pointer points to the first LBA of the virtual page 302 as the virtual page 302 allocated to the logical volume 151, and allocates the selected virtual page 302 to the logical volume 151. That is, if the storage controller 110 can write the write data to at least one halfway write virtual page 302 (virtual page 302 whose write pointer does not point to the first LBA), the storage controller 110 writes to the halfway write virtual page 302. Append data. In other words, the storage controller 110 suppresses newly assigning the virtual page 302 to the logical volume 151 as long as the write data can be written to at least one halfway write virtual page 302. In an embodiment in which the virtual volume 141 does not exist, the virtual page 302 can be read as the chunk page 60, and the chunk page write pointer may be managed for each chunk page 60.

　以上、幾つかの実施例を説明したが、これらは本発明の説明のための例示であって、本発明の範囲をこれらの実施例にのみ限定する趣旨ではない。本発明は、他の種々の形態でも実行することが可能である。 Although several embodiments have been described above, these are merely examples for explaining the present invention, and the scope of the present invention is not limited to these embodiments. The present invention can be implemented in various other forms.

１００…ストレージシステム 100: Storage system

Claims

A storage system connected to the host system,
A plurality of storage devices including a plurality of zone-providing storage devices;
Connected to the plurality of storage devices, providing a logical volume to the host system, and allocating at least one chunk page of a plurality of chunk pages based on one or more chunks directly or indirectly to the logical volume A storage controller,
Each of the plurality of zone providing storage devices provides a plurality of zones;
Each of the plurality of zones is a contiguous logical area predefined in a zone providing storage device that provides the zone;
In the case where the storage controller configures the first chunk, which is at least one of the one or more chunks, using two or more zones provided by two or more zone providing storage devices, For each of the zones, an area aligned with two or more chunk pages based on the first chunk is a parcel that is a component of the chunk.
Storage system.

For each of the two or more zones, if there is an area that is not aligned with two or more chunk pages based on the first chunk, the storage controller defines the unaligned area as an unused area that is not a component of the chunk. To
The storage system according to claim 1.

Each of the one or more chunks is composed of a plurality of parcels constituting a plurality of stripe columns,
Each of the plurality of parcels is composed of a plurality of stripes,
Each of the plurality of stripe rows is composed of two or more stripes respectively included in two or more parcels in a chunk constituting the stripe row,
As a stripe, there is at least a data stripe among a data stripe that is a stripe in which data is stored and a parity stripe that is a stripe in which parity is stored.
Each of the plurality of chunk pages corresponds to a plurality of data stripes and does not correspond to any parity stripe;
The storage system according to claim 2.

For the first chunk, the storage controller determines the first chunk based on the size of the zone, the size of the chunk page, and the number of data stripes included in one stripe column according to the RAID level for the first chunk. Determine the size of parcels in one chunk,
The storage system according to claim 3.

The storage controller reports available capacity;
The usable capacity is a capacity based on a parcel size for each of the one or more chunks.
The storage system according to claim 1.

The storage controller has a cache memory area;
The storage controller is an area corresponding to a stripe column including a write destination parity stripe, and after all the data corresponding to the stripe column is written to the area in the cache memory area, the storage controller Generating parity and storing the parity in the write destination parity stripe;
The zone including the parcel including the write destination parity stripe is a sequential zone that is a zone where random writing is impossible and sequential writing is possible.
The storage system according to claim 1.

If the storage controller satisfies a predetermined parity generation condition before all data corresponding to a stripe column including the write destination parity stripe is written to all data stripes in the stripe column, an unwritten data stripe It is assumed that zero data is written, and parity is generated based on all data corresponding to the stripe column including the write destination parity stripe and the parity is stored in the write destination parity stripe.
The storage system according to claim 6.

The storage controller generates parity for the stripe column before or after the data of all the data stripes constituting the stripe column including the write destination parity stripe is written, and converts the parity to the write destination parity stripe. Store and
The zone including the parcel including the write destination parity stripe is a conventional zone in which random writing is possible.
The storage system according to claim 1.

The storage controller has a temporary storage area;
The storage controller is
Before the data of all the data stripes constituting the stripe column including the write destination parity stripe is written, the parity related to the stripe column is updated based on the parity in the temporary storage area, and the updated parity is updated to the temporary column. Store it in the storage area,
After data of all the data stripes constituting the stripe column including the write destination parity stripe is written, the parity related to the stripe column is stored in the write destination parity stripe,
The zone including the parcel including the write destination parity stripe is a sequential zone that is a zone where random writing is impossible and sequential writing is possible.
The storage system according to claim 1.

Each of the two or more zones each including two or more parcels constituting the first chunk is a sequential zone that is a zone where random writing is impossible and sequential writing is possible.
The storage system according to claim 1.

The storage controller manages virtual volumes under the logical volume;
The virtual volume is composed of a plurality of virtual pages to which a chunk page can be allocated,
Each of the plurality of virtual pages employs Log-Structured storage,
The storage controller assigns a virtual page whose position pointed to by the virtual page write pointer and the head match to the logical volume,
The storage system according to claim 1.

The storage controller has a cache memory area composed of a plurality of cache slots;
As cache management executed by the storage controller, higher-level cache management that manages the correspondence between the area in the logical volume and the slot in the cache memory area, the area in the virtual volume, and the cache memory area There is a lower level cache management that manages the correspondence with the slot of
The storage controller executes data copy between a cache slot according to the higher-level cache management and a cache slot according to the lower-level cache management for data input / output with respect to the logical volume.
The storage system according to claim 11.

Each of two or more chunk pages based on the first chunk employs Log-Structured storage,
The storage controller allocates, to the logical volume, a chunk page in which the position pointed to by the chunk page write pointer matches the beginning of the two or more chunk pages based on the first chunk.
The storage system according to claim 1.

Each of the two or more zone providing storage devices is SMR (Shingled Magnetic Recording) -HDD (Hard Disk Drive).
The storage system according to claim 1.

When the first chunk, which is at least one of the one or more chunks, is configured using two or more zones provided by two or more zone providing storage devices, for each of the two or more zones, The area aligned with two or more chunk pages based on the first chunk is a parcel that is a component of the chunk,
Each of the two or more zones is a contiguous logical region predefined in a zone providing storage device that provides the zone;
Allocating at least one of the two or more chunk pages based on the first chunk directly or indirectly to the logical volume;
Memory control method.