WO2020044407A1

WO2020044407A1 - Learning device, learning method, and learning program

Info

Publication number: WO2020044407A1
Application number: PCT/JP2018/031585
Authority: WO
Inventors: 誠也柴田; 芙美代鷹野; 竹中　崇; 浩明井上
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2018-08-27
Filing date: 2018-08-27
Publication date: 2020-03-05
Anticipated expiration: 2021-02-27

Abstract

This learning device 10 comprises: a first storage unit 11 which stores parameters of each unit used in inference processing for calculating in a predetermined order an output for the discrimination data of each unit of a discrimination model in which a plurality of layers each composed of one or more units are combined in a layer shape; and a second storage unit 12 which stores parameters to be updated in the learning processing for updating at least a part of the parameters of each unit on the basis of the output of each unit for learning data.

Description

Learning device, learning method and learning program

　本発明は、学習装置、学習方法および学習プログラムに関する。 The present invention relates to a learning device, a learning method, and a learning program.

　機械学習の普及が進み、時々刻々と変化する状況に対応するための更なる工夫が求められる。時々刻々と変化する状況に対応するためには、使用される環境で取得される多様な生データを学習用データとして学習に取り入れることが求められる。学習用データは、判別モデルの学習に用いられるデータである。 With the spread of machine learning, further innovations are required to cope with ever-changing situations. In order to cope with a situation that changes from moment to moment, it is necessary to incorporate various raw data acquired in an environment in which it is used into learning as learning data. The learning data is data used for learning the discriminant model.

　学習用データが用いられた学習（機械学習）では、例えば、学習用データが示す入力と出力の関係等に基づいて、所定の学習器で使用される演算式や判別式のパラメタが調整される。学習器は、例えば、データが入力されると、１つまたは複数のラベルに関する判別を行う判別モデルである。 In learning using machine learning data (machine learning), for example, parameters of arithmetic expressions and discriminants used in a predetermined learning device are adjusted based on the relationship between input and output indicated by the learning data. . The learning device is, for example, a discrimination model that performs discrimination regarding one or a plurality of labels when data is input.

　機械学習における演算資源と演算精度の関係として、例えば、非特許文献１には、ニューラルネットワークの深層学習を効率的に、特に低い消費電力で実行するための学習用演算回路および学習方法の例が記載されている。 As a relationship between computational resources and computational accuracy in machine learning, for example, Non-Patent Document 1 discloses an example of a learning computation circuit and a learning method for efficiently executing deep learning of a neural network, particularly with low power consumption. Has been described.

　また、非特許文献２には、ＣＮＮ（Convolutional Neural Network）における深層学習において、複数ある畳込み層を、重みが固定される層と重みが更新される層（拡張機能層）に分けて学習範囲を制限することによって学習時間の短縮を図る学習方法の例が記載されている。 In Non-Patent Document 2, in deep learning in CNN (Convolutional Neural Network), a learning range is divided into a plurality of convolutional layers into a layer in which the weight is fixed and a layer in which the weight is updated (extended function layer). An example of a learning method for shortening the learning time by restricting the learning time is described.

　また、機械学習における学習演算用の回路構成の例として、非特許文献３には、ＦＰＧＡ（Field-Programmable Gate Array ）をベースとしたアクセラレータ設計の最適化例が記載されている。 {Also, Non-Patent Document 3 describes an optimization example of an accelerator design based on an FPGA (Field-Programmable {Gate} Array}) as an example of a circuit configuration for learning operation in machine learning.

　以下、学習方法の概略を説明する。図１０は、入力層と出力層との間に１つ以上の中間層を含むニューラルネットワークにおける一般的な学習方法および学習のための回路構成の例を示す説明図である。 Hereafter, the outline of the learning method will be described. FIG. 10 is an explanatory diagram showing an example of a general learning method and a circuit configuration for learning in a neural network including one or more intermediate layers between an input layer and an output layer.

　図１０に示す例では、汎用用途の学習アルゴリズムに対応するために、大規模学習回路７０が、所定の判別モデルであるニューラルネットワーク全体を学習する。 In the example shown in FIG. 10, the large-scale learning circuit 70 learns the entire neural network as a predetermined discriminant model in order to support a learning algorithm for general use.

　図１０に示す大規模学習回路７０に付された吹き出しには、ニューラルネットワークの学習過程における処理の方向および処理の範囲が模式的に記載されている。吹き出し内において、ニューラルネットワークにおけるニューロンに相当するユニット７１が楕円で表されている。吹き The speech balloon attached to the large-scale learning circuit 70 shown in FIG. 10 schematically describes the processing direction and the processing range in the learning process of the neural network. In the balloon, a unit 71 corresponding to a neuron in the neural network is represented by an ellipse.

　また、線分７２（図１０に示すユニット７１間を結ぶ線）は、ユニット７１間結合を表す。また、矢印７３（図１０に示す右向きの太線矢印）は、推論処理および推論処理の範囲を表す。また、矢印７４（図１０に示す左向きの太線矢印）は、パラメタ更新処理およびパラメタ更新処理の範囲を表す。なお、パラメタ更新処理は、学習処理の例である。 {Circle around (7)} The line segment 72 (the line connecting the units 71 shown in FIG. 10) represents the connection between the units 71. Further, an arrow 73 (a rightward thick line arrow shown in FIG. 10) indicates an inference process and a range of the inference process. An arrow 74 (a thick left arrow shown in FIG. 10) indicates a parameter update process and a range of the parameter update process. The parameter update process is an example of a learning process.

　なお、図１０は、各ユニット７１への入力が前段の層のユニット７１の出力になるフィードフォワード型のニューラルネットワークの例を示す。例えば、時系列情報が保持されている場合、リカレント型のニューラルネットワークのように、各ユニット７１への入力に、前の時刻における前段の層のユニット７１の出力が含まれてもよい。 FIG. 10 shows an example of a feedforward neural network in which the input to each unit 71 is the output of the unit 71 in the preceding layer. For example, when time-series information is held, the input to each unit 71 may include the output of the unit 71 in the previous layer at the previous time, as in a recurrent neural network.

　なお、各ユニット７１への入力に前の時刻における前段の層のユニット７１の出力が含まれる場合も、推論処理の方向は、入力層から出力層へと向かう方向（順方向）であるとみなされる。また、各ユニット７１への入力は、上記の例に限定されない。 When the input to each unit 71 includes the output of the unit 71 of the previous layer at the previous time, the direction of the inference processing is also considered to be the direction from the input layer to the output layer (forward direction). It is. The input to each unit 71 is not limited to the above example.

　入力層から所定の順番で行われる推論処理は、「順伝搬」とも呼ばれる。一方、パラメタ更新処理の方向は、特に限定されない。例えば、図１０に示すパラメタ更新処理のように、パラメタ更新処理の方向は、出力層から入力層へと向かう方向（逆方向）でもよい。推 The inference processing performed in a predetermined order from the input layer is also called “forward propagation”. On the other hand, the direction of the parameter update processing is not particularly limited. For example, as in the parameter update process shown in FIG. 10, the direction of the parameter update process may be a direction (reverse direction) from the output layer to the input layer.

　なお、図１０に示すパラメタ更新処理は、誤差逆伝搬法で実行される処理の例である。しかし、パラメタ更新処理は、誤差逆伝搬法で実行される処理に限定されない。例えば、パラメタ更新処理は、ＳＴＤＰ（Spike Timing Dependent Plasticity ）で実行されてもよい。 The parameter updating process shown in FIG. 10 is an example of a process executed by the back propagation method. However, the parameter update processing is not limited to the processing executed by the back propagation method. For example, the parameter update processing may be executed by STDP (Spike {Timing} Dependent {Plasticity}).

　ニューラルネットワークに限らず、深層学習におけるモデルの学習方法の例として、次のような学習方法が挙げられる。まず、入力層に学習用データを入力した後、出力層までの各層で順方向に各ユニット７１の出力を計算する推論処理を行う（順伝搬：図１０に示す矢印７３参照）。限ら Not limited to neural networks, examples of model learning methods in deep learning include the following learning methods. First, after inputting learning data to the input layer, an inference process of calculating the output of each unit 71 in the forward direction in each layer up to the output layer is performed (forward propagation: see the arrow 73 shown in FIG. 10).

　次いで、出力層からの出力（最終出力）と学習用データが示す入力と出力の関係等とから算出される誤差に基づいて、層内の各ユニット７１の出力を計算するために用いられるパラメタを更新するパラメタ更新処理を行う（逆伝搬：図１０に示す矢印７４参照）。図１０に示すように、パラメタ更新処理は、出力層から第１層までの各層を逆方向に辿って行われる。また、パラメタ更新処理は、算出される誤差が最小になるように行われる。 Next, based on an error calculated from an output (final output) from the output layer and a relation between the input and the output indicated by the learning data, a parameter used to calculate the output of each unit 71 in the layer is determined. A parameter update process for updating is performed (back propagation: see arrow 74 shown in FIG. 10). As shown in FIG. 10, the parameter update processing is performed by following each layer from the output layer to the first layer in the reverse direction. The parameter updating process is performed so that the calculated error is minimized.

　図１０に示すように、モデル全体が学習対象である場合、パラメタ更新処理で、入力層より後段の全ての層（第１層～第ｎ層）における層内の各ユニット７１の出力を計算するために用いられるパラメタが更新される。更新されるパラメタは、例えば、層内の各ユニット７１と他の層のユニット７１を結合するユニット７１間結合の重みである。 As shown in FIG. 10, when the entire model is a learning target, the output of each unit 71 in all the layers (first to n-th layers) subsequent to the input layer is calculated by the parameter updating process. The parameters used for updating are updated. The parameter to be updated is, for example, the weight of the connection between the units 71 that connects each unit 71 in the layer and the unit 71 in another layer.

　上記のようなパラメタ更新処理が、例えば学習用データが変更されながら複数回繰り返し実行されることによって、高い認識率を有する学習済みモデルが生成される。図１０は、学習を行う演算回路の実現例として、上記の推論処理とパラメタ更新処理とを高い演算精度で行う大規模学習回路７０を示す。学習 The parameter updating process as described above is repeatedly performed a plurality of times while, for example, the learning data is changed, thereby generating a learned model having a high recognition rate. FIG. 10 shows a large-scale learning circuit 70 that performs the above-described inference processing and parameter updating processing with high calculation accuracy as an example of realizing an arithmetic circuit that performs learning.

　図１１は、１つのユニット７１に着目したときのユニット７１の入出力および他ユニット７１との結合の例を示す説明図である。図１１（ａ）は、１つのユニット７１の入出力の例を示す。また、図１１（ｂ）は、２層に並べられたユニット７１間の結合の例を示す。 FIG. 11 is an explanatory diagram showing an example of input / output of the unit 71 and connection with another unit 71 when focusing on one unit 71. FIG. 11A shows an example of input / output of one unit 71. FIG. 11B shows an example of a connection between the units 71 arranged in two layers.

　図１１（ａ）に示すように、１つのユニット７１に対して４つの入力（ｘ_１～ｘ_４）と１つの出力（ｚ）が与えられた場合、ユニット７１の動作は、例えば式（１Ａ）のように表される。 As shown in FIG. 11A, when four inputs (x ₁ to x ₄ ) and one output (z) are given to one unit 71, the operation of the unit 71 is expressed by, for example, the formula (1A) ).

　ｚ＝ｆ（ｕ）　・・・式（１Ａ）
　ただし、ｕ＝ａ＋ｗ_１ｘ_１＋ｗ_２ｘ_２＋ｗ_３ｘ_３＋ｗ_４ｘ_４　・・・式（１Ｂ） z = f (u) Expression (1A)
_{_{_{_{However, u = a + w 1 x}}}} 1 + w 2 x 2 + w 3 x 3 + w 4 x 4 ··· formula (1B)

　なお、式（１Ａ）におけるｆ（）は、活性化関数を表す。また、式（１Ｂ）におけるａは、切片を表す。また、式（１Ｂ）におけるｗ_１～ｗ_４は、各入力（ｘ_１～ｘ_４）に対応した重み等のパラメタを表す。 Note that f () in Expression (1A) represents an activation function. A in the formula (1B) represents an intercept. Further, w ₁ to w ₄ in the equation (1B) represent parameters such as weights corresponding to the respective inputs (x ₁ to x ₄ ).

　一方、図１１（ｂ）に示すように、２層に並べられた層間で各ユニット７１が結合されている場合、後段の層に着目すると、層内の各ユニット７１への入力（それぞれｘ_１～ｘ_４）に対する各ユニット７１の出力（ｚ_１～ｚ_４）は、例えば、次のように表される。 On the other hand, as shown in FIG. 11B, when the units 71 are connected between the layers arranged in two layers, the input to each unit 71 in the layer (x ₁ the output of each unit 71 for _{_{~ x 4) (z 1 ~}} z 4) , for example, be expressed as follows.

　ｚ_ｉ＝ｆ（ｕ_ｉ）　・・・式（２Ａ）
　ただし、ｕ_ｉ＝ａ＋ｗ_ｉ，１ｘ_１＋ｗ_ｉ，２ｘ_２＋ｗ_ｉ，３ｘ_３＋ｗ_ｉ，４ｘ_４・・・式（２Ｂ） z _i = f (u _i ) Equation (2A)
_{_{_{_{However, u i = a + w i}}}} , 1 x 1 + w i, 2 x 2 + w i, 3 x 3 + w i, 4 x 4 ··· formula (2B)

　なお、式（２Ａ）におけるｉは、同一層内のユニット７１の識別子（本例ではｉ＝１～３）である。また、式（２Ｂ）における切片ａを、値１の定数項の係数（すなわち、パラメタの１つ）とみなすことも可能である。 In the expression (2A), i is the identifier of the unit 71 in the same layer (i = 1 to 3 in this example). Further, the intercept a in the equation (2B) can be regarded as a coefficient of a constant term having a value of 1 (that is, one of the parameters).

　以下では、式（２Ｂ）を単純化して、
　ｕ_ｉ＝Σ（ｗ_ｉ，ｋ＊ｘ_ｋ）　・・・式（２Ｃ）
と記す場合がある。なお、式（２Ｃ）において、切片ａは省略されている。また、式（２Ｃ）におけるｋは、層における各ユニット７１への入力、より具体的には入力を行う他のユニット７１の識別子を表す。 In the following, equation (2B) is simplified,
u _i = Σ (wi _{, k} * x _k ) (2C)
May be written. In addition, in formula (2C), the intercept a is omitted. Further, k in the expression (2C) represents an input to each unit 71 in the layer, more specifically, an identifier of another unit 71 that performs the input.

　また、層における各ユニット７１への入力が前段の層の各ユニット７１の出力のみである場合、上記の簡略式を、
　ｕ_ｉ ^（Ｌ）＝Σ（ｗ_ｉ，ｋ ^（Ｌ）＊ｘ_ｋ ^{（Ｌ－１）}）　・・・式（２Ｄ）
と記すことも可能である。 When the input to each unit 71 in the layer is only the output of each unit 71 in the preceding layer,
u _i ^(L) = Σ (wi _{, k} ^(L) * x _k ^(L−1) ) Expression (2D)
It is also possible to write

　なお、式（２Ｄ）におけるＬは、層の識別子を表す。また、式（２Ｄ）におけるｗ_ｉ，ｋは、第Ｌ層における各ユニットｉのパラメタを表す。より具体的には、ｗ_ｉ，ｋは、各ユニットｉと他のユニットｋとの結合（ユニット７１間結合）の重みに相当する。 Note that L in Equation (2D) represents a layer identifier. In addition, w _{i, k} in Expression (2D) represents a parameter of each unit i in the L-th layer. More specifically, w _{i, k} corresponds to the weight of the connection between each unit i and another unit k (connection between units 71).

　以下、ユニット７１を特に区別せず、ユニット７１の出力値を決める関数（活性化関数）を簡略化して、ｚ＝Σ（ｗ＊ｘ）と記す場合がある。 In the following, a function (activation function) for determining an output value of the unit 71 may be simplified as z = Σ (w * x) without distinguishing the unit 71.

　上記の重みの集合は、ベクトル形式で以下のように記載される。集合 The above set of weights is described in vector form as follows:

　ｗ_ｉ＝［ｗ_ｉ，１，ｗ_ｉ，２，・・・，ｗ_ｉ，ｋ］^Ｔ　・・・式（３） _{_{_{w i = [w i, 1}}} , w i, 2, ···, w i, k] T ··· formula (3)

　式（３）を、重みベクトルと呼ぶ。また、ある層の入力の集合である入力ベクトルｘ＝［ｘ_１，ｘ_２，・・・，ｘ_ｋ］^Ｔ、重みベクトルを横に連結した重み行列をＷとすると、出力ベクトルｚはｆ（Ｗ^Ｔｘ）で表される。なお、出力ベクトルｚと活性化関数との間に、以下の関係が成り立つ。 Equation (3) is called a weight vector. Further, if an input vector x = [x ₁ , x ₂ ,..., X _k ] ^T , which is a set of inputs of a certain layer, and a weight matrix in which weight vectors are horizontally connected is W, the output vector z is f ( represented by W ^T x). Note that the following relationship holds between the output vector z and the activation function.

　ｚ＝ｆ（ｕ）＝［ｆ（ｕ_１），ｆ（ｕ_２），・・・，ｆ（ｕ_ｎ）］　・・・式（４） _{z = f (u) = [} f (u 1), f (u 2), ···, f (u n)] ··· Equation (4)

　上記の例において、あるユニット７１が入力ｘから出力ｚを求める計算が、ユニット７１における推論処理に相当する。推論処理においてパラメタ（例えば、重みｗ）は固定される。推論処理は、例えば、運用中の監視システム等で、画像中の物体が特定の物体であるか否かを判定するために実行される処理である。一方、ユニット７１のパラメタを求める計算が、ユニット７１におけるパラメタ更新処理に相当する。 In the above example, the calculation in which a certain unit 71 calculates the output z from the input x corresponds to the inference processing in the unit 71. In the inference processing, parameters (for example, weight w) are fixed. The inference process is, for example, a process performed by an operating monitoring system or the like to determine whether an object in an image is a specific object. On the other hand, the calculation for obtaining the parameters of the unit 71 corresponds to the parameter updating process in the unit 71.

　図１２に、推論処理を行う推論装置の例を示す。図１２は、一般的な推論装置の構成例を示すブロック図である。図１２に示す推論装置８０は、重みメモリ８１と、重みロード部８２と、演算部８３とを備える。 FIG. 12 shows an example of an inference apparatus that performs inference processing. FIG. 12 is a block diagram illustrating a configuration example of a general inference apparatus. The inference device 80 illustrated in FIG. 12 includes a weight memory 81, a weight load unit 82, and a calculation unit 83.

　重みメモリ８１は、重み行列Ｗを記憶する機能を有する。重みロード部８２は、重みメモリ８１に記憶されている重み行列Ｗを重みメモリ８１からロードする機能を有する。 The weight memory 81 has a function of storing the weight matrix W. The weight loading unit 82 has a function of loading the weight matrix W stored in the weight memory 81 from the weight memory 81.

　重みロード部８２は、ロードされた重み行列Ｗを演算部８３に入力する。演算部８３は、入力された重み行列Ｗを用いて、上記の推論処理を行う機能を有する。 The weight loading unit 82 inputs the loaded weight matrix W to the calculation unit 83. The calculation unit 83 has a function of performing the above inference processing using the input weight matrix W.

　次に、図１３に、パラメタ更新処理を行う学習装置の例を示す。図１３は、一般的な学習装置の構成例を示すブロック図である。図１３に示す学習装置９０は、重みメモリ９１と、重みロード部９２と、演算部９３と、重みストア部９４とを備える。 Next, FIG. 13 shows an example of a learning device that performs a parameter updating process. FIG. 13 is a block diagram illustrating a configuration example of a general learning device. The learning device 90 illustrated in FIG. 13 includes a weight memory 91, a weight load unit 92, a calculation unit 93, and a weight storage unit 94.

　重みメモリ９１は、重み行列Ｗを記憶する機能を有する。重みロード部９２は、重みメモリ９１に記憶されている重み行列Ｗを重みメモリ９１からロードする機能を有する。 The weight memory 91 has a function of storing the weight matrix W. The weight loading unit 92 has a function of loading the weight matrix W stored in the weight memory 91 from the weight memory 91.

　重みロード部９２は、ロードされた重み行列Ｗを演算部９３に入力する。演算部９３は、入力された重み行列Ｗを用いて、上記のパラメタ更新処理を行う機能を有する。 The weight load unit 92 inputs the loaded weight matrix W to the calculation unit 93. The calculation unit 93 has a function of performing the above-described parameter update processing using the input weight matrix W.

　演算部９３は、パラメタ更新処理で更新された重み行列Ｗを、重みストア部９４に入力する。重みストア部９４は、演算部９３により更新された重み行列Ｗを重みメモリ９１に書き込む機能を有する。 The operation unit 93 inputs the weight matrix W updated in the parameter update process to the weight storage unit 94. The weight storage unit 94 has a function of writing the weight matrix W updated by the calculation unit 93 into the weight memory 91.

　具体的には、重みストア部９４は、重みメモリ９１に記憶されている重み行列Ｗを、入力された重み行列Ｗに更新する。なお、重み行列Ｗの書き込みにあたり、重みストア部９４は、重み行列Ｗを一時的に保存する機能を有してもよい。 Specifically, the weight storage unit 94 updates the weight matrix W stored in the weight memory 91 to the input weight matrix W. In writing the weight matrix W, the weight storage unit 94 may have a function of temporarily storing the weight matrix W.

Y.H.Chen, et.al., "Eyeriss: an Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks", in IEEE Jornal of Slid-State Circuits, vol.52, no.1, Jan. 2017, pp.127-138.YHChen, et.al., "Eyeriss: an Energy-Efficient Reconfigurable Accelerator Deep Convolutional Neural Networks '', in IEEE Jornal of Slid-State Circuits, vol.52, no.1, anJan. 2017, pp.127-138 . Wei. Liu, et.al., "SSD: Single shot MultiBox Detector", arXiv:1512.02325v5, Dec. 2016.Wei. Liu, et.al., “SSD: Single shot MultiBox Detector”, arXiv: 1512.02325v5, Dec. 2016. Chen Zhang, et.al., "Optimizing FPGA-based Accelerator Design for Deep convolutional Neural Networks", In ACM FPGA 2015, pp.160-170.Chen Zhang, et.al., "Optimizing FPGA-based Accelerator Design for Deep convolutional Neural Networks", In ACM FPGA 2015, pp. 160-170.

　上記の推論処理と学習処理を並列に実行可能な学習装置を設計することを考える。学習装置が重み行列Ｗを記憶する重みメモリを１つしか有していない場合、学習装置は、推論処理と学習処理を並列に実行できない。を Consider designing a learning device that can execute the above inference processing and learning processing in parallel. When the learning device has only one weight memory that stores the weight matrix W, the learning device cannot execute the inference process and the learning process in parallel.

　その理由を以下に説明する。重みメモリが１つしか存在しないと、重み行列Ｗが保存される箇所が学習装置に１つしか無い。よって、学習装置が推論処理に更新中の重み行列Ｗを使用する可能性がある。理由 The reasons are explained below. If there is only one weight memory, the learning device has only one place where the weight matrix W is stored. Therefore, there is a possibility that the learning device uses the updated weight matrix W for the inference processing.

　上記のように推論処理では通常固定のパラメタが利用されるため、学習装置が推論処理において更新中の重み行列Ｗを使用すると、正常な推論処理が実行されなくなる。非特許文献１～３には、推論処理と学習処理を並列に実行する方法が記載されていない。ため Since the inference processing normally uses fixed parameters in the inference processing, if the learning apparatus uses the updating weight matrix W in the inference processing, normal inference processing is not executed. Non-Patent Documents 1 to 3 do not describe a method of executing inference processing and learning processing in parallel.

［発明の目的］
　そこで、本発明は、上述した課題を解決する、推論処理と学習処理を並列に実行できる学習装置、学習方法および学習プログラムを提供することを目的とする。 [Object of the invention]
Therefore, an object of the present invention is to provide a learning device, a learning method, and a learning program that can solve the above-described problem and can execute inference processing and learning processing in parallel.

　本発明による学習装置は、１つ以上のユニットでそれぞれ構成された複数の層が層状に結合された判別モデルの各ユニットの判別用データに対する出力を所定の順番で計算する推論処理で用いられる各ユニットのパラメタをそれぞれ記憶する第１記憶部と、各ユニットの学習用データに対する出力に基づいて各ユニットのパラメタの少なくとも一部を更新する学習処理における更新対象のパラメタをそれぞれ記憶する第２記憶部とを備えることを特徴とする。 The learning apparatus according to the present invention is used in an inference process of calculating an output for discrimination data of each unit of a discrimination model in which a plurality of layers each composed of one or more units are combined in a layered manner in a predetermined order. A first storage unit for storing the parameters of each unit, and a second storage unit for storing parameters to be updated in a learning process of updating at least a part of the parameters of each unit based on an output of each unit for learning data And characterized in that:

　本発明による学習方法は、１つ以上のユニットでそれぞれ構成された複数の層が層状に結合された判別モデルの各ユニットの判別用データに対する出力を所定の順番で計算する推論処理で用いられる各ユニットのパラメタをそれぞれ第１記憶部に記憶させ、各ユニットの学習用データに対する出力に基づいて各ユニットのパラメタの少なくとも一部を更新する学習処理における更新対象のパラメタをそれぞれ第２記憶部に記憶させることを特徴とする。 The learning method according to the present invention is used in the inference process of calculating the output for the discriminating data of each unit of the discriminating model in which a plurality of layers each composed of one or more units are combined in a layered manner in a predetermined order. The parameters of the units are respectively stored in the first storage unit, and the parameters to be updated in the learning process of updating at least a part of the parameters of each unit based on the output of the learning data of each unit are stored in the second storage unit. It is characterized by making it.

　本発明による学習プログラムは、コンピュータに、１つ以上のユニットでそれぞれ構成された複数の層が層状に結合された判別モデルの各ユニットの判別用データに対する出力を所定の順番で計算する推論処理で用いられる各ユニットのパラメタをそれぞれ第１記憶部に記憶させる第１記憶処理、および各ユニットの学習用データに対する出力に基づいて各ユニットのパラメタの少なくとも一部を更新する学習処理における更新対象のパラメタをそれぞれ第２記憶部に記憶させる第２記憶処理を実行させることを特徴とする。 The learning program according to the present invention is an inference process in which a computer calculates, in a predetermined order, outputs for discrimination data of each unit of a discrimination model in which a plurality of layers each composed of one or more units are combined in a layered manner. Parameters to be updated in a first storage process for storing parameters of each unit to be used in the first storage unit and a learning process for updating at least a part of parameters of each unit based on an output of each unit for learning data. Is stored in the second storage unit.

　本発明によれば、推論処理と学習処理を並列に実行できる。 According to the present invention, the inference processing and the learning processing can be executed in parallel.

本発明による学習装置の第１の実施形態の構成例を示すブロック図である。1 is a block diagram illustrating a configuration example of a first embodiment of a learning device according to the present invention. 演算部１４０の構成例を示すブロック図である。FIG. 3 is a block diagram illustrating a configuration example of a calculation unit 140. 本発明による学習装置の第１の実施形態の他の構成例を示すブロック図である。FIG. 4 is a block diagram showing another configuration example of the first embodiment of the learning device according to the present invention. 第１の実施形態の学習装置１００による推論処理および学習処理の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of the inference process and the learning process by the learning apparatus 100 of 1st Embodiment. 本発明による学習装置の第２の実施形態の構成例を示すブロック図である。It is a block diagram showing an example of composition of a 2nd embodiment of a learning device by the present invention. 本発明による学習装置の第２の実施形態の他の構成例を示すブロック図である。It is a block diagram showing another example of composition of a 2nd embodiment of a learning device by the present invention. 第２の実施形態の学習装置２００による推論処理および学習処理の動作を示すフローチャートである。It is a flow chart which shows operation of inference processing and learning processing by learning device 200 of a 2nd embodiment. 本発明による学習装置のハードウェア構成例を示す説明図である。FIG. 3 is an explanatory diagram illustrating an example of a hardware configuration of a learning device according to the present invention. 本発明による学習装置の概要を示すブロック図である。It is a block diagram showing the outline of the learning device by the present invention. 入力層と出力層との間に１つ以上の中間層を含むニューラルネットワークにおける一般的な学習方法および学習のための回路構成の例を示す説明図である。FIG. 3 is an explanatory diagram showing an example of a general learning method and a circuit configuration for learning in a neural network including one or more intermediate layers between an input layer and an output layer. １つのユニット７１に着目したときのユニット７１の入出力および他ユニット７１との結合の例を示す説明図である。FIG. 7 is an explanatory diagram showing an example of input / output of the unit 71 and coupling with another unit 71 when focusing on one unit 71. 一般的な推論装置の構成例を示すブロック図である。It is a block diagram showing an example of composition of a general inference device. 一般的な学習装置の構成例を示すブロック図である。It is a block diagram showing an example of composition of a general learning device.

実施形態１．
［構成の説明］
　以下、本発明の実施形態を、図面を参照して説明する。図１は、本発明による学習装置の第１の実施形態の構成例を示すブロック図である。 Embodiment 1 FIG.
[Description of configuration]
Hereinafter, embodiments of the present invention will be described with reference to the drawings. FIG. 1 is a block diagram illustrating a configuration example of a first embodiment of a learning device according to the present invention.

　図１に示すように、学習装置１００は、推論用重みメモリ１１０と、更新中重みメモリ１２０と、重みロード選択部１３０と、演算部１４０と、重みストア部１５０と、重み更新制御部１６０と、重みコピー部１７０と、制御部１８０とを備える。 As shown in FIG. 1, the learning device 100 includes an inference weight memory 110, an updating weight memory 120, a weight load selection unit 130, a calculation unit 140, a weight storage unit 150, and a weight update control unit 160. , A weight copy unit 170 and a control unit 180.

　なお、各ブロック図に記載されている単方向の矢印は、データが流れる方向を示す。しかし、各矢印が記載されている箇所において双方向にデータが流れる可能性は排除されていない。単 The unidirectional arrows in each block diagram indicate the direction in which data flows. However, the possibility of data flowing bidirectionally at the location where each arrow is described is not excluded.

　学習装置１００が推論処理と学習処理を並列に実行するために、学習装置１００が複数の重みメモリを有することが解決手段として考えられる。以下、並列に推論処理と学習処理を実行する学習装置１００の各構成要素の機能を説明する。解決 Since the learning device 100 executes the inference process and the learning process in parallel, the learning device 100 may have a plurality of weight memories as a solution. Hereinafter, the function of each component of the learning device 100 that performs the inference process and the learning process in parallel will be described.

　推論用重みメモリ１１０は、推論処理に使用される重み行列Ｗ（パラメタ群）を記憶する機能を有する。また、更新中重みメモリ１２０は、学習処理に使用される重み行列Ｗを記憶する機能を有する。 The inference weight memory 110 has a function of storing a weight matrix W (parameter group) used for inference processing. The updating weight memory 120 has a function of storing the weight matrix W used for the learning process.

　重みロード選択部１３０は、推論用重みメモリ１１０と、更新中重みメモリ１２０とのうちのいずれかから重み行列Ｗをロードする機能を有する。推論処理が行われる時、重みロード選択部１３０は、推論用重みメモリ１１０から重み行列Ｗをロードする。 The weight load selection unit 130 has a function of loading the weight matrix W from one of the inference weight memory 110 and the updating weight memory 120. When the inference processing is performed, the weight load selection unit 130 loads the weight matrix W from the inference weight memory 110.

　また、学習処理が行われる時、重みロード選択部１３０は、更新中重みメモリ１２０から重み行列Ｗをロードする。重みロード選択部１３０は、ロードされた重み行列Ｗを演算部１４０に入力する。 When the learning process is performed, the weight load selecting unit 130 loads the weight matrix W from the updating weight memory 120. The weight load selection unit 130 inputs the loaded weight matrix W to the calculation unit 140.

　演算部１４０は、推論用重みメモリ１１０からロードされた重み行列Ｗを用いて上記の推論処理を行う機能を有する。また、演算部１４０は、更新中重みメモリ１２０からロードされた重み行列Ｗを用いて上記の学習処理を行う機能を有する。すなわち、異なる重み行列Ｗを用いるため、演算部１４０は、推論処理と学習処理を並列に実行できる。 The operation unit 140 has a function of performing the above inference processing using the weight matrix W loaded from the inference weight memory 110. The arithmetic unit 140 has a function of performing the above-described learning process using the weight matrix W loaded from the updating weight memory 120. That is, since different weight matrices W are used, the arithmetic unit 140 can execute inference processing and learning processing in parallel.

　具体的には、演算部１４０は、１つ以上のユニット７１でそれぞれ構成された複数の層が層状に結合された判別モデルの各ユニット７１の判別用データに対する出力を所定の順番で計算する推論処理を実行する。 Specifically, the arithmetic unit 140 calculates in a predetermined order the output for the discriminating data of each unit 71 of the discriminating model in which a plurality of layers each composed of one or more units 71 are combined in a layered manner. Execute the process.

　推論用重みメモリ１１０は、推論処理で用いられる各ユニット７１の重み（重み行列Ｗ）をそれぞれ記憶している。なお、各ユニット７１の重みが、本実施形態における各ユニット７１のパラメタである。また、判別モデルは、例えばニューラルネットワークである。 The inference weight memory 110 stores the weight (weight matrix W) of each unit 71 used in the inference processing. The weight of each unit 71 is a parameter of each unit 71 in the present embodiment. The discrimination model is, for example, a neural network.

　また、演算部１４０は、各ユニット７１の学習用データに対する出力に基づいて各ユニット７１の重みの少なくとも一部を更新する学習処理を実行する。更新中重みメモリ１２０は、学習処理における更新対象の重み（重み行列Ｗ）をそれぞれ記憶している。 The arithmetic unit 140 executes a learning process of updating at least a part of the weight of each unit 71 based on the output of each unit 71 for the learning data. The updating weight memory 120 stores the update target weight (weight matrix W) in the learning process.

　また、重みロード選択部１３０は、推論用重みメモリ１１０または更新中重みメモリ１２０から重みをロードする。演算部１４０は、ロードされた重みを用いて推論処理または学習処理を実行する。 {The weight load selector 130 loads the weight from the inference weight memory 110 or the updating weight memory 120. The operation unit 140 performs inference processing or learning processing using the loaded weights.

　演算部１４０は、学習処理で更新された重み行列Ｗを、重みストア部１５０に入力する。重みストア部１５０は、演算部１４０により更新された重み行列Ｗを更新中重みメモリ１２０に書き込む機能を有する。 The operation unit 140 inputs the weight matrix W updated in the learning process to the weight storage unit 150. The weight storage unit 150 has a function of writing the weight matrix W updated by the arithmetic unit 140 into the updating weight memory 120.

　具体的には、重みストア部１５０は、更新中重みメモリ１２０に記憶されている重み行列Ｗを、入力された重み行列Ｗに更新する。なお、重み行列Ｗの書き込みにあたり、重みストア部１５０は、重み行列Ｗを一時的に保存する機能を有してもよい。 Specifically, the weight storage unit 150 updates the weight matrix W stored in the updating weight memory 120 to the input weight matrix W. When writing the weight matrix W, the weight storage unit 150 may have a function of temporarily storing the weight matrix W.

　すなわち、重みストア部１５０は、学習処理における各ユニット７１の更新対象の重み（重み行列Ｗ）を更新中重みメモリ１２０に格納する。重みストア部１５０が重み行列Ｗを更新中重みメモリ１２０に格納することによって、次の学習処理では更新された重み行列Ｗが使用される。 That is, the weight storage unit 150 stores the update target weight (weight matrix W) of each unit 71 in the learning process in the updating weight memory 120. The weight storage unit 150 stores the weight matrix W in the updating weight memory 120, so that the updated learning matrix W is used in the next learning process.

　学習が完了した場合、演算部１４０は、学習が完了したことを重み更新制御部１６０に通知する。通知を受けた重み更新制御部１６０は、更新中重みメモリ１２０に記憶されている重み行列Ｗを推論用重みメモリ１１０にコピーし、推論用重みメモリ１１０に記憶されていた重み行列Ｗとコピーされた重み行列Ｗを置き換えるように、重みコピー部１７０に指示する。 When the learning is completed, the arithmetic unit 140 notifies the weight update control unit 160 that the learning is completed. Upon receiving the notification, the weight update control unit 160 copies the weight matrix W stored in the updating weight memory 120 to the inference weight memory 110, and copies the weight matrix W stored in the inference weight memory 110. The weight copy unit 170 is instructed to replace the weight matrix W.

　指示を受けた重みコピー部１７０は、更新中重みメモリ１２０に記憶されている重み行列Ｗを推論用重みメモリ１１０にコピーし、推論用重みメモリ１１０に記憶されていた重み行列Ｗとコピーされた重み行列Ｗを置き換える。 Upon receiving the instruction, the weight copy unit 170 copies the weight matrix W stored in the updating weight memory 120 to the inference weight memory 110, and is copied with the weight matrix W stored in the inference weight memory 110. Replace the weight matrix W.

　学習が完了した場合は、例えば所定の数の学習用データが使用された重み更新処理を含む学習処理が、所定の回数繰り返し実行された場合である。学習が完了した場合、重みコピー部１７０は、推論用重みメモリ１１０に記憶されている重みを、重みに対応する更新中重みメモリ１２０に記憶されている更新対象の重みに更新する。 The case where the learning is completed is a case where a learning process including a weight update process using a predetermined number of learning data is repeatedly executed a predetermined number of times. When the learning is completed, the weight copy unit 170 updates the weight stored in the inference weight memory 110 to the update target weight stored in the updating weight memory 120 corresponding to the weight.

　すなわち、重みコピー部１７０は、推論処理で用いられる各ユニット７１の重み（重み行列Ｗ）を推論用重みメモリ１１０に格納する。重みコピー部１７０が重み行列Ｗを推論用重みメモリ１１０に格納することによって、学習が完了した後の推論処理では、学習済みモデルが使用される。 That is, the weight copy unit 170 stores the weight (weight matrix W) of each unit 71 used in the inference processing in the inference weight memory 110. The weighted copy unit 170 stores the weight matrix W in the weight memory 110 for inference, so that inference processing after learning is completed uses the learned model.

　制御部１８０は、重みロード選択部１３０、演算部１４０、および重み更新制御部１６０を制御する機能を有する。制御部１８０は、入力された画像の種類に基づいて、推論処理と学習処理のいずれが実行対象であるかを重みロード選択部１３０と演算部１４０に通知する。 The control unit 180 has a function of controlling the weight load selection unit 130, the calculation unit 140, and the weight update control unit 160. The control unit 180 notifies the weight load selection unit 130 and the calculation unit 140 which of the inference process and the learning process is to be executed based on the type of the input image.

　また、学習が完了した場合、制御部１８０は、演算部１４０を介して学習が完了したことを重み更新制御部１６０に通知する。 (4) When the learning is completed, the control unit 180 notifies the weight update control unit 160 via the arithmetic unit 140 that the learning is completed.

　図２は、演算部１４０の構成例を示すブロック図である。図２に示すように、演算部１４０は、推論用重みレジスタ１４１と、更新中重みレジスタ１４２と、演算器用重みロード選択部１４３と、演算器１４４と、演算器用重みストア部１４５とを有する。 FIG. 2 is a block diagram showing a configuration example of the arithmetic unit 140. As shown in FIG. 2, the arithmetic unit 140 includes an inference weight register 141, an updating weight register 142, an arithmetic unit weight load selection unit 143, an arithmetic unit 144, and an arithmetic unit weight storage unit 145.

　演算部１４０の構成が図２に示す構成である場合、学習装置１００は、各ユニット７１と対応する複数の演算部を備える。 When the configuration of the arithmetic unit 140 is the configuration illustrated in FIG. 2, the learning device 100 includes a plurality of arithmetic units corresponding to each unit 71.

　推論用重みレジスタ１４１は、演算部１４０に入力された、推論用重みメモリ１１０からロードされた重み行列Ｗの、演算部１４０に対応するユニット７１の重みｗを記憶する機能を有する。 The inference weight register 141 has a function of storing the weight w of the unit 71 corresponding to the operation unit 140 in the weight matrix W input from the operation unit 140 and loaded from the inference weight memory 110.

　更新中重みレジスタ１４２は、演算部１４０に入力された、更新中重みメモリ１２０からロードされた重み行列Ｗの、演算部１４０に対応するユニット７１の重みｗを記憶する機能を有する。中 The updating weight register 142 has a function of storing the weight w of the unit 71 corresponding to the arithmetic unit 140, of the weight matrix W input from the arithmetic unit 140 and loaded from the updating weight memory 120.

　演算器用重みロード選択部１４３は、推論用重みレジスタ１４１と、更新中重みレジスタ１４２とのうちのいずれかから重みｗをロードする機能を有する。推論処理が行われる時、演算器用重みロード選択部１４３は、推論用重みレジスタ１４１から重みｗをロードする。 The arithmetic unit weight load selection unit 143 has a function of loading the weight w from one of the inference weight register 141 and the updating weight register 142. When the inference processing is performed, the arithmetic unit weight load selector 143 loads the weight w from the inference weight register 141.

　また、学習処理が行われる時、演算器用重みロード選択部１４３は、更新中重みレジスタ１４２から重みｗをロードする。演算器用重みロード選択部１４３は、ロードされた重みｗを演算器１４４に入力する。 When the learning process is performed, the arithmetic unit weight load selector 143 loads the weight w from the updating weight register 142. The computing unit weight load selection unit 143 inputs the loaded weight w to the computing unit 144.

　演算器１４４は、推論用重みレジスタ１４１からロードされた重みｗを用いて上記の推論処理用の演算を行う機能を有する。また、演算器１４４は、更新中重みレジスタ１４２からロードされた重みｗを用いて上記の学習処理用の演算を行う機能を有する。 The arithmetic unit 144 has a function of performing the above-described inference processing operation using the weight w loaded from the inference weight register 141. The arithmetic unit 144 has a function of performing the above-described arithmetic operation for the learning process using the weight w loaded from the updating weight register 142.

　演算器１４４は、学習処理用の演算を介して更新された重みｗを、演算器用重みストア部１４５に入力する。演算器用重みストア部１４５は、更新中重みレジスタ１４２に記憶されている重みｗを、入力された重みｗに更新する機能を有する。 The arithmetic unit 144 inputs the weight w updated through the operation for the learning process to the arithmetic unit weight storage unit 145. The arithmetic unit weight storage unit 145 has a function of updating the weight w stored in the updating weight register 142 to the input weight w.

　上述したように、図２に示す演算部１４０は、推論処理で用いられ演算部１４０に対応するユニット７１のパラメタ（重みｗ）を記憶する推論用重みレジスタ１４１と、学習処理における更新対象であり演算部１４０に対応するユニット７１のパラメタ（重みｗ）を記憶する更新中重みレジスタ１４２とを有する。 As described above, the arithmetic unit 140 shown in FIG. 2 is an inference weight register 141 that stores a parameter (weight w) of the unit 71 corresponding to the arithmetic unit 140 used in the inference process, and is an update target in the learning process. An updating weight register 142 that stores a parameter (weight w) of the unit 71 corresponding to the arithmetic unit 140.

　なお、推論用重みレジスタ１４１および更新中重みレジスタ１４２は、１つの重みｗの代わりに、重み行列Ｗの１列等を記憶してもよい。 Note that the inference weight register 141 and the updating weight register 142 may store one column or the like of the weight matrix W instead of one weight w.

　演算部１４０が推論用重みレジスタ１４１と、更新中重みレジスタ１４２とを有することによって、推論用重みメモリ１１０および更新中重みメモリ１２０から重み行列Ｗが読み出される回数が減る。すなわち、読み出しで消費される電力が減るため、推論処理と学習処理の切り替えに係るコストが削減される。 Since the arithmetic unit 140 includes the inference weight register 141 and the updating weight register 142, the number of times the weight matrix W is read from the inference weight memory 110 and the updating weight memory 120 is reduced. That is, since the power consumed for reading is reduced, the cost for switching between the inference processing and the learning processing is reduced.

　なお、本実施形態の演算部１４０の構成は、図２に示す構成以外の構成でもよい。例えば、演算部１４０は、入力された重み行列Ｗの、演算部１４０に対応するユニット７１の重みｗを記憶するレジスタを１つだけ有していてもよい。演算部１４０がレジスタを１つだけ有する場合、重みロード選択部１３０は、実行対象の処理に応じて推論用重みメモリ１１０または更新中重みメモリ１２０から重み行列Ｗをロードする。 The configuration of the calculation unit 140 of the present embodiment may be a configuration other than the configuration illustrated in FIG. For example, the arithmetic unit 140 may have only one register that stores the weight w of the unit 71 corresponding to the arithmetic unit 140 in the input weight matrix W. When the arithmetic unit 140 has only one register, the weight load selecting unit 130 loads the weight matrix W from the inference weight memory 110 or the updating weight memory 120 according to the process to be executed.

　図３は、本発明による学習装置の第１の実施形態の他の構成例を示すブロック図である。図３に示す学習装置１０１は、図１に示す学習装置１００が備える各構成要素と、固定層用重みメモリ１９０とを備える。 FIG. 3 is a block diagram showing another configuration example of the first embodiment of the learning device according to the present invention. The learning device 101 illustrated in FIG. 3 includes the components included in the learning device 100 illustrated in FIG. 1 and a fixed-layer weight memory 190.

　パラメタ更新処理（学習処理）において、各ユニット７１の出力を計算するために用いられるパラメタは、全て更新されなくてもよい。例えば、入力層に近い層、すなわち浅い層には、複数回パラメタ更新処理が行われても、パラメタが大きく変化しないという特徴がある。 In the parameter updating process (learning process), all parameters used for calculating the output of each unit 71 do not need to be updated. For example, a layer close to the input layer, that is, a shallow layer has a feature that the parameter does not largely change even if the parameter update processing is performed a plurality of times.

　第１層～第ｎ層のうち一部の層内の各ユニット７１の出力を計算するために用いられるパラメタ（本例では重み）が固定される場合、固定層用重みメモリ１９０は、固定された重みに関する重み行列を記憶する。また、推論用重みメモリ１１０および更新中重みメモリ１２０は、固定されていない重みに関する重み行列を記憶する。 When the parameter (weight in this example) used to calculate the output of each unit 71 in some of the first to n-th layers is fixed, the fixed-layer weight memory 190 is fixed. The weight matrix related to the weight is stored. In addition, the inference weight memory 110 and the updating weight memory 120 store weight matrices for weights that are not fixed.

　推論処理が行われる時、図３に示す重みロード選択部１３０は、推論用重みメモリ１１０と固定層用重みメモリ１９０から重み行列をそれぞれロードする。 When the inference process is performed, the weight load selection unit 130 shown in FIG. 3 loads the weight matrices from the inference weight memory 110 and the fixed-layer weight memory 190, respectively.

［動作の説明］
　以下、本実施形態の学習装置１００の推論処理および学習処理の動作を図４を参照して説明する。図４は、第１の実施形態の学習装置１００による推論処理および学習処理の動作を示すフローチャートである。 [Description of operation]
Hereinafter, the operation of the inference processing and the learning processing of the learning device 100 of the present embodiment will be described with reference to FIG. FIG. 4 is a flowchart illustrating an operation of an inference process and a learning process performed by the learning device 100 according to the first embodiment.

　最初に、学習装置１００の制御部１８０に画像が入力される（ステップS101）。制御部１８０は、入力された画像の種類を判定する（ステップS102）。 First, an image is input to the control unit 180 of the learning device 100 (Step S101). The control unit 180 determines the type of the input image (Step S102).

　入力された画像が学習用画像であると判定した場合（ステップS102における「学習用画像」）、制御部１８０は、実行対象の処理が学習処理であることを重みロード選択部１３０と演算部１４０に通知する（ステップS103）。 When it is determined that the input image is a learning image (“learning image” in step S102), the control unit 180 determines that the process to be executed is a learning process by the weight load selecting unit 130 and the arithmetic unit 140. Is notified (step S103).

　ステップS103で、制御部１８０は、入力された学習用画像を演算部１４０に入力する。または、制御部１８０は、前層のユニット７１により抽出された学習用画像の特徴量等の、学習用画像から得られる所定の入力データを演算部１４０に入力する。 In step S103, the control unit 180 inputs the input learning image to the calculation unit 140. Alternatively, the control unit 180 inputs predetermined input data obtained from the learning image, such as the feature amount of the learning image extracted by the unit 71 of the previous layer, to the arithmetic unit 140.

　次いで、重みロード選択部１３０は、更新中重みメモリ１２０から重み行列Ｗをロードする（ステップS104）。重みロード選択部１３０は、ロードされた重み行列Ｗを演算部１４０に入力する。 Next, the weight load selector 130 loads the weight matrix W from the updating weight memory 120 (step S104). The weight load selection unit 130 inputs the loaded weight matrix W to the calculation unit 140.

　次いで、演算部１４０は、ロードされた重み行列Ｗを用いて学習処理を実行する（ステップS105）。演算部１４０は、学習処理で更新された重み行列Ｗを重みストア部１５０に入力する。 Next, the arithmetic unit 140 performs a learning process using the loaded weight matrix W (step S105). The calculation unit 140 inputs the weight matrix W updated in the learning process to the weight storage unit 150.

　次いで、重みストア部１５０は、更新中重みメモリ１２０に記憶されている重み行列Ｗを、学習処理で更新された重み行列Ｗに更新する（ステップS106）。 Next, the weight storage unit 150 updates the weight matrix W stored in the updating weight memory 120 to the weight matrix W updated in the learning process (step S106).

　次いで、制御部１８０は、学習が完了したか否かを判断する（ステップS107）。学習が完了していないと判断した場合（ステップS107におけるNo）、学習装置１００は、入力された学習用画像に対する学習処理を終了する。 Next, the control unit 180 determines whether the learning has been completed (step S107). If it is determined that the learning has not been completed (No in step S107), the learning device 100 ends the learning process on the input learning image.

　学習が完了したと判断した場合（ステップS107におけるYes ）、制御部１８０は、演算部１４０を介して学習が完了したことを重み更新制御部１６０に通知する。次いで、重み更新制御部１６０は、重みコピー部１７０を起動させる（ステップS108）。 If it is determined that the learning has been completed (Yes in Step S107), the control unit 180 notifies the weight update control unit 160 via the arithmetic unit 140 that the learning has been completed. Next, the weight update control unit 160 activates the weight copy unit 170 (Step S108).

　次いで、重みコピー部１７０は、更新中重みメモリ１２０に記憶されている重み行列Ｗを推論用重みメモリ１１０にコピーし、推論用重みメモリ１１０に記憶されていた重み行列Ｗとコピーされた重み行列Ｗを置き換える（ステップS109）。置き換えた後、学習装置１００は、入力された学習用画像に対する学習処理を終了する。 Next, the weight copy unit 170 copies the weight matrix W stored in the updating weight memory 120 to the inference weight memory 110, and the weight matrix W stored in the inference weight memory 110 and the copied weight matrix. W is replaced (step S109). After the replacement, the learning device 100 ends the learning process on the input learning image.

　入力された画像が判別用画像であると判定した場合（ステップS102における「判別用画像」）、制御部１８０は、実行対象の処理が推論処理であることを重みロード選択部１３０と演算部１４０に通知する（ステップS110）。 When it is determined that the input image is the image for determination (“image for determination” in step S102), the control unit 180 determines that the process to be executed is the inference process by the weight load selection unit 130 and the arithmetic unit 140. (Step S110).

　ステップS110で、制御部１８０は、入力された判別用画像を演算部１４０に入力する。または、制御部１８０は、前層のユニット７１により抽出された判別用画像の特徴量等の、判別用画像から得られる所定の入力データを演算部１４０に入力する。 In step S110, the control unit 180 inputs the input determination image to the calculation unit 140. Alternatively, the control unit 180 inputs predetermined input data obtained from the discrimination image, such as the feature amount of the discrimination image extracted by the unit 71 of the previous layer, to the calculation unit 140.

　次いで、重みロード選択部１３０は、推論用重みメモリ１１０から重み行列Ｗをロードする（ステップS111）。重みロード選択部１３０は、ロードされた重み行列Ｗを演算部１４０に入力する。 Next, the weight load selecting unit 130 loads the weight matrix W from the inference weight memory 110 (Step S111). The weight load selection unit 130 inputs the loaded weight matrix W to the calculation unit 140.

　なお、図３に示す学習装置１０１の重みロード選択部１３０は、ステップS111の処理で、固定層用重みメモリ１９０からも重み行列をロードする。 The weight load selection unit 130 of the learning device 101 shown in FIG. 3 also loads a weight matrix from the fixed-layer weight memory 190 in step S111.

　次いで、演算部１４０は、ロードされた重み行列Ｗを用いて推論処理を実行する（ステップS112）。推論処理を実行した後、学習装置１００は、入力された判別用画像に対する推論処理を終了する。なお、学習装置１００は、複数の判別用画像に対する推論処理をまとめて実行してもよい。 Next, the arithmetic unit 140 performs an inference process using the loaded weight matrix W (step S112). After executing the inference processing, the learning device 100 ends the inference processing on the input determination image. Note that the learning device 100 may collectively execute inference processing on a plurality of determination images.

　また、学習装置１００は、図４に示す推論処理と学習処理を並列に実行可能である。よって、重みロード選択部１３０は、実行対象の処理が切り替えられた時に重みメモリから求められる重み行列Ｗをロードしてもよい。重みロード選択部１３０が重み行列Ｗをロードするタイミングは、特に限定されない。 The learning device 100 can execute the inference process and the learning process illustrated in FIG. 4 in parallel. Therefore, the weight load selection unit 130 may load the weight matrix W obtained from the weight memory when the process to be executed is switched. The timing at which the weight load selector 130 loads the weight matrix W is not particularly limited.

［効果の説明］
　本実施形態の学習装置１００は、推論用重みメモリ１１０と、更新中重みメモリ１２０とを備える。よって、更新中重みメモリ１２０に記憶されている重み行列Ｗが更新されている間であっても、演算部１４０は、推論用重みメモリ１１０に記憶されている重み行列Ｗを用いて推論処理を実行できる。 [Explanation of effects]
The learning device 100 of the present embodiment includes an inference weight memory 110 and an updating weight memory 120. Therefore, even while the weight matrix W stored in the updating weight memory 120 is being updated, the calculation unit 140 performs the inference process using the weight matrix W stored in the inference weight memory 110. I can do it.

　学習装置１００が推論用重みメモリ１１０と更新中重みメモリ１２０とを備えることによって、推論処理用の重み行列Ｗが、信頼性の高い状態に保たれる。すなわち、学習装置１００は、推論処理と学習処理を並列に実行できる。 Since the learning apparatus 100 includes the weight memory 110 for inference and the weight memory 120 during updating, the weight matrix W for inference processing is maintained in a highly reliable state. That is, the learning device 100 can execute the inference process and the learning process in parallel.

　また、演算部１４０の構成が図２に示す構成である場合、読み出しで消費される電力が減るため、推論処理と学習処理の切り替えに係るコストが削減される。 (2) In the case where the configuration of the arithmetic unit 140 is the configuration shown in FIG. 2, the power consumed for reading is reduced, so that the cost for switching between the inference process and the learning process is reduced.

実施形態２．
［構成の説明］
　次に、本発明による学習装置の第２の実施形態を、図面を参照して説明する。図５は、本発明による学習装置の第２の実施形態の構成例を示すブロック図である。 Embodiment 2. FIG.
[Description of configuration]
Next, a second embodiment of the learning device according to the present invention will be described with reference to the drawings. FIG. 5 is a block diagram illustrating a configuration example of a second embodiment of the learning device according to the present invention.

　図５に示すように、学習装置２００は、第１重みメモリ２１０と、第２重みメモリ２２０と、重みロード選択部２３０と、演算部２４０と、重みストア選択部２５０と、重み更新制御部２６０と、制御部２７０とを備える。 As shown in FIG. 5, the learning device 200 includes a first weight memory 210, a second weight memory 220, a weight load selector 230, a calculator 240, a weight store selector 250, and a weight update controller 260. And a control unit 270.

　第１の実施形態と同様に、学習装置２００が推論処理と学習処理を並列に実行するために、学習装置２００が複数の重みメモリを有することが解決手段として考えられる。本実施形態の学習装置２００では、複数の重みメモリの役割が固定されていない。同様 Similar to the first embodiment, in order for the learning device 200 to execute the inference process and the learning process in parallel, the learning device 200 may have a plurality of weight memories as a solution. In the learning device 200 of the present embodiment, the roles of the plurality of weight memories are not fixed.

　すなわち、学習装置２００は、複数の重みメモリの役割が入れ替え可能な構成をとる。以下、並列に推論処理と学習処理を実行する学習装置２００の各構成要素の機能を説明する。 {That is, the learning device 200 has a configuration in which the roles of a plurality of weight memories are interchangeable. Hereinafter, the function of each component of the learning device 200 that performs the inference process and the learning process in parallel will be described.

　第１重みメモリ２１０および第２重みメモリ２２０は、推論処理または学習処理に使用される重み行列Ｗを記憶する機能を有する。 The first weight memory 210 and the second weight memory 220 have a function of storing a weight matrix W used for inference processing or learning processing.

　重みロード選択部２３０は、第１重みメモリ２１０と、第２重みメモリ２２０とのうちのいずれかから重み行列Ｗをロードする機能を有する。 The weight load selection unit 230 has a function of loading the weight matrix W from one of the first weight memory 210 and the second weight memory 220.

　重みロード選択部２３０は、推論処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスと、学習処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスをそれぞれ保持している。なお、保持されているメモリアドレスは、第１重みメモリ２１０または第２重みメモリ２２０のいずれかのメモリアドレスである。 The weight load selection unit 230 holds a memory address of a weight memory in which a weight matrix W for inference processing is stored and a memory address of a weight memory in which a weight matrix W for learning processing is stored. The held memory address is a memory address of either the first weight memory 210 or the second weight memory 220.

　保持されているメモリアドレスに従って、重みロード選択部２３０は、推論処理用の重み行列Ｗを第１重みメモリ２１０と、第２重みメモリ２２０とのうちのいずれかからロードする。 The weight load selection unit 230 loads the weight matrix W for inference processing from either the first weight memory 210 or the second weight memory 220 according to the stored memory address.

　同様に、保持されているメモリアドレスに従って、重みロード選択部２３０は、学習処理用の重み行列Ｗを第１重みメモリ２１０と、第２重みメモリ２２０とのうちのいずれかからロードする。重みロード選択部２３０は、ロードされた重み行列Ｗを演算部２４０に入力する。 Similarly, according to the stored memory address, the weight load selecting unit 230 loads the weight matrix W for learning processing from either the first weight memory 210 or the second weight memory 220. The weight load selection unit 230 inputs the loaded weight matrix W to the calculation unit 240.

　演算部２４０は、第１の実施形態の演算部１４０と同様の機能を有する。なお、本実施形態の演算部２４０の構成が、図２に示す構成でもよい。 The operation unit 240 has the same function as the operation unit 140 of the first embodiment. Note that the configuration of the calculation unit 240 of the present embodiment may be the configuration shown in FIG.

　重みストア選択部２５０は、第１重みメモリ２１０または第２重みメモリ２２０に記憶されている重み行列Ｗを、入力された重み行列Ｗに更新する機能を有する。 The weight store selection unit 250 has a function of updating the weight matrix W stored in the first weight memory 210 or the second weight memory 220 with the input weight matrix W.

　重みストア選択部２５０は、推論処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスと、学習処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスをそれぞれ保持している。 The weight store selection unit 250 holds a memory address of a weight memory in which a weight matrix W for inference processing is stored and a memory address of a weight memory in which a weight matrix W for learning processing is stored.

　保持されているメモリアドレスに従って、重みストア選択部２５０は、第１重みメモリ２１０または第２重みメモリ２２０に記憶されている重み行列Ｗを、入力された重み行列Ｗに更新する。 The weight store selection unit 250 updates the weight matrix W stored in the first weight memory 210 or the second weight memory 220 to the input weight matrix W according to the stored memory address.

　学習が完了した場合、演算部２４０は、学習が完了したことを重み更新制御部２６０に通知する。通知を受けた重み更新制御部２６０は、重みロード選択部２３０と重みストア選択部２５０に、保持されているメモリアドレスの役割を入れ替えるように指示する。 When the learning is completed, the arithmetic unit 240 notifies the weight update control unit 260 that the learning has been completed. Upon receiving the notification, the weight update control unit 260 instructs the weight load selection unit 230 and the weight store selection unit 250 to exchange the role of the held memory address.

　具体的には、重み更新制御部２６０は、推論処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスを、学習処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスとして扱うように指示する。 Specifically, the weight update control unit 260 treats the memory address of the weight memory in which the weight matrix W for inference processing is stored as the memory address of the weight memory in which the weight matrix W for learning processing is stored. To instruct.

　同時に、重み更新制御部２６０は、学習処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスを、推論処理用の重み行列Ｗが記憶されている重みメモリのメモリアドレスとして扱うように指示する。 At the same time, the weight update control unit 260 instructs to treat the memory address of the weight memory storing the weight matrix W for the learning process as the memory address of the weight memory storing the weight matrix W for the inference process. I do.

　学習が完了した場合は、例えば所定の数の学習用データが使用された重み更新処理を含む学習処理が、所定の回数繰り返し実行された場合である。換言すると学習が完了した場合、重み更新制御部２６０は、推論処理で用いられる重みを学習処理用の重み行列Ｗが記憶されている重みメモリからロードするように重みロード選択部２３０に指示している。 The case where the learning is completed is a case where a learning process including a weight update process using a predetermined number of learning data is repeatedly executed a predetermined number of times. In other words, when the learning is completed, the weight update control unit 260 instructs the weight load selection unit 230 to load the weight used in the inference process from the weight memory in which the weight matrix W for the learning process is stored. I have.

　同様に、学習が完了した場合、重み更新制御部２６０は、学習処理における更新対象の重みを推論処理用の重み行列Ｗが記憶されている重みメモリからロードするように重みロード選択部２３０に指示している。従って、本実施形態の学習装置２００には、第１の実施形態の重みコピー部１７０が不要になる。 Similarly, when the learning is completed, the weight update control unit 260 instructs the weight load selection unit 230 to load the update target weight in the learning process from the weight memory storing the weight matrix W for the inference process. are doing. Therefore, the weight copy unit 170 of the first embodiment is unnecessary in the learning device 200 of the present embodiment.

　なお、重みロード選択部２３０と重みストア選択部２５０は、学習の完了前にメモリアドレスの役割の入れ替え指示を予め重み更新制御部２６０から受け取り、学習の完了後等の所定のタイミングでメモリアドレスの役割を入れ替えてもよい。 Note that the weight load selection unit 230 and the weight store selection unit 250 receive from the weight update control unit 260 an instruction to switch the role of the memory address in advance before the learning is completed, and at a predetermined timing such as after the learning is completed, Roles may be switched.

　重みロード選択部２３０と重みストア選択部２５０が予め指示を受け取る場合、重み更新制御部２６０は、重みロード選択部２３０と重みストア選択部２５０に、役割の入れ替え先のメモリアドレスを学習の完了前に予め通知してもよい。 When the weight load selection unit 230 and the weight store selection unit 250 receive an instruction in advance, the weight update control unit 260 instructs the weight load selection unit 230 and the weight store selection unit 250 to change the memory address of the role exchange destination before learning is completed. May be notified in advance.

　制御部２７０は、第１の実施形態の制御部１８０と同様の機能を有する。 The control unit 270 has the same function as the control unit 180 of the first embodiment.

　図６は、本発明による学習装置の第２の実施形態の他の構成例を示すブロック図である。図６に示す学習装置２０１は、図５に示す学習装置２００が備える各構成要素と、固定層用重みメモリ２８０とを備える。 FIG. 6 is a block diagram showing another configuration example of the learning apparatus according to the second embodiment of the present invention. The learning device 201 illustrated in FIG. 6 includes the components included in the learning device 200 illustrated in FIG. 5 and a fixed-layer weight memory 280.

　本実施形態の学習装置２０１も、パラメタ更新処理（学習処理）において各ユニット７１の出力を計算するために用いられるパラメタ（本例では重み）を全て更新しなくてもよい。固定層用重みメモリ２８０は、固定された重みに関する重み行列を記憶する。また、第１重みメモリ２１０および第２重みメモリ２２０は、固定されていない重みに関する重み行列を記憶する。学習 The learning device 201 of the present embodiment also does not need to update all parameters (weights in this example) used for calculating the output of each unit 71 in the parameter update processing (learning processing). The fixed layer weight memory 280 stores a weight matrix related to fixed weights. Further, the first weight memory 210 and the second weight memory 220 store weight matrices for weights that are not fixed.

［動作の説明］
　以下、本実施形態の学習装置２００の推論処理および学習処理の動作を図７を参照して説明する。図７は、第２の実施形態の学習装置２００による推論処理および学習処理の動作を示すフローチャートである。 [Description of operation]
Hereinafter, the operation of the inference processing and the learning processing of the learning device 200 of the present embodiment will be described with reference to FIG. FIG. 7 is a flowchart illustrating an operation of an inference process and a learning process performed by the learning device 200 according to the second embodiment.

　最初に、学習装置２００の制御部２７０に画像が入力される（ステップS201）。制御部２７０は、入力された画像の種類を判定する（ステップS202）。 First, an image is input to the control unit 270 of the learning device 200 (Step S201). The control unit 270 determines the type of the input image (Step S202).

　入力された画像が学習用画像であると判定した場合（ステップS202における「学習用画像」）、制御部２７０は、実行対象の処理が学習処理であることを重みロード選択部２３０と演算部２４０に通知する（ステップS203）。 When determining that the input image is a learning image (“learning image” in step S202), the control unit 270 determines that the process to be executed is a learning process by the weight load selection unit 230 and the arithmetic unit 240. Is notified (step S203).

　ステップS203で、制御部２７０は、入力された学習用画像を演算部２４０に入力する。または、制御部２７０は、前層のユニット７１により抽出された学習用画像の特徴量等の、学習用画像から得られる所定の入力データを演算部２４０に入力する。 In step S203, the control unit 270 inputs the input learning image to the calculation unit 240. Alternatively, the control unit 270 inputs predetermined input data obtained from the learning image, such as the feature amount of the learning image extracted by the unit 71 of the previous layer, to the arithmetic unit 240.

　次いで、重みロード選択部２３０は、学習処理用の重み行列Ｗが記憶されている重みメモリから重み行列Ｗをロードする（ステップS204）。重みロード選択部２３０は、ロードされた重み行列Ｗを演算部２４０に入力する。 Next, the weight load selection unit 230 loads the weight matrix W from the weight memory in which the weight matrix W for the learning process is stored (Step S204). The weight load selection unit 230 inputs the loaded weight matrix W to the calculation unit 240.

　次いで、演算部２４０は、ロードされた重み行列Ｗを用いて学習処理を実行する（ステップS205）。演算部２４０は、学習処理で更新された重み行列Ｗを重みストア選択部２５０に入力する。 Next, the arithmetic unit 240 performs a learning process using the loaded weight matrix W (step S205). The calculation unit 240 inputs the weight matrix W updated in the learning process to the weight store selection unit 250.

　次いで、重みストア選択部２５０は、学習処理で更新された重み行列Ｗで、重みメモリに記憶されている学習処理用の重み行列Ｗを更新する（ステップS206）。 Next, the weight store selection unit 250 updates the weight matrix W for the learning process stored in the weight memory with the weight matrix W updated in the learning process (step S206).

　次いで、制御部２７０は、学習が完了したか否かを判断する（ステップS207）。学習が完了していないと判断した場合（ステップS207におけるNo）、学習装置２００は、入力された学習用画像に対する学習処理を終了する。 Next, the control unit 270 determines whether or not the learning has been completed (step S207). If it is determined that the learning has not been completed (No in step S207), the learning device 200 ends the learning process on the input learning image.

　学習が完了したと判断した場合（ステップS207におけるYes ）、制御部２７０は、演算部２４０を介して学習が完了したことを重み更新制御部２６０に通知する。 When it is determined that the learning has been completed (Yes in step S207), the control unit 270 notifies the weight update control unit 260 via the arithmetic unit 240 that the learning has been completed.

　次いで、重み更新制御部２６０は、重みロード選択部２３０と重みストア選択部２５０に、保持されているメモリアドレスの役割を入れ替えるように指示する（ステップS208）。 Next, the weight update control unit 260 instructs the weight load selection unit 230 and the weight store selection unit 250 to exchange the role of the stored memory address (step S208).

　次いで、重みロード選択部２３０と重みストア選択部２５０は、保持されているメモリアドレスの役割を入れ替える（ステップS209）。入れ替えた後、学習装置２００は、入力された学習用画像に対する学習処理を終了する。 Next, the weight load selector 230 and the weight store selector 250 swap the roles of the stored memory addresses (step S209). After the replacement, the learning device 200 ends the learning process on the input learning image.

　入力された画像が判別用画像であると判定した場合（ステップS202における「判別用画像」）、制御部２７０は、実行対象の処理が推論処理であることを重みロード選択部２３０と演算部２４０に通知する（ステップS210）。 When it is determined that the input image is the image for determination (“image for determination” in step S202), the control unit 270 determines that the process to be executed is the inference process by the weight load selection unit 230 and the arithmetic unit 240. (Step S210).

　ステップS210で、制御部２７０は、入力された判別用画像を演算部２４０に入力する。または、制御部２７０は、前層のユニット７１により抽出された判別用画像の特徴量等の、判別用画像から得られる所定の入力データを演算部２４０に入力する。 In step S210, the control unit 270 inputs the input determination image to the calculation unit 240. Alternatively, the control unit 270 inputs predetermined input data obtained from the discrimination image, such as the feature amount of the discrimination image extracted by the unit 71 of the previous layer, to the calculation unit 240.

　次いで、重みロード選択部２３０は、推論処理用の重み行列Ｗが記憶されている重みメモリから重み行列Ｗをロードする（ステップS211）。重みロード選択部２３０は、ロードされた重み行列Ｗを演算部２４０に入力する。 Next, the weight load selection unit 230 loads the weight matrix W from the weight memory storing the weight matrix W for inference processing (step S211). The weight load selection unit 230 inputs the loaded weight matrix W to the calculation unit 240.

　なお、図６に示す学習装置２０１の重みロード選択部２３０は、ステップS211の処理で、固定層用重みメモリ２８０からも重み行列をロードする。 The weight load selector 230 of the learning device 201 shown in FIG. 6 also loads the weight matrix from the fixed-layer weight memory 280 in the process of step S211.

　次いで、演算部２４０は、ロードされた重み行列Ｗを用いて推論処理を実行する（ステップS212）。推論処理を実行した後、学習装置２００は、入力された判別用画像に対する推論処理を終了する。なお、学習装置２００は、複数の判別用画像に対する推論処理をまとめて実行してもよい。 Next, the calculation unit 240 performs an inference process using the loaded weight matrix W (step S212). After executing the inference processing, the learning device 200 ends the inference processing on the input discriminating image. Note that the learning device 200 may collectively execute inference processing on a plurality of determination images.

　また、学習装置２００は、図７に示す推論処理と学習処理を並列に実行可能である。よって、重みロード選択部２３０は、実行対象の処理が切り替えられた時に重みメモリから求められる重み行列Ｗをロードしてもよい。重みロード選択部２３０が重み行列Ｗをロードするタイミングは、特に限定されない。 The learning device 200 can execute the inference process and the learning process illustrated in FIG. 7 in parallel. Therefore, the weight load selection unit 230 may load the weight matrix W obtained from the weight memory when the process to be executed is switched. The timing at which the weight load selector 230 loads the weight matrix W is not particularly limited.

［効果の説明］
　本実施形態の学習装置２００は、第１重みメモリ２１０と、第２重みメモリ２２０とを備える。よって、例えば第１重みメモリ２１０に記憶されている学習処理用の重み行列Ｗが更新されている間であっても、演算部２４０は、第２重みメモリ２２０に記憶されている推論処理用の重み行列Ｗを用いて推論処理を実行できる。すなわち、学習装置２００は、推論処理と学習処理を並列に実行できる。 [Explanation of effects]
The learning device 200 of the present embodiment includes a first weight memory 210 and a second weight memory 220. Therefore, for example, even while the weighting matrix W for learning processing stored in the first weight memory 210 is being updated, the arithmetic unit 240 performs the processing for inference processing stored in the second weight memory 220. Inference processing can be performed using the weight matrix W. That is, the learning device 200 can execute the inference process and the learning process in parallel.

　また、本実施形態の学習装置２００は、第１の実施形態の重みコピー部１７０を使用せずに、推論処理用の重み行列Ｗを学習済みの重み行列Ｗに更新できる。すなわち、第１の実施形態に比べて、学習装置２００は、より迅速に学習済みの重み行列Ｗを使用できる。 The learning device 200 of the present embodiment can update the weight matrix W for inference processing to the learned weight matrix W without using the weight copy unit 170 of the first embodiment. That is, the learning device 200 can use the learned weight matrix W more quickly than in the first embodiment.

　以下、各実施形態の学習装置のハードウェア構成の具体例を説明する。図８は、本発明による学習装置のハードウェア構成例を示す説明図である。 Hereinafter, specific examples of the hardware configuration of the learning device of each embodiment will be described. FIG. 8 is an explanatory diagram showing an example of a hardware configuration of the learning device according to the present invention.

　図８に示す学習装置は、プロセッサ１０８と、主記憶装置１０２と、補助記憶装置１０３と、インタフェース１０４と、出力デバイス１０５と、入力デバイス１０６とを備える。また、プロセッサ１０８は、ＣＰＵ１０９や、ＧＰＵ１０７等の各種演算・処理装置を含んでいてもよい。 The learning device illustrated in FIG. 8 includes a processor 108, a main storage device 102, an auxiliary storage device 103, an interface 104, an output device 105, and an input device 106. Further, the processor 108 may include various arithmetic and processing devices such as the CPU 109 and the GPU 107.

　図８に示すように実装される場合、学習装置の動作は、プログラムの形式で補助記憶装置１０３に記憶されていてもよい。プログラムが補助記憶装置１０３に記憶される場合、ＣＰＵ１０９は、プログラムを補助記憶装置１０３から読み出して主記憶装置１０２に展開し、展開されたプログラムに従って学習装置における所定の処理を実行する。 When implemented as shown in FIG. 8, the operation of the learning device may be stored in the auxiliary storage device 103 in the form of a program. When the program is stored in the auxiliary storage device 103, the CPU 109 reads the program from the auxiliary storage device 103, expands the program in the main storage device 102, and executes a predetermined process in the learning device according to the expanded program.

　なお、ＣＰＵ１０９は、プログラムに従って動作する情報処理装置の一例である。学習装置は、ＣＰＵ（Central Processing Unit ）以外にも、例えば、ＭＰＵ（Micro Processing Unit ）やＭＣＵ（Memory Control Unit ）やＧＰＵ（Graphics Processing Unit）を備えていてもよい。図８には、学習装置がＣＰＵ１０９に加えて、ＧＰＵ１０７をさらに備える例が記載されている。 The CPU 109 is an example of an information processing device that operates according to a program. The learning device may include, for example, an MPU (Micro Processing Unit), an MCU (Memory Control Unit), or a GPU (Graphics Processing Unit) in addition to the CPU (Central Processing Unit). FIG. 8 illustrates an example in which the learning apparatus further includes a GPU 107 in addition to the CPU 109.

　補助記憶装置１０３は、一時的でない有形の媒体の一例である。一時的でない有形の媒体の他の例として、インタフェース１０４を介して接続される磁気ディスク、光磁気ディスク、ＣＤ－ＲＯＭ（Compact Disk Read Only Memory ）、ＤＶＤ－ＲＯＭ（Digital Versatile Disk Read Only Memory ）、半導体メモリ等が挙げられる。 The auxiliary storage device 103 is an example of a non-transitory tangible medium. Other examples of non-transitory tangible media include a magnetic disk, a magneto-optical disk, a CD-ROM (Compact Disk Read Only Memory), a DVD-ROM (Digital Versatile Disk Read Only Memory) connected via the interface 104, A semiconductor memory and the like are included.

　また、補助記憶装置１０３に記憶される対象のプログラムが補助記憶装置１０３に記憶される代わりに通信回線によって学習装置に配信される場合、配信を受けた学習装置は、配信されたプログラムを主記憶装置１０２に展開し、所定の処理を実行してもよい。 When the target program stored in the auxiliary storage device 103 is distributed to the learning device through a communication line instead of being stored in the auxiliary storage device 103, the distributed learning device stores the distributed program in the main storage. The processing may be developed on the device 102 and a predetermined process may be executed.

　また、プログラムは、学習装置における所定の処理の一部を実現するためのものでもよい。さらに、プログラムは、補助記憶装置１０３に既に記憶されている他のプログラムと組み合わせられて使用される、学習装置における所定の処理を実現するための差分プログラムでもよい。 The program may be for realizing a part of a predetermined process in the learning device. Further, the program may be a difference program for implementing a predetermined process in the learning device, which is used in combination with another program already stored in the auxiliary storage device 103.

　インタフェース１０４は、他の装置との間で情報の送受信を行う。また、出力デバイス１０５は、ユーザに情報を提示する。また、入力デバイス１０６は、ユーザからの情報の入力を受け付ける。 The interface 104 transmits and receives information to and from other devices. The output device 105 presents information to a user. Further, the input device 106 receives input of information from a user.

　また、学習装置における処理内容によっては、図８に示す一部の要素は省略可能である。例えば、学習装置がユーザに情報を提示しないのであれば、出力デバイス１０５は省略可能である。また、例えば、学習装置がユーザから情報入力を受け付けないのであれば、入力デバイス１０６は省略可能である。 Also, some elements shown in FIG. 8 can be omitted depending on the processing content in the learning device. For example, if the learning device does not present information to the user, the output device 105 can be omitted. Also, for example, if the learning device does not accept information input from the user, the input device 106 can be omitted.

　また、上記の各構成要素の一部または全部は、汎用または専用の回路（Circuitry ）、プロセッサ等やこれらの組み合わせによって実現される。これらは単一のチップによって構成されてもよいし、バスを介して接続される複数のチップによって構成されてもよい。また、上記の各構成要素の一部または全部は、上述した回路等とプログラムとの組み合わせによって実現されてもよい。 {Some or all of the above components are realized by general-purpose or special-purpose circuits (Circuitry), processors, and the like, and combinations thereof. These may be constituted by a single chip, or may be constituted by a plurality of chips connected via a bus. In addition, some or all of the above-described components may be realized by a combination of the above-described circuit and the like and a program.

　上記の各構成要素の一部または全部が複数の情報処理装置や回路等により実現される場合には、複数の情報処理装置や回路等は、集中配置されてもよいし、分散配置されてもよい。例えば、情報処理装置や回路等は、クライアントアンドサーバシステム、クラウドコンピューティングシステム等、各々が通信ネットワークを介して接続される形態として実現されてもよい。 When some or all of the above-described components are realized by a plurality of information processing devices or circuits, the plurality of information processing devices or circuits may be centrally arranged or distributed. Good. For example, the information processing device, the circuit, and the like may be realized as a form in which each is connected via a communication network, such as a client and server system or a cloud computing system.

　次に、本発明の概要を説明する。図９は、本発明による学習装置の概要を示すブロック図である。本発明による学習装置１０は、１つ以上のユニットでそれぞれ構成された複数の層が層状に結合された判別モデルの各ユニットの判別用データに対する出力を所定の順番で計算する推論処理で用いられる各ユニットのパラメタをそれぞれ記憶する第１記憶部１１（例えば、推論用重みメモリ１１０）と、各ユニットの学習用データに対する出力に基づいて各ユニットのパラメタの少なくとも一部を更新する学習処理における更新対象のパラメタをそれぞれ記憶する第２記憶部１２（例えば、更新中重みメモリ１２０）とを備える。 Next, the outline of the present invention will be described. FIG. 9 is a block diagram showing an outline of a learning device according to the present invention. The learning apparatus 10 according to the present invention is used in an inference process of calculating an output for each unit of discrimination data of a discrimination model in which a plurality of layers each composed of one or more units are combined in a layered manner in a predetermined order. A first storage unit 11 (for example, an inference weight memory 110) that stores parameters of each unit, and an update in a learning process of updating at least a part of parameters of each unit based on an output of each unit with respect to learning data. A second storage unit 12 (for example, an updating weight memory 120) for storing target parameters;

　そのような構成により、学習装置は、推論処理と学習処理を並列に実行できる。により With such a configuration, the learning device can execute inference processing and learning processing in parallel.

　また、学習装置１０は、第１記憶部１１または第２記憶部１２からパラメタをロードするロード部（例えば、重みロード選択部１３０）と、ロードされたパラメタを用いて推論処理または学習処理を実行する実行部（例えば、演算部１４０）とを備えてもよい。 Further, the learning device 10 executes a reasoning process or a learning process using the loaded parameters (for example, the weight load selecting unit 130) for loading parameters from the first storage unit 11 or the second storage unit 12. (For example, the operation unit 140).

　また、学習装置１０は、各ユニットと対応する複数の実行部を備え、実行部は、推論処理で用いられその実行部に対応するユニットのパラメタを記憶する第３記憶部（例えば、推論用重みレジスタ１４１）と、学習処理における更新対象でありその実行部に対応するユニットのパラメタを記憶する第４記憶部（例えば、更新中重みレジスタ１４２）とを含んでもよい。 Further, the learning device 10 includes a plurality of execution units corresponding to each unit, and the execution unit is used in the inference processing and stores a parameter of the unit corresponding to the execution unit in a third storage unit (for example, an inference weight). A register 141) and a fourth storage unit (for example, the updating weight register 142) that stores parameters of a unit to be updated in the learning process and corresponding to the execution unit.

　そのような構成により、学習装置は、推論処理用の演算の実行と学習処理用の演算の実行とを切り替える時間を短縮できる。 With such a configuration, the learning device can reduce the time for switching between execution of the calculation for the inference process and execution of the calculation for the learning process.

　また、学習装置１０は、所定の数の学習用データが使用された学習処理が完了した後、第１記憶部１１に記憶されているパラメタを、パラメタに対応する第２記憶部１２に記憶されている更新対象のパラメタに更新する更新部（例えば、重みコピー部１７０）を備えてもよい。 After completing the learning process using the predetermined number of learning data, the learning device 10 stores the parameters stored in the first storage unit 11 in the second storage unit 12 corresponding to the parameters. An update unit (for example, a weight copy unit 170) for updating the parameter to be updated may be provided.

　そのような構成により、学習装置は、学習済みモデルを用いて推論処理を実行できる。により With such a configuration, the learning device can execute the inference process using the learned model.

　また、学習装置１０は、所定の数の学習用データが使用された学習処理が完了した後、推論処理で用いられるパラメタを第２記憶部１２から、学習処理における更新対象のパラメタを第１記憶部１１からそれぞれロードするようにロード部に指示する指示部（例えば、重み更新制御部２６０）を備えてもよい。 After completing the learning process using the predetermined number of learning data, the learning device 10 stores the parameters used in the inference process from the second storage unit 12 and the parameters to be updated in the learning process in the first storage unit. An instruction unit (for example, a weight update control unit 260) for instructing the load unit to load each from the unit 11 may be provided.

　そのような構成により、学習装置は、学習済みモデルを推論用重みメモリにコピーする処理を省略できる。 With such a configuration, the learning device can omit the process of copying the learned model to the inference weight memory.

　また、学習装置１０は、各ユニットのパラメタのうち固定されているパラメタをそれぞれ記憶する第５記憶部（例えば、固定層用重みメモリ１９０）を備え、第１記憶部１１と第２記憶部１２は、各ユニットのパラメタのうち固定されていないパラメタをそれぞれ記憶してもよい。 Further, the learning device 10 includes a fifth storage unit (for example, a fixed-layer weight memory 190) that stores the fixed parameters of the parameters of each unit, and the first storage unit 11 and the second storage unit 12 May store unfixed parameters among the parameters of each unit.

　そのような構成により、学習装置は、一部の層内の各ユニットの重みが固定されている判別モデルに対応できる。により With such a configuration, the learning device can support a discrimination model in which the weight of each unit in some layers is fixed.

　また、判別モデルは、ニューラルネットワークでもよい。判別 The discrimination model may be a neural network.

　そのような構成により、学習装置は、深層学習を実行できる。学習 With such a configuration, the learning device can execute deep learning.

　以上、実施形態および実施例を参照して本願発明を説明したが、本願発明は上記実施形態および実施例に限定されるものではない。本願発明の構成や詳細には、本願発明のスコープ内で当業者が理解し得る様々な変更をすることができる。 Although the present invention has been described with reference to the exemplary embodiments and examples, the present invention is not limited to the exemplary embodiments and examples. Various changes that can be understood by those skilled in the art can be made to the configuration and details of the present invention within the scope of the present invention.

１０、１００、１０１、２００、２０１　学習装置
１１　第１記憶部
１２　第２記憶部
７０　大規模学習回路
７１　ユニット
８０　推論装置
８１、９１　重みメモリ
８２、９２　重みロード部
８３、９３、１４０、２４０　演算部
９４、１５０　重みストア部
１０２　主記憶装置
１０３　補助記憶装置
１０４　インタフェース
１０５　出力デバイス
１０６　入力デバイス
１０７　ＧＰＵ
１０８　プロセッサ
１０９　ＣＰＵ
１１０　推論用重みメモリ
１２０　更新中重みメモリ
１３０、２３０　重みロード選択部
１４１　推論用重みレジスタ
１４２　更新中重みレジスタ
１４３　演算器用重みロード選択部
１４４　演算器
１４５　演算器用重みストア部
１６０、２６０　重み更新制御部
１７０　重みコピー部
１８０、２７０　制御部
１９０、２８０　固定層用重みメモリ
２１０　第１重みメモリ
２２０　第２重みメモリ
２５０　重みストア選択部 10, 100, 101, 200, 201 Learning device 11 First storage unit 12 Second storage unit 70 Large-scale learning circuit 71 Unit 80 Inference device 81, 91 Weight memory 82, 92 Weight load unit 83, 93, 140, 240 Operation Units 94 and 150 Weight storage unit 102 Main storage device 103 Auxiliary storage device 104 Interface 105 Output device 106 Input device 107 GPU
108 processor 109 CPU
110 Inference weight memory 120 Updating weight memory 130, 230 Weight load selection unit 141 Inference weight register 142 Updating weight register 143 Operation unit weight load selection unit 144 Operation unit 145 Operation unit weight storage unit 160, 260 Weight update control unit 170 Weight copy unit 180, 270 Control unit 190, 280 Fixed layer weight memory 210 First weight memory 220 Second weight memory 250 Weight store selection unit

Claims

The parameters of the respective units used in the inference processing for calculating the output of each unit of the discrimination model of the discrimination model composed of one or more units in layers in a predetermined order are calculated. A first storage unit,
A learning device, comprising: a second storage unit that stores a parameter to be updated in a learning process of updating at least a part of a parameter of each unit based on an output of the unit for learning data.

A loading unit that loads a parameter from the first storage unit or the second storage unit;
The learning device according to claim 1, further comprising: an execution unit configured to execute an inference process or a learning process using the loaded parameters.

It has multiple execution units corresponding to each unit,
The execution unit is a third storage unit that is used in the inference process and stores a parameter of a unit corresponding to the execution unit, and a fourth storage unit that is an update target in the learning process and stores a parameter of the unit corresponding to the execution unit. The learning device according to claim 2, comprising:

After the learning process using the predetermined number of learning data is completed, the parameter stored in the first storage unit is updated to the parameter to be updated stored in the second storage unit corresponding to the parameter. The learning device according to any one of claims 1 to 3, further comprising an updating unit that performs the updating.

After the learning process using a predetermined number of learning data is completed, the parameters used in the inference process are loaded from the second storage unit, and the parameters to be updated in the learning process are loaded from the first storage unit. The learning device according to claim 2, further comprising an instruction unit that instructs the unit.

A fifth storage unit for storing a fixed parameter among the parameters of each unit,
The learning device according to any one of claims 1 to 5, wherein the first storage unit and the second storage unit store parameters that are not fixed among the parameters of each unit.

The parameters of each unit used in the inference processing for calculating the output for the discrimination data of each unit of the discrimination model of the discrimination model in which a plurality of layers each composed of one or more units are combined in a layered manner in a predetermined order are respectively described as follows. 1 Store in the storage unit,
A learning method, wherein a parameter to be updated in a learning process of updating at least a part of a parameter of each unit based on an output of each unit for learning data is stored in a second storage unit.

Loading parameters from the first storage unit or the second storage unit,
The learning method according to claim 7, wherein inference processing or learning processing is performed using the loaded parameters.

On the computer,
The parameters of each unit used in the inference processing for calculating the output for the discrimination data of each unit of the discrimination model of the discrimination model in which a plurality of layers each composed of one or more units are combined in a layered manner in a predetermined order are respectively described as follows. A first storage process to be stored in the first storage unit; and a parameter to be updated in the learning process of updating at least a part of the parameter of each unit based on the output of the unit for learning data, to the second storage unit. A learning program for executing a second storage process for storing.

On the computer,
The learning program according to claim 9, wherein the learning program is configured to execute a loading process of loading a parameter from the first storage unit or the second storage unit, and an inference process or a learning process using the loaded parameter.