JP7725065B2

JP7725065B2 - Learning model device, computing device production system, computing method, computing device production method, and program

Info

Publication number: JP7725065B2
Application number: JP2021197852A
Authority: JP
Inventors: 智晴長尾; 直規葛谷
Original assignee: Yokohama National University NUC
Current assignee: Yokohama National University NUC
Priority date: 2021-12-06
Filing date: 2021-12-06
Publication date: 2025-08-19
Anticipated expiration: 2041-12-06
Also published as: JP2023083885A

Description

本発明は、学習モデル装置、演算装置生産システム、演算方法、演算装置生産方法およびプログラムに関する。 The present invention relates to a learning model device, a computing device production system, a computing method, a computing device production method, and a program.

複数の二値データの入力を受け、それら複数の二値データの合計と閾値とを比較して二値データの出力値を算出するノードを用いるバイナリニューラルネットワークが提案されている（例えば、特許文献１参照）。 A binary neural network has been proposed that uses nodes that receive multiple binary data inputs, compare the sum of those binary data with a threshold, and calculate an output value for the binary data (see, for example, Patent Document 1).

特開２０１９－６１４９６号公報JP 2019-61496 A

１層のバイナリニューラルネットワークでは、排他的論理和を表現できないなど、入力値に対してとりうる出力値が限定される。バイナリニューラルネットワークの層数を増やせば、より多様な出力値をとり得るが、ネットワーク構造がより複雑になる。また、ニューラルネットワークの層数が増えると、学習精度の低下、および、学習速度の低下が生じる可能性がある。
二値データを用いるノードを備える学習モデルが、ノードの層数を増やす必要なしに、比較的多様な出力値をとり得ることが好ましい。 A single-layer binary neural network is limited in the output values it can take for a given input value, as it cannot express exclusive OR. Increasing the number of layers in a binary neural network allows for a greater variety of output values, but the network structure becomes more complex. In addition, increasing the number of layers in a neural network can result in a decrease in learning accuracy and speed.
It is desirable for a learning model with nodes that use binary data to be able to take on a relatively wide variety of output values without having to increase the number of layers of nodes.

本発明の目的の一例は、二値データを用いるノードを備える学習モデルが、ノードの層数を増やす必要なしに、比較的多様な出力値をとり得るようにすることができる、学習モデル装置、演算装置生産システム、演算方法、演算装置生産方法およびプログラムを提供することである。 One example of the objective of the present invention is to provide a learning model device, a computing device production system, a computing method, a computing device production method, and a program that enable a learning model having nodes that use binary data to take on a relatively wide variety of output values without the need to increase the number of layers of nodes.

本発明の第１の態様によれば、学習モデル装置は、二値ベクトルの入力を受け、前記二値ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、前記二値ベクトルを前記実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、前記二値ベクトルによる座標値の要素以外の要素に基づいて、二値化された出力値を決定するノードを備える。 According to a first aspect of the present invention, a learning model device receives a binary vector as input and includes a node that determines a binarized output value based on elements of the coordinate values of points included in a hypersurface in real space that has one more dimension than the dimension of the binary vector, the coordinate values of which include each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real space, other than the elements of the coordinate values of the binary vector.

本発明の第２の態様によれば、演算装置生産システムは、学習モデルシステムと、学習制御部と、設定部とを備え、前記学習モデルシステムは、二値ベクトルの入力を受け、前記二値ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、前記二値ベクトルを前記実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、前記二値ベクトルによる座標値の要素以外の要素に基づいて、出力値を決定するノードを備え、前記学習制御部は、前記学習モデルシステムの学習を制御し、前記設定部は、学習後の前記学習モデルシステムのノードにおける入力と出力との関係を示すルックアップテーブルを生成し、生成したルックアップテーブルを演算装置のテンプレートに設定する。 According to a second aspect of the present invention, a computing device production system includes a learning model system, a learning control unit, and a setting unit. The learning model system receives a binary vector as input and includes a node that determines an output value based on elements of coordinate values of a point included in a hypersurface in real space that has one more dimension than the number of dimensions of the binary vector, the coordinate values of the point including each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real space, other than the elements of the coordinate values of the binary vector. The learning control unit controls learning of the learning model system. The setting unit generates a lookup table indicating the relationship between input and output at the node of the learning model system after learning, and sets the generated lookup table in a template for the computing device.

本発明の第３の態様によれば、演算方法は、コンピュータが、二値ベクトルの入力を受け、前記二値ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、前記二値ベクトルを前記実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、前記二値ベクトルによる座標値の要素以外の要素に基づいて、二値化された出力値を決定することを含む。 According to a third aspect of the present invention, a computing method includes a computer receiving a binary vector as input, and determining a binarized output value based on elements of the coordinate values of points included in a hypersurface in real space that has one more dimension than the number of dimensions of the binary vector, the coordinate values of which include each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real space, other than the elements of the coordinate values of the binary vector.

本発明の第４の態様によれば、演算装置生産方法は、二値ベクトルの入力を受け、前記二値ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、前記二値ベクトルを前記実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、前記二値ベクトルによる座標値の要素以外の要素に基づいて、出力値を決定するノードを備える学習モデルシステムの学習を行い、学習後の前記学習モデルシステムのノードにおける入力と出力との関係を示すルックアップテーブルを生成し、生成したルックアップテーブルを演算装置のテンプレートに設定することを含む。 According to a fourth aspect of the present invention, a method for producing a computing device includes: training a learning model system that receives an input of a binary vector; and that includes a node that determines an output value based on elements of the coordinate values of a point that is included in a hypersurface in real space with a number of dimensions one greater than the number of dimensions of the binary vector and has coordinate values that include each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real space, other than elements of the coordinate values of the binary vector; generating a lookup table that indicates the relationship between input and output at the node of the learning model system after training; and setting the generated lookup table as a template for a computing device.

本発明の第５の態様によれば、プログラムは、コンピュータに、二値ベクトルの入力を受け、前記二値ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、前記二値ベクトルを前記実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、前記二値ベクトルによる座標値の要素以外の要素に基づいて、二値化された出力値を決定することを実行させるためのプログラムである。 According to a fifth aspect of the present invention, a program causes a computer to receive a binary vector as input, and determine a binarized output value based on elements of the coordinate values of points included in a hypersurface in real space that has one more dimension than the number of dimensions of the binary vector, the coordinate values of which include each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real space, other than the elements of the coordinate values of the binary vector.

本発明の第６の態様によれば、プログラムは、コンピュータに、二値ベクトルの入力を受け、前記二値ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、前記二値ベクトルを前記実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、前記二値ベクトルによる座標値の要素以外の要素に基づいて、出力値を決定するノードを備える学習モデルシステムの学習を行うことと、学習後の前記学習モデルシステムのノードにおける入力と出力との関係を示すルックアップテーブルを生成することと、
生成したルックアップテーブルを演算装置のテンプレートに設定することと、を実行させるためのプログラムである。 According to a sixth aspect of the present invention, a program is provided for a computer to receive an input of a binary vector, and to perform training of a learning model system including a node that determines an output value based on elements of coordinate values of a point that is included in a hypersurface in a real number space having a number of dimensions one more than the number of dimensions of the binary vector, the coordinate values including each element of the coordinate values when the binary vector is treated as a coordinate value in a subspace of the real number space, other than the elements of the coordinate values of the binary vector; and to generate a lookup table indicating the relationship between input and output at the node of the learning model system after training;
and setting the generated lookup table as a template for the arithmetic unit.

上記した学習モデル装置、演算装置生産システム、演算方法、演算装置生産方法およびプログラムによれば、二値データを用いるノードを備える学習モデルが、ノードの層数を増やす必要なしに、比較的多様な出力値をとり得るようにすることができる。 The above-described learning model device, computing device production system, computing method, computing device production method, and program enable a learning model equipped with nodes that use binary data to take on a relatively wide variety of output values without the need to increase the number of node layers.

実施形態に係る演算装置生産システムの構成の例を示す図である。1 is a diagram illustrating an example of a configuration of a computing device production system according to an embodiment. 実施形態に係るテーブル型ノードにおけるデータの入出力の例を示す図である。FIG. 10 is a diagram illustrating an example of input and output of data in a table type node according to an embodiment. 実施形態に係る超曲面型ノードの構成の例を示す図である。FIG. 10 is a diagram illustrating an example of the configuration of a hypersurface node according to the embodiment. 実施形態に係る、B-Spline曲面を用いた超曲面処理部の動作確認に用いた二値演算の第１の例を示す図である。FIG. 10 is a diagram illustrating a first example of a binary operation used to check the operation of a hypersurface processing unit using a B-Spline surface according to an embodiment. 動作確認で得られたB-Spline曲面の第１の例を示す図である。FIG. 10 is a diagram showing a first example of a B-Spline surface obtained in an operation check. 実施形態に係る、B-Spline曲面を用いた超曲面処理部の動作確認に用いた二値演算の第２の例を示す図である。FIG. 10 is a diagram illustrating a second example of a binary operation used to check the operation of a hypersurface processing unit using a B-Spline surface according to an embodiment. 動作確認で得られたB-Spline曲面の第２の例を示す図である。FIG. 10 is a diagram showing a second example of a B-Spline surface obtained in an operation check. 実施形態に係る、B-Spline曲面を用いた超曲面処理部の動作確認に用いた二値演算の第３の例を示す図である。FIG. 10 is a diagram illustrating a third example of a binary operation used to check the operation of a hypersurface processing unit using a B-Spline surface according to an embodiment. 動作確認で得られたB-Spline曲面の第３の例を示す図である。FIG. 10 is a diagram showing a third example of a B-Spline surface obtained in an operation check. 実施形態に係る、B-Spline曲面を用いた超曲面処理部１１１の動作確認に用いた二値演算の第４の例を示す図である。FIG. 10 is a diagram illustrating a fourth example of a binary operation used to check the operation of the hypersurface processing unit 111 using a B-Spline surface according to the embodiment. 動作確認で得られたB-Spline曲面の第４の例を示す図である。FIG. 10 is a diagram showing a fourth example of a B-Spline surface obtained in an operation check. 実施形態に係る演算装置生産システム１が演算装置を生成する処理の手順の例を示すフローチャートである。10 is a flowchart illustrating an example of a processing procedure in which the computing device production system 1 according to the embodiment generates a computing device. 実施形態に係るテーブル型ノードが二値ベクトルを出力する場合の、データの入出力の例を示す図である。10A and 10B are diagrams illustrating an example of data input and output when a table-type node according to an embodiment outputs a binary vector. 実施形態に係る超曲面型ノードが二値ベクトルを出力する場合の、データの入出力の例を示す図である。10A and 10B are diagrams illustrating an example of data input and output when a hypersurface node according to an embodiment outputs a binary vector. 実施形態に係る１つの学習モデル装置が１つの超曲面型ノードを備える場合の、学習モデル装置における超曲面型ノードの構成の例を示す図である。A figure showing an example of the configuration of a hypersurface type node in a learning model device when one learning model device according to an embodiment has one hypersurface type node. 実施形態に係る１つの学習モデル装置が複数の超曲面型ノードを備える場合の、学習モデル装置における超曲面型ノードの構成の第１の例を示す図である。A figure showing a first example of the configuration of a hypersurface type node in a learning model device when one learning model device according to an embodiment has multiple hypersurface type nodes. 実施形態に係る１つの学習モデル装置が複数の超曲面型ノードを備える場合の、学習モデル装置における超曲面型ノードの構成の第２の例を示す図である。A figure showing a second example of the configuration of a hypersurface type node in a learning model device when one learning model device according to an embodiment has multiple hypersurface type nodes. 実施形態に係る実験に用いた畳み込みニューラルネットワークの構成を示す図である。FIG. 1 is a diagram illustrating a configuration of a convolutional neural network used in an experiment according to an embodiment. 実験結果として得られた認識率を示す図である。FIG. 10 is a diagram showing the recognition rate obtained as an experimental result. ＦＰＧＡの構成の例を示す図である。FIG. 1 is a diagram illustrating an example of a configuration of an FPGA. 実施形態に係る複数のテーブル型ノードに共用でルックアップテーブルが設けられる場合の、演算装置の構成の例を示す図である。FIG. 10 is a diagram illustrating an example of the configuration of a computing device when a lookup table is provided in common for a plurality of table-type nodes according to an embodiment. 少なくとも１つの実施形態に係るコンピュータの構成の例を示す概略ブロック図である。FIG. 1 is a schematic block diagram illustrating an example configuration of a computer according to at least one embodiment.

以下、本発明の実施形態を説明するが、以下の実施形態は請求の範囲にかかる発明を限定するものではない。また、実施形態の中で説明されている特徴の組み合わせの全てが発明の解決手段に必須であるとは限らない。
図１は、実施形態に係る演算装置生産システムの構成の例を示す図である。図１に示す構成で、演算装置生産システム１は、学習モデル装置１００と、学習制御部３００と、設定部４００とを備える。学習モデル装置１００は、超曲面型ノード１１０を備える。
また、図１には、演算装置２００が示されている。演算装置２００は、テーブル型ノード２１０を備える。演算装置２００が、演算装置生産システム１の一部として構成されていてもよいし、演算装置生産システム１の外部の装置として構成されていてもよい。 The following describes embodiments of the present invention, but the following embodiments do not limit the scope of the invention as claimed. Furthermore, not all of the combinations of features described in the embodiments are necessarily essential to the solution of the invention.
Fig. 1 is a diagram showing an example of the configuration of a computing device production system according to an embodiment. In the configuration shown in Fig. 1, the computing device production system 1 includes a learning model device 100, a learning control unit 300, and a setting unit 400. The learning model device 100 includes a hypersurface type node 110.
1 also shows a computing device 200. The computing device 200 includes a table-type node 210. The computing device 200 may be configured as a part of the computing device production system 1, or may be configured as a device external to the computing device production system 1.

演算装置生産システム１は、二値データを用いるノードを備える学習モデルの学習を行って、学習モデルのノード毎に入力値と出力値との関係を示すルックアップテーブルを取得する。演算装置生産システム１は、取得したルックアップテーブルを演算装置２００のテンプレートのテーブル型ノード２１０に設定することで、演算装置２００を生産する。ここでいう演算装置２００のテンプレートは、ルックアップテーブルが設定されておらず、それ以外の点では演算装置２００と同様のものである。
演算装置生産システム１では、学習モデル装置１００が学習モデルの機能を有し、超曲面型ノード１１０が、二値データを用いるノードに該当する。 The computing device production system 1 performs learning of a learning model having nodes that use binary data, and acquires a lookup table that indicates the relationship between input values and output values for each node of the learning model. The computing device production system 1 produces the computing device 200 by setting the acquired lookup table in the table-type node 210 of the template of the computing device 200. The template of the computing device 200 here does not have a lookup table set, but is otherwise similar to the computing device 200.
In the computing device production system 1, the learning model device 100 has the function of a learning model, and the hypersurface type node 110 corresponds to a node that uses binary data.

ここでいう学習モデルは、パラメータ値を調整可能なパラメータを有し、入力データ値に対して入力データ値およびパラメータ値に応じた出力データ値を出力するものである。学習モデルのパラメータ値を調整することを、学習モデルの学習と称する。学習モデル装置１００の学習モデルの学習を、学習モデル装置１００の学習とも称する。
二値データの演算を二値データ演算とも称する。ここでいう演算は、ルックアップテーブルを参照して出力値を決定することであってもよい。 The learning model here has parameters whose parameter values are adjustable, and outputs output data values corresponding to the input data values and the parameter values. Adjusting the parameter values of the learning model is referred to as learning the learning model. Learning the learning model of the learning model device 100 is also referred to as learning of the learning model device 100.
The operation of binary data is also referred to as binary data operation. The operation here may refer to a lookup table to determine an output value.

演算装置２００は、テーブル型ノード２１０を用いて二値データ演算を行う。演算装置２００が備えるテーブル型ノード２１０の個数は特定の個数に限定されず、１つ以上のいろいろな個数とすることができる。特に、演算装置２００が、学習モデル装置１００が備える超曲面型ノード１１０の個数と同じ個数のテーブル型ノード２１０を備え、学習モデル装置１００で超曲面型ノード１１０が接続されるネットワーク構造と同じネットワーク構造で、テーブル型ノード２１０が接続されていてもよい。これにより、演算装置生産システム１が学習モデル装置１００の学習にて超曲面型ノード１１０毎に取得するルックアップテーブルを、そのままテーブル型ノード２１０に設定することができる。 The computing device 200 performs binary data operations using table-type nodes 210. The number of table-type nodes 210 provided in the computing device 200 is not limited to a specific number and can be one or more. In particular, the computing device 200 may be provided with the same number of table-type nodes 210 as the number of hypersurface-type nodes 110 provided in the learning model device 100, and the table-type nodes 210 may be connected in the same network structure as the network structure to which the hypersurface-type nodes 110 are connected in the learning model device 100. This allows the lookup table that the computing device production system 1 obtains for each hypersurface-type node 110 during learning by the learning model device 100 to be set directly in the table-type node 210.

二値データを扱ういろいろな演算に演算装置２００を用いることができる。例えば、ビット演算、論理演算、または、二値画像処理に演算装置２００を用いることができるが、演算装置２００の用途はこれらに限定されない。
テーブル型ノード２１０は、ルックアップテーブルを参照し、入力される二値ベクトルとルックアップテーブルで対応付けられている出力値を出力する。 The arithmetic unit 200 can be used for various operations that handle binary data, such as bit operations, logical operations, or binary image processing, but the uses of the arithmetic unit 200 are not limited to these.
The table type node 210 refers to a lookup table and outputs an output value that is associated with an input binary vector in the lookup table.

図２は、テーブル型ノード２１０におけるデータの入出力の例を示す図である。図２は、２入力かつ１出力の場合のテーブル型ノード２１０の例を示している。ただし、テーブル型ノード２１０における入力データの個数は特定の個数に限定されず、１つ以上のいろいろな個数とすることができる。また、後述するように、テーブル型ノード２１０における出力データの個数が複数であってもよい。 Figure 2 is a diagram showing an example of data input and output in a table-type node 210. Figure 2 shows an example of a table-type node 210 with two inputs and one output. However, the number of input data items in the table-type node 210 is not limited to a specific number, and can be any number greater than or equal to one. Furthermore, as will be described later, the number of output data items in the table-type node 210 may be multiple.

図２の例では、テーブル型ノード２１０は、２つの二値データｘ_０およびｘ_１の入力を受け、ルックアップテーブルで入力データ値に対応付けられている二値データ値を出力する。２つの二値データｘ_０およびｘ_１は、二値ベクトルの例に該当する。また、図２の例では、ルックアップテーブルで入力データ値に対応付けられる出力データ値は、入力データ値と同じ行に示されている出力データ値である。 2, the table-type node 210 receives two binary data items _x0 and _x1 as inputs and outputs binary data values that are associated with the input data values in a lookup table. The two binary data items _x0 and _x1 are examples of binary vectors. Also, in the example of FIG. 2, the output data value that is associated with the input data value in the lookup table is the output data value shown in the same row as the input data value.

テーブル型ノード２１０では、ルックアップテーブルにおいて入力データ値の組み合わせ毎に任意の出力データ値を設定することができる。テーブル型ノード２１０は、この点で、高い表現力を有する。例えば、バイナリニューラルネットワークの１つのノードでは排他的論理和の演算を行うことができないのに対し、１つのテーブル型ノード２１０で排他的論理和の演算を行うことができる。
また、テーブル型ノード２１０では、ルックアップテーブルを参照して入力データ値に対する出力データ値を決定する点で、複雑な演算に相当する入出力の場合でも、比較的短時間で、かつ、比較的小さい消費電力で、データを出力することができる。 The table-type node 210 allows any output data value to be set for each combination of input data values in a lookup table. In this respect, the table-type node 210 has high expressive power. For example, while a single node in a binary neural network cannot perform an exclusive OR operation, a single table-type node 210 can perform an exclusive OR operation.
Furthermore, the table-type node 210 determines the output data value for the input data value by referring to a lookup table, so that even in the case of input/output that corresponds to complex calculations, data can be output in a relatively short time and with relatively low power consumption.

一方、テーブル型ノード２１０が用いるルックアップテーブルは離散的な入力値と離散的な出力値との対応関係を表すものであり、このルックアップテーブルを関数として微分することはできない。このため、例えば誤差逆伝播法など関数の微分を用いる学習手法を演算装置２００に適用することはできない。
そこで、演算装置生産システム１は、学習モデル装置１００の学習を行い、学習結果を演算装置２００に反映させる。 On the other hand, the lookup table used by the table-type node 210 represents the correspondence between discrete input values and discrete output values, and this lookup table cannot be differentiated as a function. For this reason, a learning method that uses the differentiation of a function, such as the backpropagation method, cannot be applied to the arithmetic device 200.
Therefore, the computing device production system 1 performs learning on the learning model device 100 and reflects the learning results in the computing device 200.

学習モデル装置１００は、超曲面型ノード１１０を用いて二値データ演算を行う。学習モデル装置１００が、１つの超曲面型ノード１１０を備えていてもよいし、複数の超曲面型ノード１１０を備えていてもよい。学習モデル装置１００が複数の超曲面型ノード１１０を備える場合、超曲面型ノード１１０間でデータの受け渡しが行われるようになっていてもよい。この場合、学習モデル装置１００におけるデータの入出力の関係は、ニューラルネットワークの場合と同様、有向グラフの形式で表すことができる。
学習モデル装置１００は、学習モデルシステムの例に該当する。 The learning model device 100 performs binary data operations using a hypersurface node 110. The learning model device 100 may include one hypersurface node 110, or multiple hypersurface nodes 110. When the learning model device 100 includes multiple hypersurface nodes 110, data may be passed between the hypersurface nodes 110. In this case, the data input/output relationship in the learning model device 100 can be represented in the form of a directed graph, as in the case of a neural network.
The learning model device 100 is an example of a learning model system.

図３は、超曲面型ノード１１０の構成の例を示す図である。図３に示す構成で、超曲面型ノード１１０は、超曲面処理部１１１と、閾値演算部１１２とを備える。
また、図３は、２入力かつ１出力の場合の超曲面型ノード１１０の例を示している。ただし、超曲面型ノード１１０における入力データの個数および出力データの個数は、特定の個数に限定されない。例えば、学習モデル装置１００の超曲面型ノード１１０と、演算装置２００のテーブル型ノード２１０とが一対一に対応付けられ、対応付けられる超曲面型ノード１１０とテーブル型ノード２１０とが、同じ個数の入力データの入力を受けて、同じ個数の出力データを出力するようにしてもよい。 3 is a diagram showing an example of the configuration of the hypersurface node 110. In the configuration shown in FIG.
3 shows an example of a hypersurface node 110 with two inputs and one output. However, the number of input data and the number of output data in the hypersurface node 110 are not limited to a specific number. For example, the hypersurface node 110 of the learning model device 100 and the table node 210 of the calculation device 200 may be in one-to-one correspondence, and the corresponding hypersurface node 110 and table node 210 may receive the same number of input data and output the same number of output data.

超曲面型ノード１１０は、学習モデル装置１００が行う二値データ演算、またはその一部に該当する二値データ演算を行う。具体的には、超曲面型ノード１１０は、二値ベクトルによる入力ベクトルの入力を受ける。すなわち、超曲面型ノード１１０は、１つ以上の二値データ値の入力を受ける。そして、超曲面型ノード１１０は、超曲面処理部１１１および閾値演算部１１２にて、入力ベクトルに対する二値データ演算を行って出力値を決定し出力する。 The hypersurface node 110 performs the binary data operations performed by the learning model device 100, or a portion of those operations. Specifically, the hypersurface node 110 receives an input vector as a binary vector. That is, the hypersurface node 110 receives an input of one or more binary data values. The hypersurface node 110 then performs a binary data operation on the input vector using the hypersurface processing unit 111 and threshold calculation unit 112, and determines and outputs an output value.

超曲面処理部１１１は、入力ベクトルの次元数よりも１次元多い次元数の実数空間における超曲面に含まれる点のうち、入力ベクトルを上記の実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の座標値を検出する。そして、超曲面処理部１１１は、検出した座標値の要素のうち、入力ベクトルによる座標値の要素以外の要素の値を出力する。 The hypersurface processing unit 111 detects the coordinate values of points included in a hypersurface in real space with one more dimension than the dimension of the input vector, and which have coordinate values that include each element of the coordinate values when the input vector is treated as coordinate values in a subspace of the real space.The hypersurface processing unit 111 then outputs the values of elements of the detected coordinate values other than the elements of the coordinate values of the input vector.

図３の例の場合、２つの入力データ値ｘ_０およびｘ_１が入力ベクトルの例に該当する。この場合、入力ベクトルの次元数は２次元である。図３に示されるｘ_０ｘ_１ｚ座標空間が、入力ベクトルの次元数よりも１次元多い３次元の実数空間の例に該当する。ここでのｚは、超曲面処理部１１１の出力値であり、閾値演算部１１２への入力データ値となる。 In the example of Fig. 3, two input data values _x0 and _x1 correspond to examples of input vectors. In this case, the number of dimensions of the input vector is two. The _x0 x1 _z coordinate space shown in Fig. 3 corresponds to an example of a three-dimensional real number space, which has one more dimension than the number of dimensions of the input vector. Here, z is the output value of the hypersurface processing unit 111 and serves as an input data value to the threshold calculation unit 112.

このように、入力ベクトルの次元数よりも１次元多い次元数の実数空間として、入力ベクトルの各要素値を実数値として扱う場合の要素毎の座標軸である入力座標軸と、超曲面処理部１１１の出力値の座標軸である出力座標軸との組み合わせによる座標空間を用いることができる。
入力ベクトルの次元数よりも１次元多い次元数の実数空間を、入出力実数空間とも称する。 In this way, a coordinate space can be used that is a real number space with one more dimension than the number of dimensions of the input vector, which is a combination of an input coordinate axis, which is the coordinate axis for each element when each element value of the input vector is treated as a real value, and an output coordinate axis, which is the coordinate axis for the output value of the hypersurface processing unit 111.
A real number space having one more dimension than the number of dimensions of the input vector is also referred to as an input/output real number space.

また、図３の例で、ｘ_０ｘ_１ｚ座標空間に示される曲面が、入力ベクトルの次元数よりも１次元多い３次元の実数空間（入出力実数空間）における超曲面の例に該当する。
また、ｘ_０座標およびｘ_１座標で構成されるｘ_０ｘ_１座標平面が、ｘ_０ｘ_１ｚ座標空間の部分空間の例に該当する。二値データ値である入力データ値ｘ_０およびｘ_１を、いずれも実数データ値として扱うことで、入力ベクトルをｘ_０ｘ_１座標平面における座標値として扱うことができる。例えば、ｘ_０＝１、ｘ_１＝０の場合、入力ベクトル（１，０）をｘ_０ｘ_１座標平面における座標値（１，０）として扱うことができる。 In the example of FIG. 3, the surface shown in the x ₀ x ₁ z coordinate space corresponds to an example of a hypersurface in a three-dimensional real space (input/output real space) that has one more dimension than the number of dimensions of the input vector.
Furthermore, the _x0x1 coordinate plane consisting of the _x0 coordinate and _the _x1 coordinate corresponds to an example of a subspace of _{the x0x1z} _coordinate space. By treating the input data values _x0 and _x1 , which are binary data values, as real data values, the input vector can be treated as a coordinate value on the _x0x1 coordinate plane. For example, when _x0 = ₁ and _x1 = 0, the input vector (1, 0) can be treated as a coordinate value (1, 0) on _the _x0x1 coordinate plane.

このように、ｘ_０ｘ_１座標平面における座標値（１，０）が定まると、ｘ_０ｘ_１ｚ座標空間に示される曲面に含まれる点のうち、この座標値の各要素ｘ_０＝１およびｘ_１＝０を含む座標値を有する点が一意に特定される。図３に示される曲面では、ｘ_０＝１、ｘ_１＝０の場合のｚ座標値はｚ＝０であり、座標値（１，０，０）が特定される。 In this way, once the coordinate value (1, ₀ ) on the _x0x1 coordinate plane is determined, points having coordinate values including the elements _x0 = ₁ and _x1 = 0 _of this coordinate value are uniquely identified among the points included in the curved surface shown in the x0x1z coordinate space. In the curved surface shown in Fig. 3, when _x0 = 1 and _x1 = 0, the z coordinate value is z = 0, and the coordinate value (1, 0, 0) is identified.

そのために、入出力実数空間における超曲面として、その超曲面に含まれる点のうち、入力ベクトルを入出力実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点が一意に定まるような超曲面を用いる。
そのような超曲面の例として、入力ベクトルがとり得る値の各々を、実数空間の部分空間における座標値として扱った場合のその座標値と、入力ベクトルがとり得る値毎に設定される出力座標値との組み合わせによる座標値の点を制御点とするB-Spline超曲面が挙げられる。 To this end, a hypersurface in the input/output real number space is used such that, among the points included in the hypersurface, points having coordinate values including each element of the coordinate values when the input vector is treated as coordinate values in a subspace of the input/output real number space are uniquely determined.
An example of such a hypersurface is a B-Spline hypersurface whose control points are points with coordinate values obtained by combining the coordinate values of an input vector when each possible value of the input vector is treated as a coordinate value in a subspace of real number space with the output coordinate values set for each possible value of the input vector.

図３の例では、入力データ値ｘ_０、ｘ_１の何れも０または１の値をとり得る。したがって、入力ベクトルがとり得る値は、（ｘ_０，ｘ_１）＝（０，０）、（０，１）、（１，０）および（１，１）である。図３の例では、入力ベクトルがとり得る値と出力値との組み合わせによる座標値（ｘ_０，ｘ_１，ｚ）＝（０，０，０）、（０，１，０）、（１，０，０）および（１，１，１）の各点が制御点に設定されており、白丸（○）で示されている。図３の例では、これらの制御点の値に基づくB-Spline曲面が、ｘ_０ｘ_１ｚ座標空間における曲面として用いられている。 In the example of Fig. 3, both input data values _x0 and _x1 can take the value of 0 or 1. Therefore, the values that the input vector can take are ( _x0 , _x1 ) = (0,0), (0,1), (1,0), and (1,1). In the example of Fig. 3, points with coordinate values ( _x0 , _x1 , z) = (0,0,0), (0,1,0), (1,0,0), and (1,1,1) that are combinations of the values that the input vector can take and the output value are set as control points and are indicated by white circles (○). In the example of Fig. 3, a B-Spline surface based on the values of these control points is used as a surface in _the _x0x1z coordinate space.

各制御点の座標値の要素のうち出力座標値を、学習パラメータ値として扱うことができる。学習パラメータ値としての出力座標値を変更することで、入力ベクトルがとり得る値に対する超曲面処理部１１１の出力値が変更される。
また、B-Spline超曲面は、入力ベクトルを引数として出力座標値を出力する微分可能な関数で表すことができる。これにより、制御点の座標値に含まれる学習パラメータ値の学習に、誤差逆伝播法など関数の微分を用いる学習手法を適用し得る。
ただし、入出力実数空間における超曲面は、B-Spline超曲面に限定されず、学習パラメータ値に応じて超曲面処理部１１１における入力ベクトルと出力データ値との関係が変化し、かつ、微分可能な関数で示されるいろいろな超曲面とすることができる。 The output coordinate values of the coordinate elements of each control point can be treated as learning parameter values. By changing the output coordinate values as learning parameter values, the output values of the hypersurface processing unit 111 corresponding to the values that the input vector can take are changed.
Furthermore, a B-Spline hypersurface can be expressed as a differentiable function that takes an input vector as an argument and outputs an output coordinate value. This makes it possible to apply a learning method that uses function differentiation, such as backpropagation, to learning the learning parameter values included in the coordinate values of the control points.
However, the hypersurface in the input/output real number space is not limited to a B-Spline hypersurface, and the relationship between the input vector and the output data value in the hypersurface processing unit 111 changes depending on the learning parameter value, and various hypersurfaces expressed by differentiable functions can be used.

超曲面処理部１１１は、超曲面に含まれる点の座標値として特定した座標値の要素のうち、入力ベクトルによる要素以外の要素の値を出力する。上記の例の場合、超曲面処理部１１１は、特定した座標値（ｘ_０，ｘ_１，ｚ）＝（１，０，０）のうち、ｚ座標値「０」を出力する。
なお、超曲面処理部１１１の出力値は実数値であり、二値データにおける二値に相当する値以外の値をとり得る。 The hypersurface processing unit 111 outputs the values of the elements of the coordinate values identified as the coordinate values of the points included in the hypersurface, other than the elements based on the input vector. In the above example, the hypersurface processing unit 111 outputs the z coordinate value "0" from the identified coordinate values ( _x0 , _x1 , z) = (1, 0, 0).
The output values of the hypersurface processing unit 111 are real values, and can take values other than those corresponding to the binary values in the binary data.

閾値演算部１１２は、学習モデル装置１００における順伝播のときは、超曲面処理部１１１の出力値をステップ関数で二値化する。例えば、閾値演算部１１２が、超曲面処理部１１１の出力値と閾値とを比較して、比較結果に応じて二値データにおける二値の何れかを出力するようにしてもよい。この場合の閾値は固定値となっていてもよいし、学習パラメータ値として可変になっていてもよい。
学習モデル装置１００における順伝播のときの例として、学習モデル装置１００による二値データ演算実行時が挙げられる。 During forward propagation in the learning model device 100, the threshold calculation unit 112 binarizes the output value of the hypersurface processing unit 111 using a step function. For example, the threshold calculation unit 112 may compare the output value of the hypersurface processing unit 111 with a threshold and output one of the two values in the binary data depending on the comparison result. In this case, the threshold may be a fixed value or may be variable as a learning parameter value.
An example of forward propagation in the learning model device 100 is when the learning model device 100 executes binary data operations.

一方、閾値演算部１１２は、学習モデル装置１００における逆伝播のときは、ステップ関数を微分可能な関数で近似する。ステップ関数を近似する微分可能な関数の例としてシグモイド関数、および、双曲線正接関数（Hyperbolic Tangent Function）を挙げることができるが、閾値演算部１１２が用いる関数はこれらに限定されない。 On the other hand, during backpropagation in the learning model device 100, the threshold calculation unit 112 approximates the step function with a differentiable function. Examples of differentiable functions that approximate the step function include a sigmoid function and a hyperbolic tangent function, but the functions used by the threshold calculation unit 112 are not limited to these.

学習モデル装置１００における逆伝播のときの例として、誤差逆伝播法における学習パラメータの補正量算出時が挙げられる。閾値演算部１１２が、ステップ関数を微分可能な関数で近似することで、誤差逆伝播法など関数の微分を用いる学習手法を学習モデル装置１００の学習に適用することができる。 An example of backpropagation in the learning model device 100 is when calculating the amount of correction for learning parameters in the backpropagation method. By having the threshold calculation unit 112 approximate the step function with a differentiable function, learning methods that use function differentiation, such as the backpropagation method, can be applied to learning in the learning model device 100.

学習時の少なくとも一部の期間の間、閾値演算部１１２が、データの二値化を行わず、超曲面型ノード１１０が実数値のデータを出力するようにしてもよい。例えば、学習開始時から所定の条件が成立するまでの学習の初期の段階では、閾値演算部１１２がデータの二値化を行わず、所定の条件が成立した初期段階終了後は、閾値演算部１１２がデータの二値化を行うようにしてもよい。
これにより、学習が比較的速く進むことが期待され、また、学習結果が局所解に陥る可能性が比較的低いことが期待される。 For at least a part of the learning period, the threshold calculation unit 112 may not binarize the data, and the hypersurface node 110 may output real-valued data. For example, in the early stage of learning from the start of learning until a predetermined condition is met, the threshold calculation unit 112 may not binarize the data, and after the early stage when the predetermined condition is met ends, the threshold calculation unit 112 may binarize the data.
This is expected to enable learning to proceed relatively quickly, and also to reduce the likelihood that the learning results will fall into a local optimum.

図３の例の場合、入出力実数空間における超曲面は、式（１）のように表される。 In the example shown in Figure 3, the hypersurface in the input/output real space is expressed as in equation (1).

ここでは、ｘ_０およびｘ_１は、それぞれ、二値データ値による超曲面型ノード１１０への入力値を実数値として扱う場合の、その実数値をとる実数変数とする。ｚは、超曲面処理部１１１の出力値をとる実数変数とする。超曲面処理部１１１の出力値は、ｘ_０ｘ_１ｚ座標空間における曲面に含まれる点のうち、超曲面型ノード１１０への入力値をｘ_０座標値およびｘ_１座標値として扱った場合の、それらの座標値を含むｘ_０ｘ_１ｚ座標値を有する点の、そのｘ_０ｘ_１ｚ座標値のうちのｚ座標値である。 Here, _x0 and _x1 are real variables that take on real values when the input values to the hypersurface node 110, which are binary data values, are treated as real values. z is a real variable that takes on the output value of the hypersurface processing unit 111. The output value of the hypersurface processing unit 111 is the z coordinate value of the _x0 _x1 z coordinate values of a point that has an _x0 x1 _z coordinate value that includes the input values to the hypersurface node 110 when these coordinate values are treated as _x0 and _x1 coordinate values, among the points included on the surface in the _x0 _x1 z coordinate space.

ｗ_０、ｗ_１、ｗ_２、ｗ_３は、それぞれ学習パラメータとして用いられる実数変数である。上記のように、これら学習パラメータの値が、B-Spline曲面の制御点におけるｚ座標値として用いられていてもよい。
ｆは、微分可能な関数である。
超曲面型ノード１１０と閾値演算部１１２とを組み合わせた超曲面型ノード１１０全体による演算は、式（２）のように表される。 _w0 , _w1 , _w2 , and _w3 are real variables used as learning parameters. As described above, the values of these learning parameters may be used as z coordinate values at the control points of the B-Spline surface.
f is a differentiable function.
The calculation performed by the entire hypersurface node 110, which is a combination of the hypersurface node 110 and the threshold calculation unit 112, is expressed as in equation (2).

ここでは、ｙは、二値データ値による超曲面型ノード１１０の出力値を実数値として扱う場合の、その実数値をとる実数変数とする。
学習モデル装置１００における逆伝播のときは、ｆ_Ｒは、微分可能な関数である。
超曲面型ノード１１０の出力値の正解値をｙ^＊で表し、超曲面型ノード１１０の出力値ｙとその正解値ｙ^＊との誤差Ｅを式（３）のように定義する。 Here, y is a real variable that takes on a real value when the output value of the hypersurface node 110 based on binary data values is treated as a real value.
In the case of back propagation in the learning model device 100, f _R is a differentiable function.
The correct value of the output value of the hypersurface node 110 is represented by y ^* , and the error E between the output value y of the hypersurface node 110 and the correct value y ^* is defined as in equation (3).

学習係数をαとして、学習パラメータｗ_０の補正量Δｗ_０は、Δｗ_０＝－α（∂Ｅ／∂ｗ_０）と算出することができる。式（３）および式（２）を用いてＥの記載を書き換えると、学習パラメータｗ_０の補正量Δｗ_０は、式（４）を用いて算出することができる。 When the learning coefficient is α, the correction amount Δw ₀ of the learning parameter w ₀ can be calculated as Δw ₀ = -α(∂E/∂w ₀ ). By rewriting the description of E using equations (3) and (2), the correction amount Δw ₀ of the learning parameter w ₀ can be calculated using equation (4).

学習パラメータｗ_１、ｗ_２、ｗ_３についても同様である。
このように、超曲面型ノード１１０によれば、関数の微分を用いる学習手法を用いることができる。 The same applies to the learning parameters w ₁ , w ₂ , and w ₃ .
In this way, the hypersurface node 110 allows for the use of a learning method that uses the differentiation of a function.

学習制御部３００は、学習モデル装置１００の学習を制御する。例えば、学習制御部３００は、データベースなど他の装置から学習データ（Training Data）を取得し、得られた学習データを用いて学習モデル装置１００に学習を行わせる。学習によって、学習モデル装置１００の学習パラメータ値が調整される。 The learning control unit 300 controls the learning of the learning model device 100. For example, the learning control unit 300 acquires training data from another device, such as a database, and causes the learning model device 100 to perform learning using the acquired training data. Through learning, the learning parameter values of the learning model device 100 are adjusted.

設定部４００は、学習後の学習モデル装置１００の超曲面型ノード１１０における入力値と出力値との関係を示すルックアップテーブルを生成し、生成したルックアップテーブルを演算装置２００のテンプレートのテーブル型ノード２１０に設定する。
例えば、設定部４００は、図３の例における学習モデル装置１００について、入力ベクトルがとり得る全ての値（ｘ_０，ｘ_１）＝（０，０）、（０，１）、（１，０）、（１，１）のそれぞれについて出力値ｙを観測し、図２の例におけるルックアップテーブルを生成する。そして、設定部４００は、生成したルックアップテーブルを、図２の例のように演算装置２００のテンプレートのテーブル型ノード２１０に設定する。 The setting unit 400 generates a lookup table showing the relationship between input values and output values in the hypersurface type node 110 of the learning model device 100 after learning, and sets the generated lookup table in the table type node 210 of the template of the calculation device 200.
For example, for the learning model device 100 in the example of Fig. 3, the setting unit 400 observes the output value y for each of all possible values ( _x0 , _x1 ) = (0,0), (0,1), (1,0), and (1,1) of the input vector, and generates the lookup table in the example of Fig. 2. Then, the setting unit 400 sets the generated lookup table in the table-type node 210 of the template of the arithmetic device 200 as in the example of Fig. 2.

学習制御部３００と、設定部４００と、学習モデル装置１００とが、別々の装置として構成されていてもよい。この場合、これら各装置がパソコン（Personal Computer）などのコンピュータを用いて構成されていてもよい。あるいは、学習制御部３００、設定部４００、または、学習モデル装置１００の何れか１つ以上が、ＡＳＩＣ（Application Specific Integrated Circuit）またはＦＰＧＡ（Field Programmable Gate Array）を用いて構成されるなど、その装置専用のハードウェアを用いて構成されていてもよい。 The learning control unit 300, setting unit 400, and learning model device 100 may be configured as separate devices. In this case, each of these devices may be configured using a computer such as a personal computer. Alternatively, one or more of the learning control unit 300, setting unit 400, or learning model device 100 may be configured using hardware dedicated to that device, such as an ASIC (Application Specific Integrated Circuit) or FPGA (Field Programmable Gate Array).

あるいは、学習制御部３００、設定部４００、および、学習モデル装置１００のうち何れか２つ以上が、一体的に構成されていてもよい。例えば、学習制御部３００と、設定部４００と、学習モデル装置１００とが同一の装置に組み込まれていてもよい。この場合も、装置がコンピュータを用いて構成されていてもよいし、その装置専用のハードウェアを用いて構成されていてもよい。 Alternatively, any two or more of the learning control unit 300, setting unit 400, and learning model device 100 may be configured integrally. For example, the learning control unit 300, setting unit 400, and learning model device 100 may be incorporated into the same device. In this case, too, the device may be configured using a computer, or may be configured using hardware dedicated to that device.

演算装置２００についても、コンピュータを用いて構成されていてもよいし、演算装置２００専用のハードウェアを用いて構成されていてもよい。後述するように、演算装置２００は、特にＦＰＧＡへの実装に適した構成となっていると考えられ、演算装置２００が、ＦＰＧＡを用いて構成されていてもよい。 The arithmetic device 200 may also be configured using a computer, or may be configured using hardware dedicated to the arithmetic device 200. As described below, the arithmetic device 200 is considered to have a configuration that is particularly suitable for implementation in an FPGA, and the arithmetic device 200 may also be configured using an FPGA.

また、装置の運用時に追加学習を行う場合など、学習モデル装置１００を運用に用いるようにしてもよい。この場合、演算装置生産システム１が、設定部４００と演算装置２００とを備えていなくてもよい。 Furthermore, the learning model device 100 may be used for operation, such as when additional learning is performed during device operation. In this case, the computing device production system 1 does not need to be equipped with the setting unit 400 and the computing device 200.

B-Spline曲面を用いて超曲面処理部１１１による二値演算の学習について動作確認をおこなったところ、良好な結果が得られた。
図４は、B-Spline曲面を用いた超曲面処理部１１１の動作確認に用いた二値演算の第１の例を示す図である。
図４に示すような論理演算の「ＡＮＤ」（論理積）の入出力データを用いて動作確認を行い、図５に示すような曲面を得られた。 When the operation of the hypersurface processing unit 111 was checked using a B-Spline surface for learning binary operations, good results were obtained.
FIG. 4 is a diagram showing a first example of binary operations used to check the operation of the hypersurface processing unit 111 using a B-Spline surface.
Operation was confirmed using input and output data of the logical operation "AND" (logical product) as shown in FIG. 4, and a curved surface as shown in FIG. 5 was obtained.

図５は、動作確認で得られたB-Spline曲面の第１の例を示す図である。
図５に示す曲面では、ｘ_０＝０、ｘ_１＝０のときのｚの値は、およそ０になっている。また、ｘ_０＝０、ｘ_１＝１のときのｚの値は、およそ０になっている。また、ｘ_０＝１、ｘ_１＝０のときのｚの値は、およそ０になっている。また、ｘ_０＝１、ｘ_１＝１のときのｚの値は、およそ１になっている。
このように、図４に示す論理回路の「ＡＮＤ」の入出力と同様の入出力を示す曲面を得られた。 FIG. 5 is a diagram showing a first example of a B-Spline surface obtained in the operation check.
In the curved surface shown in Fig. 5, when _x0 = 0 and _x1 = 0, the value of z is approximately 0. Also, when _x0 = 0 and _x1 = 1, the value of z is approximately 0. Also, when _x0 = 1 and _x1 = 0, the value of z is approximately 0. Also, when _x0 = 1 and _x1 = 1, the value of z is approximately 1.
In this way, a curved surface showing inputs and outputs similar to the inputs and outputs of "AND" in the logic circuit shown in FIG. 4 was obtained.

図６は、B-Spline曲面を用いた超曲面処理部１１１の動作確認に用いた二値演算の第２の例を示す図である。
図６に示すような論理演算の「ＯＲ」（論理和）の入出力データを用いて動作確認を行い、図７に示すような曲面を得られた。 FIG. 6 is a diagram showing a second example of binary operations used to check the operation of the hypersurface processing unit 111 using a B-Spline surface.
Operation was confirmed using input and output data of the logical operation "OR" (logical sum) as shown in FIG. 6, and a curved surface as shown in FIG. 7 was obtained.

図７は、動作確認で得られたB-Spline曲面の第２の例を示す図である。
図７に示す曲面では、ｘ_０＝０、ｘ_１＝０のときのｚの値は、およそ０になっている。また、ｘ_０＝０、ｘ_１＝１のときのｚの値は、およそ１になっている。また、ｘ_０＝１、ｘ_１＝０のときのｚの値は、およそ１になっている。また、ｘ_０＝１、ｘ_１＝１のときのｚの値は、およそ１になっている。
このように、図６に示す論理回路の「ＯＲ」の入出力と同様の入出力を示す曲面を得られた。 FIG. 7 is a diagram showing a second example of a B-Spline surface obtained in the operation check.
In the curved surface shown in Fig. 7, when _x0 = 0 and _x1 = 0, the value of z is approximately 0. When _x0 = 0 and _x1 = 1, the value of z is approximately 1. When _x0 = 1 and _x1 = 0, the value of z is approximately 1. When _x0 = 1 and _x1 = 1, the value of z is approximately 1.
In this way, a curved surface showing inputs and outputs similar to the "OR" inputs and outputs of the logic circuit shown in FIG. 6 was obtained.

図８は、B-Spline曲面を用いた超曲面処理部１１１の動作確認に用いた二値演算の第３の例を示す図である。
図８に示すような論理演算の「ＥＸＯＲ」（排他的論理和）の入出力データを用いて動作確認を行い、図９に示すような曲面を得られた。 FIG. 8 is a diagram showing a third example of binary operations used to check the operation of the hypersurface processing unit 111 using a B-Spline surface.
Operation was confirmed using input and output data of the logical operation "EXOR" (exclusive OR) as shown in FIG. 8, and a curved surface as shown in FIG. 9 was obtained.

図９は、動作確認で得られたB-Spline曲面の第３の例を示す図である。
図９に示す曲面では、ｘ_０＝０、ｘ_１＝０のときのｚの値は、およそ０になっている。また、ｘ_０＝０、ｘ_１＝１のときのｚの値は、およそ１になっている。また、ｘ_０＝１、ｘ_１＝０のときのｚの値は、およそ１になっている。また、ｘ_０＝１、ｘ_１＝１のときのｚの値は、およそ０になっている。
このように、図８に示す論理回路の「ＥＸＯＲ」の入出力と同様の入出力を示す曲面を得られた。 FIG. 9 is a diagram showing a third example of a B-Spline surface obtained in the operation check.
In the curved surface shown in Fig. 9, when _x0 = 0 and _x1 = 0, the value of z is approximately 0. When _x0 = 0 and _x1 = 1, the value of z is approximately 1. When _x0 = 1 and _x1 = 0, the value of z is approximately 1. When _x0 = 1 and _x1 = 1, the value of z is approximately 0.
In this way, a curved surface showing inputs and outputs similar to the inputs and outputs of "EXOR" in the logic circuit shown in FIG. 8 was obtained.

図１０は、B-Spline曲面を用いた超曲面処理部１１１の動作確認に用いた二値演算の第４の例を示す図である。
図１０に示すような論理演算の「ＮＯＴｘ_０」（入力信号ｘ_０の否定）の入出力データを用いて動作確認を行い、図１１に示すような曲面を得られた。 FIG. 10 is a diagram showing a fourth example of binary operations used to check the operation of the hypersurface processing unit 111 using a B-Spline surface.
Operation was confirmed using input and output data of the logical operation "NOT x ₀ " (negation of input signal x ₀ ) as shown in FIG. 10, and a curved surface as shown in FIG. 11 was obtained.

図１１は、動作確認で得られたB-Spline曲面の第４の例を示す図である。
図１１に示す曲面では、ｘ_０＝０、ｘ_１＝０のときのｚの値は、およそ１になっている。また、ｘ_０＝０、ｘ_１＝１のときのｚの値は、およそ１になっている。また、ｘ_０＝１、ｘ_１＝０のときのｚの値は、およそ０になっている。また、ｘ_０＝１、ｘ_１＝１のときのｚの値は、およそ０になっている。
このように、図１０に示す論理回路の「ＮＯＴｘ_０」の入出力と同様の入出力を示す曲面を得られた。 FIG. 11 is a diagram showing a fourth example of a B-Spline surface obtained in the operation check.
In the curved surface shown in Fig. 11, when _x0 = 0 and _x1 = 0, the value of z is approximately 1. When _x0 = 0 and _x1 = 1, the value of z is approximately 1. When _x0 = 1 and _x1 = 0, the value of z is approximately 0. When _x0 = 1 and _x1 = 1, the value of z is approximately 0.
In this way, a curved surface showing inputs and outputs similar to the inputs and outputs of "NOT x ₀ " in the logic circuit shown in FIG. 10 was obtained.

図１２は、演算装置生産システム１が演算装置２００を生成する処理の手順の例を示すフローチャートである。
図１２の処理で、学習モデル装置１００は、学習制御部３００の制御に従って学習を行い、学習パラメータ値を調整する（ステップＳ１１）。学習モデル装置１００は、学習制御部３００の制御に従って、超曲面型ノード１１０を含む学習モデル装置１００の学習を行う。特に、複数の超曲面型ノード１１０がネットワークを構成している場合、ニューラルネットワークの学習の場合ように、ネットワーク全体の学習を行う。
上述したように、学習パラメータが、B-Spline超曲面の制御点の座標の要素となっていてもよい。 FIG. 12 is a flowchart showing an example of a processing procedure by which the computing device production system 1 generates the computing device 200.
12, the learning model device 100 performs learning and adjusts learning parameter values under the control of the learning control unit 300 (step S11). The learning model device 100 performs learning of the learning model device 100 including the hypersurface type node 110 under the control of the learning control unit 300. In particular, when a network is made up of multiple hypersurface type nodes 110, learning of the entire network is performed, as in the case of neural network learning.
As described above, the learning parameters may be elements of the coordinates of the control points of the B-Spline hypersurface.

次に、設定部４００は、学習完了後の超曲面型ノード１１０における入力値と出力値との関係を示すルックアップテーブルを生成し、生成したルックアップテーブルを、演算装置２００のテンプレートのテーブル型ノード２１０に設定する（ステップＳ１２）。
学習モデル装置１００が複数の超曲面型ノード１１０を備える場合、超曲面型ノード１１０とテーブル型ノード２１０とが一対一に対応付けられるように、演算装置２００を構成しておく。具体的には、演算装置２００が備えるテーブル型ノード２１０の個数を、学習モデル装置１００が備える超曲面型ノード１１０の個数と同数にし、テーブル型ノード２１０が、超曲面型ノード１１０が構成するネットワークと同じ構造のネットワークを構成するようにしておく。設定部４００は、超曲面型ノード１１０毎に、その超曲面型ノード１１０における入力値と出力値との関係を示すルックアップテーブルを生成し、その超曲面型ノード１１０と一対一に対応付けられるテーブル型ノード２１０に、生成したルックアップテーブルを設定する。
ステップＳ１２の後、演算装置生産システム１は、図１２の処理を終了する。 Next, the setting unit 400 generates a lookup table showing the relationship between input values and output values in the hypersurface type node 110 after learning is complete, and sets the generated lookup table in the table type node 210 of the template of the calculation device 200 (step S12).
When the learning model device 100 includes a plurality of hypersurface type nodes 110, the arithmetic device 200 is configured so that the hypersurface type nodes 110 and the table type nodes 210 are in one-to-one correspondence with each other. Specifically, the number of table type nodes 210 included in the arithmetic device 200 is set to the same number as the number of hypersurface type nodes 110 included in the learning model device 100, and the table type nodes 210 are configured to form a network having the same structure as the network formed by the hypersurface type nodes 110. The setting unit 400 generates, for each hypersurface type node 110, a lookup table indicating the relationship between input values and output values in that hypersurface type node 110, and sets the generated lookup table in the table type node 210 that is in one-to-one correspondence with that hypersurface type node 110.
After step S12, the computing device production system 1 ends the process of FIG.

超曲面型ノード１１０およびテーブル型ノード２１０が、それぞれ二値ベクトルを出力するようにしてもよい。すなわち、超曲面型ノード１１０およびテーブル型ノード２１０が、それぞれ複数の二値データを出力するようにしてもよい。 The hypersurface node 110 and the table node 210 may each output a binary vector. In other words, the hypersurface node 110 and the table node 210 may each output multiple binary data.

図１３は、テーブル型ノード２１０が二値ベクトルを出力する場合の、データの入出力の例を示す図である。
図１３の例で、テーブル型ノード２１０ｂは、Ｎ次元（Ｎは、正の整数）の二値ベクトル（ｘ_０，ｘ_１，・・・，ｘ_Ｎ－１）の入力を受け、Ｍ次元（Ｍは、正の整数）の二値ベクトル（ｙ_０，ｙ_１，・・・，ｙ_Ｍ－１）を出力する。テーブル型ノード２１０ｂは、テーブル型ノード２１０の例に該当する。 FIG. 13 is a diagram showing an example of data input/output when the table type node 210 outputs a binary vector.
13, the table type node 210b receives an input of an N-dimensional (N is a positive integer) binary vector (x ₀ , x ₁ , ..., x _N-1 ) and outputs an M-dimensional (M is a positive integer) binary vector (y ₀ , y ₁ , ..., y _M-1 ). The table type node 210b is an example of the table type node 210.

この場合、テーブル型ノード２１０ｂは、入力ベクトルである二値ベクトル（ｘ_０，ｘ_１，・・・，ｘ_Ｎ－１）がとり得る値毎に、出力ベクトルである二値ベクトル（ｙ_０，ｙ_１，・・・，ｙ_Ｍ－１）の値を示すルックアップテーブルを備える。入力ベクトルである二値ベクトル（ｘ_０，ｘ_１，・・・，ｘ_Ｎ－１）がとり得る値は、２^Ｎ通りであり、ルックアップテーブルは、２^Ｎ行分のデータを示す。 In this case, the table-type node 210b has a lookup table that indicates the value of the binary vector (y ₀ , y ₁ , ..., y M-1 ), which is the output vector, for each possible value of the binary vector (x ₀ , x ₁ , ..., x _N- ₁ ), which is the input vector. The binary vector (x ₀ , x ₁ , ..., x _N-1 ), which is the input vector, can take on 2 ^N different values, and the lookup table indicates 2 ^N rows of data.

図１４は、超曲面型ノード１１０が二値ベクトルを出力する場合の、データの入出力の例を示す図である。
図１４の例で、超曲面型ノード１１０ｂは、テーブル型ノード２１０ｂの場合と同じ二値ベクトル（ｘ_０，ｘ_１，・・・，ｘ_Ｎ－１）の入力を受け、テーブル型ノード２１０ｂの場合と同じ二値ベクトル（ｙ_０，ｙ_１，・・・，ｙ_Ｍ－１）を出力する。
この場合の超曲面は、例えば、式（５）のように表される。 FIG. 14 is a diagram showing an example of data input/output when the hypersurface node 110 outputs a binary vector.
In the example of Figure 14, the hypersurface type node 110b receives the same binary vector (x ₀ , x ₁ , ..., x _N-1 ) as the table type node 210b, and outputs the same binary vector (y ₀ , y ₁ , ..., y _M-1 ) as the table type node 210b.
In this case, the hypersurface is expressed as, for example, equation (5).

Ｌは、超曲面としてB-Spline超曲面を用いる場合の学習パラメータの個数を表す正の整数である。入力ベクトルがとり得る値毎にB-Spline超曲面の制御点が設けられ、１つの制御点につき１つの学習パラメータが設けられる。このため、Ｌの値は式（６）のように表される。 L is a positive integer representing the number of learning parameters when a B-Spline hypersurface is used as the hypersurface. A control point of the B-Spline hypersurface is provided for each possible value of the input vector, and one learning parameter is provided for each control point. Therefore, the value of L is expressed as in equation (6).

この場合も、微分可能な関数ｆを得ることができ、関数の微分を用いる学習手法を適用し得る。例えば、式（５）の関数ｆが、Ｎ個の入力変数ｘ_０、ｘ_１、・・・、ｘ_Ｎ－１、および、１つの出力変数ｙ_ｉ（ここでは、ｉは０≦ｉ≦Ｍ－１の整数）の各変数の座標軸を持つＮ＋１次元座標空間におけるＭ個の超曲面で表されていてもよい。 In this case, too, a differentiable function f can be obtained, and a learning method using the differentiation of the function can be applied. For example, the function f in equation (5) may be expressed by M hypersurfaces in an N+1-dimensional coordinate space having coordinate axes for N input variables x ₀ , x ₁ , ..., x _N-1 and one output variable y _i (where i is an integer such that 0≦i≦M-1).

あるいは、テーブル型ノード２１０が出力する二値データの個数分だけ、学習モデル装置１００に１出力の超曲面型ノード１１０が設けられていてもよい。図１３に例示されるＮ入力Ｍ出力のテーブル型ノード２１０ｂに対応付けて、学習モデル装置１００に、Ｎ入力１出力の超曲面型ノード１１０がＭ個設けられていてもよい。学習完了後に設定部４００が、これらＭ個の超曲面型ノード１１０における入力値と出力値との関係を、図１３に例示されるようなＮ入力Ｍ出力のルックアップテーブルに纏め、得られたルックアップテーブルをテーブル型ノード２１０ｂに設定するようにしてもよい。 Alternatively, the learning model device 100 may be provided with one-output hypersurface nodes 110, the number of which is equal to the number of binary data output by the table-type node 210. The learning model device 100 may be provided with M N-input, one-output hypersurface nodes 110, corresponding to the N-input, M-output table-type node 210b illustrated in FIG. 13. After learning is complete, the setting unit 400 may compile the relationships between input and output values in these M hypersurface-type nodes 110 into an N-input, M-output lookup table, such as the one illustrated in FIG. 13, and set the resulting lookup table in the table-type node 210b.

学習モデル装置１００における超曲面型ノード１１０の構成について、幾つかのバリエーションが考えられる。
図１５は、１つの学習モデル装置１００が１つの超曲面型ノード１１０を備える場合の、学習モデル装置１００における超曲面型ノード１１０の構成の例を示す図である。図１５に示す構成で、学習モデル装置１００ｃは、１つの超曲面型ノード１１０ｃを備える。
学習モデル装置１００ｃは、学習モデル装置１００の例に該当する。超曲面型ノード１１０ｃは、超曲面型ノード１１０の例に該当する。 There are several possible variations in the configuration of the hypersurface node 110 in the learning model device 100.
Fig. 15 is a diagram showing an example of the configuration of the hypersurface type node 110 in a learning model device 100 when one learning model device 100 includes one hypersurface type node 110. In the configuration shown in Fig. 15, a learning model device 100c includes one hypersurface type node 110c.
The learning model device 100c is an example of the learning model device 100. The hypersurface type node 110c is an example of the hypersurface type node 110.

学習モデル装置１００ｃは、４つの二値データｘ_０、ｘ_１、ｘ_２およびｘ_３の入力を受けて、３つの二値データｙ_０、ｙ_１およびｙ_２を出力する。これに応じて、超曲面型ノード１１０ｃは、４つの二値データｘ_０、ｘ_１、ｘ_２およびｘ_３の入力を受けて、３つの二値データｙ_０、ｙ_１およびｙ_２を出力する。 The learning model device 100c receives four inputs of binary data _x0 , _x1 , _x2 , and _x3 , and outputs three inputs of binary data _y0 , _y1 , and _y2 . In response to this, the hypersurface node 110c receives four inputs of binary data _x0 , _x1 , _x2 , and _x3 , and outputs three inputs of binary data _y0 , _y1 , and _y2 .

このように、学習モデル装置１００が１つの超曲面型ノード１１０を備えるようにしてもよい。そして、超曲面型ノード１１０が、学習モデル装置１００への入力データの入力を受けて、学習モデル装置１００の出力データを出力するようにしてもよい。
この学習モデル装置１００に対応する演算装置２００も、この学習モデル装置１００と同様の構成とすることができる。具体的には、演算装置２００が、１つのテーブル型ノード２１０を備えるようにしてもよい。そして、テーブル型ノード２１０が、演算装置２００への入力データの入力を受けて、演算装置２００の出力データを出力するようにしてもよい。 In this way, the learning model device 100 may be configured to include one hypersurface node 110. The hypersurface node 110 may then receive input data to the learning model device 100 and output output data from the learning model device 100.
The arithmetic device 200 corresponding to this learning model device 100 can also have the same configuration as this learning model device 100. Specifically, the arithmetic device 200 may be provided with one table-type node 210. The table-type node 210 may then receive input data to the arithmetic device 200 and output output data from the arithmetic device 200.

あるいは、上述したように、学習モデル装置１００が出力データの個数の超曲面型ノード１１０を備えるようにしてもよい。そして、学習完了後に、設定部４００が、複数の超曲面型ノード１１０における入力値と出力値との関係を、１つのルックアップテーブルに纏め、得られたルックアップテーブルを１つのテーブル型ノード２１０に設定するようにしてもよい。 Alternatively, as described above, the learning model device 100 may be provided with hypersurface nodes 110 equal to the number of output data. Then, after learning is complete, the setting unit 400 may compile the relationships between input values and output values in multiple hypersurface nodes 110 into a single lookup table, and set the obtained lookup table in a single table-type node 210.

例えば、図１５の例で、学習モデル装置１００が、１つの超曲面型ノード１１０ｃに変えて、３つの超曲面型ノード１１０を備えるようにしてもよい。この場合、３つの超曲面型ノード１１０のそれぞれに、４つの二値データｘ_０、ｘ_１、ｘ_２およびｘ_３を入力する。出力データについては、超曲面型ノード１１０毎に異なる二値データを１つずつ出力するようにする。具体的には、１つ目の超曲面型ノード１１０が二値データｙ_０を出力し、２つ目の超曲面型ノード１１０が二値データｙ_１を出力し、３つ目の超曲面型ノード１１０が二値データｙ_３を出力するようにする。
学習完了後に、設定部４００が、３つの超曲面型ノード１１０における入力値と出力値との関係を、１つのルックアップテーブルに纏め、得られたルックアップテーブルを１つのテーブル型ノード２１０に設定するようにしてもよい。 For example, in the example of Figure 15, the learning model device 100 may be equipped with three hypersurface type nodes 110 instead of one hypersurface type node 110c. In this case, four binary data _x0 , _x1 , _x2 , and _x3 are input to each of the three hypersurface type nodes 110. As for output data, each hypersurface type node 110 outputs a different binary data. Specifically, the first hypersurface type node 110 outputs binary data _y0 , the second hypersurface type node 110 outputs binary data y1, and the third hypersurface type node 110 outputs binary data _y3 _.
After the learning is completed, the setting unit 400 may compile the relationships between the input values and output values of the three hypersurface type nodes 110 into one lookup table, and set the obtained lookup table to one table type node 210.

図１６は、１つの学習モデル装置１００が複数の超曲面型ノード１１０を備える場合の、学習モデル装置１００における超曲面型ノード１１０の構成の第１の例を示す図である。図１６に示す構成で、学習モデル装置１００ｄは、超曲面型ノード１１０ｄ－１、超曲面型ノード１１０ｄ－２、超曲面型ノード１１０ｄ－３、および、超曲面型ノード１１０ｄ－４を備える。
学習モデル装置１００ｄは、学習モデル装置１００の例に該当する。超曲面型ノード１１０ｄ－１、超曲面型ノード１１０ｄ－２、超曲面型ノード１１０ｄ－３、および、超曲面型ノード１１０ｄ－４は、それぞれ、超曲面型ノード１１０の例に該当する。 Fig. 16 is a diagram showing a first example of the configuration of the hypersurface type nodes 110 in a learning model device 100 when one learning model device 100 is equipped with a plurality of hypersurface type nodes 110. In the configuration shown in Fig. 16, a learning model device 100d is equipped with a hypersurface type node 110d-1, a hypersurface type node 110d-2, a hypersurface type node 110d-3, and a hypersurface type node 110d-4.
The learning model device 100d is an example of the learning model device 100. The hypersurface type node 110d-1, the hypersurface type node 110d-2, the hypersurface type node 110d-3, and the hypersurface type node 110d-4 are examples of the hypersurface type node 110, respectively.

図１６は、超曲面型ノード１１０が構成するネットワークの構造が固定の場合の例を示している。例えば、学習モデル装置１００の設計者など人が、ネットワークの構造を予め決定し、学習モデル装置１００に実装しておく。
このように、１つの学習モデル装置１００が複数の超曲面型ノード１１０を備え、これら複数の超曲面型ノード１１０がネットワークを構成していてもよい。この場合のネットワークの構造は、ニューラルネットワークの場合と同様、いろいろな構造とすることができる。 16 shows an example in which the structure of the network formed by the hypersurface type nodes 110 is fixed. For example, a person such as a designer of the learning model device 100 determines the structure of the network in advance and implements it in the learning model device 100.
In this way, one learning model device 100 may be equipped with a plurality of hypersurface type nodes 110, and these plurality of hypersurface type nodes 110 may constitute a network. In this case, the structure of the network may be various, as in the case of a neural network.

この学習モデル装置１００に対応する演算装置２００も、この学習モデル装置１００と同様の構成とすることができる。具体的には、演算装置２００が、学習モデル装置１００が備える超曲面型ノード１１０の個数と同じ個数のテーブル型ノード２１０を備えるようにする。そして、テーブル型ノード２１０が、超曲面型ノード１１０が構成するネットワークと同じ構造のネットワークを構成するようにする。 The calculation device 200 corresponding to this learning model device 100 can also be configured in the same way as this learning model device 100. Specifically, the calculation device 200 is configured to have the same number of table-type nodes 210 as the number of hypersurface-type nodes 110 that the learning model device 100 has. The table-type nodes 210 are then configured to form a network with the same structure as the network formed by the hypersurface-type nodes 110.

図１７は、１つの学習モデル装置１００が複数の超曲面型ノード１１０を備える場合の、学習モデル装置１００における超曲面型ノード１１０の構成の第２の例を示す図である。図１７に示す構成で、学習モデル装置１００ｅは、超曲面型ノード１１０ｅ－１から超曲面型ノード１１０ｅ－１２を備える。
学習モデル装置１００ｅは、学習モデル装置１００の例に該当する。超曲面型ノード１１０ｅ－１から超曲面型ノード１１０ｅ－１２の各々は、超曲面型ノード１１０の例に該当する。 17 is a diagram showing a second example of the configuration of the hypersurface type nodes 110 in the learning model device 100 when one learning model device 100 is equipped with multiple hypersurface type nodes 110. In the configuration shown in FIG. 17, the learning model device 100e is equipped with hypersurface type nodes 110e-1 to 110e-12.
The learning model device 100e corresponds to an example of the learning model device 100. Each of the hypersurface type node 110e-1 to the hypersurface type node 110e-12 corresponds to an example of the hypersurface type node 110.

図１７は、超曲面型ノード１１０が構成するネットワークの構造が学習時に可変である場合の例を示している。この場合、ネットワークの構造を機械学習で決定するようにしてもよい。ネットワークの構造の学習手法について、例えば、遺伝的プログラミングの手法を用いるようにしてもよい。 Figure 17 shows an example where the structure of the network formed by the hypersurface type nodes 110 is variable during learning. In this case, the network structure may be determined by machine learning. A genetic programming technique, for example, may be used as a method for learning the network structure.

さらに例えば、学習制御部３００が、遺伝的プログラミングの一種であるCartesian genetic programming (CGP)の手法を用いてネットワーク構造を探索する場合について考える。この場合、学習制御部３００は、超曲面型ノード１１０が構成するネットワークの構造を、あるネットワーク構造に仮設定する。仮設定されるネットワーク構造は、ネットワーク構造の候補といえる。 Furthermore, consider the case where the learning control unit 300 searches for a network structure using Cartesian genetic programming (CGP), a type of genetic programming. In this case, the learning control unit 300 provisionally sets the structure of the network made up of the hypersurface type nodes 110 to a certain network structure. The provisionally set network structure can be considered a candidate network structure.

そして、学習制御部３００は、仮設定したネットワーク構造の評価値を計算する。具体的には、学習制御部３００は、仮設定による超曲面型ノード１１０のネットワークの学習を行い、学習結果の評価スコア（例えば、認識率）を算出して、仮設定したネットワーク構造の評価値とする。 Then, the learning control unit 300 calculates an evaluation value for the provisionally set network structure. Specifically, the learning control unit 300 learns the network of the provisionally set hypersurface node 110, calculates an evaluation score (e.g., recognition rate) for the learning results, and sets this as the evaluation value for the provisionally set network structure.

学習制御部３００は、仮設定するネットワーク構造を変化させ、そのネットワーク構造の評価値を算出することを、ネットワーク構造の学習の終了条件として予め定められている条件が成立するまで繰り返す。
ネットワーク構造を変化させる際、学習制御部３００は、ネットワーク構造を変化させる度合いを、評価値に基づいて決定することができる。例えば、評価値が所定の評価閾値以上によい評価を示す場合、学習制御部３００が、ネットワーク構造における１つのエッジ（Edge）のみ変化させるなど、ネットワーク構造の変化の度合いを比較的小さくするようにしてもよい。一方、例えば、評価値が所定の評価閾値未満の低い評価を示す場合、学習制御部３００が、ネットワーク構造における１０個のエッジを変化させるなど、ネットワーク構造の変化の度合いを比較的大きくするようにしてもよい。
ただし、超曲面型ノード１１０が構成するネットワークの構造の学習手法は、特定の方法に限定されない。 The learning control unit 300 repeatedly changes the provisionally set network structure and calculates the evaluation value of the network structure until a predetermined condition is met as a condition for terminating the learning of the network structure.
When changing the network structure, the learning control unit 300 can determine the degree of change to the network structure based on the evaluation value. For example, if the evaluation value indicates a good evaluation equal to or greater than a predetermined evaluation threshold, the learning control unit 300 may change the network structure to a relatively small degree, such as by changing only one edge in the network structure. On the other hand, if the evaluation value indicates a low evaluation equal to or less than the predetermined evaluation threshold, the learning control unit 300 may change the network structure to a relatively large degree, such as by changing ten edges in the network structure.
However, the method for learning the structure of the network formed by the hypersurface type nodes 110 is not limited to a specific method.

畳み込みニューラルネットワークにB-Spline超曲面を用いた学習の手法を適用して文字認識の実験を行ったところ、良好な結果が得られた。
図１８は、実験に用いた畳み込みニューラルネットワークの構成を示す図である。図１８に示すように、実験では、第１畳み込み層と、プーリング層と、第２畳み込み層とを備え、ソフトマックス関数を用いてクラスを選択する畳み込みニューラルネットワークを用いた。 We conducted character recognition experiments by applying a learning method using B-Spline hypersurfaces to a convolutional neural network, and obtained good results.
Fig. 18 shows the configuration of the convolutional neural network used in the experiment. As shown in Fig. 18, the experiment used a convolutional neural network that includes a first convolutional layer, a pooling layer, and a second convolutional layer and selects classes using a softmax function.

学習データとして、ＭＮＩＳＴ（Modified National Institute of Standards and Technology）で示される手書きの数字の画像を、８ピクセル×８ピクセルに縮小し二値化した画像のデータセットを用いた。
第１畳み込み層では、８ピクセル×８ピクセルの入力画像データに対し、３ピクセル×３ピクセルの部分画像毎に畳み込み演算を行い、パディングは無し（Zero Padding）として、６ピクセル×６ピクセルの画像データを生成する。
第１畳み込み層は、１つの画像データの入力を受けて１０個の画像データを出力する。これら１０個の画像データは、「０」から「９」の１０個のクラスそれぞれについての特徴量として扱われる。
このように、第１畳み込み層は、３ピクセル×３ピクセルの画像パッチを用いた畳み込みによる９次元のデータの入力を受けて、１０次元のデータ（１０チャンネルのデータ）を出力する。 As training data, a dataset of images of handwritten numbers specified by the Modified National Institute of Standards and Technology (MNIST) was used, which were reduced to 8 pixels x 8 pixels and binarized.
In the first convolutional layer, a convolution operation is performed on each 3 pixel x 3 pixel partial image of 8 pixel x 8 pixel input image data, and 6 pixel x 6 pixel image data is generated with no padding (zero padding).
The first convolutional layer receives one image data input and outputs ten image data pieces, which are treated as features for each of the ten classes from "0" to "9."
In this way, the first convolutional layer receives 9-dimensional data input by convolution using a 3 pixel x 3 pixel image patch, and outputs 10-dimensional data (10-channel data).

なお、実験では、従来側の畳み込みニューラルネットワークを用いる場合と、B-Spline超曲面を用いて、かつ、ノードの出力データを二値化せずに実数データとする場合と、B-Spline超曲面を用いて、かつ、ノードの出力データを二値化する場合とを比較した。
従来側の畳み込みニューラルネットワークでは、活性化関数としてＲｅＬＵ（Rectified Linear Unit、正規化線形関数）を用いて、ノードの出力データは実数データとした。 In the experiment, we compared the results of using a conventional convolutional neural network, using a B-Spline hypersurface and converting the node output data into real data without binarizing it, and using a B-Spline hypersurface and binarizing the node output data.
In the conventional convolutional neural network, ReLU (Rectified Linear Unit) was used as the activation function, and the output data of the nodes was real number data.

一方、B-Spline超曲面を用いる場合は、B-Spline超曲面（B-Spline関数）が、活性化関数の意味合いを含むといえる。一般的なニューラルネットワークではノード毎に、線形処理と活性化関数による処理とが行われるのに対し、超曲面型ノード１１０によれば、これら２つの処理の組み合わせに相当する処理を、B-Spline超曲面を用いた処理で行うことができる。 On the other hand, when a B-Spline hypersurface is used, the B-Spline hypersurface (B-Spline function) can be said to have the semantics of an activation function. In a typical neural network, linear processing and processing using an activation function are performed for each node, but with the hypersurface node 110, processing equivalent to a combination of these two processes can be performed using a B-Spline hypersurface.

出力データを二値化する場合については、超曲面型ノード１１０における閾値演算部１１２の場合と同様、B-Spline超曲面を用いて得られる値を二値化した。
なお、B-Spline超曲面を用いて、かつ、ノードの出力データを二値化する場合の畳み込みニューラルネットワークで、二値化されたデータを実数データの形式で出力している。このデータを１ビットデータで出力するようにしても、同等の認識率を得られると考えられる。 When binarizing the output data, the values obtained using a B-Spline hypersurface are binarized in the same manner as in the case of the threshold calculation unit 112 in the hypersurface type node 110 .
In addition, in the convolutional neural network where the B-Spline hypersurface is used and the output data of the nodes is binarized, the binarized data is output in the form of real number data. It is believed that the same recognition rate can be obtained even if this data is output as 1-bit data.

プーリング層では、最大プーリング（Max Pooling）にて、６ピクセル×６ピクセルの画像データを３ピクセル×３ピクセルの画像データに縮小する。
プーリング層は、１０個の画像データの入力を受けて１０個の画像データを出力する。 In the pooling layer, image data of 6 pixels x 6 pixels is reduced to image data of 3 pixels x 3 pixels by max pooling.
The pooling layer receives 10 pieces of image data as input and outputs 10 pieces of image data.

第２畳み込み層では、３ピクセル×３ピクセルの画像データに対して、３ピクセル×３ピクセル単位でDepthwise畳み込みを行い、パディングは無しとして、１ピクセル×１ピクセルの画像データを出力する。
第２畳み込み層での、従来側の畳み込みニューラルネットワークに用いる活性化関数、および、B-Spline超曲面を用いるノードにおける次元数は、第１畳み込み層の場合と同様とした。 In the second convolutional layer, depthwise convolution is performed on 3 pixel x 3 pixel image data in 3 pixel x 3 pixel units, and 1 pixel x 1 pixel image data is output without padding.
In the second convolutional layer, the activation function used in the conventional convolutional neural network and the number of dimensions in the node using the B-Spline hypersurface were the same as in the first convolutional layer.

第２畳み込み層は、１０個の画像データの入力を受けて１０個の画像データを出力する。したがって、第２畳み込み層は、「０」から「９」の１０個のクラスそれぞれについて、スカラのスコアを出力する。
ソフトマックス関数が、「０」から「９」の１０個のクラスのうちスコアが最大のクラスを選択することで、クラス推定が行われる。 The second convolutional layer receives 10 pieces of image data as input and outputs 10 pieces of image data. Therefore, the second convolutional layer outputs a scalar score for each of the 10 classes from "0" to "9."
The softmax function selects the class with the largest score from the 10 classes from "0" to "9," thereby performing class estimation.

図１９は、実験結果として得られた認識率を示す図である。
図１９に示す実験結果で、B-Spline超曲面を用いて、かつ、ノードの出力データを二値化せずに実数データとする場合、および、B-Spline超曲面を用いて、かつ、ノードの出力データを二値化する場合の何れも、従来側の畳み込みニューラルネットワークを用いる場合よりも高い認識率が得られた。 FIG. 19 is a diagram showing the recognition rate obtained as a result of the experiment.
In the experimental results shown in FIG. 19, a higher recognition rate was obtained than when using a conventional convolutional neural network in both cases: when a B-Spline hypersurface was used and the node output data was converted to real data without being binarized, and when a B-Spline hypersurface was used and the node output data was binarized.

設定部４００が、演算装置２００をＦＰＧＡに実装するようにしてもよい。
図２０は、ＦＰＧＡの構成の例を示す図である。図２０に示す構成で、ＦＰＧＡは、コンフィギャラブルロジックブロック（Configurable Logic Block；ＣＬＢ）と、スイッチングブロック（Switching Block）とを備えるコンフィギャラブルロジックブロックは、ベーシックロジックエレメント（Basic Logic Element）を備える。ベーシックロジックエレメントは、ＬＵＴ(Lookup Table)と、フリップフロップ（Flip Flop；ＦＦ）と、マルチプレクサ（Multiplexer；ＭＵＸ）とを備える。 The setting unit 400 may implement the arithmetic device 200 in an FPGA.
Fig. 20 is a diagram showing an example of the configuration of an FPGA. In the configuration shown in Fig. 20, the FPGA includes a configurable logic block (CLB) and a switching block. The configurable logic block includes a basic logic element (BLE). The BLE includes a lookup table (LUT), a flip-flop (FF), and a multiplexer (MUX).

ベーシックロジックエレメントでは、ルックアップテーブルが、入力データ（I/P's）の入力を受けて入力データの値に応じた値のデータをフリップフロップおよびマルチプレクサに出力する。
フリップフロップは、クロック信号（CLK）が入力されるタイミングで、ルックアップテーブルからのデータ値を記憶する。リセット信号（RST）が入力された場合、フリップフロップは、記憶しているデータをリセットする。フリップフロップは、記憶しているデータをマルチプレクサに出力する。 In a basic logic element, a lookup table receives input data (I/P's) and outputs data of a value corresponding to the value of the input data to a flip-flop and a multiplexer.
The flip-flop stores the data value from the look-up table when a clock signal (CLK) is input. When a reset signal (RST) is input, the flip-flop resets the stored data. The flip-flop outputs the stored data to the multiplexer.

マルチプレクサは、ルックアップテーブルからのデータ、および、フリップフロップからのデータの入力を受けて、１つの出力データ（O/P）を出力する。例えば、マルチプレクサは、制御信号の入力を受けて、ルックアップテーブルからのデータ、または、フリップフロップからのデータの何れか一方を出力する。
スイッチングブロックは、コンフィギャラブルロジックブロック間のデータ線の接続の有無（On/Off）を切り替える。 The multiplexer receives data from the lookup table and data from the flip-flop as inputs and outputs one output data (O/P). For example, the multiplexer receives a control signal and outputs either data from the lookup table or data from the flip-flop.
The switching block switches the connection/non-connection (On/Off) of the data lines between the configurable logic blocks.

演算装置２００をＦＰＧＡに実装する場合、設定部４００が、超曲面型ノード１１０における入力値と出力値との関係に基づいて生成したルックアップテーブルを、ベーシックロジックエレメントのルックアップテーブルに設定するようにしてもよい。この場合、マルチプレクサがルックアップテーブルからのデータを出力するようにすることで、ベーシックロジックエレメントに、超曲面型ノード１１０が行う演算と同様の演算を行わせることができる。 When the arithmetic device 200 is implemented in an FPGA, the setting unit 400 may set a lookup table generated based on the relationship between input and output values in the hypersurface node 110 as the lookup table of the basic logic element. In this case, by having the multiplexer output data from the lookup table, the basic logic element can be made to perform the same calculation as the calculation performed by the hypersurface node 110.

図１７の例のように、学習モデル装置１００が、超曲面型ノード１１０が構成するネットワークの構造を学習によって決定する場合、設定部４００が、スイッチングブロックの設定を調整することで、超曲面型ノード１１０が構成するネットワークと同様の構造のネットワークをＦＰＧＡに実装するようにしてもよい。 As in the example of Figure 17, when the learning model device 100 determines the structure of the network formed by the hypersurface node 110 through learning, the setting unit 400 may adjust the settings of the switching block so that a network with a structure similar to the network formed by the hypersurface node 110 is implemented in the FPGA.

複数のテーブル型ノード２１０に共用でルックアップテーブルが設けられていてもよい。
図２１は、複数のテーブル型ノード２１０に共用でルックアップテーブルが設けられる場合の、演算装置２００の構成の例を示す図である。図２１に示す構成で、演算装置２００ｆは、テーブル型ノード２１０ｆ－１からテーブル型ノード２１０ｆ－４と、テーブル記憶部２２０ｆ－１およびテーブル記憶部２２０ｆ－２とを備える。 A lookup table may be provided in common for multiple table type nodes 210 .
Fig. 21 is a diagram showing an example of the configuration of a computing device 200 when a lookup table is provided to be shared by multiple table type nodes 210. In the configuration shown in Fig. 21, a computing device 200f includes table type nodes 210f-1 to 210f-4, table storage units 220f-1 and 220f-2.

テーブル型ノード２１０ｆ－１からテーブル型ノード２１０ｆ－４を総称してテーブル型ノード２１０ｆとも表記する。テーブル記憶部２２０ｆ－１およびテーブル記憶部２２０ｆ－２を総称してテーブル記憶部２２０ｆとも表記する。
演算装置２００ｆは、演算装置２００の例に該当する。テーブル型ノード２１０ｆは、テーブル型ノード２１０の変形例に該当する。テーブル型ノード２１０ｆとテーブル記憶部２２０ｆとの組み合わせは、テーブル型ノード２１０の例に該当する。
テーブル記憶部２２０ｆは、ルックアップテーブルを記憶する。 The table type nodes 210f-1 to 210f-4 are also collectively referred to as table type nodes 210f. The table storage units 220f-1 and 220f-2 are also collectively referred to as table storage units 220f.
The arithmetic device 200f corresponds to an example of the arithmetic device 200. The table type node 210f corresponds to a modified example of the table type node 210. The combination of the table type node 210f and the table storage unit 220f corresponds to an example of the table type node 210.
The table storage unit 220f stores a lookup table.

テーブル型ノード２１０ｆの各々は、テーブル型ノード２１０ｆ自らはルックアップテーブルを備えず、テーブル記憶部２２０が記憶するルックアップテーブルを参照する。それ以外の点では、テーブル型ノード２１０ｆはテーブル型ノード２１０と同様である。
テーブル記憶部２２０ｆ－１が記憶するルックアップテーブルは、テーブル型ノード２１０ｆ－１とテーブル型ノード２１０ｆ－２とが共用で参照するルックアップテーブルとなっている。テーブル記憶部２２０ｆ－２が記憶するルックアップテーブルは、テーブル型ノード２１０ｆ－３とテーブル型ノード２１０ｆ－４とが共用で参照するルックアップテーブルとなっている。 Each of the table type nodes 210f does not have its own lookup table, but refers to a lookup table stored in the table storage unit 220. In other respects, the table type node 210f is similar to the table type node 210.
The lookup table stored in the table storage unit 220f-1 is a lookup table that is commonly referenced by the table type node 210f-1 and the table type node 210f-2. The lookup table stored in the table storage unit 220f-2 is a lookup table that is commonly referenced by the table type node 210f-3 and the table type node 210f-4.

学習モデル装置１００の学習が完了し、設定部４００が、超曲面型ノード１１０毎にルックアップテーブルを生成した際に、「似ているルックアップテーブル」を１つのルックアップテーブルに纏め、纏められた１つのルックアップテーブルをテーブル記憶部２２０ｆに記憶させるようにしてもよい。そして、纏められる前のルックアップテーブルを参照することになっていた超曲面型ノード１１０が、纏められたルックアップテーブルを共用で参照するように、設定部４００が、各超曲面型ノード１１０のルックアップテーブルの参照先を設定するようにしてもよい。 When learning by the learning model device 100 is complete and the setting unit 400 has generated a lookup table for each hypersurface node 110, the "similar lookup tables" may be aggregated into a single lookup table, and the aggregated lookup table may be stored in the table storage unit 220f. The setting unit 400 may then set the reference destination for the lookup table of each hypersurface node 110 so that the hypersurface nodes 110 that were to reference the lookup tables before the aggregation now share the aggregated lookup table.

「似ているルックアップテーブル」の判定条件として、例えば、ルックアップテーブルで「共通する行」のうち所定の閾値の割合以上（例えば、９０％以上）の行で出力値が同じである、といった条件を用いるようにしてもよい。
テーブル型ノード２１０への入力データと、ルックアップテーブルに示される入力データとの対応関係をテーブル型ノード２１０毎に設定可能な場合、「共通する行」の設定方法が複数通りあり、この点でルックアップテーブルを共用化できる可能性が高くなる。 As a criterion for determining whether a lookup table is similar, for example, a condition may be used in which the output values of the "common rows" in the lookup table are the same for a predetermined threshold percentage or more (for example, 90% or more).
If the correspondence between the input data to the table-type node 210 and the input data shown in the lookup table can be set for each table-type node 210, there are multiple ways to set the "common rows," which increases the possibility of sharing the lookup table.

図２１の例で、テーブル型ノード２１０ｆ－１への入力データは（ｘ_１，ｘ_２）であり、テーブル型ノード２１０ｆ－２への入力データは（ｘ_０，ｘ_１）である。また、テーブル記憶部２２０ｆ－１が記憶するルックアップテーブルにおける入力データが（Ｉ_０，Ｉ_１）であるものとする。また、テーブル型ノード２１０ｆ－１が、ｘ_１をＩ_０に対応付け、ｘ_２をＩ_１に対応付けてルックアップテーブルを参照するように、ルックアップテーブルにおける出力値が設定されるものとする。 21, the input data to the table type node 210f-1 is ( _x1 , _x2 ), and the input data to the table type node 210f-2 is ( _x0 , _x1 ). Also, the input data in the lookup table stored in the table storage unit 220f-1 is ( _I0 , _I1 ). Also, the output values in the lookup table are set so that the table type node 210f-1 associates _x1 with _I0 and _x2 with _I1 and refers to the lookup table.

テーブル型ノード２１０ｆ－２が、ｘ_０をＩ_０に対応付け、ｘ_１をＩ_１に対応付けるようにしてもよいし、あるいは、ｘ_１をＩ_０に対応付け、ｘ_０をＩ_１に対応付けるようにしてもよい。これら２通りの対応付けのうち少なくとも何れか一方の対応付けで判定条件が満たされれば、テーブル型ノード２１０ｆ－１が参照するルックアップテーブルと、テーブル型ノード２１０ｆ－２が参照するルックアップテーブルとを共用化することができる。 The table type node 210f-2 may associate x ₀ with I ₀ and x ₁ with I ₁ , or may associate x ₁ with I ₀ and x ₀ with I _1. If the determination condition is satisfied by at least one of these two associations, the lookup table referenced by the table type node 210f-1 and the lookup table referenced by the table type node 210f-2 can be shared.

テーブル型ノード２１０ｆへの入力データの個数が、ルックアップテーブルに示される入力データの個数よりも少ない場合、「共通する行」の設定方法の場合の数がさらに増え、ルックアップテーブルを共用化できる可能性がさらに高くなる。
例えば、テーブル型ノード２１０ｆ－１の出力データをｖ_１と表記すると、テーブル型ノード２１０ｆ－３の入力データは、（ｘ_０，ｖ_１）の２つである。また、テーブル型ノード２１０ｆ－４の入力データは、（ｙ_０，ｘ_２，ｘ_３）の３つである。
なお、ここでは、「入力データの個数」は、入力ベクトルの個数ではなく、個々の二値データの個数（したがって、入力ベクトルの要素数）を指すものとする。 If the number of input data to the table-type node 210f is less than the number of input data shown in the lookup table, the number of cases in which the "common row" setting method is used increases, further increasing the possibility that the lookup table can be shared.
For example, if the output data of the table type node 210f-1 is expressed as _v1 , the input data of the table type node 210f-3 is two pieces of data ( _x0 , _v1 ), and the input data of the table type node 210f-4 is three pieces of data ( _y0 , _x2 , _x3 ).
It should be noted that the "number of input data" here refers not to the number of input vectors but to the number of individual binary data (hence, the number of elements of the input vector).

また、テーブル記憶部２２０ｆ－２が記憶するルックアップテーブルにおける入力データが（Ｉ_２，Ｉ_３，Ｉ_４）の３つであるものとする。また、テーブル型ノード２１０ｆ－４が、ｙ_０をＩ_２に対応付け、ｘ_２をＩ_３に対応付け、ｘ_３をＩ_４に対応付けてルックアップテーブルを参照するように、ルックアップテーブルにおける出力値が設定されるものとする。 It is also assumed that the input data in the lookup table stored in the table storage unit 220f-2 are three, (I ₂ , I ₃ , I ₄ ). It is also assumed that the output values in the lookup table are set so that the table-type node 210f-4 associates y ₀ with I ₂ , x ₂ with I ₃ , and x ₃ with I ₄ to refer to the lookup table.

テーブル型ノード２１０ｆ－３が入力データ（ｘ_０，ｖ_１）をルックアップテーブルにおける入力データ（Ｉ_２，Ｉ_３，Ｉ_４）に対応付ける方法は、（ｘ_０，ｖ_１）＝（Ｉ_２，Ｉ_３）、（Ｉ_２，Ｉ_４）、（Ｉ_３，Ｉ_２）、（Ｉ_３，Ｉ_４）、（Ｉ_４，Ｉ_２）、（Ｉ_４，Ｉ_３）の６通りある。これら６通りの対応付けのうち少なくとも何れか１通りの対応付けで判定条件が満たされれば、テーブル型ノード２１０ｆ－３が参照するルックアップテーブルと、テーブル型ノード２１０ｆ－４が参照するルックアップテーブルとを共用化することができる。 There are six ways in which the table type node 210f-3 associates input data ( _x0 , _v1 ) with input data ( _I2 , _I3 , _I4 ) in the lookup table: ( _x0 , _v1 ) = ( _I2 , _I3 ), ( _I2 , I4), ( _I3 , _I2 ₎ , ( _I3 , _I4 ), ( _I4 , _I2 ), ( _I4 , _I3 ). If the determination condition is met with at least one of these six associations, the lookup table referenced by the table type node 210f-3 and the lookup table referenced by the table type node 210f-4 can be shared.

また、複数のテーブル型ノード２１０ｆが参照するルックアップテーブルを共用化する場合、共用化されるルックアップテーブルに、図１３の例のように出力値の列を複数設け、テーブル型ノード２１０ｆ毎の出力値を記載するようにしてもよい。
例えば、３つの１出力のテーブル型ノード２１０ｆを共用化する場合、ルックアップテーブルに出力値の列を３列設け、各テーブル型ノード２１０ｆの出力値を記載するようにしてもよい。
この場合、テーブル型ノード２１０ｆがルックアップテーブルを参照する際の検索キーとして用いられる入力データの記載を共用化することができ、この点で、ルックアップテーブルの記憶に必要なメモリ容量を削減することができる。 Furthermore, when a lookup table referenced by multiple table-type nodes 210f is shared, the shared lookup table may have multiple columns of output values, as in the example of Figure 13, and the output values for each table-type node 210f may be entered.
For example, when three table-type nodes 210f each having one output are shared, three columns of output values may be provided in the lookup table, and the output value of each table-type node 210f may be written therein.
In this case, the description of the input data used as a search key when the table type node 210f refers to the lookup table can be shared, and in this respect, the memory capacity required to store the lookup table can be reduced.

以上のように、超曲面型ノード１１０は、二値ベクトルの入力を受け、入力された二値ベクトルの次元数よりも１次元多い次元数の実数空間である入出力実数空間における超曲面に含まれる点のうち、入力された二値ベクトルを入出力実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の座標値の要素のうち、入力された二値ベクトルによる座標値の要素以外の要素に基づいて、二値化された出力値を決定する。 As described above, the hypersurface node 110 receives a binary vector as input, and determines a binarized output value based on the coordinate elements of points contained in a hypersurface in the input/output real space, which is a real space with one more dimension than the number of dimensions of the input binary vector, that have coordinate values that include each element of the coordinate values when the input binary vector is treated as coordinate values in a subspace of the input/output real space, other than the elements of the coordinate values of the input binary vector.

超曲面型ノード１１０によれば、超曲面で入力値と出力値との関係を表すことで、真理値表と同等の表現力を得られる。学習モデル装置１００によれば、この点で、二値データを用いる超曲面型ノード１１０を備える学習モデルが、超曲面型ノード１１０の層数を増やす必要なしに、比較的多様な出力値をとり得るようにすることができる。 The hypersurface node 110 can express the relationship between input and output values using a hypersurface, achieving expressive power equivalent to that of a truth table. In this respect, the learning model device 100 enables a learning model equipped with a hypersurface node 110 that uses binary data to be able to take a relatively wide variety of output values without the need to increase the number of layers of the hypersurface node 110.

また、超曲面型ノード１１０に設けられる超曲面は、超曲面型ノード１１０に入力される二値ベクトルがとり得る値の各々を、入出力実数空間の部分空間における座標値として扱った場合のその座標値と、学習パラメータ値との組み合わせによる、入出力実数空間における座標値の点を制御点とするB-Spline超曲面である。 Furthermore, the hypersurface provided in the hypersurface type node 110 is a B-Spline hypersurface whose control points are points with coordinate values in the input/output real number space, which are determined by combining the coordinate values of the binary vectors input to the hypersurface type node 110 when treated as coordinate values in a subspace of the input/output real number space with learning parameter values.

超曲面型ノード１１０によれば、このようにB-Spline超曲面を用いることで微分可能な関数を得られ、誤差逆伝播法など関数の微分を用いる学習手法を適用することができる。
また、超曲面型ノード１１０への入力データ値をB-Spline超曲面の制御点として用いることで、制御点の出力座標値が入力データ値に対する出力データ値を示すようなる。超曲面型ノード１１０によれば、この点で、入力データ値と出力データ値との関係を比較的容易に把握することができる。 According to the hypersurface node 110, a differentiable function can be obtained by using a B-Spline hypersurface in this way, and a learning method that uses the differentiation of a function, such as the backpropagation method, can be applied.
Furthermore, by using the input data values to the hypersurface node 110 as control points of the B-Spline hypersurface, the output coordinate values of the control points indicate the output data values for the input data values. In this respect, the hypersurface node 110 makes it relatively easy to grasp the relationship between the input data values and the output data values.

また、超曲面処理部１１１は、超曲面に含まれる点のうち、超曲面型ノード１１０に入力される二値ベクトルを入出力実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の前記座標値の要素のうち、超曲面型ノード１１０に入力される二値ベクトルによる座標値の要素以外の要素の値を取得する。
閾値演算部１１２は、学習モデル装置１００における順伝播のときは、超曲面処理部１１１が取得した値をステップ関数で二値化し、超曲面処理部１１１における逆伝播のときは、ステップ関数を微分可能な関数で近似する。 Furthermore, the hypersurface processing unit 111 acquires the values of elements of the coordinate values of points included in the hypersurface that include each element of the coordinate values when the binary vector input to the hypersurface type node 110 is treated as a coordinate value in a subspace of the input/output real number space, other than the elements of the coordinate values of the binary vector input to the hypersurface type node 110.
When performing forward propagation in the learning model device 100, the threshold calculation unit 112 binarizes the value acquired by the hypersurface processing unit 111 using a step function, and when performing backward propagation in the hypersurface processing unit 111, it approximates the step function with a differentiable function.

このように、閾値演算部１１２が、閾値関数を切り替えることで、順伝播のときは超曲面型ノード１１０が二値データを出力するようにし、かつ、誤差逆伝播法など関数の微分を用いる学習手法を適用することができる。 In this way, the threshold calculation unit 112 switches the threshold function, allowing the hypersurface node 110 to output binary data during forward propagation, and also enabling the application of learning methods that use function differentiation, such as backpropagation.

また、学習モデル装置１００は、二値ベクトルの入力を受け、入力された二値ベクトルの次元数よりも１次元多い次元数の実数空間である入出力実数空間における超曲面に含まれる点のうち、入力された二値ベクトルを入出力実数空間の部分空間における座標値として扱った場合のその座標値の各要素を含む座標値を有する点の座標値の要素のうち、入力された二値ベクトルによる座標値の要素以外の要素に基づいて、二値化された出力値を決定する。
学習制御部３００は、学習モデル装置１００の学習を制御する。
設定部４００は、学習後の学習モデル装置１００のノードにおける入力値と出力値との関係を示すルックアップテーブルを生成し、生成したルックアップテーブルを演算装置２００のテンプレートに設定する。 Furthermore, the learning model device 100 receives an input of a binary vector, and determines a binarized output value based on elements of the coordinate values of points included in a hypersurface in an input/output real space, which is a real space with one more dimension than the number of dimensions of the input binary vector, that have coordinate values including each element of the coordinate values when the input binary vector is treated as a coordinate value in a subspace of the input/output real space, other than the elements of the coordinate values of the input binary vector.
The learning control unit 300 controls the learning of the learning model device 100 .
The setting unit 400 generates a lookup table indicating the relationship between input values and output values at the nodes of the learning model device 100 after learning, and sets the generated lookup table in the template of the calculation device 200.

演算装置生産システム１によれば、超曲面で超曲面処理部１１１における入力値と出力値との関係を表すことで、真理値表と同等の表現力を得られる。演算装置生産システム１によれば、この点で、二値データを用いる超曲面型ノード１１０を備える学習モデルが、超曲面型ノード１１０の層数を増やす必要なしに、比較的多様な出力値をとり得るようにすることができる。
また、演算装置生産システム１によれば、学習で得られたルックアップテーブルを演算装置２００のテンプレートに設定して演算装置２００を生産することで、演算装置２００は、ルックアップテーブルを参照して二値演算を行うことができる。演算装置生産システム１によれば、テーブル型ノード２１０は、ルックアップテーブルを参照して入力データ値に対する出力データ値を決定する点で、複雑な演算に相当する入出力の場合でも、比較的短時間で、かつ、比較的小さい消費電力で、データを出力することができる。 According to the computing device production system 1, it is possible to obtain expressive power equivalent to that of a truth table by using a hypersurface to represent the relationship between input values and output values in the hypersurface processing unit 111. In this respect, according to the computing device production system 1, it is possible to make it possible for a learning model including hypersurface type nodes 110 that use binary data to take on a relatively wide variety of output values without the need to increase the number of layers of the hypersurface type nodes 110.
Furthermore, according to the computing device production system 1, the lookup table obtained by learning is set as a template for the computing device 200 and the computing device 200 is produced, so that the computing device 200 can perform binary operations by referring to the lookup table. According to the computing device production system 1, the table-type node 210 determines an output data value for an input data value by referring to the lookup table, and therefore can output data in a relatively short time and with relatively low power consumption even in the case of input/output corresponding to a complex operation.

また、演算装置２００は、Field Programmable Gate Arrayを用いて構成される。
演算装置生産システム１によれば、既存のＦＧＰＡを演算装置２００のテンプレートとして用いることができ、演算装置２００のテンプレートを別途生成する必要がない。このように、演算装置生産システム１によれば、演算装置２００を生産するための負担が比較的小さい。 The arithmetic device 200 is configured using a field programmable gate array.
According to the computing device production system 1, an existing FPGA can be used as a template for the computing device 200, and there is no need to separately generate a template for the computing device 200. In this way, according to the computing device production system 1, the burden of producing the computing device 200 is relatively small.

図２２は、少なくとも１つの実施形態に係るコンピュータの構成の例を示す概略ブロック図である。図２２に示す構成で、コンピュータ７００は、ＣＰＵ７１０と、主記憶装置７２０と、補助記憶装置７３０と、インタフェース７４０とを備える。 Figure 22 is a schematic block diagram illustrating an example of a computer configuration according to at least one embodiment. In the configuration shown in Figure 22, the computer 700 includes a CPU 710, a main memory device 720, an auxiliary memory device 730, and an interface 740.

上記の学習モデル装置１００、演算装置２００、演算装置２００ｆ、学習制御部３００、および、設定部４００のうち何れか１つ以上が、コンピュータ７００に実装されてもよい。その場合、上述した各処理部の動作は、プログラムの形式で補助記憶装置７３０に記憶されている。ＣＰＵ７１０は、プログラムを補助記憶装置７３０から読み出して主記憶装置７２０に展開し、当該プログラムに従って上記処理を実行する。また、ＣＰＵ７１０は、プログラムに従って、上述した各記憶部に対応する記憶領域を主記憶装置７２０に確保する。 Any one or more of the above-mentioned learning model device 100, calculation device 200, calculation device 200f, learning control unit 300, and setting unit 400 may be implemented in the computer 700. In this case, the operation of each of the above-mentioned processing units is stored in the auxiliary storage device 730 in the form of a program. The CPU 710 reads the program from the auxiliary storage device 730, expands it in the main storage device 720, and executes the above-mentioned processing in accordance with the program. The CPU 710 also allocates memory areas in the main storage device 720 corresponding to each of the above-mentioned memory units in accordance with the program.

学習モデル装置１００がコンピュータ７００に実装される場合、超曲面型ノード１１０の動作は、プログラムの形式で補助記憶装置７３０に記憶されている。ＣＰＵ７１０は、プログラムを補助記憶装置７３０から読み出して主記憶装置７２０に展開し、当該プログラムに従って上記処理を実行する。 When the learning model device 100 is implemented in a computer 700, the operation of the hypersurface node 110 is stored in the auxiliary storage device 730 in the form of a program. The CPU 710 reads the program from the auxiliary storage device 730, expands it into the main storage device 720, and executes the above-mentioned processing in accordance with the program.

また、ＣＰＵ７１０は、プログラムに従って、学習モデル装置１００が処理を行うための記憶領域を主記憶装置７２０に確保する。学習モデル装置１００と他の装置との通信は、インタフェース７４０が通信機能を有し、ＣＰＵ７１０の制御に従って動作することで実行される。
学習モデル装置１００とユーザとのインタラクションは、インタフェース７４０が入力デバイスおよび出力デバイスを有し、ＣＰＵ７１０の制御に従って出力デバイスにて情報をユーザに提示し、入力デバイスにてユーザ操作を受け付けることで実行される。 Furthermore, the CPU 710, in accordance with the program, allocates a memory area in the main memory device 720 for the learning model device 100 to perform processing. Communication between the learning model device 100 and other devices is performed by the interface 740, which has a communication function and operates under the control of the CPU 710.
Interaction between the learning model device 100 and the user is carried out by the interface 740 having an input device and an output device, presenting information to the user via the output device under the control of the CPU 710, and accepting user operations via the input device.

演算装置２００がコンピュータ７００に実装される場合、テーブル型ノード２１０の動作は、プログラムの形式で補助記憶装置７３０に記憶されている。ＣＰＵ７１０は、プログラムを補助記憶装置７３０から読み出して主記憶装置７２０に展開し、当該プログラムに従って上記処理を実行する。 When the computing device 200 is implemented in a computer 700, the operation of the table type node 210 is stored in the auxiliary storage device 730 in the form of a program. The CPU 710 reads the program from the auxiliary storage device 730, expands it into the main storage device 720, and executes the above processing in accordance with the program.

また、ＣＰＵ７１０は、プログラムに従って、演算装置２００が処理を行うための記憶領域を主記憶装置７２０に確保する。演算装置２００と他の装置との通信は、インタフェース７４０が通信機能を有し、ＣＰＵ７１０の制御に従って動作することで実行される。
演算装置２００とユーザとのインタラクションは、インタフェース７４０が入力デバイスおよび出力デバイスを有し、ＣＰＵ７１０の制御に従って出力デバイスにて情報をユーザに提示し、入力デバイスにてユーザ操作を受け付けることで実行される。 Furthermore, the CPU 710 allocates a storage area in the main storage device 720 for the arithmetic device 200 to perform processing in accordance with the program. Communication between the arithmetic device 200 and other devices is performed by the interface 740, which has a communication function and operates under the control of the CPU 710.
Interaction between the computing device 200 and the user is carried out by the interface 740 having an input device and an output device, presenting information to the user via the output device under the control of the CPU 710, and accepting user operations via the input device.

演算装置２００ｆがコンピュータ７００に実装される場合、テーブル型ノード２１０ｆの動作は、プログラムの形式で補助記憶装置７３０に記憶されている。ＣＰＵ７１０は、プログラムを補助記憶装置７３０から読み出して主記憶装置７２０に展開し、当該プログラムに従って上記処理を実行する。 When the calculation device 200f is implemented in the computer 700, the operation of the table type node 210f is stored in the auxiliary storage device 730 in the form of a program. The CPU 710 reads the program from the auxiliary storage device 730, loads it into the main storage device 720, and executes the above processing in accordance with the program.

また、ＣＰＵ７１０は、プログラムに従って、テーブル記憶部２２０ｆに対応する記憶領域を主記憶装置７２０に確保する。演算装置２００ｆと他の装置との通信は、インタフェース７４０が通信機能を有し、ＣＰＵ７１０の制御に従って動作することで実行される。
演算装置２００ｆとユーザとのインタラクションは、インタフェース７４０が入力デバイスおよび出力デバイスを有し、ＣＰＵ７１０の制御に従って出力デバイスにて情報をユーザに提示し、入力デバイスにてユーザ操作を受け付けることで実行される。 Furthermore, the CPU 710, in accordance with the program, allocates a storage area corresponding to the table storage unit 220f in the main storage device 720. Communication between the calculation device 200f and other devices is performed by the interface 740, which has a communication function and operates under the control of the CPU 710.
Interaction between the computing device 200f and the user is carried out by the interface 740 having an input device and an output device, presenting information to the user via the output device under the control of the CPU 710, and accepting user operations via the input device.

学習制御部３００がコンピュータ７００に実装される場合、学習制御部３００の動作は、プログラムの形式で補助記憶装置７３０に記憶されている。ＣＰＵ７１０は、プログラムを補助記憶装置７３０から読み出して主記憶装置７２０に展開し、当該プログラムに従って上記処理を実行する。 When the learning control unit 300 is implemented in the computer 700, the operation of the learning control unit 300 is stored in the auxiliary storage device 730 in the form of a program. The CPU 710 reads the program from the auxiliary storage device 730, loads it into the main storage device 720, and executes the above-mentioned processing in accordance with the program.

また、ＣＰＵ７１０は、プログラムに従って、学習制御部３００が処理を行うための記憶領域を主記憶装置７２０に確保する。学習制御部３００と他の装置との通信は、インタフェース７４０が通信機能を有し、ＣＰＵ７１０の制御に従って動作することで実行される。
学習制御部３００とユーザとのインタラクションは、インタフェース７４０が入力デバイスおよび出力デバイスを有し、ＣＰＵ７１０の制御に従って出力デバイスにて情報をユーザに提示し、入力デバイスにてユーザ操作を受け付けることで実行される。 Furthermore, the CPU 710 allocates a storage area in the main storage device 720 for the learning control unit 300 to perform processing in accordance with the program. Communication between the learning control unit 300 and other devices is performed by the interface 740, which has a communication function and operates under the control of the CPU 710.
Interaction between the learning control unit 300 and the user is carried out by the interface 740 having an input device and an output device, presenting information to the user via the output device under the control of the CPU 710, and accepting user operations via the input device.

設定部４００がコンピュータ７００に実装される場合、設定部４００の動作は、プログラムの形式で補助記憶装置７３０に記憶されている。ＣＰＵ７１０は、プログラムを補助記憶装置７３０から読み出して主記憶装置７２０に展開し、当該プログラムに従って上記処理を実行する。 When the setting unit 400 is implemented in the computer 700, the operation of the setting unit 400 is stored in the auxiliary storage device 730 in the form of a program. The CPU 710 reads the program from the auxiliary storage device 730, loads it into the main storage device 720, and executes the above-mentioned processing in accordance with the program.

また、ＣＰＵ７１０は、プログラムに従って、設定部４００が処理を行うための記憶領域を主記憶装置７２０に確保する。設定部４００と他の装置との通信は、インタフェース７４０が通信機能を有し、ＣＰＵ７１０の制御に従って動作することで実行される。
設定部４００とユーザとのインタラクションは、インタフェース７４０が入力デバイスおよび出力デバイスを有し、ＣＰＵ７１０の制御に従って出力デバイスにて情報をユーザに提示し、入力デバイスにてユーザ操作を受け付けることで実行される。 Furthermore, the CPU 710, in accordance with the program, allocates a storage area in the main storage device 720 for the setting unit 400 to perform processing. Communication between the setting unit 400 and other devices is performed by the interface 740, which has a communication function and operates under the control of the CPU 710.
Interaction between the setting unit 400 and the user is carried out by the interface 740 having an input device and an output device, presenting information to the user via the output device under the control of the CPU 710, and accepting user operations via the input device.

なお、学習モデル装置１００、演算装置２００、演算装置２００ｆ、学習制御部３００、および、設定部４００の全部または一部の機能を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行することで各部の処理を行ってもよい。なお、ここでいう「コンピュータシステム」とは、ＯＳ（Operating System）や周辺機器等のハードウェアを含むものとする。
また、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ＲＯＭ（Read Only Memory）、ＣＤ－ＲＯＭ（Compact Disc Read Only Memory）等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。また上記プログラムは、前述した機能の一部を実現するためのものであってもよく、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであってもよい。 In addition, a program for realizing all or part of the functions of the learning model device 100, the calculation device 200, the calculation device 200f, the learning control unit 300, and the setting unit 400 may be recorded on a computer-readable recording medium, and the program recorded on this recording medium may be read into a computer system and executed to perform processing of each unit. Note that the term "computer system" here includes hardware such as an OS (Operating System) and peripheral devices.
Furthermore, "computer-readable recording medium" refers to portable media such as flexible disks, optical magnetic disks, ROMs (Read Only Memory), and CD-ROMs (Compact Disc Read Only Memory), as well as storage devices such as hard disks built into computer systems. The program may be one that realizes part of the functions described above, or may be one that can realize the functions described above in combination with a program already recorded in the computer system.

以上、本発明の実施形態を図面を参照して詳述してきたが、具体的な構成はこの実施形態に限られるものではなく、この発明の要旨を逸脱しない範囲の設計変更等も含まれる。 The above describes an embodiment of the present invention in detail with reference to the drawings, but the specific configuration is not limited to this embodiment and includes design modifications and the like that do not deviate from the gist of the present invention.

１演算装置生産システム
１００学習モデル装置
１１０超曲面型ノード
１１１超曲面処理部
１１２閾値演算部
２００、２００ｆ演算装置
２１０、２１０ｆテーブル型ノード
２２０ｆテーブル記憶部
３００学習制御部
４００設定部
２２０テーブル記憶部 1 Calculation device production system 100 Learning model device 110 Hypersurface type node 111 Hypersurface processing unit 112 Threshold value calculation unit 200, 200f Calculation device 210, 210f Table type node 220f Table storage unit 300 Learning control unit 400 Setting unit 220 Table storage unit

Claims

A learning model device comprising: a node that receives an input of a binary vector, and determines a binarized output value based on elements of coordinate values of a point that is included in a hypersurface in a real number space that has one more dimension than the number of dimensions of the binary vector, the point having coordinate values that include each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real number space, other than elements of the coordinate values of the binary vector.

The hypersurface is a B-Spline hypersurface whose control points are points of coordinate values in the real number space, which are determined by combining the coordinate values when each of the values that the binary vector can take is treated as a coordinate value in a subspace of the real number space with a learning parameter value.
The learning model device according to claim 1 .

The node
a hypersurface processing unit that acquires values of elements other than the coordinate values of the binary vector among elements of coordinate values of points included in the hypersurface, the points having coordinate values including elements of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real number space; and
a threshold calculation unit that, in forward propagation in the learning model device, binarizes the value acquired by the hypersurface processing unit using a step function, and, in back propagation in the learning model device, approximates the step function using a differentiable function;
The learning model device according to claim 1 or 2, comprising:

A learning model system, a learning control unit, and a setting unit are provided,
The learning model system includes:
a node that receives an input of a binary vector and determines a binarized output value based on elements of coordinate values of a point that is included in a hypersurface in a real number space that has a dimension that is one dimension larger than the dimension of the binary vector, the coordinate values including each element of the coordinate values when the binary vector is treated as a coordinate value in a subspace of the real number space, other than the elements of the coordinate values of the binary vector;
the learning control unit controls learning of the learning model system;
the setting unit generates a lookup table indicating a relationship between an input value and an output value at a node of the learning model system after learning, and sets the generated lookup table in a template of the arithmetic device.
Computing device production system.

The arithmetic unit is configured using a Field Programmable Gate Array.
The computing device production system of claim 4.

The computer
A calculation method including: receiving an input of a binary vector; and determining a binarized output value based on elements of coordinate values of a point included in a hypersurface in a real number space that has one more dimension than the number of dimensions of the binary vector, the point having coordinate values including each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real number space, other than elements of coordinate values of the binary vector.

learning a learning model system including a node that receives an input of a binary vector and determines an output value based on elements of coordinate values of a point that has coordinate values including each element of the coordinate values when the binary vector is treated as a coordinate value in a subspace of the real number space, among points included in a hypersurface in a real number space that has one dimension greater than the dimension of the binary vector, other than the elements of the coordinate values of the binary vector;
generating a lookup table indicating the relationship between inputs and outputs at the nodes of the learning model system after learning;
A method for producing a computing device, comprising setting the generated lookup table as a template for the computing device.

On the computer,
A program for executing the following: receiving an input of a binary vector, and determining a binarized output value based on elements of the coordinate values of a point included in a hypersurface in a real number space that has one more dimension than the number of dimensions of the binary vector, the point having coordinate values including each element of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real number space, other than elements of the coordinate values of the binary vector.

On the computer,
learning a learning model system including a node that receives an input of a binary vector and determines an output value based on elements of coordinate values of a point that has coordinate values including elements of the coordinate values when the binary vector is treated as coordinate values in a subspace of the real number space, among points included in a hypersurface in a real number space that has one dimension greater than the dimension of the binary vector, other than the elements of the coordinate values of the binary vector;
generating a lookup table indicating the relationship between inputs and outputs at the nodes of the learning model system after learning;
Setting the generated lookup table as a template for the arithmetic device;
A program to execute.