WO2021095519A1

WO2021095519A1 - Information processing device

Info

Publication number: WO2021095519A1
Application number: PCT/JP2020/040345
Authority: WO
Inventors: 大和岡本; 敦史橋本; 竜米谷; 宏之宮浦; 佑紀広橋; 盛太郎武良
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 2019-11-14
Filing date: 2020-10-28
Publication date: 2021-05-20
Anticipated expiration: 2022-05-14
Also published as: JP2021081814A; JP7409027B2

Abstract

According to the present invention, whether a learned model can demonstrate a capability obtained via machine learning is determined on the basis of data input into the learned model. An information processing device according to the present invention comprises an encoder, which includes an encoder model, and a determination unit. The determination unit receives specific feature amounts of the input data from the encoder model and outputs the degree of matching, to at least one data set, of the input data. The encoder model uses machine learning to perform matching so as to distribute feature amounts in a specific subspace of a feature amount space defined by a plurality of dimensions. The determination unit calculates the degree of matching of the input data to the data set from the positional relationship between the specific feature amounts and the specific subspace.

Description

Information processing device

　本開示は、情報処理装置に関する。 This disclosure relates to an information processing device.

　従来、学習済みモデルに入力される特徴量を入力データから抽出する構成が知られている。たとえば、非特許文献１には、マルチタスク敵対的ネットワーク（ＭＴＡＮ：multi-task　adversarial　network）が開示されている。ＭＴＡＮにおいては、内容のラベルとスタイルのラベルとが付された画像の特徴表現が、当該画像のスタイル識別における敵対的学習を通して、スタイル因子に関して一般化される。ＭＴＡＮによれば、未知のスタイル因子を含む画像の内容識別の混同を生じさせることなく、当該画像の特徴表現を一般化することができる。 Conventionally, a configuration is known in which the feature amount input to the trained model is extracted from the input data. For example, Non-Patent Document 1 discloses a multi-task adversarial network (MTAN). In MTAN, feature representations of images labeled with content and style are generalized with respect to style factors through hostile learning in style identification of the image. According to MTAN, it is possible to generalize the feature representation of an image without causing confusion in the content identification of the image containing an unknown style factor.

Yang　Liu,　Zhaowen　Wang,　Hailin　Jin　and　Ian　Wassell,　"Multi-Task　Adversarial　Network　for　Disentangled　Feature　Learning",　in　Proceedings　of　the　IEEE　Conference　on　Computer　Vision　and　Pattern　Recognition　2018,　pp.　3743-3751.Yang Liu, Zhaowen Wang, Hailin Jin and Ian Wassell, "Multi-Task Adversarial Network for Disentangled Feature Learning", in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018, 375.pp.

　学習済みモデルには、学習データが取得された条件とは異なる条件で取得されたデータが入力され得る。データの特徴量は当該データの属性が同じであっても当該データが取得された条件によって大きく異なり得る。両条件の乖離の程度によっては、入力データに対する学習済みモデルの性能が大幅に低下し得る。しかし、非特許文献１に開示されているＭＴＡＮにおいては、両条件の乖離による学習済みモデルの性能低下について考慮されていない。 Data acquired under conditions different from the conditions under which the training data was acquired can be input to the trained model. Even if the attributes of the data are the same, the feature amount of the data may differ greatly depending on the conditions under which the data is acquired. Depending on the degree of divergence between the two conditions, the performance of the trained model for the input data can be significantly reduced. However, in MTAN disclosed in Non-Patent Document 1, the performance deterioration of the trained model due to the discrepancy between the two conditions is not taken into consideration.

　本開示は上記のような課題を解決するためになされたものであり、その目的は、学習済みモデルが機械学習によって得られた性能を発揮することが可能か否かを、学習済みモデルへの入力データに基づいて判定することである。 The present disclosure has been made to solve the above-mentioned problems, and the purpose of the present disclosure is to determine whether or not the trained model can exhibit the performance obtained by machine learning. The judgment is based on the input data.

　本開示の一例によれば、情報処理装置は、エンコーダと、判定部とを備える。エンコーダは、少なくとも１つのデータセットに含まれる学習データから複数の次元の特徴量を出力するエンコーダモデルを含む。判定部は、少なくとも１つの入力データの特定特徴量をエンコーダモデルから受けて、少なくとも１つのデータセットへの前記少なくとも１つの入力データの適合度を出力する。エンコーダモデルは、複数の次元によって規定される特徴量空間の特定部分空間に特徴量を分布させるように機械学習によって適合されている。判定部は、特定特徴量と特定部分空間との位置関係から少なくとも１つのデータセットへの少なくとも１つの入力データの適合度を算出する。 According to an example of the present disclosure, the information processing device includes an encoder and a determination unit. The encoder includes an encoder model that outputs features of a plurality of dimensions from training data contained in at least one data set. The determination unit receives the specific feature amount of at least one input data from the encoder model, and outputs the goodness of fit of the at least one input data to the at least one data set. The encoder model is adapted by machine learning to distribute features in specific subspaces of the feature space defined by multiple dimensions. The determination unit calculates the goodness of fit of at least one input data to at least one data set from the positional relationship between the specific feature amount and the specific subspace.

　この開示によれば、判定部によって算出される適合度を参照することにより、学習データが取得された少なくとも１つの環境（既知の環境）に入力データが適合しているか否かを識別することができる。その結果、既知の環境において取得された学習データを用いる機械学習によって適合された学習済みモデルが、入力データに対して正常に機能するか否かを識別することができる。すなわち、学習済みモデルが機械学習によって得られた性能を発揮することが可能か否かを、学習済みモデルへの入力データに基づいて判定することができる。 According to this disclosure, it is possible to identify whether or not the input data is suitable for at least one environment (known environment) from which the training data is acquired by referring to the goodness of fit calculated by the determination unit. it can. As a result, it is possible to identify whether or not the trained model adapted by machine learning using the training data acquired in the known environment functions normally with respect to the input data. That is, it is possible to determine whether or not the trained model can exhibit the performance obtained by machine learning based on the input data to the trained model.

　上述の開示において、判定部は、特定特徴量と、学習データからエンコーダモデルによって抽出された特徴量との差を、エンコーダモデルを介して少なくとも１つの入力データの変更量として逆算し、当該変更量の絶対値を減少させるための少なくとも１つの入力データが取得された条件の変更方法を出力してもよい。 In the above disclosure, the determination unit back-calculates the difference between the specific feature amount and the feature amount extracted from the training data by the encoder model as the change amount of at least one input data via the encoder model, and the change amount. The method of changing the condition from which at least one input data for reducing the absolute value of is acquired may be output.

　この開示によれば、入力データが既知の環境に適合していない場合でも、入力データが取得される環境の条件を既知の環境の条件に適合させることができる。その結果、学習済みモデルは、未知に環境において取得された入力データに対しても機械学習において得られた性能を発揮することができる。 According to this disclosure, even if the input data does not conform to the known environment, the conditions of the environment in which the input data is acquired can be adapted to the conditions of the known environment. As a result, the trained model can exhibit the performance obtained in machine learning even for the input data acquired in the unknown environment.

　上述の開示において、情報処理装置は、記憶部と、学習部とをさらに備えてもよい。記憶部には、少なくとも１つのデータセットが保存されてもよい。学習部は、少なくとも１つのデータセットを用いる機械学習により、特定部分空間に特徴量を分布させるようにエンコーダモデルを適合させてもよい。 In the above disclosure, the information processing device may further include a storage unit and a learning unit. At least one data set may be stored in the storage unit. The learning unit may adapt the encoder model so that the features are distributed in a specific subspace by machine learning using at least one data set.

　この開示によれば、機械学習が行われていないエンコーダモデルを、学習部によって学習済みのエンコーダモデルに適合することができる。 According to this disclosure, an encoder model that has not been machine-learned can be adapted to an encoder model that has been learned by the learning unit.

　本開示の一例によれば、情報処理装置は、記憶部と、エンコーダと、学習部とを備える。記憶部には、少なくとも１つのデータセットが保存されている。エンコーダは、データセットに含まれる学習データから複数の次元の特徴量を抽出するエンコーダモデルを含む。学習部は、データセットを用いる機械学習により、複数の次元によって規定される特徴量空間の特定部分空間に特徴量を分布させるようにエンコーダモデルを適合させる。 According to an example of the present disclosure, the information processing device includes a storage unit, an encoder, and a learning unit. At least one data set is stored in the storage unit. The encoder includes an encoder model that extracts features of multiple dimensions from the training data contained in the dataset. The learning unit adapts the encoder model so that the features are distributed in a specific subspace of the feature space defined by a plurality of dimensions by machine learning using a data set.

　上述の開示において、情報処理装置は、デコーダをさらに備えてもよい。デコーダは、エンコーダからの特徴量を復号するデコーダモデルを含んでもよい。学習部は、少なくとも１つのデータセットを用いる機械学習により、当該特徴量が標準正規分布に従うように、エンコーダモデルおよびデコーダモデルを適合させてもよい。 In the above disclosure, the information processing device may further include a decoder. The decoder may include a decoder model that decodes the features from the encoder. The learning unit may adapt the encoder model and the decoder model so that the features follow a standard normal distribution by machine learning using at least one data set.

　この開示によれば、機械学習によってデコーダモデルが変分自己符号化器（ＶＡＥ：Variational　Auto-Encoder）に適合されるとともに、エンコーダモデルによって学習データから抽出される特徴量が特定部分空間に一様に分布するようにエンコーダモデルが適合される。その結果、入力データが少なくとも１つのデータセットに適合するか否かによって、入力データから抽出される特徴量の特徴量空間における位置の違いが明確になる。その結果、学習済みモデルが機械学習によって得られた性能を発揮することが可能か否かを、学習済みモデルへの入力データに基づいてより精度よく判定することができる。 According to this disclosure, the decoder model is adapted to a variational auto-encoder (VAE) by machine learning, and the features extracted from the training data by the encoder model are uniform in a specific subspace. The encoder model is adapted to be distributed in. As a result, the difference in the position of the feature amount extracted from the input data in the feature amount space becomes clear depending on whether or not the input data fits at least one data set. As a result, it is possible to more accurately determine whether or not the trained model can exhibit the performance obtained by machine learning based on the input data to the trained model.

　上述の開示において、情報処理装置は、識別器をさらに備えてもよい。識別器は、学習データが少なくとも１つの環境のいずれに含まれるかを識別する識別モデルを含んでもよい。機械学習は、識別モデルとエンコーダモデルとの間で行われる敵対的学習であってもよい。敵対的学習においては、学習データが取得された正解環境の識別に識別モデルが成功する確率が最大化するように識別モデルが最適化されるとともに、正解環境の識別に識別モデルが失敗する確率が最大化するようにエンコーダモデルが最適化されてもよい。 In the above disclosure, the information processing device may further include a classifier. The classifier may include a discriminative model that identifies which of the at least one environment the training data is contained in. Machine learning may be hostile learning performed between the discriminative model and the encoder model. In hostile learning, the discriminative model is optimized to maximize the probability that the discriminative model will succeed in identifying the correct environment from which the training data was acquired, and the probability that the discriminative model will fail in identifying the correct environment. The encoder model may be optimized for maximization.

　この開示によれば、敵対的学習により、エンコーダモデルによって学習データから抽出される特徴量から少なくとも１つのデータセットの各々に含まれるデータが取得された環境に特有のバイアスが取り除かれる。エンコーダモデルから出力される特徴量においては、既知の環境に共通する特徴が強調されるため、学習済みモデルが機械学習によって得られた性能を発揮することが可能か否かを、学習済みモデルへの入力データに基づいて判定し易くなる。 According to this disclosure, hostile learning removes the bias peculiar to the environment in which the data contained in each of at least one data set is acquired from the features extracted from the training data by the encoder model. Since the features output from the encoder model emphasize the features common to known environments, it is determined to the trained model whether or not the trained model can exhibit the performance obtained by machine learning. It becomes easy to judge based on the input data of.

　上述の開示において、特定部分空間は、特徴量空間の原点を中心とする球面であってもよい。 In the above disclosure, the specific subspace may be a spherical surface centered on the origin of the feature space.

　この開示によれば、特定部分空間を球面とすることにより、エンコーダに対する機械学習を通して、既知の環境に適合する環境において取得されたデータの特徴量の原点からの距離と、既知の環境に不適合の環境において取得されたデータの特徴量の原点からの距離に明確な差が生じさせ易い。その結果、学習済みモデルが機械学習によって得られた性能を発揮することが可能か否かを、学習済みモデルへの入力データに基づいてより明確に判定することができる。 According to this disclosure, by making a specific subspace spherical, the distance from the origin of the features of the data acquired in an environment compatible with the known environment through machine learning for the encoder and the incompatibility with the known environment. It is easy to make a clear difference in the distance from the origin of the feature amount of the data acquired in the environment. As a result, it is possible to more clearly determine whether or not the trained model can exhibit the performance obtained by machine learning based on the input data to the trained model.

　本開示によれば、学習済みモデルが機械学習によって得られた性能を発揮することが困難な環境を識別することができる。 According to the present disclosure, it is possible to identify an environment in which it is difficult for the trained model to exhibit the performance obtained by machine learning.

実施の形態に係る情報処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus which concerns on embodiment. 図１の学習部によってエンコーダモデルおよびデコーダモデルに対して行われる機械学習を説明するためのブロック図である。It is a block diagram for demonstrating the machine learning performed on the encoder model and the decoder model by the learning part of FIG. 図１の学習部によって行われる敵対的学習を説明するためのブロック図である。It is a block diagram for demonstrating the hostile learning performed by the learning part of FIG. 図１の学習部によって主目的モデルに対して行われる機械学習を説明するためのブロック図である。It is a block diagram for demonstrating the machine learning performed on the main purpose model by the learning part of FIG. 学習済みのエンコーダモデルおよび学習済みの主目的モデルによる推論処理の流れを示すブロック図である。It is a block diagram which shows the flow of the inference processing by the trained encoder model and the trained main purpose model. 主目的モデルによって行われる推論処理がソフトマックス関数を用いた多クラス分類のロジスティック回帰である場合の、図５のエンコーダモデルから出力される特徴量の分布を示す。The distribution of features output from the encoder model of FIG. 5 is shown when the inference processing performed by the main purpose model is a logistic regression of multiclass classification using a softmax function. 図５の判定部によって行われる適合判定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the conformity determination process performed by the determination part of FIG. 主目的モデルによって行われる推論処理がシグモイド関数を用いた２クラス分類のロジスティック回帰である場合の、図５のエンコーダモデルから出力される特徴量の分布を示す。The distribution of features output from the encoder model of FIG. 5 is shown when the inference processing performed by the main object model is logistic regression of two-class classification using a sigmoid function. 図５の判定部によって行われる適合判定処理の他の例の流れを示すフローチャートである。It is a flowchart which shows the flow of another example of the conformity determination process performed by the determination part of FIG. 実施の形態の変形例１に係る情報処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus which concerns on modification 1 of embodiment. 実施の形態の変形例２に係る情報処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the information processing apparatus which concerns on modification 2 of embodiment. 図１の情報処理装置を含む外観検査システムの全体構成の一例を示す概略図である。It is the schematic which shows an example of the whole structure of the appearance inspection system including the information processing apparatus of FIG. 図１２の撮像部によって取得されるワークの画像例を説明するための模式図である。It is a schematic diagram for demonstrating the image example of the work acquired by the image pickup part of FIG. 図１２の情報処理装置の概略構成図である。It is a schematic block diagram of the information processing apparatus of FIG.

　以下、実施の形態について図面を参照しながら詳細に説明する。なお、図中同一または相当部分には同一符号を付してその説明は原則として繰り返さない。 Hereinafter, the embodiment will be described in detail with reference to the drawings. In principle, the same or corresponding parts in the drawings are designated by the same reference numerals and the description is not repeated.

　＜適用例＞
　図１は、実施の形態に係る情報処理装置１００の構成を示すブロック図である。図１に示されるように、情報処理装置１００は、記憶部Ｓｔｇと、学習部Ｌｎと、エンコーダＥｎｃと、デコーダＤｅｃと、識別器Ｄｓｃと、主目的処理器Ｍｐｒと、判定部Ｊｄｇとを備える。記憶部Ｓｔｇには、データセットＥ１～Ｅｎが保存されている。エンコーダＥｎｃは、エンコーダモデルＭｃを含む。デコーダＤｅｃは、デコーダモデルＭｄを含む。識別器Ｄｓｃは、識別モデルＭｅを含む。主目的処理器Ｍｐｒは、主目的モデルＭｍを含む。エンコーダモデルＭｃ、デコーダモデルＭｄ、識別モデルＭｅ、および主目的モデルＭｍの各々は、ニューラルネットワークを含む。なお、デコーダＤｅｃは、情報処理装置１００に含まれていなくてもよい。 <Application example>
FIG. 1 is a block diagram showing a configuration of an information processing device 100 according to an embodiment. As shown in FIG. 1, the information processing device 100 includes a storage unit Stg, a learning unit Ln, an encoder Enc, a decoder Dec, a discriminator Dsc, a main purpose processor Mpr, and a determination unit Jdg. .. Data sets E1 to En are stored in the storage unit Stg. The encoder Enc includes an encoder model Mc. The decoder Dec includes a decoder model Md. The classifier Dsc includes a discriminative model Me. The main purpose processor Mpr includes a main purpose model Mm. Each of the encoder model Mc, the decoder model Md, the discriminative model Me, and the main purpose model Mm includes a neural network. The decoder Dec may not be included in the information processing device 100.

　データセットＥ１～Ｅｎは、学習データｄｔ１～ｄｔｎを含む。学習データｄｔ１～ｄｔｎには、それぞれデータセットＥ１～Ｅｎ各々のラベルが付されているとともに、主目的処理器Ｍｐｒの出力に対応する正解データが含まれる。なお、データセットＥ１～Ｅｎに各々に付されたラベルは各データセットに含まれるデータが取得された環境に対応する。環境とは、ドメインとも呼ばれ、データが取得される条件によって決定される。当該条件の違いは、同属性の学習データの特徴量の分布を異ならせる要因（バイアス）になり得る。たとえば、当該条件には、たとえば、学習データが取得された日時、学習データが取得された場所、学習データを取得した装置の設定値、および当該装置の機種等を含む。なお、データセットＥｎおよびｄｔｎの「ｎ」は、２以上の自然数である。 The data sets E1 to En include training data dt1 to dtn. The training data dt1 to dtn are labeled with each of the data sets E1 to En, and include correct answer data corresponding to the output of the main purpose processor Mpr. The labels attached to each of the data sets E1 to En correspond to the environment in which the data included in each data set was acquired. The environment, also called a domain, is determined by the conditions under which the data is retrieved. The difference in the conditions can be a factor (bias) that makes the distribution of the features of the learning data of the same attribute different. For example, the condition includes, for example, the date and time when the learning data was acquired, the place where the learning data was acquired, the set value of the device from which the learning data was acquired, the model of the device, and the like. The "n" in the datasets En and dtn is a natural number of 2 or more.

　エンコーダモデルＭｃは、データセットＤｓに含まれる学習データから複数の次元の特徴量を抽出する。デコーダモデルＭｄは、エンコーダモデルＭｃから特徴量を受けて、当該特徴量を復号する。識別モデルＭｅは、エンコーダモデルＭｃから特徴量を受けて、当該特徴量が抽出されたデータが含まれるデータセットを識別する。主目的モデルＭｍは、エンコーダモデルＭｃから特徴量を受けて、当該特徴量が抽出されたデータに対するパターン認識等の推論を行う。主目的モデルＭｍの機能としては、たとえば、画像認識、自然言語処理、および音声認識を挙げることができる。 The encoder model Mc extracts features of a plurality of dimensions from the training data included in the data set Ds. The decoder model Md receives a feature amount from the encoder model Mc and decodes the feature amount. The discriminative model Me receives the feature amount from the encoder model Mc and identifies the data set including the data from which the feature amount is extracted. The main object model Mm receives a feature amount from the encoder model Mc and makes an inference such as pattern recognition for the data from which the feature amount is extracted. Functions of the main purpose model Mm include, for example, image recognition, natural language processing, and voice recognition.

　学習部Ｌｎは、データセットＤｓを用いる機械学習により、エンコーダモデルＭｃから出力される特徴量の次元によって規定される特徴量空間の特定部分空間に、特徴量を分布させるようにエンコーダモデルＭｃおよびデコーダモデルＭｄを変分自己符号化器に適合させる。エンコーダモデルＭｃの最適化における損失関数としてリングロスを用いるとともに、エンコーダモデルＭｃおよびデコーダモデルＭｄが自己符号化器に適合されることにより当該特定部分空間を球面とすることができる。当該球面の半径は、予め定められた値を用いてもよいし、エンコーダモデルＭｃの最適化の過程で学習されてもよい。当該損失関数としてクロスエントロピーを用いて、当該特定部分空間を超平面としてもよい。 The learning unit Ln uses the encoder model Mc and the decoder to distribute the features in a specific subspace of the feature space defined by the dimension of the features output from the encoder model Mc by machine learning using the data set Ds. Fit the model Md to the variant self-encoder. A ring loss is used as a loss function in the optimization of the encoder model Mc, and the specific subspace can be made spherical by adapting the encoder model Mc and the decoder model Md to the self-encoder. The radius of the spherical surface may use a predetermined value, or may be learned in the process of optimizing the encoder model Mc. Cross entropy may be used as the loss function to make the specific subspace a hyperplane.

　学習部Ｌｎは、エンコーダモデルＭｃおよびデコーダモデルＭｄに対して、図２に示されるような機械学習を行う。エンコーダモデルＭｃは、学習データｄｔｚを受けて特徴量ｆｚをデコーダモデルＭｄに出力する。デコーダモデルＭｄは、特徴量ｆｚをデータｄｃｚに復号する。学習部Ｌｎは、データｄｃｚと学習データｄｔｚとの誤差Ｌｓ１を最小化するとともに特徴量ｆｚが標準正規分布に従うようにバックプロパゲーションを行って、エンコーダモデルＭｃおよびデコーダモデルＭｄの各々に含まれるニューラルネットワークの重みおよびバイアスを更新する。当該機械学習により、エンコーダモデルＭｃおよびデコーダモデルＭｄは、学習済みの自己符号化器に適合される。 The learning unit Ln performs machine learning on the encoder model Mc and the decoder model Md as shown in FIG. The encoder model Mc receives the learning data dtz and outputs the feature amount fz to the decoder model Md. The decoder model Md decodes the feature amount fz into data dcz. The learning unit Ln minimizes the error Ls1 between the data dcz and the learning data dtz, backpropagates the feature fz so that it follows a standard normal distribution, and performs neurals included in each of the encoder model Mc and the decoder model Md. Update network weights and biases. By the machine learning, the encoder model Mc and the decoder model Md are adapted to the trained self-encoder.

　学習部Ｌｎは、エンコーダモデルＭｃと識別モデルＭｅとの間での図３に示されるような敵対的学習を行う。図３において確率Ｐｓは、学習データｄｔｘが取得された正解データセットＥｘの識別に識別モデルＭｅが成功する確率である。確率Ｐｆは、正解データセットＥｘの識別に識別モデルＭｅが失敗する確率である。 The learning unit Ln performs hostile learning as shown in FIG. 3 between the encoder model Mc and the discriminative model Me. In FIG. 3, the probability Ps is the probability that the discriminative model Me succeeds in identifying the correct data set Ex from which the training data dtx has been acquired. The probability Pf is the probability that the discriminative model Me fails to identify the correct data set Ex.

　図３に示されるように、学習部Ｌｎは、確率Ｐｓが最大化するように識別モデルＭｅを最適化する。すなわち、学習部Ｌｎは、正解データセットＥｘのラベルに対応する正解データと識別モデルＭｅの出力との誤差が最小化するようにバックプロパゲーションを行って、識別モデルＭｅに含まれるニューラルネットワークの重みおよびバイアスを更新する。 As shown in FIG. 3, the learning unit Ln optimizes the discriminative model Me so that the probability Ps is maximized. That is, the learning unit Ln performs backpropagation so as to minimize the error between the correct answer data corresponding to the label of the correct answer data set Ex and the output of the discriminative model Me, and the weight of the neural network included in the discriminative model Me. And update the bias.

　逆に、学習部Ｌｎは、確率Ｐｆが最大化するようにエンコーダモデルＭｃを最適化する。すなわち、学習部Ｌｎは、正解データセットＥｘのラベルに対応する正解データと識別モデルＭｅの出力との誤差が最大化するようにバックプロパゲーションを行って、エンコーダモデルＭｃに含まれるニューラルネットワークの重みおよびバイアスを更新する。 On the contrary, the learning unit Ln optimizes the encoder model Mc so that the probability Pf is maximized. That is, the learning unit Ln performs backpropagation so as to maximize the error between the correct answer data corresponding to the label of the correct answer data set Ex and the output of the discriminative model Me, and the weight of the neural network included in the encoder model Mc. And update the bias.

　敵対的学習において、エンコーダモデルＭｃは識別モデルＭｅによる学習データが含まれるデータセットの識別精度が低下するように最適化され、識別モデルＭｅは学習データが含まれるデータセットの識別精度が向上するように最適化される。敵対的学習を通して、データセットＥ１～Ｅｎの各々が有する各データセットに含まれるデータが取得された環境特有のバイアスがエンコーダモデルＭｃから出力される特徴量から取り除かれる。 In hostile learning, the encoder model Mc is optimized to reduce the discriminative accuracy of the dataset containing the training data by the discriminative model Me, and the discriminative model Me is optimized to improve the discriminative accuracy of the dataset containing the training data. Optimized for. Through hostile learning, the environment-specific bias in which the data contained in each of the datasets E1 to En is acquired is removed from the features output from the encoder model Mc.

　学習部Ｌｎは、データセットＤｓを用いて、主目的モデルＭｍに対して図４に示されるような機械学習を行う。エンコーダモデルＭｃは、学習データｄｔｙを受けて特徴量ｆｙを主目的モデルＭｍに出力する。主目的モデルＭｍは、特徴量ｆｙを受けて、推論結果ｄｃｙを出力する。学習部Ｌｎは、推論結果ｄｃｙと正解データｄａｙとの誤差Ｌｓ２を最小化するようにバックプロパゲーションを行って、主目的モデルＭｍに含まれるニューラルネットワークの重みおよびバイアスを更新する。当該機械学習により、主目的モデルＭｍは、データセットＥ１～Ｅｎに適合された学習済みモデルになる。 The learning unit Ln performs machine learning as shown in FIG. 4 on the main object model Mm using the data set Ds. The encoder model Mc receives the learning data dty and outputs the feature amount fy to the main object model Mm. The main object model Mm receives the feature quantity fy and outputs the inference result dcy. The learning unit Ln performs backpropagation so as to minimize the error Ls2 between the inference result dcy and the correct answer data day, and updates the weight and bias of the neural network included in the main object model Mm. By the machine learning, the main purpose model Mm becomes a trained model fitted to the datasets E1 to En.

　図５は、学習済みのエンコーダモデルＭｃおよび学習済みの主目的モデルＭｍによる推論処理の流れを示すブロック図である。図５に示されるように、学習済みのエンコーダモデルＭｃを介して、未知の環境において取得されたデータを含むデータセットＥ＿ｕｎにおいて取得されたデータｄｔ＿ｕｎの特徴量ｆ＿ｕｎが学習済みの主目的モデルＭｍに入力される場合、データセットＥ１～Ｅｎが取得された条件とデータセットＥ＿ｕｎに含まれるデータが取得された条件との乖離の程度によっては、当該特徴量に対して主目的モデルＭｍが正常に機能しない可能性がある。 FIG. 5 is a block diagram showing the flow of inference processing by the trained encoder model Mc and the trained main purpose model Mm. As shown in FIG. 5, the feature amount f_un of the data dt_un acquired in the data set E_un including the data acquired in the unknown environment is converted into the trained main object model Mm via the trained encoder model Mc. When input, the main purpose model Mm functions normally for the feature amount depending on the degree of deviation between the condition from which the datasets E1 to En are acquired and the condition from which the data contained in the dataset E_un is acquired. May not.

　そこで、情報処理装置１００においては、データセットＥ＿ｕｎに由来する特徴量ｆ＿ｕｎに対して、特徴量空間における特徴量ｆ＿ｕｎと特定部分空間との位置関係から、データセットＥ１～Ｅｎへのデータｄｔ＿ｕｎの適合度ａｄ＿ｕｎを出力する。適合度ａｄ＿ｕｎを参照することにより、データｄｔ＿ｕｎがデータセットＥ１～Ｅｎに適合しているか否かを明確に識別することができる。情報処理装置１００によれば、学習済みの主目的モデルＭｍが機械学習によって得られた性能を発揮することか否かを、データｄｔ＿ｕｎに基づいて判定することができる。なお、判定結果の出力の態様としては、データｄｔｄｔ＿ｕｎに対して学習済みの主目的モデルＭｍが性能を発揮することができるか否かを出力してもよいし、データセットＥ＿ｕｎに含まれる複数のデータの適合度からデータセットＥ＿ｕｎに含まれるデータが取得された条件によって決定される環境に対して学習済みの主目的モデルＭｍが性能を発揮することができるか否かを出力してもよい。データｄｔｄｔ＿ｕｎの適合度から、学習済みの主目的モデルＭｍの出力結果の信頼度を出力してもよい。 Therefore, in the information processing apparatus 100, the data dt_un is adapted to the data sets E1 to En from the positional relationship between the feature amount f_un and the specific subspace in the feature amount space with respect to the feature amount f_un derived from the data set E_un. Output the degree ad_un. By referring to the goodness of fit ad_un, it is possible to clearly identify whether or not the data dt_un conforms to the datasets E1 to En. According to the information processing apparatus 100, it can be determined based on the data dt_un whether or not the trained main object model Mm exhibits the performance obtained by machine learning. As a mode of output of the determination result, it may be output whether or not the trained main object model Mm can exhibit the performance for the data dtdt_un, and a plurality of data sets E_un included in the data set E_un. From the goodness of fit of the data, it may be output whether or not the trained main object model Mm can exhibit the performance for the environment determined by the condition in which the data included in the data set E_un is acquired. From the goodness of fit of the data dtdt_un, the reliability of the output result of the trained main purpose model Mm may be output.

　図６は、主目的モデルＭｍによって行われる推論処理がソフトマックス関数を用いた多クラス分類のロジスティック回帰である場合の、図５のエンコーダモデルＭｃから出力される特徴量の分布を示す。なお、図６においては説明の便宜のために特徴量空間を次元ｘ１および次元ｘ２から規定される２次元平面として描いているが、エンコーダモデルＭｃから出力される特徴量は３以上の次元を有する場合もある。図６においてエンコーダモデルＭｃから出力される特徴量は点としてプロットされている。特定部分空間Ｓｂ１は、半径Ｒ１の円として描かれている。後に説明する図８においても同様である。 FIG. 6 shows the distribution of the features output from the encoder model Mc of FIG. 5 when the inference processing performed by the main object model Mm is a logistic regression of multi-class classification using the softmax function. In FIG. 6, the feature space is drawn as a two-dimensional plane defined by the dimensions x1 and x2 for convenience of explanation, but the feature output from the encoder model Mc has three or more dimensions. In some cases. In FIG. 6, the features output from the encoder model Mc are plotted as points. The specific subspace Sb1 is drawn as a circle with a radius R1. The same applies to FIG. 8 which will be described later.

　図６を参照しながら、データセットＥ１～Ｅｎに適合するデータの特徴は、当該データから抽出される特徴量の分布に反映され、当該特徴量のノルム（特徴量空間における原点からの距離）はある程度の大きさを持つ場合が多い。そのため、データセットＥ１～Ｅｎに適合する環境において取得されたデータの特徴量は、特定部分空間Ｓｂ１付近に分布することが多い。一方、データセットＥ１～Ｅｎに不適合の環境において取得されたデータの特徴は、当該データから抽出される特徴量の分布に反映されない場合が多く、当該特徴量のノルムは比較的小さくなる場合が多い。そのため、データセットＥ１～Ｅｎに不適合の環境において取得されたデータの特徴量は、原点付近に分布している。 With reference to FIG. 6, the features of the data matching the datasets E1 to En are reflected in the distribution of the features extracted from the data, and the norm of the features (distance from the origin in the feature space) is It often has a certain size. Therefore, the feature quantities of the data acquired in the environment conforming to the datasets E1 to En are often distributed in the vicinity of the specific subspace Sb1. On the other hand, the features of the data acquired in the environment incompatible with the datasets E1 to En are often not reflected in the distribution of the features extracted from the data, and the norm of the features is often relatively small. .. Therefore, the feature quantities of the data acquired in the environment incompatible with the datasets E1 to En are distributed near the origin.

　データｄｔ＿ｕｎの特徴量ｆ＿ｕｎと特定部分空間Ｓｂ１との距離δは、データセットＥ１～Ｅｎへのデータｄｔ＿ｕｎの近さを表す。そこで判定部Ｊｄｇは、距離δに基づいて、データセットＥ１～Ｅｎへのデータｄｔ＿ｕｎの適合度ａｄ＿ｕｎを出力する。なお、図６において距離δは、特徴量ｆ＿ｕｎに対応する点Ｐ＿ｕｎと、点Ｐ＿ｕｎおよび原点を通過する直線と特定部分空間Ｓｂ１との交点Ｐｓｂとの距離である。 The distance δ between the feature amount f_un of the data dt_un and the specific subspace Sb1 represents the proximity of the data dt_un to the data sets E1 to En. Therefore, the determination unit Jdg outputs the goodness of fit ad_un of the data dt_un to the data sets E1 to En based on the distance δ. In FIG. 6, the distance δ is the distance between the point P_un corresponding to the feature amount f_un and the intersection Psb of the straight line passing through the point P_un and the origin and the specific subspace Sb1.

　図７は、図５の判定部Ｊｄｇによって行われる適合判定処理の流れを示すフローチャートである。以下ではステップを単にＳと記載する。図７に示されるように、判定部Ｊｄｇは、Ｓ１１において距離δが閾値δｔｈよりも短いか否かを判定する。閾値δｔｈは、実機実験あるいはシミュレーションによって適宜決定することができる。 FIG. 7 is a flowchart showing the flow of conformity determination processing performed by the determination unit Jdg of FIG. In the following, the step is simply referred to as S. As shown in FIG. 7, the determination unit Jdg determines whether or not the distance δ is shorter than the threshold value δth in S11. The threshold value δth can be appropriately determined by an actual machine experiment or a simulation.

　距離δが閾値δｔｈよりも小さい場合（Ｓ１１においてＹＥＳ）、判定部Ｊｄｇは、Ｓ１２においてデータｄｔ＿ｕｎがデータセットＥ１～Ｅｎに適合していることを示す値（たとえばＴＲＵＥ）を適合度ａｄ＿ｕｎに設定して処理をＳ１４に進める。距離δが閾値δｔｈ以上である場合（Ｓ１１においてＮＯ）、判定部Ｊｄｇは、Ｓ１３においてデータｄｔ＿ｕｎがデータセットＥ１～Ｅｎに不適合であることを示す値（たとえばＦＡＬＳＥ）を適合度ａｄ＿ｕｎに設定して処理をＳ１４に進める。 When the distance δ is smaller than the threshold value δth (YES in S11), the determination unit Jdg sets a value (for example, TRUE) indicating that the data dt_un conforms to the data sets E1 to En in S12 to the goodness of fit ad_un. And the process proceeds to S14. When the distance δ is equal to or greater than the threshold value δth (NO in S11), the determination unit Jdg sets a value (for example, FALSE) indicating that the data dt_un is incompatible with the datasets E1 to En in S13 in the goodness of fit ad_un. The process proceeds to S14.

　判定部Ｊｄｇは、Ｓ１５において適合度ａｄ＿ｕｎを出力して処理を終了する。なお、図７においては適合度ａｄ＿ｕｎが適合か不適合かを表す二値のいずれかに設定される場合について説明したが、適合度ａｄ＿ｕｎは距離δに応じた連続的な値（たとえば百分率）に設定されてもよい。 The determination unit Jdg outputs the goodness of fit ad_un in S15 and ends the process. In FIG. 7, the case where the goodness of fit ad_un is set to either of the binary values indicating conformity or nonconformity has been described, but the goodness of fit ad_un is set to a continuous value (for example, a percentage) according to the distance δ. May be done.

　Ｓ１１においてＮＯの場合、判定部Ｊｄｇは、データセットＥ１～Ｅｎにデータｄｔ＿ｕｎを適合させるためのデータｄｔ＿ｕｎが取得された条件の変更方法を出力してもよい。この場合、判定部Ｊｄｇは、特徴量ｆ＿ｕｎと、データセットＥ１～Ｅｎにおいて取得された学習データからエンコーダモデルＭｃによって抽出される特徴量との差を、バックプロパゲーションによってエンコーダモデルＭｃに入力されるデータの変更量として逆算する。判定部Ｊｄｇは、当該変更量の絶対値を減少させるために必要な当該条件の変更方法を出力する。ユーザは、当該変更方法を実行することによりデータセットＥ１～Ｅｎに適合し、学習済みの主目的モデルＭｍが性能を発揮可能なデータを取得可能になる。 If NO in S11, the determination unit Jdg may output a method of changing the condition in which the data dt_un for adapting the data dt_un to the data sets E1 to En is acquired. In this case, the determination unit Jdg inputs the difference between the feature amount f_un and the feature amount extracted by the encoder model Mc from the training data acquired in the data sets E1 to En into the encoder model Mc by backpropagation. Calculate back as the amount of data change. The determination unit Jdg outputs a method of changing the condition necessary for reducing the absolute value of the change amount. By executing the change method, the user can acquire data that fits the data sets E1 to En and that the trained main purpose model Mm can exhibit its performance.

　図８は、主目的モデルＭｍによって行われる推論処理がシグモイド関数を用いた２クラス分類のロジスティック回帰である場合の、図５のエンコーダモデルＭｃから出力される特徴量の分布を示す。図８を参照しながら、２クラス分類の場合、データセットＥ１～Ｅｎに適合するデータの特徴量が分布する球面と、データセットＥ１～Ｅｎに不適合のデータの特徴量が分布する球面とは、リングロスを用いたエンコーダモデルＭｃの最適化において明確に分離することが可能である。図８においては、データセットＥ１～Ｅｎに適合するデータの特徴量は半径Ｒ１の球面Ｓｂ１付近に分布し、データセットＥ１～Ｅｎに不適合のデータの特徴量は半径Ｒ２の球面Ｓｂ２付近に分布している。このような場合、データｄｔ＿ｕｎの特徴量ｆ＿ｕｎのノルムＲが、半径Ｒ２より大きい閾値Ｒｔｈより大きいか否かによってデータセットＥ１～Ｅｎへのデータｄｔ＿ｕｎの適合度ａｄ＿ｕｎを算出することができる。閾値Ｒｔｈは、実機実験あるいはシミュレーションによって適宜決定することができる。 FIG. 8 shows the distribution of the features output from the encoder model Mc of FIG. 5 when the inference processing performed by the main object model Mm is a logistic regression of two-class classification using a sigmoid function. With reference to FIG. 8, in the case of two-class classification, the sphere in which the features of the data conforming to the datasets E1 to En are distributed and the sphere in which the features of the data not conforming to the datasets E1 to En are distributed are Clear separation is possible in the optimization of the encoder model Mc using ring loss. In FIG. 8, the feature amounts of the data conforming to the datasets E1 to En are distributed near the spherical surface Sb1 having the radius R1, and the feature amounts of the data not conforming to the datasets E1 to En are distributed near the spherical surface Sb2 having the radius R2. ing. In such a case, the goodness of fit ad_un of the data dt_un to the datasets E1 to En can be calculated depending on whether the norm R of the feature amount f_un of the data dt_un is larger than the threshold value Rth larger than the radius R2. The threshold value Rth can be appropriately determined by an actual machine experiment or a simulation.

　図９は、図５の判定部Ｊｄｇによって行われる適合判定処理の他の例の流れを示すフローチャートである。図９に示されるフローチャートは、図７のＳ１１がＳ２１に置き換えられたフローチャートである。 FIG. 9 is a flowchart showing the flow of another example of the conformity determination process performed by the determination unit Jdg of FIG. The flowchart shown in FIG. 9 is a flowchart in which S11 in FIG. 7 is replaced with S21.

　図９に示されるように、判定部Ｊｄｇは、Ｓ２１においてノルムＲが閾値Ｒｔｈより大きいか否かを判定する。ノルムＲが閾値Ｒｔｈより大きい場合（Ｓ２１においてＹＥＳ）、判定部Ｊｄｇは、データｄｔ＿ｕｎがデータセットＥ１～Ｅｎに適合していると判定し、Ｓ１２，Ｓ１４を行って処理を終了する。ノルムＲが閾値Ｒｔｈ以下である場合（Ｓ２１においてＮＯ）、判定部Ｊｄｇは、データｄｔ＿ｕｎがデータセットＥ１～Ｅｎに不適合と判定し、Ｓ１３，Ｓ１４を行って処理を終了する。 As shown in FIG. 9, the determination unit Jdg determines in S21 whether or not the norm R is larger than the threshold value Rth. When the norm R is larger than the threshold value Rth (YES in S21), the determination unit Jdg determines that the data dt_un conforms to the data sets E1 to En, performs S12 and S14, and ends the process. When the norm R is equal to or less than the threshold value Rth (NO in S21), the determination unit Jdg determines that the data dt_un is incompatible with the data sets E1 to En, performs S13 and S14, and ends the process.

　なお、判定部Ｊｄｇによる適合判定処理は、図７および図９に示されるようなルールベースの判定方法に限定されず、機械学習によって最適化された学習済みモデルを用いる判定方法であってもよい。この場合、特定部分空間Ｓｂ１に分布する特徴量、および当該特徴量にノイズを付加した不正データを判定部Ｊｄｇの機械学習に用いられる学習データとして使用することができる。 The conformity determination process by the determination unit Jdg is not limited to the rule-based determination method as shown in FIGS. 7 and 9, and may be a determination method using a trained model optimized by machine learning. .. In this case, the feature amount distributed in the specific subspace Sb1 and the invalid data in which noise is added to the feature amount can be used as learning data used for machine learning of the determination unit Jdg.

　［実施の形態に係る情報処理装置の変形例］
　情報処理装置１００においては、機械学習と、当該機械学習によって適合された学習済みモデルよる推論とが同じ装置において行われる場合について説明した。両者は別個の装置で行われてもよい。 [Modification of the information processing device according to the embodiment]
In the information processing apparatus 100, a case where machine learning and inference by a trained model adapted by the machine learning are performed in the same apparatus has been described. Both may be done in separate devices.

　図１０は、実施の形態の変形例１に係る情報処理装置１００Ａの構成を示すブロック図である。情報処理装置１００Ａの構成は、図１の情報処理装置１００から判定部Ｊｄｇが除かれているとともに、エンコーダＥｎｃ、デコーダＤｅｃ、識別器Ｄｓｃ、および主目的処理器ＭｐｒがエンコーダＥｎｃＡ、デコーダＤｅｃＡ、識別器ＤｓｃＡ、および主目的処理器ＭｐｒＡに置き換えられた構成である。図１０を参照しながら、情報処理装置１００Ａは、エンコーダモデルＭｃ、デコーダモデルＭｄ、識別モデルＭｅ、および主目的モデルＭｍに対して、図２～図４に示されるような学習処理を行う。 FIG. 10 is a block diagram showing the configuration of the information processing device 100A according to the first modification of the embodiment. In the configuration of the information processing device 100A, the determination unit Jdg is removed from the information processing device 100 of FIG. 1, and the encoder Enc, the decoder Dec, the discriminator Dsc, and the main purpose processor Mpr are the encoder EncA, the decoder DecA, and the discriminating unit. It is a configuration replaced by the vessel DscA and the main purpose processor MprA. With reference to FIG. 10, the information processing apparatus 100A performs learning processing as shown in FIGS. 2 to 4 on the encoder model Mc, the decoder model Md, the identification model Me, and the main object model Mm.

　図１１は、実施の形態の変形例２に係る情報処理装置１００Ｂの構成を示すブロック図である。情報処理装置１００Ｂの構成は、図１の情報処理装置１００から記憶部Ｓｔｇ、デコーダＤｅｃ、および識別器Ｄｓｃが除かれているとともに、エンコーダＥｎｃおよび主目的処理器ＭｐｒがエンコーダＥｎｃＢおよび主目的処理器ＭｐｒＢにそれぞれ置き換えられた構成である。図１０および図１１を参照しながら、エンコーダＥｎｃＢおよび主目的処理器ＭｐｒＢは、情報処理装置１００Ａによって適合されたエンコーダモデルＭｃおよび主目的モデルＭｍをそれぞれ含む。情報処理装置１００Ｂにおいては、エンコーダモデルＭｃおよび主目的モデルＭｍを用いた推論処理、および図７または図９に示されるような適合判定処理が行われる。 FIG. 11 is a block diagram showing the configuration of the information processing device 100B according to the second modification of the embodiment. In the configuration of the information processing device 100B, the storage unit Stg, the decoder Dec, and the classifier Dsc are removed from the information processing device 100 of FIG. 1, and the encoder Enc and the main purpose processor Mpr are the encoder EncB and the main purpose processor. It is a configuration in which each is replaced with MprB. With reference to FIGS. 10 and 11, the encoder EncB and the main purpose processor MprB include an encoder model Mc and a main purpose model Mm adapted by the information processing apparatus 100A, respectively. In the information processing apparatus 100B, inference processing using the encoder model Mc and the main purpose model Mm, and conformity determination processing as shown in FIG. 7 or FIG. 9 are performed.

　［情報処理装置１００の具体例］
　図１２は、図１の情報処理装置１００を含む外観検査システム１の全体構成の一例を示す概略図である。外観検査システム１において情報処理装置１００は画像処理装置として機能する。図１２に示されるように、外観検査システム１は、生産ラインなどに組込まれて検査対象２（以下「ワーク２」とも称す。）を撮像して得られる画像に基づいて、ワーク２の欠陥を分類する。すなわち情報処理装置１００の主目的処理器Ｍｐｒは、外観検査システム１において画像識別器として機能する。 [Specific example of information processing device 100]
FIG. 12 is a schematic view showing an example of the overall configuration of the visual inspection system 1 including the information processing apparatus 100 of FIG. In the visual inspection system 1, the information processing device 100 functions as an image processing device. As shown in FIG. 12, the visual inspection system 1 detects defects in the work 2 based on an image obtained by incorporating an inspection target 2 (hereinafter, also referred to as “work 2”) into a production line or the like. Classify. That is, the main purpose processor Mpr of the information processing device 100 functions as an image classifier in the visual inspection system 1.

　外観検査システム１においては、ワーク２はベルトコンベヤなどの搬送機構６によって所定方向に搬送される。これに対して、撮像部８は、ワーク２に対して固定した位置に配置されている。さらに、撮像部８に対して、一定の相対位置に照明光源９が配置される。照明光源９は、少なくとも、撮像部８の視野（ワーク２が位置し得る範囲）を照明する。撮像部８は、移動するワーク２を撮像する。撮像部８によって得られた画像データは、情報処理装置１００へ伝送される。撮像部８の向き、ならびに照明光源９の光量、設置数、および配置位置は、学習データが取得される環境を決定する条件になり得る。周囲の照明環境からの外乱を受けないように、光量、設置数、配置位置などが最適化されることが好ましい。 In the visual inspection system 1, the work 2 is conveyed in a predetermined direction by a conveying mechanism 6 such as a belt conveyor. On the other hand, the imaging unit 8 is arranged at a fixed position with respect to the work 2. Further, the illumination light source 9 is arranged at a fixed relative position with respect to the image pickup unit 8. The illumination light source 9 illuminates at least the field of view of the imaging unit 8 (the range in which the work 2 can be located). The imaging unit 8 images the moving work 2. The image data obtained by the imaging unit 8 is transmitted to the information processing device 100. The orientation of the image pickup unit 8, the amount of light of the illumination light source 9, the number of installations, and the arrangement position can be conditions for determining the environment in which the learning data is acquired. It is preferable that the amount of light, the number of installations, the arrangement position, etc. are optimized so as not to be disturbed by the surrounding lighting environment.

　ワーク２が撮像部８の視野内に到達したことは、搬送機構６の両端に配置された検出センサ４によって検出される。具体的には、検出センサ４は、同一の光軸上に配置された受光部４ａと投光部４ｂとを含み、投光部４ｂから放射される光がワーク２で遮蔽されることを受光部４ａで検出することによって、ワーク２の到達を検出する。検出センサ４の検出信号（以下「トリガ信号」とも称す。）は、ＰＬＣ（Programmable　Logic　Controller）５へ出力される。 The fact that the work 2 has reached the field of view of the imaging unit 8 is detected by the detection sensors 4 arranged at both ends of the transport mechanism 6. Specifically, the detection sensor 4 includes a light receiving unit 4a and a light projecting unit 4b arranged on the same optical axis, and receives light that the light emitted from the light projecting unit 4b is shielded by the work 2. By detecting in the part 4a, the arrival of the work 2 is detected. The detection signal of the detection sensor 4 (hereinafter, also referred to as “trigger signal”) is output to the PLC (Programmable Logic Controller) 5.

　ＰＬＣ５は、検出センサ４などからのトリガ信号を受信するとともに、搬送機構６の制御自体を司る。 The PLC 5 receives a trigger signal from the detection sensor 4 and the like, and controls the transport mechanism 6 itself.

　外観検査システム１は、さらに、情報処理装置１００と、ディスプレイ１０２と、マウス１０４とを含む。情報処理装置１００は、ＰＬＣ５と、撮像部８と、ディスプレイ１０２と、マウス１０４とに接続される。 The visual inspection system 1 further includes an information processing device 100, a display 102, and a mouse 104. The information processing device 100 is connected to the PLC 5, the imaging unit 8, the display 102, and the mouse 104.

　撮像部８は、一例として、レンズなどの光学系に加えて、ＣＣＤ（Coupled　Charged　Device）やＣＭＯＳ（Complementary　Metal　Oxide　Semiconductor）センサといった、複数の画素に区画された撮像素子を含んで構成される。 As an example, the image pickup unit 8 includes an image sensor divided into a plurality of pixels, such as a CCD (Coupled Charged Device) and a CMOS (Complementary Metal Oxide Semiconductor) sensor, in addition to an optical system such as a lens.

　情報処理装置１００は、汎用的なアーキテクチャを有するコンピュータである。情報処理装置１００は、予めインストールされたプログラムを実行することで、機械学習および欠陥分類等の各種機能を実現する。このような汎用的なコンピュータを利用する場合には、情報処理装置１００の機能を提供するためのアプリケーションに加えて、コンピュータの基本的な機能を提供するためのＯＳ（Operating　System）がインストールされていてもよい。この場合には、情報処理装置１００の機能を実現するプログラムは、ＯＳの一部として提供されるプログラムモジュールのうち、必要なモジュールを所定の配列で所定のタイミングで呼出して処理を実行させるものであってもよい。すなわち、当該プログラム自体は、上記のようなモジュールを含んでおらず、ＯＳと協働して処理が実行される。当該プログラムとしては、このような一部のモジュールを含まない形態であってもよい。 The information processing device 100 is a computer having a general-purpose architecture. The information processing device 100 realizes various functions such as machine learning and defect classification by executing a pre-installed program. When using such a general-purpose computer, an OS (Operating System) for providing the basic functions of the computer is installed in addition to the application for providing the functions of the information processing device 100. You may. In this case, the program that realizes the function of the information processing apparatus 100 calls the necessary modules in a predetermined array at a predetermined timing among the program modules provided as a part of the OS to execute the process. There may be. That is, the program itself does not include the above-mentioned module, and the process is executed in cooperation with the OS. The program may be in a form that does not include such a part of modules.

　さらに、情報処理装置１００の機能を実現するプログラムは、他のプログラムの一部に組込まれて提供されるものであってもよい。その場合にも、プログラム自体には、上記のような組合せられる他のプログラムに含まれるモジュールを含んでおらず、当該他のプログラムと協働して処理が実行される。すなわち、情報処理装置１００の機能を実現するログラムとしては、このような他のプログラムに組込まれた形態であってもよい。なお、プログラムの実行により提供される機能の一部もしくは全部を専用のハードウェア回路として実装してもよい。 Further, the program that realizes the function of the information processing apparatus 100 may be provided by being incorporated into a part of another program. Even in that case, the program itself does not include the modules included in the other programs to be combined as described above, and the processing is executed in cooperation with the other programs. That is, the program that realizes the function of the information processing device 100 may be in a form incorporated in such another program. Note that some or all of the functions provided by executing the program may be implemented as a dedicated hardware circuit.

　図１３は、図１２の撮像部８によって取得されるワーク２の画像例を説明するための模式図である。図１３（ａ）は、欠陥のないワーク２の画像である。図１３（ｂ）は、欠けＤ１を有するワーク２の画像である。図１３（ｃ）は、傷Ｄ２を有するワーク２の画像である。ワーク２に生じ得る欠陥は、欠けおよび傷に限定されず、たとえば、凹み、および変形も含む。情報処理装置１００は、ワーク２の画像を、ワーク２に生じている欠陥の類型（欠陥なし、欠け、傷、凹み、および変形等）に応じて分類する。 FIG. 13 is a schematic diagram for explaining an image example of the work 2 acquired by the imaging unit 8 of FIG. FIG. 13A is an image of the work 2 having no defects. FIG. 13B is an image of the work 2 having the chipped D1. FIG. 13C is an image of the work 2 having the scratch D2. Defects that can occur in the work 2 are not limited to chips and scratches, but also include, for example, dents and deformations. The information processing apparatus 100 classifies the images of the work 2 according to the types of defects occurring in the work 2 (no defects, chips, scratches, dents, deformations, etc.).

　図１４は、図１２の情報処理装置１００の概略構成図である。図１４に示されるように、情報処理装置１００は、演算処理部であるプロセッサ１１０と、記憶部としてのメインメモリ１１２およびハードディスク１１４と、カメラインターフェイス１１６と、入力インターフェイス１１８と、表示コントローラ１２０と、ＰＬＣインターフェイス１２２と、通信インターフェイス１２４と、データリーダ／ライタ１２６とを含む。これらの各部は、バス１２８を介して、互いにデータ通信可能に接続される。 FIG. 14 is a schematic configuration diagram of the information processing device 100 of FIG. As shown in FIG. 14, the information processing apparatus 100 includes a processor 110 as an arithmetic processing unit, a main memory 112 and a hard disk 114 as a storage unit, a camera interface 116, an input interface 118, a display controller 120, and the like. It includes a PLC interface 122, a communication interface 124, and a data reader / writer 126. Each of these parts is connected to each other via a bus 128 so as to be capable of data communication.

　プロセッサ１１０は、ＣＰＵ（Central　Processing　Unit）を含む。プロセッサ１１０は、ＧＰＵ（Graphics　Processing　Unit）をさらに含んでもよい。プロセッサ１１０は、ハードディスク１１４に格納されたプログラム（コード）をメインメモリ１１２に展開して、これらを所定順序で実行することで、各種の演算を実施する。 The processor 110 includes a CPU (Central Processing Unit). The processor 110 may further include a GPU (Graphics Processing Unit). The processor 110 expands the programs (codes) stored in the hard disk 114 into the main memory 112 and executes them in a predetermined order to perform various operations.

　メインメモリ１１２は、典型的には、ＤＲＡＭ（Dynamic　Random　Access　Memory）などの揮発性の記憶装置である。メインメモリ１１２は、ハードディスク１１４から読み出されたプログラムに加えて、撮像部８によって取得された画像データ、画像データの処理結果を示すデータ、およびワークデータなどを保持する。 The main memory 112 is typically a volatile storage device such as a DRAM (Dynamic Random Access Memory). In addition to the program read from the hard disk 114, the main memory 112 holds image data acquired by the imaging unit 8, data indicating the processing result of the image data, work data, and the like.

　ハードディスク１１４は、不揮発性の磁気記憶装置である。ハードディスク１１４には、データセットＥ１～Ｅｎ、主目的モデルＭｍ、エンコーダモデルＭｃ、デコーダモデルＭｄ、および、識別モデルＭｅ、機械学習プログラムＰｇ１、および欠陥識別プログラムＰｇ２が保存されている。外観検査システム１において主目的モデルＭｍは、画像識別モデルとして機能する。ハードディスク１１４には、各種設定値などが格納されてもよい。ハードディスク１１４にインストールされるプログラムは、後述するように、メモリカード１０６などに格納された状態で流通する。なお、ハードディスク１１４に加えて、あるいは、ハードディスク１１４に代えて、フラッシュメモリなどの半導体記憶装置を採用してもよい。 The hard disk 114 is a non-volatile magnetic storage device. The hard disk 114 stores data sets E1 to En, a main purpose model Mm, an encoder model Mc, a decoder model Md, and an identification model Me, a machine learning program Pg1, and a defect identification program Pg2. In the visual inspection system 1, the main object model Mm functions as an image identification model. Various setting values and the like may be stored in the hard disk 114. The program installed on the hard disk 114 is distributed in a state of being stored in a memory card 106 or the like, as will be described later. In addition to the hard disk 114, or instead of the hard disk 114, a semiconductor storage device such as a flash memory may be adopted.

　データセットＤｓに含まれる複数の学習データの各々は、欠陥の類型毎にラベル付けされたワーク２の画像である。また、当該複数の学習データの各々には、各学習データが取得された環境に対応したラベルも付されている。ワーク２の画像は、図１２の撮像部８によって撮影された画像でもよいし、他の撮像装置によって撮影された画像でもよい。学習データが取得された条件には、たとえば、撮像部８の向き、ならびに照明光源９の光量、設置数、および配置位置が含まれる。 Each of the plurality of training data included in the data set Ds is an image of the work 2 labeled for each defect type. In addition, each of the plurality of learning data is also labeled according to the environment in which each learning data was acquired. The image of the work 2 may be an image taken by the image pickup unit 8 of FIG. 12 or an image taken by another image pickup device. The conditions under which the learning data is acquired include, for example, the orientation of the imaging unit 8, the amount of light of the illumination light source 9, the number of installations, and the arrangement position.

　機械学習プログラムＰｇ１において、データセットＤｓ、エンコーダモデルＭｃ、デコーダモデルＭｄ、識別モデルＭｅ、および主目的モデルＭｍが参照される。機械学習プログラムＰｇ１を実行するプロセッサ１１０によって、図２のエンコーダＥｎｃおよびデコーダＤｅｃ、図３のエンコーダＥｎｃおよび識別器Ｄｓｃ、ならびに図４の主目的処理器Ｍｐｒが実現される。プロセッサ１１０は、機械学習プログラムＰｇ１を実行することによって、エンコーダモデルＭｃ、デコーダモデルＭｄ、識別モデルＭｅ、および主目的モデルＭｍの各々を学習済みモデルに適合する。 In the machine learning program Pg1, the data set Ds, the encoder model Mc, the decoder model Md, the discriminative model Me, and the main purpose model Mm are referred to. The processor 110 that executes the machine learning program Pg1 realizes the encoder Enc and decoder Dec of FIG. 2, the encoder Enc and discriminator Dsc of FIG. 3, and the main purpose processor Mpr of FIG. The processor 110 fits each of the encoder model Mc, the decoder model Md, the discriminative model Me, and the main purpose model Mm into the trained model by executing the machine learning program Pg1.

　欠陥識別プログラムＰｇ２において、エンコーダモデルＭｃおよび主目的モデルＭｍが参照される。欠陥識別プログラムＰｇ２を実行するプロセッサ１１０によって、図５のエンコーダＥｎｃ、主目的処理器Ｍｐｒ、および判定部Ｊｄｇが実現される。プロセッサ１１０は、欠陥識別プログラムＰｇ２を実行することによって、撮像部８によって取得された画像の欠陥を識別し、識別結果をディスプレイ１０２に出力する。プロセッサ１１０は、データセットＥ１～Ｅｎへの当該画像が取得された環境の適合度をディスプレイ１０２に出力する。当該適合度は、学習データが取得された条件と推論処理において主目的処理器Ｍｐｒに入力されるデータが取得された条件との類似度に応じて、判定部Ｊｄｇによって算出される。当該適合度が予め定められた閾値を超えているか否かに応じて、判定部Ｊｄｇは、学習データが取得された条件によって決定される環境に入力データが適合しているか否かをディスプレイ１０２に出力してもよい。 In the defect identification program Pg2, the encoder model Mc and the main purpose model Mm are referred to. The processor 110 that executes the defect identification program Pg2 realizes the encoder Enc of FIG. 5, the main purpose processor Mpr, and the determination unit Jdg. The processor 110 identifies defects in the image acquired by the imaging unit 8 by executing the defect identification program Pg2, and outputs the identification result to the display 102. The processor 110 outputs to the display 102 the goodness of fit of the environment in which the image is acquired to the data sets E1 to En. The goodness of fit is calculated by the determination unit Jdg according to the degree of similarity between the condition from which the training data is acquired and the condition from which the data input to the main purpose processor Mpr in the inference process is acquired. Depending on whether or not the goodness of fit exceeds a predetermined threshold value, the determination unit Jdg displays on the display 102 whether or not the input data is suitable for the environment determined by the conditions under which the learning data is acquired. It may be output.

　プロセッサ１１０は、主目的処理器Ｍｐｒへの入力データがデータセットＥ１～Ｅｎに適合していない場合、データセットＥ１～Ｅｎに当該入力データを適合させるための、当該入力データが取得された条件の変更方法をディスプレイ１０２に出力する。当該変更方法には、たとえば、照明光源９の光量、設置数、配置、あるいは撮像部８の向きの変更が含まれる。 When the input data to the main purpose processor Mpr does not conform to the data sets E1 to En, the processor 110 determines the conditions under which the input data is acquired in order to adapt the input data to the datasets E1 to En. The change method is output to the display 102. The change method includes, for example, changing the amount of light of the illumination light source 9, the number of installations, the arrangement, or the orientation of the imaging unit 8.

　カメラインターフェイス１１６は、プロセッサ１１０と撮像部８との間のデータ伝送を仲介する。すなわち、カメラインターフェイス１１６は、ワーク２を撮像して画像データを生成する撮像部８を接続する。より具体的には、カメラインターフェイス１１６は、１つ以上の撮像部８と接続が可能であり、撮像部８からの複数の画像データを一時的に蓄積するための画像バッファ１１６ａを含む。そして、カメラインターフェイス１１６は、画像バッファ１１６ａに少なくとも１コマ分の画像データが蓄積されると、その蓄積されたデータをメインメモリ１１２へ転送する。また、カメラインターフェイス１１６は、ＣＰＵ１１０が発生した内部コマンドに従って、撮像部８に対して撮像指令を与える。 The camera interface 116 mediates data transmission between the processor 110 and the imaging unit 8. That is, the camera interface 116 connects the imaging unit 8 that images the work 2 and generates image data. More specifically, the camera interface 116 can be connected to one or more image pickup units 8 and includes an image buffer 116a for temporarily accumulating a plurality of image data from the image pickup unit 8. Then, when the image data for at least one frame is accumulated in the image buffer 116a, the camera interface 116 transfers the accumulated data to the main memory 112. Further, the camera interface 116 gives an imaging command to the imaging unit 8 according to an internal command generated by the CPU 110.

　入力インターフェイス１１８は、プロセッサ１１０とマウス１０４、キーボード、タッチパネルなどの入力部との間のデータ伝送を仲介する。すなわち、入力インターフェイス１１８は、ユーザが入力部を操作することで与えられる操作指令を受付ける。 The input interface 118 mediates data transmission between the processor 110 and input units such as a mouse 104, a keyboard, and a touch panel. That is, the input interface 118 receives an operation command given by the user operating the input unit.

　表示コントローラ１２０は、表示装置の典型例であるディスプレイ１０２と接続され、プロセッサ１１０における画像処理の結果などをユーザに通知する。すなわち、表示コントローラ１２０は、ディスプレイ１０２に接続され、ディスプレイ１０２での表示を制御する。ディスプレイ１０２は、たとえば液晶ディスプレイ、有機ＥＬ（Electro　Luminescence）ディスプレイ、またはその他の表示装置である。 The display controller 120 is connected to a display 102, which is a typical example of a display device, and notifies the user of the result of image processing in the processor 110 and the like. That is, the display controller 120 is connected to the display 102 and controls the display on the display 102. The display 102 is, for example, a liquid crystal display, an organic EL (Electro Luminescence) display, or other display device.

　ＰＬＣインターフェイス１２２は、プロセッサ１１０とＰＬＣ５との間のデータ伝送を仲介する。より具体的には、ＰＬＣインターフェイス１２２は、ＰＬＣ５によって制御される生産ラインの状態に係る情報やワークに係る情報などをプロセッサ１１０へ伝送する。 The PLC interface 122 mediates data transmission between the processor 110 and the PLC 5. More specifically, the PLC interface 122 transmits information related to the state of the production line controlled by the PLC 5 and information related to the work to the processor 110.

　通信インターフェイス１２４は、プロセッサ１１０とコンソール（あるいは、パーソナルコンピュータやサーバ装置）などとの間のデータ伝送を仲介する。通信インターフェイス１２４は、典型的には、イーサネット（登録商標）やＵＳＢ（Universal　Serial　Bus）などからなる。なお、後述するように、メモリカード１０６に格納されたプログラムを情報処理装置１００にインストールする形態に代えて、通信インターフェイス１２４を介して、配信サーバなどからダウンロードしたプログラムを情報処理装置１００にインストールしてもよい。 The communication interface 124 mediates data transmission between the processor 110 and a console (or a personal computer or server device). The communication interface 124 typically comprises Ethernet (registered trademark), USB (Universal Serial Bus), or the like. As will be described later, instead of installing the program stored in the memory card 106 in the information processing device 100, the program downloaded from the distribution server or the like is installed in the information processing device 100 via the communication interface 124. You may.

　データリーダ／ライタ１２６は、プロセッサ１１０と記録媒体であるメモリカード１０６との間のデータ伝送を仲介する。すなわち、メモリカード１０６には、情報処理装置１００で実行されるプログラムなどが格納された状態で流通し、データリーダ／ライタ１２６は、このメモリカード１０６からプログラムを読み出す。また、データリーダ／ライタ１２６は、プロセッサ１１０の内部指令に応答して、撮像部８によって取得された画像データおよび／または情報処理装置１００における処理結果などをメモリカード１０６へ書き込む。なお、メモリカード１０６は、ＣＦ（Compact　Flash）、ＳＤ（Secure　Digital）などの汎用的な半導体記憶デバイスや、フレキシブルディスク（Flexible　Disk）などの磁気記憶媒体や、ＣＤ－ＲＯＭ（Compact　Disk　Read　Only　Memory）などの光学記憶媒体等からなる。 The data reader / writer 126 mediates data transmission between the processor 110 and the memory card 106, which is a recording medium. That is, the memory card 106 is distributed in a state in which a program or the like executed by the information processing device 100 is stored, and the data reader / writer 126 reads the program from the memory card 106. Further, the data reader / writer 126 writes the image data acquired by the imaging unit 8 and / or the processing result in the information processing apparatus 100 to the memory card 106 in response to the internal command of the processor 110. The memory card 106 is a general-purpose semiconductor storage device such as CF (Compact Flash) or SD (Secure Digital), a magnetic storage medium such as a flexible disk (Flexible Disk), or a CD-ROM (Compact Disk Read Only Memory). ) And other optical storage media.

　また、情報処理装置１００には、必要に応じて、プリンタなどの他の出力装置が接続されてもよい。 Further, another output device such as a printer may be connected to the information processing device 100, if necessary.

　なお、実施の形態に係る情報処理装置が適用可能なシステムは、外観検査システムに限定されない。当該システムとしては、たとえば、歩行者および車等の物体検出を行う自動運転システム、および医療診断システムを挙げることができる。 The system to which the information processing device according to the embodiment can be applied is not limited to the visual inspection system. Examples of the system include an automatic driving system that detects objects such as pedestrians and vehicles, and a medical diagnosis system.

　自動運転システムにおいては、たとえば、自動車の運転席から撮影された画像データに人または他の自動車等の障害物の有無が識別される。画像データが取得される環境を決定する条件としては、画像が取得された時間帯、天気、あるいは季節等を挙げることができる。自動運転システムにおいては、実際の自動運転時において、運転席から撮像された画像に対する障害物の有無の判定が可能か否かがリアルタイムに判定される。 In the automatic driving system, for example, the presence or absence of an obstacle such as a person or another automobile is identified in the image data taken from the driver's seat of the automobile. As a condition for determining the environment in which the image data is acquired, the time zone, the weather, the season, etc. in which the image was acquired can be mentioned. In the automatic driving system, it is determined in real time whether or not it is possible to determine the presence or absence of an obstacle in the image captured from the driver's seat during actual automatic driving.

　医療診断システムにおいては、たとえば、患者の日常の写真、レントゲン写真、およびＣＴ（Computed　Tomography）が入力され、当該患者が罹患している病気およぶ体調等が識別される。データが取得された環境を決定する条件には、たとえば当該データを取得した機器、当該患者の年齢、性別、体格、体質、国籍、および既往歴が含まれる。医療診断システムによる実際の診断時には、当該患者に対する診断が可能か否かが判定される。 In the medical diagnosis system, for example, daily photographs of patients, X-rays, and CT (Computed Tomography) are input to identify the illness and physical condition of the patient. Conditions that determine the environment in which the data was obtained include, for example, the device from which the data was obtained, the patient's age, gender, physique, constitution, nationality, and medical history. At the time of actual diagnosis by the medical diagnosis system, it is determined whether or not the diagnosis for the patient is possible.

　実施の形態に係る情報処理装置が適用可能なシステムとしては、他にも顔認識システム、音声認識システム、自然言語処理システム、渋滞予測等を行う交通状況監視システム、および強化学習を行うロボットシステムを挙げることができる。顔認識システムにおいては、顔画像による人物の識別が行われ、当該顔画像を取得した機器、当該人物の年齢、性別、および国籍等が当該顔画像が取得された環境を決定する条件に含まれる。音声認識システムにおいては、音声による人物の識別が行われ、当該音声を取得した機器、当該音声を発した人物の年齢、性別、および国籍等が当該音声が取得された環境を決定する条件に含まれる。自然言語処理システムにおいては、文字列の意味の識別が行われ、当該文字列が由来する言語、文字列を記載した人間の年齢、性別、国籍、および文字列が意味する内容の分野等が環境を決定する条件に含まれる。交通状況監視システムにおいては、道路画像に基づいて渋滞の有無が識別され、道路の場所、道路画像が取得された季節、時間帯、天気、および路面状況等が道路画像が取得された環境を決定する条件に含まれる。ロボットシステムにおいては、画像および音声の認識に基づくアクションの識別が行われ、ロボットが置かれた場所、ロボットが備えるセンサの性能、および種類等が、画像および音声が取得された環境を決定する条件に含まれる。 Other systems to which the information processing device according to the embodiment can be applied include a face recognition system, a voice recognition system, a natural language processing system, a traffic condition monitoring system for predicting congestion, and a robot system for enhanced learning. Can be mentioned. In the face recognition system, a person is identified by a face image, and the device that acquired the face image, the age, gender, nationality, etc. of the person are included in the conditions for determining the environment in which the face image is acquired. .. In a voice recognition system, a person is identified by voice, and the device that acquired the voice, the age, gender, nationality, etc. of the person who made the voice are included in the conditions for determining the environment in which the voice was acquired. Is done. In a natural language processing system, the meaning of a character string is identified, and the environment is the language from which the character string is derived, the age, gender, nationality of the person who described the character string, and the field of the content that the character string means. Is included in the conditions for determining. In the traffic condition monitoring system, the presence or absence of traffic congestion is identified based on the road image, and the location of the road, the season when the road image was acquired, the time zone, the weather, the road surface condition, etc. determine the environment in which the road image was acquired. It is included in the conditions to be used. In a robot system, actions are identified based on image and voice recognition, and the location where the robot is placed, the performance and type of sensors provided by the robot, etc. determine the environment in which the image and sound are acquired. include.

　以上、実施の形態に係る情報処理装置によれば、学習済みモデルが機械学習によって得られた性能を発揮することが困難な環境を識別することができる。 As described above, according to the information processing apparatus according to the embodiment, it is possible to identify an environment in which it is difficult for the trained model to exhibit the performance obtained by machine learning.

　＜付記＞
　上述したような実施の形態は、以下のような技術思想を含む。 <Additional notes>
The above-described embodiments include the following technical ideas.

　（構成１）
　少なくとも１つのデータセット（Ｅ１～Ｅｎ）に含まれる学習データ（ｄｔｘ）から複数の次元（ｘ１，ｘ２）の特徴量（ｆｘ）を出力するエンコーダモデル（Ｍｃ）を含むエンコーダ（Ｅｎｃ）と、
　少なくとも１つの入力データ（ｄｔ＿ｕｎ）の特定特徴量（ｆ＿ｕｎ）を前記エンコーダモデル（Ｍｃ）から受けて、前記少なくとも１つのデータセット（Ｅ１～Ｅｎ）への前記少なくとも１つの入力データの適合度（ａｄ＿ｕｎ）を出力する判定部（Ｊｄｇ）とを備え、
　前記エンコーダモデル（Ｍｃ）は、前記複数の次元（ｘ１，ｘ２）によって規定される特徴量空間の特定部分空間（Ｓｂ１）に前記特徴量（ｆｘ）を分布させるように機械学習によって適合されており、
　前記判定部（Ｊｄｇ）は、前記特定特徴量（ｆ＿ｕｎ）と前記特定部分空間（Ｓｂ１）との位置関係（δ）から前記適合度（ａｄ＿ｕｎ）を算出する、情報処理装置（１００，１００Ｂ）。 (Structure 1)
An encoder (Enc) including an encoder model (Mc) that outputs features (fx) of a plurality of dimensions (x1, x2) from training data (dtx) included in at least one data set (E1 to En).
A specific feature amount (f_un) of at least one input data (dt_un) is received from the encoder model (Mc), and the goodness of fit (ad_un) of the at least one input data to the at least one data set (E1 to En). ) Is provided with a judgment unit (Jdg) that outputs
The encoder model (Mc) is adapted by machine learning so that the feature amount (fx) is distributed in a specific subspace (Sb1) of the feature amount space defined by the plurality of dimensions (x1, x2). ,
The determination unit (Jdg) is an information processing device (100, 100B) that calculates the goodness of fit (ad_un) from the positional relationship (δ) between the specific feature amount (f_un) and the specific subspace (Sb1).

　（構成２）
　前記判定部（Ｊｄｇ）は、前記特定特徴量（ｆ＿ｕｎ）と、前記学習データ（ｄｔｘ）から前記エンコーダモデル（Ｍｅ）によって抽出された特徴量（ｆｘ）との差を、前記エンコーダモデル（Ｍｃ）を介して前記少なくとも１つの入力データ（ｄｔ＿ｕｎ）の変更量として逆算し、前記変更量の絶対値を減少させるための前記少なくとも１つの入力データ（ｄｔ＿ｕｎ）が取得された条件の変更方法を出力する、構成１に記載の情報処理装置（１００，１００Ｂ）。 (Structure 2)
The determination unit (Jdg) determines the difference between the specific feature amount (f_un) and the feature amount (fx) extracted from the training data (dtx) by the encoder model (Me). Is calculated back as the change amount of the at least one input data (dt_un), and the method of changing the condition in which the at least one input data (dt_un) is acquired for reducing the absolute value of the change amount is output. , The information processing apparatus (100, 100B) according to the configuration 1.

　（構成３）
　前記少なくとも１つの（Ｅ１～Ｅｎ）が保存された記憶部（Ｓｔｇ，１１４）と、
　前記少なくとも１つの（Ｅ１～Ｅｎ）を用いる機械学習により、前記特定部分空間（Ｓｂ１）に前記特徴量（ｆｘ）を分布させるように前記エンコーダモデル（Ｍｃ）を適合させる学習部（Ｌｎ）とをさらに備える、構成１または２に記載の情報処理装置（１００）。 (Structure 3)
A storage unit (Stg, 114) in which at least one (E1 to En) is stored, and
By machine learning using at least one (E1 to En), a learning unit (Ln) that adapts the encoder model (Mc) so as to distribute the feature amount (fx) in the specific subspace (Sb1) is provided. The information processing apparatus (100) according to configuration 1 or 2, further comprising.

　（構成４）
　少なくとも１つのデータセット（Ｅ１～Ｅｎ）が保存された記憶部（Ｓｔｇ）と、
　前記少なくとも１つのデータセット（Ｅ１～Ｅｎ）に含まれる学習データ（ｄｔｘ）から複数の次元（ｘ１，ｘ２）の特徴量（ｆｘ）を抽出するエンコーダモデル（Ｍｃ）を含むエンコーダ（Ｅｎｃ）と、
　学習部（Ｌｎ）とを備え、
　前記学習部（Ｌｎ）は、前記少なくとも１つのデータセット（Ｅ１～Ｅｎ）を用いる機械学習により、前記複数の次元（ｘ１，ｘ２）によって規定される特徴量空間の特定部分空間（Ｓｂ１）に前記特徴量（ｆｘ）を分布させるように前記エンコーダモデル（Ｍｃ）を適合させる、情報処理装置（１００Ａ）。 (Structure 4)
A storage unit (Stg) in which at least one data set (E1 to En) is stored, and
An encoder (Enc) including an encoder model (Mc) that extracts features (fx) of a plurality of dimensions (x1, x2) from training data (dtx) included in at least one data set (E1 to En).
Equipped with a learning department (Ln)
The learning unit (Ln) is subjected to machine learning using the at least one data set (E1 to En) to the specific subspace (Sb1) of the feature space defined by the plurality of dimensions (x1, x2). An information processing device (100A) that adapts the encoder model (Mc) so as to distribute a feature amount (fx).

　（構成５）
　前記特徴量（ｆｚ）を復号するデコーダモデル（Ｍｄ）を含むデコーダ（Ｄｅｃ）をさらに備え、
　前記学習部（Ｌｎ）は、前記少なくとも１つのデータセット（Ｅ１～Ｅｎ）を用いる機械学習により、前記特徴量（ｆｚ）が標準正規分布に従うように、前記エンコーダモデル（Ｍｃ）および前記デコーダモデル（Ｍｄ）を適合させる、構成３または４に記載の情報処理装置（１００，１００Ａ）。 (Structure 5)
A decoder (Dec) including a decoder model (Md) for decoding the feature amount (fz) is further provided.
The learning unit (Ln) uses the encoder model (Mc) and the decoder model (Mc) so that the feature quantity (fz) follows a standard normal distribution by machine learning using the at least one data set (E1 to En). The information processing apparatus (100, 100A) according to configuration 3 or 4, which is adapted to Md).

　（構成６）
　前記学習データ（ｄｔｙ）が前記少なくとも１つのデータセット（Ｅ１～Ｅｎ）のいずれに含まれるかを識別する識別モデル（Ｍｅ）を含む識別器（Ｄｓｃ）をさらに備え、
　前記機械学習は、前記識別モデル（Ｍｅ）と前記エンコーダモデル（Ｍｃ）との間で行われる敵対的学習であり、
　前記敵対的学習においては、前記学習データ（ｄｔｙ）が含まれる正解データセット（Ｅｙ）の識別に前記識別モデル（Ｍｅ）が成功する確率Ｐｓが最大化するように前記識別モデル（Ｍｅ）が最適化されるとともに、前記正解データセット（Ｅｙ）の識別に前記識別モデル（Ｍｅ）が失敗する確率が最大化するように前記エンコーダモデル（Ｍｃ）が最適化される、構成５に記載の情報処理装置（１００，１００Ａ）。 (Structure 6)
A classifier (Dsc) including a discriminative model (Me) for discriminating which of the at least one data set (E1 to En) the training data (dty) is included in is further provided.
The machine learning is hostile learning performed between the discriminative model (Me) and the encoder model (Mc).
In the hostile learning, the discriminative model (Me) is optimal so as to maximize the probability Ps that the discriminative model (Me) succeeds in discriminating the correct data set (Ey) including the learning data (dty). The information processing according to the configuration 5, wherein the encoder model (Mc) is optimized so as to maximize the probability that the discriminative model (Me) fails to identify the correct data set (Ey). Equipment (100,100A).

　（構成７）
　前記特定部分空間（Ｓｂ１）は、前記特徴量空間の原点を中心とする球面である、構成１～６のいずれかに記載の情報処理装置（１００，１００Ａ，１００Ｂ）。 (Structure 7)
The information processing apparatus (100, 100A, 100B) according to any one of configurations 1 to 6, wherein the specific subspace (Sb1) is a spherical surface centered on the origin of the feature amount space.

　今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は上記した説明ではなくて請求の範囲によって示され、請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiments disclosed this time should be considered to be exemplary in all respects and not restrictive. The scope of the present invention is shown by the claims rather than the above description, and it is intended to include all modifications within the meaning and scope equivalent to the claims.

　１　外観検査システム、２　ワーク、４　検出センサ、４ａ　受光部、４ｂ　投光部、６　搬送機構、８　撮像部、９　照明光源、１４，Ｊｄｇ　判定部、１００，１００Ａ，１００Ｂ　情報処理装置、１０２　ディスプレイ、１０４　マウス、１０６　メモリカード、１１０　プロセッサ、１１２　メインメモリ、１１４　ハードディスク、１１６　カメラインターフェイス、１１６ａ　画像バッファ、１１８　入力インターフェイス、１２０　表示コントローラ、１２２　インターフェイス、１２４　通信インターフェイス、１２６　ライタ、１２８　バス、Ｄｓ　データセット、Ｄｅｃ，ＤｅｃＡ　デコーダ、Ｄｓｃ，ＤｓｃＡ　識別器、Ｅｎｃ，ＥｎｃＡ，ＥｎｃＢ　エンコーダ、Ｌｎ　学習部、Ｍｃ　エンコーダモデル、Ｍｄ　デコーダモデル、Ｍｅ　識別モデル、Ｍｍ　主目的モデル、Ｍｐｒ，ＭｐｒＡ，ＭｐｒＢ　主目的処理器。 1 Visual inspection system, 2 Work, 4 Detection sensor, 4a Light receiving part, 4b Flooding part, 6 Conveying mechanism, 8 Imaging part, 9 Illumination light source, 14, Jdg judgment part, 100, 100A, 100B Information processing device, 102 display , 104 mouse, 106 memory card, 110 processor, 112 main memory, 114 hard disk, 116 camera interface, 116a image buffer, 118 input interface, 120 display controller, 122 interface, 124 communication interface, 126 writer, 128 bus, Ds dataset , Dec, DecA decoder, Dsc, DscA classifier, Enc, EncA, EncB encoder, Ln learning unit, Mc encoder model, Md decoder model, Me identification model, Mm main purpose model, Mpr, MprA, MprB main purpose processor.

Claims

An encoder that includes an encoder model that outputs features of multiple dimensions from training data contained in at least one data set, and
It includes a determination unit that receives a specific feature amount of at least one input data from the encoder model and outputs the goodness of fit of the at least one input data to the at least one data set.
The encoder model is adapted by machine learning to distribute the features in a specific subspace of the features space defined by the plurality of dimensions.
The determination unit is an information processing device that calculates the goodness of fit from the positional relationship between the specific feature amount and the specific subspace.

The determination unit back-calculates the difference between the specific feature amount and the feature amount extracted from the training data by the encoder model as a change amount of the at least one input data via the encoder model, and the change. The information processing apparatus according to claim 1, wherein the information processing apparatus according to claim 1 outputs a method of changing the condition from which at least one input data has been acquired in order to reduce the absolute value of the quantity.

A storage unit in which at least one data set is stored, and a storage unit.
The information processing apparatus according to claim 1 or 2, further comprising a learning unit that adapts the encoder model so that the feature amount is distributed in the specific subspace by machine learning using the at least one data set.

A storage unit in which at least one data set is stored, and
An encoder including an encoder model that extracts features of a plurality of dimensions from the training data contained in at least one data set, and an encoder.
Equipped with a learning department
The learning unit adapts the encoder model so as to distribute the feature amount in a specific subspace of the feature amount space defined by the plurality of dimensions by machine learning using the at least one data set. apparatus.

A decoder including a decoder model that decodes the feature amount is further provided.
The information processing apparatus according to claim 3 or 4, wherein the learning unit adapts the encoder model and the decoder model so that the features follow a standard normal distribution by machine learning using the at least one data set. ..

Further comprising a classifier including a discriminative model for discriminating which of the at least one dataset the training data is contained in.
The machine learning is hostile learning performed between the discriminative model and the encoder model.
In the hostile learning, the discriminative model is optimized so as to maximize the probability that the discriminative model will succeed in identifying the correct data set containing the training data, and the correct data set is identified. The information processing apparatus according to claim 5, wherein the encoder model is optimized so as to maximize the probability that the discriminative model will fail.

The information processing device according to any one of claims 1 to 6, wherein the specific subspace is a spherical surface centered on the origin of the feature amount space.