JP6840627B2

JP6840627B2 - Hyperparameter evaluation method, computer and program

Info

Publication number: JP6840627B2
Application number: JP2017117620A
Authority: JP
Inventors: 鈴木　麻由美; 麻由美鈴木; 琢磨柴原
Original assignee: Hitachi Ltd
Current assignee: Hitachi Ltd
Priority date: 2017-06-15
Filing date: 2017-06-15
Publication date: 2021-03-10
Anticipated expiration: 2037-06-15
Also published as: JP2019003408A

Description

本発明は、機械学習を用いてデータ解析を行う装置に関する。 The present invention relates to an apparatus that performs data analysis using machine learning.

機械学習技術は、ヘルスケアや金融、産業などのさまざまな分野で蓄積されたデータの解析に対して、新しい社会的価値を創出する技術として注目されている。機械学習には、support vector machineやdeep learningといったアルゴリズムの種類、および各アルゴリズムにおいてモデルを決定するために必要なパラメータがあり、ハイパーパラメータと呼ばれる。 Machine learning technology is attracting attention as a technology that creates new social value for the analysis of data accumulated in various fields such as healthcare, finance, and industry. Machine learning has the types of algorithms such as support vector machine and deep learning, and the parameters required to determine the model in each algorithm, and is called hyperparameters.

ハイパーパラメータの組み合わせは数万通り以上考えられる。解析したい問題に機械学習を適用する際には、ハイパーパラメータの組み合わせをいくつも試行し、最も精度が高くなる組み合わせを発見する必要がある。加えて、ハイパーパラメータの組み合わせは解析する問題ごとに最適な組み合わせが異なるため、問題やデータが変わるごとに行う必要がある。 There are more than tens of thousands of combinations of hyperparameters. When applying machine learning to the problem you want to analyze, you need to try a number of hyperparameter combinations and find the one with the highest accuracy. In addition, since the optimum combination of hyperparameters differs depending on the problem to be analyzed, it is necessary to perform it every time the problem or data changes.

また、ハイパーパラメータの探索は各ハイパーパラメータがもたらす効果から探索領域の見当をつけて行うことにより探索時間を短縮可能であるため、専門家がハイパーパラメータの探索を行うことが現実的である。但し、専門家が探索を行った場合においても、最適なハイパーパラメータの発見までに半年程度の時間を要する場合がある。 In addition, since it is possible to shorten the search time by estimating the search area from the effect of each hyperparameter, it is realistic for an expert to search for hyperparameters. However, even if an expert searches, it may take about half a year to find the optimum hyperparameters.

そのため、目標精度を達成できなかった場合には、探索開始から半年経った後に、目標精度を達成できなかった要因を考察し、問題の設定やデータの質が不適切だったなどの結論を得ることになる。 Therefore, if the target accuracy cannot be achieved, six months after the start of the search, the factors that failed to achieve the target accuracy are considered, and the conclusion is obtained that the problem setting and data quality are inappropriate. It will be.

また、日々蓄積されるデータが増加した都度、データに追加したい場合においても、追加するデータに対応したモデルが完成するのは半年後となる。このように、機械学習を実社会に適用する場合、専門性の高さとハイパーパラメータ探索の複雑さは課題の１つである。 In addition, even if it is desired to add data to the data each time the data accumulated daily increases, the model corresponding to the data to be added will be completed in half a year. Thus, when applying machine learning to the real world, high expertise and complexity of hyperparameter search are one of the problems.

先行技術として、専門家がハイパーパラメータ探索のノウハウをフローとして体系化させたcheat sheetが知られている（非特許文献１）。cheat sheetを活用することでアルゴリズムの種類の決定が可能になると考えられる。また、別の先行技術として、予め用意しておいたハイパーパラメータの組み合わせを自動で網羅的に試行し、最も予測精度が高かったハイパーパラメータを提示するサービスの製品化も行われている（非特許文献２）。 As a prior art, a cheat sheet is known in which an expert systematizes the know-how of hyperparameter search as a flow (Non-Patent Document 1). It is thought that the type of algorithm can be determined by using the cheat sheet. In addition, as another prior art, a service has been commercialized that automatically and comprehensively tries a combination of hyperparameters prepared in advance and presents the hyperparameters with the highest prediction accuracy (non-patented). Document 2).

“scikit-learn cheat sheet”、［online］、［平成29年5月 8日検索］、インターネット〈 URL：http://scikit-learn.org/stable/tutorial/machine_learning_map/〉“Scikit-learn cheat sheet”, [online], [Search on May 8, 2017], Internet <URL: http://scikit-learn.org/stable/tutorial/machine_learning_map/> “Sap predictive analytics whitepaper”、［online］、［平成29年5月 8日検索］、インターネット〈https://www.sap.com/japan/documents/2016/02/c4d70a6c-617c-0010-82c7-eda71af511fa.html〉“Sap predictive analytics whitepaper”, [online], [Search on May 8, 2017], Internet <https://www.sap.com/japan/documents/2016/02/c4d70a6c-617c-0010-82c7- eda71af511fa.html>

解析したい問題に機械学習を適用する際には、ハイパーパラメータの組み合わせをいくつも試行し、最も精度が高くなる組み合わせを発見する必要がある。上記作業には、半年程度の時間と専門家が必要になる。 When applying machine learning to the problem you want to analyze, you need to try a number of hyperparameter combinations and find the one with the highest accuracy. The above work requires about half a year and specialists.

ハイパーパラメータの組み合わせを自動で網羅的に試行する場合、時間は同様にかかるものの専門家は不要となる。また、解析対象のデータが外部に持ち出せない場合、解析対象データを保管する施設に解析用計算機を搬入する必要がある。従来では機械学習のハイパーパラメータ探索には大規模な解析用計算機の使用が前提となっているが、該当施設に解析用計算機の搬入ができない可能性がある。さらに、既存の計算器を用いた場合に要する時間はさらに長くなる可能性がある。 When trying to combine hyperparameters automatically and comprehensively, it takes time as well, but no specialist is required. In addition, if the data to be analyzed cannot be taken out, it is necessary to bring the analysis computer to the facility that stores the data to be analyzed. Conventionally, the use of a large-scale analysis computer has been a prerequisite for machine learning hyperparameter search, but there is a possibility that the analysis computer cannot be brought into the facility. Moreover, the time required when using an existing computer may be even longer.

本発明の目的は、ハイパーパラメータ探索の工程において、専門家および高性能な解析用計算器を必要とせずに、データの特徴から各ハイパーパラメータの組み合わせにおいて期待される予測性能を推定することで、達成可能と見込まれる予測性能およびハイパーパラメータの組み合わせを迅速に提示可能にする計算機を提供することにある。 An object of the present invention is to estimate the predicted performance expected for each combination of hyperparameters from the characteristics of data without the need for an expert and a high-performance computer for analysis in the process of hyperparameter search. The purpose is to provide a computer that can quickly present a combination of predictive performance and hyperparameters that are expected to be achievable.

本発明は、プロセッサとメモリを有する計算機が、解析対象データに適用可能なハイパーパラメータを評価するハイパーパラメータの評価方法であって、前記計算機が、予め設定された事前データと複数のハイパーパラメータから２次特徴量生成部と精度予測部を生成する第１のステップと、前記計算機が、前記解析対象データを受け付ける第２のステップと、前記２次特徴量生成部が、前記解析対象データと複数のハイパーパラメータを入力として２次特徴量を算出する第３のステップと、前記精度予測部が、前記２次特徴量を入力としてハイパーパラメータ毎の期待予測精度を算出する第４のステップと、を含む。 The present invention is a hyperparameter evaluation method in which a computer having a processor and a memory evaluates hyperparameters applicable to the data to be analyzed, and the computer uses preset preset data and a plurality of hyperparameters. The first step of generating the next feature quantity generation unit and the accuracy prediction unit, the second step of the computer accepting the analysis target data, and the secondary feature quantity generation unit include the analysis target data and a plurality. It includes a third step of calculating a secondary feature amount by inputting a hyperparameter, and a fourth step of calculating an expected prediction accuracy for each hyperparameter by the accuracy prediction unit using the secondary feature amount as an input. ..

本発明によれば、解析対象データに対し、最高の精度となると予測されたハイパーパラメータの組み合わせおよび前記最高の予測精度を迅速に提示することが可能となる。これにより、機械学習のハイパーパラメータ探索において、半年などの長期間を要することなく、かつ専門家および高性能な解析用の計算機を不要とすることが期待できる。 According to the present invention, it is possible to quickly present a combination of hyperparameters predicted to have the highest accuracy and the highest prediction accuracy for the data to be analyzed. As a result, it can be expected that the hyperparameter search of machine learning does not require a long period of time such as half a year, and does not require a specialist and a high-performance computer for analysis.

本発明の実施例１を示し、異常検知装置の構成の一例を示すブロック図である。It is a block diagram which shows Example 1 of this invention and shows an example of the structure of an abnormality detection apparatus. 本発明の実施例１を示し、ソフトウェアの構成の一例を示すブロック図である。It is a block diagram which shows Example 1 of this invention and shows an example of the structure of software. 本発明の実施例１を示し、期待予測精度予測器の処理の一例を示すフローチャートである。FIG. 5 is a flowchart showing Example 1 of the present invention and showing an example of processing of an expected prediction accuracy predictor. 本発明の実施例１を示し、２次特徴量生成器および精度予測器における処理の一例を示す説明図である。It is explanatory drawing which shows Example 1 of this invention and shows an example of processing in a secondary feature quantity generator and accuracy predictor. 本発明の実施例１を示し、学習器で行われる処理の一例を示すフローチャートである。FIG. 5 is a flowchart showing Example 1 of the present invention and showing an example of processing performed by the learner. 本発明の実施例１を示し、結果を保存する処理の一例を示すフローチャートである。It is a flowchart which shows Example 1 of this invention and shows an example of the process which saves a result. 本発明の実施例１を示し、画面表示の一例を示す図である。It is a figure which shows Example 1 of this invention and shows an example of screen display. 本発明の実施例１を示し、２次特徴量生成器および精度予測器の生成の一例を示す説明図である。It is explanatory drawing which shows Example 1 of this invention and shows an example of the generation of a secondary feature quantity generator and an accuracy predictor. 本発明の実施例２を示し、異常検知装置の機能の一例を示す説明図である。It is explanatory drawing which shows Example 2 of this invention and shows an example of the function of the abnormality detection apparatus. 本発明の実施例３を示し、処理の一例を示すフローチャートである。It is a flowchart which shows Example 3 of this invention and shows an example of processing. 本発明の実施例３を示し、データ解析手法の処理の一例を示すフローチャートである。FIG. 3 is a flowchart showing Example 3 of the present invention and showing an example of processing of a data analysis method. 本発明の実施例４を示し、異常検知装置の機能の一例を示す説明図である。It is explanatory drawing which shows Example 4 of this invention and shows an example of the function of the abnormality detection apparatus.

本発明の第一の実施形態を、図面を用いて説明する。以下、本発明の実施形態を説明するための全図において、基本的に同一機能を有するものは同一符号を付し、その繰り返しの説明は省略する。 The first embodiment of the present invention will be described with reference to the drawings. Hereinafter, in all the drawings for explaining the embodiment of the present invention, those having basically the same function are designated by the same reference numerals, and the repeated description thereof will be omitted.

まず、本発明の実施例１の異常検知装置を説明する。本実施例１の異常検知装置１００は、データ解析のためのモデルを機械学習により生成し、生成されたモデルによって異常があるデータの検知を行う例を示す。 First, the abnormality detection device according to the first embodiment of the present invention will be described. The abnormality detection device 100 of the first embodiment shows an example in which a model for data analysis is generated by machine learning, and data having an abnormality is detected by the generated model.

図１は、本実施形態の異常検知装置１００のハードウェア構成の一例を示すブロック図である。本実施例１の異常検知装置１００は、ＣＰＵ１０２と、メモリ１０３と、入出力インターフェイス１０４と、通信装置１０５と、ドライブ装置１０６と、を有する。 FIG. 1 is a block diagram showing an example of the hardware configuration of the abnormality detection device 100 of the present embodiment. The abnormality detection device 100 of the first embodiment includes a CPU 102, a memory 103, an input / output interface 104, a communication device 105, and a drive device 106.

これらはデータバス１０１によって相互に接続されている。ＣＰＵ１０２は、制御装置と演算装置からなり、異常検知装置１００の制御や、演算や情報転送をつかさどる中央処理装置である。メモリ１０３は、異常検知装置１００が処理すべきデジタルデータを一定期間保持することが可能である記憶装置である。 These are interconnected by a data bus 101. The CPU 102 includes a control device and an arithmetic unit, and is a central processing unit that controls the abnormality detection device 100, performs arithmetic operations, and transfers information. The memory 103 is a storage device capable of holding digital data to be processed by the abnormality detection device 100 for a certain period of time.

入出力インターフェイス１０４は、装置の外部に接続する機器との情報の入出力に使用するインターフェイスである。入出力インターフェイス１０４に、キーボードやマウスなどの入力装置１０７と、ディスプレイなどの出力装置１０８を接続することが可能である。 The input / output interface 104 is an interface used for inputting / outputting information to / from a device connected to the outside of the device. An input device 107 such as a keyboard or mouse and an output device 108 such as a display can be connected to the input / output interface 104.

通信装置１０５は、インターネットなどのネットワークに接続するケーブルを通信装置１０５に接続することで、インターネットなどのネットワークへの接続を可能にする装置である。ドライブ装置１０６は、情報が書き込まれているまたは書き込み可能である空のディスクメディアやＨＤＤなどの記憶媒体１０９を含み、書き込まれている情報の読み出し、および情報の書き込みを行う装置である。 The communication device 105 is a device that enables connection to a network such as the Internet by connecting a cable that connects to a network such as the Internet to the communication device 105. The drive device 106 is a device that includes a blank disk medium or a storage medium 109 such as an HDD on which information is written or writable, and reads the written information and writes the information.

メモリ１０３には、本実施例１の異常検知装置１００の機能を実現させるために必要な、ＣＰＵ１０２の演算処理用のプログラム２００と各種データが予め格納されている。ＣＰＵ１０２が、メモリ１０３に格納されたプログラム２００を実行することによって、異常検知装置１００の機能を実現する各種処理が行われる。 The memory 103 stores in advance a program 200 for arithmetic processing of the CPU 102 and various data necessary for realizing the function of the abnormality detection device 100 of the first embodiment. By executing the program 200 stored in the memory 103, the CPU 102 performs various processes for realizing the functions of the abnormality detection device 100.

なお、ＣＰＵ１０２が実行するプログラム２００はドライブ装置１０６に接続された前記記憶媒体１０９に格納しておき、そのプログラムを読み込んでメモリ１０３に格納するようにしてもよい。 The program 200 executed by the CPU 102 may be stored in the storage medium 109 connected to the drive device 106, and the program may be read and stored in the memory 103.

図２はメモリ１０３、または前記記憶媒体１０９格納されている演算処理用のプログラム２００の実行により実装される機能の一例を示すブロック図である。期待予測精度予測器２１０は、２次特徴量生成器２１１および精度予測器２１２からなる。なお、２次特徴量生成器２１１は、２次特徴量生成モジュールとしてもよい。また、精度予測器２１２は、精度予測モジュールとしてもよい。 FIG. 2 is a block diagram showing an example of a function implemented by executing the operation processing program 200 stored in the memory 103 or the storage medium 109. The expected prediction accuracy predictor 210 includes a secondary feature amount generator 211 and an accuracy predictor 212. The secondary feature amount generator 211 may be used as a secondary feature amount generation module. Further, the accuracy predictor 212 may be an accuracy prediction module.

２次特徴量生成器２１１は、解析対象データとハイパーパラメータの組み合わせから構成された１次特徴量を用いて、２次特徴量を生成する機能を含む。精度予測器２１２は、２次特徴量生成器２１１において生成した２次特徴量を入力として、ハイパーパラメータの期待予測精度を推定する機能を有する。なお、ハイパーパラメータは、前述したとおり、機械学習のアルゴリズムの種類、および各アルゴリズムにおいてモデルを決定するために必要なパラメータを含む情報である。 The secondary feature amount generator 211 includes a function of generating a secondary feature amount using a primary feature amount composed of a combination of analysis target data and hyperparameters. The accuracy predictor 212 has a function of estimating the expected prediction accuracy of hyperparameters by inputting the secondary feature amount generated by the secondary feature amount generator 211. As described above, the hyperparameters are information including the types of machine learning algorithms and the parameters required to determine the model in each algorithm.

期待予測精度は、当該ハイパーパラメータと解析対象データを機械学習に適用し、生成したモデルの精度を評価した場合に、患者と非患者の識別などにおいて正しく識別できた確率などで表現することができる。なお、機械学習のモデルについては、予め設定されたモデルを使用する。 Expected prediction accuracy can be expressed by the probability of being able to correctly distinguish between patients and non-patients when the hyperparameters and data to be analyzed are applied to machine learning and the accuracy of the generated model is evaluated. .. As the machine learning model, a preset model is used.

学習器２２０は、任意のハイパーパラメータの組み合わせによる学習を行い、データ解析のためのモデルを生成する機能を含む。結果表示器２３０は、ユーザインターフェイスによる操作や期待予測精度予測器２１０の結果および学習器２２０の結果を表示する機能を含む。 The learner 220 includes a function of performing learning by a combination of arbitrary hyperparameters and generating a model for data analysis. The result display 230 includes a function of displaying the operation by the user interface, the result of the expected prediction accuracy predictor 210, and the result of the learner 220.

２次特徴量生成器２１１と、精度予測器２１２と、学習器２２０と、結果表示器２３０の各機能部はプログラム２００としてメモリ１０３にロードされる。なお、本実施例１では、学習器２２０をソフトウェアによって実現する例を示すが、ハードウェアによって実現してもよい。 Each functional unit of the secondary feature amount generator 211, the accuracy predictor 212, the learner 220, and the result display 230 is loaded into the memory 103 as a program 200. In the first embodiment, the learning device 220 is realized by software, but it may be realized by hardware.

ＣＰＵ１０２は、各機能部のプログラムに従って処理することによって、所定の機能を提供する機能部として稼働する。例えば、ＣＰＵ１０２は、学習プログラムに従って処理することで学習器２２０として機能する。他のプログラムについても同様である。さらに、ＣＰＵ１０２は、各プログラムが実行する複数の処理のそれぞれの機能を提供する機能部としても稼働する。計算機及び計算機システムは、これらの機能部を含む装置及びシステムである。 The CPU 102 operates as a functional unit that provides a predetermined function by processing according to the program of each functional unit. For example, the CPU 102 functions as a learner 220 by processing according to a learning program. The same applies to other programs. Further, the CPU 102 also operates as a functional unit that provides each function of a plurality of processes executed by each program. A computer and a computer system are devices and systems including these functional parts.

異常検知装置１００の各機能を実現するプログラム、テーブル等の情報は、ドライブ装置１０６や不揮発性半導体メモリ、ハードディスクドライブ、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記憶デバイス、または、ＩＣカード、ＳＤカード、ＤＶＤ等の計算機読み取り可能な非一時的データ記憶媒体に格納することができる。 Information such as programs and tables that realize each function of the abnormality detection device 100 can be obtained from a drive device 106, a non-volatile semiconductor memory, a hard disk drive, a storage device such as an SSD (Solid State Drive), or an IC card, SD card, DVD. It can be stored in a non-temporary data storage medium that can be read by a computer such as.

次に、本実施例１のプログラム２００を実行した際の処理の流れを説明する。図３は、本実施例１の異常検知装置１００で行われる処理の一例を示すフローチャートである。異常検知装置１００は、プログラムの実行により、解析対象データに対し、最高の精度となると予測されたハイパーパラメータの組み合わせおよび前記最高の予測精度を瞬時（または迅速）に提示する。 Next, the flow of processing when the program 200 of the first embodiment is executed will be described. FIG. 3 is a flowchart showing an example of processing performed by the abnormality detection device 100 of the first embodiment. The abnormality detection device 100 instantly (or quickly) presents the combination of hyperparameters predicted to be the highest accuracy and the highest prediction accuracy to the data to be analyzed by executing the program.

異常検知装置１００は、期待予測精度予測器２１０を機能させて、ドライブ装置１０６に接続された記憶媒体１０９から、メモリ１０３に解析対象データを読み込む（ステップＳ３０１）。なお、異常検知装置１００は、入力装置１０７等から解析対象データの指定を受け付けて、記憶媒体１０９からメモリ１０３に読み込む。 The abnormality detection device 100 makes the expected prediction accuracy predictor 210 function and reads the analysis target data into the memory 103 from the storage medium 109 connected to the drive device 106 (step S301). The abnormality detection device 100 receives the designation of the analysis target data from the input device 107 or the like, and reads the data from the storage medium 109 into the memory 103.

続いて、異常検知装置１００の期待予測精度予測器２１０は、解析対象データとハイパーパラメータセットからなる１次特徴量を用いて、２次特徴量生成器２１１により２次特徴量を生成する（ステップＳ３０２）。 Subsequently, the expected prediction accuracy predictor 210 of the abnormality detection device 100 generates a secondary feature amount by the secondary feature amount generator 211 using the primary feature amount composed of the analysis target data and the hyperparameter set (step). S302).

ここで、ハイパーパラメータセットとは、ハイパーパラメータの組み合わせを意味する。例えば、解析用のアルゴリズムの種類はｄｅｅｐｌｅａｒｎｉｎｇ、層の数は１０、ユニット（ニューロン）数は５００、といった組み合わせで構成されるハイパーパラメータを含むセットである。 Here, the hyperparameter set means a combination of hyperparameters. For example, the type of algorithm for analysis is deep learning, the number of layers is 10, the number of units (neurons) is 500, and so on.

なお、解析対象データに適用可能なハイパーパラメータの予測精度を評価するハイパーパラメータセットは、予め用意された複数のハイパーパラメータで構成される。あるいは、オペレータなどが入力装置１０７を介して指定した複数のハイパーパラメータを受け付けて、ハイパーパラメータセットを構成するようにしてもよい。 The hyperparameter set for evaluating the prediction accuracy of hyperparameters applicable to the data to be analyzed is composed of a plurality of hyperparameters prepared in advance. Alternatively, a plurality of hyperparameters specified by an operator or the like via the input device 107 may be accepted to form a hyperparameter set.

１次特徴量は、解析対象データとハイパーパラメータセットＡ、解析対象データとハイパーパラメータセットＢ、というように、予め定めておいた複数個のハイパーパラメータセットが自動的に解析対象データと結合されたデータセットである。本実施例１では１次特徴量は、予め生成されたものとする。 As for the primary feature quantity, a plurality of predetermined hyperparameter sets such as the analysis target data and the hyperparameter set A and the analysis target data and the hyperparameter set B are automatically combined with the analysis target data. It is a data set. In the first embodiment, it is assumed that the primary feature amount is generated in advance.

次に、期待予測精度予測器２１０は、後述するように生成された２次特徴量を用いて、精度予測器２１２により期待予測精度を予測する（ステップＳ３０３）。最後に、結果表示器２３０は、予測した期待予測精度のうち、前記期待予測精度が高かったもののうち上位数個（予め設定された順位閾値以上）におけるハイパーパラメータセットおよび前記期待予測精度を出力装置１０８に表示し、処理を終了する（ステップＳ３０４）。 Next, the expected prediction accuracy predictor 210 predicts the expected prediction accuracy by the accuracy predictor 212 using the secondary features generated as described later (step S303). Finally, the result display 230 outputs the hyperparameter set and the expected prediction accuracy of the top several (preset preset rank threshold values or more) of the predicted expected prediction accuracy having the highest expected prediction accuracy. Displayed on 108, the process ends (step S304).

図４は、前記ステップＳ３０２の、２次特徴量生成および、前記ステップＳ３０３の、期待予測精度の予測における処理を説明するための模式図である。１次特徴量は、上述の通り、解析対象データと１つのハイパーパラメータセットの組み合わせとして複数のサンプルが生成される。 FIG. 4 is a schematic diagram for explaining the process of generating the secondary feature amount in step S302 and predicting the expected prediction accuracy in step S303. As for the primary feature quantity, as described above, a plurality of samples are generated as a combination of the data to be analyzed and one hyperparameter set.

ハイパーパラメータセットが、１０個あった場合で以下説明を進める。１０サンプルの１次特徴量に対して、前記ステップＳ３０２の２次特徴量生成により２次特徴量が生成され、前記ステップＳ３０３の期待予測精度予測により期待予測精度が予測され、１０サンプルそれぞれに対して期待予測精度が１個ずつ提示される。 The following description will be given when there are 10 hyperparameter sets. For the primary feature amount of 10 samples, the secondary feature amount is generated by the secondary feature amount generation in step S302, the expected prediction accuracy is predicted by the expected prediction accuracy prediction in step S303, and for each of the 10 samples. The expected prediction accuracy is presented one by one.

前記ステップＳ３０４で示したように、提示された１０個の期待予測精度を高い順に並び替えを行い、期待予測精度の高い順にハイパーパラメータセットおよび前記期待予測精度を表示し終了する。 As shown in step S304, the ten presented expected prediction accuracy are rearranged in descending order, and the hyperparameter set and the expected prediction accuracy are displayed in descending order of the expected prediction accuracy, and the process ends.

本実施例１では、予め特徴量に用いるハイパーパラメータセットを定めておいたが、オペレータによりハイパーパラメータセットを追加、決定可能としてもよい。また、期待予測精度のうち、予測精度が高かったもののうち上位数個におけるハイパーパラメータセットおよび予測精度の表示を行って処理の終了としたが、これに限定されるものではない。 In the first embodiment, the hyperparameter set used for the feature quantity is defined in advance, but the hyperparameter set may be added and determined by the operator. Further, among the expected prediction accuracy, the hyperparameter set and the prediction accuracy of the top several of the highest prediction accuracy are displayed to end the processing, but the processing is not limited to this.

例えば、期待予測精度予測器２１０による結果を結果表示器２３０が出力装置１０８にハイパーパラメータセットを表示し、オペレータが表示された中から、ハイパーパラメータセットを１つ選択し、選択したハイパーパラメータセットによる学習を行いデータ解析のためのモデルを生成する機能があってもよい。 For example, the result display 230 displays the result by the expected prediction accuracy predictor 210 on the output device 108, and the operator selects one hyperparameter set from the displayed ones, and the result is based on the selected hyperparameter set. There may be a function of training and generating a model for data analysis.

また、オペレータが任意のパラメータセットを選択せず、上位１個のみ、または上位５個すべてなどにおいて自動的に学習を行うようにしてもよい。さらに、全ての１次特徴量に対する期待予測精度およびそのハイパーパラメータセットを指定したあるいは特定のメモリ空間へ保存してもよいし、オペレータが指定した特定のデータのみを指定したあるいは特定のメモリ空間へ保存してもよい。 Further, the operator may not select an arbitrary parameter set, and may automatically perform learning on only the top one or all the top five parameters. Furthermore, the expected prediction accuracy for all primary features and their hyperparameter sets may be specified or stored in a specific memory space, or only specific data specified by the operator may be specified or stored in a specific memory space. You may save it.

続いて、期待予測精度予測器２１０が出力したハイパーパラメータセットのうち、オペレータが指定した１つまたは複数個のハイパーパラメータセットによる学習を学習器２２０によって行いデータ解析のためのモデルを生成する。 Subsequently, among the hyperparameter sets output by the expected prediction accuracy predictor 210, learning by one or a plurality of hyperparameter sets specified by the operator is performed by the learner 220 to generate a model for data analysis.

図５は学習器２２０で行われる処理の一例を示すフローチャートである。学習器２２０は、指定されたハイパーパラメータを用いて学習を行いデータ解析のためのモデルを生成する。この処理は、オペレータからの指示などに基づいて開始される。なお、モデルの生成手法は周知ないし公知の技術を適用することができる。 FIG. 5 is a flowchart showing an example of processing performed by the learner 220. The learner 220 performs learning using the designated hyperparameters and generates a model for data analysis. This process is started based on an instruction from the operator or the like. A well-known or known technique can be applied to the model generation method.

学習器２２０は、まず、メモリ１０３から入力装置１０７等で指定された解析対象データを読み出す（ステップＳ５０１）。続いて、学習器２２０は、事前にメモリ１０３に保存したハイパーパラメータを読み出すかどうかを判定する（ステップＳ５０２）。 First, the learner 220 reads out the analysis target data designated by the input device 107 or the like from the memory 103 (step S501). Subsequently, the learner 220 determines whether or not to read the hyperparameters stored in the memory 103 in advance (step S502).

なお、事前にメモリ１０３に保存されたハイパーパラメータは、例えば、図３のステップＳ３０４で期待予測精度が表示されたハイパーパラメータである。 The hyperparameters stored in the memory 103 in advance are, for example, the hyperparameters whose expected prediction accuracy is displayed in step S304 of FIG.

ステップＳ５０２の判定は、学習器２２０が出力装置１０８にメモリ１０３内のハイパーパラメータセットを使用するか否かのボタンを表示し、入力装置１０７から受け付けた応答に基づいて実施する。メモリ１０３内のハイパーパラメータセットを使用する場合には、ステップＳ５０４へ進みそうでない場合にはステップＳ５０３へ進む。 The determination in step S502 is performed based on the response received from the input device 107 by displaying a button on the output device 108 whether or not the learner 220 uses the hyperparameter set in the memory 103. When the hyperparameter set in the memory 103 is used, the process proceeds to step S504, and if not, the process proceeds to step S503.

ステップＳ５０３では、学習器２２０が出力装置１０８にユーザインターフェイスを出力し、学習させるハイパーパラメータセットを入力装置１０７から受け付ける。一方、メモリ１０３から読み出す場合、学習器２２０は、学習させるハイパーパラメータセットを選択するために、メモリ１０３に格納されているハイパーパラメータセットを読み出す（ステップＳ５０４）。 In step S503, the learner 220 outputs the user interface to the output device 108, and receives the hyperparameter set to be learned from the input device 107. On the other hand, when reading from the memory 103, the learner 220 reads the hyperparameter set stored in the memory 103 in order to select the hyperparameter set to be learned (step S504).

学習器２２０は、ステップＳ５０４にて読み出したハイパーパラメータセットの中から、学習させるハイパーパラメータセットを選択する（ステップＳ５０５）。この際、選択するハイパーパラメータセットは１つのみでもよいし、複数であってもよい。ハイパーパラメータセットの選択は、学習器２２０が入力装置１０７からオペレータの指令を受け付けても良い。 The learner 220 selects the hyperparameter set to be learned from the hyperparameter set read in step S504 (step S505). At this time, only one hyperparameter set may be selected, or a plurality of hyperparameter sets may be selected. For the selection of the hyperparameter set, the learner 220 may accept the operator's command from the input device 107.

続いて、学習器２２０は、選択されたハイパーパラメータセットと、ステップＳ５０１にて読み出した解析対象データを用いて、所定の学習を行って解析モデルの生成を行う（ステップＳ５０６）。 Subsequently, the learner 220 performs predetermined learning using the selected hyperparameter set and the analysis target data read in step S501 to generate an analysis model (step S506).

学習器２２０は、学習に用いるデータは解析対象データのうち、たとえば５０％のみを用い、生成したモデルの評価用に残りの５０％を用いて、１０フォールドクロスバリデーション等による精度の評価などを行う。解析対象データの残りの５０％等を利用してクロスバリデーションを実施することで、学習結果の過学習を抑制することが可能となる。 The learner 220 uses, for example, only 50% of the data to be analyzed as the data used for learning, and uses the remaining 50% for evaluation of the generated model to evaluate the accuracy by 10-fold cross-validation or the like. .. By performing cross-validation using the remaining 50% or the like of the data to be analyzed, it is possible to suppress overfitting of the learning result.

最後に、学習器２２０は生成したモデルの予測精度を結果として出力装置１０８に表示し、終了する（ステップＳ５０７）。ステップＳ５０７では、上述のように予測精度の高い順に学習により生成したモデルまたはハイパーパラメータセットを出力装置１０８に表示することができる。 Finally, the learner 220 displays the prediction accuracy of the generated model on the output device 108 as a result, and ends (step S507). In step S507, the model or hyperparameter set generated by learning in descending order of prediction accuracy as described above can be displayed on the output device 108.

本実施例１におけるプログラムの実行により、解析対象データに対し、最高の精度となると予測されたハイパーパラメータセットおよび前記最高の精度を瞬時に提示することが可能となる。 By executing the program in the first embodiment, it is possible to instantly present the hyperparameter set predicted to be the highest accuracy and the highest accuracy to the data to be analyzed.

次に、処理結果の保存および、オペレータが指定した１つまたは複数個のハイパーパラメータセットによる学習を行いデータ解析のためのモデルの生成処理における結果の保存について説明する。 Next, saving of the processing result and saving of the result in the model generation process for data analysis by learning with one or a plurality of hyperparameter sets specified by the operator will be described.

図６は、結果を保存する処理の一例を示すフローチャートである。本実施例１の保存処理は、オペレータがユーザインターフェイス等により保存ボタンを押下するなどの信号をＣＰＵ１０２に送信することにより精度予測器２１２や学習器２２０で開始される。学習器２２０の場合を用いて説明する。 FIG. 6 is a flowchart showing an example of the process of saving the result. The saving process of the first embodiment is started by the accuracy predictor 212 and the learning device 220 by transmitting a signal such as pressing the save button by the operator through the user interface or the like to the CPU 102. The case of the learner 220 will be described.

まず、学習器２２０は、保存する対象およびメモリ空間を選択する（ステップＳ６０１）。続いて、学習器２２０は、選択した対象を選択したメモリ空間または特定のメモリ空間へ保存する（ステップＳ６０２）。最後に、学習器２２０は、保存が成功した旨のメッセージを出力装置１０８に表示し、処理を終了する（ステップＳ６０３）。 First, the learner 220 selects a storage target and a memory space (step S601). Subsequently, the learner 220 saves the selected target in the selected memory space or a specific memory space (step S602). Finally, the learner 220 displays a message to the effect that the saving is successful on the output device 108, and ends the process (step S603).

図７は、以上の処理において出力装置１０８であるディスプレイ等に表示される画面の一例を示す図である。本実施例１における結果表示器２３０の実行により出力装置１０８には表示画面７００が表示される。 FIG. 7 is a diagram showing an example of a screen displayed on a display or the like which is an output device 108 in the above processing. By executing the result display 230 in the first embodiment, the display screen 700 is displayed on the output device 108.

異常検知装置１００のオペレータは、入力装置１０７を操作してＩｍｐｏｒｔｆｉｌｅボタン７０１を押下することで、ファイル選択画面（図示省略）からファイルを選択することにより、解析対象データを選択することができる。 The operator of the abnormality detection device 100 can select the data to be analyzed by operating the input device 107 and pressing the Import file button 701 to select a file from the file selection screen (not shown).

選択した解析対象データの名前は、Ｉｍｐｏｒｔｆｉｌｅｎａｍｅ欄７０５に出力される。その後、オペレータがＥｘｐｅｃｔｅｄｐｒｅｄｉｃｔｉｏｎボタン７０２を押下することで、解析対象データが期待予測精度予測器２１０に入力され、図３のフローチャートに代表されるような処理が行われ、結果表示器２３０によって結果表示領域７０８に結果が表示される。 The name of the selected analysis target data is output to the Import file name field 705. After that, when the operator presses the Branch prediction button 702, the analysis target data is input to the expected prediction accuracy predictor 210, processing as represented by the flowchart of FIG. 3 is performed, and the result display 230 displays the result. The result is displayed in area 708.

結果表示領域７０８に表示された期待予測精度予測器２１０の結果は、予測した期待予測精度のうち、予測精度が高かったもの上位数個におけるハイパーパラメータセットおよび前記期待予測精度を、前記期待予測精度が高いものから順に表によりまとめて表示したものなどが考えられる。 The result of the expected prediction accuracy predictor 210 displayed in the result display area 708 is the hyper parameter set and the expected prediction accuracy in the top few of the predicted expected prediction accuracy with the highest prediction accuracy, and the expected prediction accuracy. It is conceivable that the items with the highest value are displayed in a table in order.

オペレータは、結果表示領域７０８に表示されているラジオボタン７０９（あるいはチェックボックス等）により、任意のハイパーパラメータセットを選択することが可能である。オペレータは、ハイパーパラメータセットを選択した後、Ｓａｖｅボタン７０４を押下すると、保存したいメモリ空間を指定可能な画面が表示される。 The operator can select any hyperparameter set by the radio button 709 (or check box or the like) displayed in the result display area 708. When the operator presses the Save button 704 after selecting the hyperparameter set, a screen on which the memory space to be saved can be specified is displayed.

オペレータは、入力装置１０７を介してメモリ空間を指定し、実行ボタンを押下すると、指定したメモリ空間に選択した前記期待予測精度およびそのハイパーパラメータセットが保存される。保存した先のメモリ空間は、Ｅｘｐｏｒｔｆｉｌｅｎａｍｅ７０６などに出力される。 When the operator specifies a memory space via the input device 107 and presses the execute button, the selected expected prediction accuracy and its hyperparameter set are saved in the designated memory space. The saved destination memory space is output to an Export file name 706 or the like.

続いて、オペレータが学習させるハイパーパラメータセットを選択したい際には、Ｉｍｐｏｒｔｆｉｌｅボタン７０１を押下することで、ファイル選択画面（図示省略）から選択することにより、ハイパーパラメータセットを選択することができる。 Subsequently, when the operator wants to select the hyperparameter set to be learned, the hyperparameter set can be selected by pressing the Import file button 701 and selecting from the file selection screen (not shown).

Ｉｍｐｏｒｔｆｉｌｅボタン７０１を押下した際には、まず、選択したいファイルが解析対象データなのか、またはハイパーパラメータセットなのかを選択する機能があってもよい。選択したハイパーパラメータセットの名前は、Ｉｍｐｏｒｔｈｙｐｅｒ−ｐａｒａｍｅｔｅｒｎａｍｅ７０７に出力される。 When the Import file button 701 is pressed, there may be a function of first selecting whether the file to be selected is the data to be analyzed or the hyperparameter set. The name of the selected hyperparameter set is output to Import hyper-parameter name 707.

結果表示領域７０８に、前記ハイパーパラメータセットが表形式などで表示され、学習させるハイパーパラメータセットを、前記表に付随して表示されているラジオボタン７０９やチェックボックス等により選択可能とすることが考えられる。 It is conceivable that the hyperparameter set is displayed in the result display area 708 in a tabular format or the like, and the hyperparameter set to be trained can be selected by the radio button 709 or the check box displayed accompanying the table. Be done.

続いて、オペレータが、入力装置１０７を介してＴｒａｉｎｉｎｇボタン７０３を押下することで、選択したハイパーパラメータセットと、Ｉｍｐｏｒｔｆｉｌｅｎａｍｅ欄７０５に記載されている解析対象データを用いて、図５のフローチャートに代表されるような処理が行われ、結果表示領域７０８にモデルの評価結果が表示される。 Subsequently, when the operator presses the Training button 703 via the input device 107, the selected hyperparameter set and the analysis target data described in the Import file name column 705 are used in the flowchart of FIG. Processing as represented is performed, and the evaluation result of the model is displayed in the result display area 708.

前記モデルの評価結果を保存したい場合、入力装置１０７を介してＳａｖｅボタン７０４を押下すると、保存したいメモリ空間を指定可能な画面が表示される。オペレータは入力装置１０７を介してメモリ空間を指定し、実行ボタンを押下すると、指定したメモリ空間に前記モデル評価結果およびそのハイパーパラメータを含むパラメータセットが保存される。保存した先のメモリ空間は、Ｅｘｐｏｒｔｆｉｌｅｎａｍｅ７０６などに出力される。 When it is desired to save the evaluation result of the model, pressing the Save button 704 via the input device 107 displays a screen on which the memory space to be saved can be specified. When the operator specifies a memory space via the input device 107 and presses the execute button, the model evaluation result and a parameter set including the hyperparameters thereof are saved in the designated memory space. The saved destination memory space is output to an Export file name 706 or the like.

このようにして、２次特徴量生成器２１１および精度予測器２１２から構成される期待予測精度予測器２１０により、半年などの長期間を要さず、かつ専門家および高性能な解析用計算器を不要とし、学習器２２０で生成したモデルを適用することが可能となる。 In this way, the expected prediction accuracy predictor 210 composed of the secondary feature generator 211 and the accuracy predictor 212 does not require a long period of time such as half a year, and is an expert and high-performance computer for analysis. Is unnecessary, and the model generated by the learner 220 can be applied.

異常検知装置１００において、上記モデルを適用することで、最新のデータを用いた予測精度の高い異常検知が可能となる。異常検知装置１００で検知する異常の例としては、医療用画像診断装置により取得された画像から読影可能な疾病の検知や、遺伝子検査装置により取得された遺伝子発現データ配列を用いた疾病の検知や、生化学検査装置により取得された成分分析結果を用いた疾病の検知や、医薬品開発において行われた実験により取得された薬物の活性値を用いた薬剤候補成分の検知などが考えられる。 By applying the above model to the abnormality detection device 100, it is possible to detect an abnormality with high prediction accuracy using the latest data. Examples of abnormalities detected by the abnormality detection device 100 include detection of diseases that can be read from images acquired by a medical diagnostic imaging device, detection of diseases using a gene expression data sequence acquired by a genetic testing device, and detection of diseases. , Detection of diseases using component analysis results obtained by biochemical testing equipment, detection of drug candidate components using drug activity values obtained by experiments conducted in drug development, etc. can be considered.

また、ヘルスケア関連領域以外においては、機器や人に取り付けられたセンサーより取得されたデータを用いた異常動作の検知や、クレジットカードの審査の際にカード申請者から提出された年収や借金の有無等のデータを用いた例外の検知などが考えられる。 In areas other than healthcare-related areas, abnormal behavior is detected using data acquired from sensors attached to devices and people, and annual income and debt submitted by card applicants during credit card examinations. Exception detection using data such as presence / absence can be considered.

図８は、２次特徴量生成器２１１および精度予測器２１２の生成の一例の説明図である。以下では、２次特徴量生成を行う２次特徴量生成器２１１と、および期待予測精度の予測を行う精度予測器２１２の生成について図８を用いて説明する。２次特徴量生成器２１１および精度予測器２１２は、解析対象データを用いて期待予測精度を提示するより前に、予め生成しておく。 FIG. 8 is an explanatory diagram of an example of generation of the secondary feature amount generator 211 and the accuracy predictor 212. In the following, the generation of the secondary feature amount generator 211 that generates the secondary feature amount and the accuracy predictor 212 that predicts the expected prediction accuracy will be described with reference to FIG. The secondary feature quantity generator 211 and the accuracy predictor 212 are generated in advance before the expected prediction accuracy is presented using the analysis target data.

まず、２次特徴量生成器２１１の生成について説明する。２次特徴量生成器２１１の入力は１次特徴量であり、出力は２次特徴量である。１次特徴量は、予め設定された事前データとハイパーパラメータセットからなる。 First, the generation of the secondary feature amount generator 211 will be described. The input of the secondary feature generator 211 is the primary feature, and the output is the secondary feature. The primary feature quantity consists of preset preset data and a hyperparameter set.

ここで、事前データとは、たとえば、クレジットカードの審査に用いられる年収や借金の有無等のデータや、病院における血糖値を含む血液検査の結果などである。事前データは、オープンデータなどを活用し、大量に収集しておくことが考えられる。 Here, the prior data is, for example, data such as annual income and the presence or absence of debt used for credit card examination, and the result of a blood test including a blood glucose level in a hospital. It is conceivable to collect a large amount of advance data by utilizing open data.

収集しておく事前データは、欠損値を含むデータや、性別のように連続でない値が特徴量に含まれるデータ、など様々なバリエーションに富んでいることが望ましい。 It is desirable that the prior data to be collected is rich in various variations such as data including missing values and data in which non-continuous values such as gender are included in the features.

本実施例１では、予め用意した１つの事前データに対して、複数のハイパーパラメータセットで、精度予測を予め行った結果を事前精度予測結果（予測精度）として記憶媒体１０９等に保持しておく。 In the first embodiment, the result of performing accuracy prediction in advance with a plurality of hyperparameter sets for one pre-data prepared in advance is stored in a storage medium 109 or the like as a pre-accuracy prediction result (prediction accuracy). ..

つまり、事前データが１０個、ハイパーパラメータセットが５個であった場合には、５０サンプルの1次特徴量を用意することが可能である。５０サンプルの１次特徴量それぞれに対して、事前精度予測結果が存在することになる。 That is, when there are 10 prior data and 5 hyperparameter sets, it is possible to prepare a primary feature amount of 50 samples. Pre-accuracy prediction results will exist for each of the primary features of 50 samples.

事前精度予測結果は、ハイパーパラメータセットと事前データを機械学習に適用した場合に、例えば患者と非患者の識別であれば正しく２者を識別できた確率などで表現することができる。事前精度予測結果には、ハイパーパラメータセット内に記載のアルゴリズムおよびそのハイパーパラメータを用いた機械学習を行い、１０フォールドバリデーションなどを用いた精度評価結果を用いてもよい。 The pre-accuracy prediction result can be expressed by the probability that the two can be correctly distinguished when the hyperparameter set and the pre-data are applied to machine learning, for example, in the case of distinguishing between a patient and a non-patient. As the pre-accuracy prediction result, the algorithm described in the hyperparameter set and machine learning using the hyperparameters may be performed, and the accuracy evaluation result using 10 fold validation or the like may be used.

２次特徴量生成器２１１は、事前精度予測結果を精度予測器２１２により予測可能となるような２次特徴量を生成可能となるように設計する。 The secondary feature amount generator 211 is designed so that the secondary feature amount can be generated so that the pre-accuracy prediction result can be predicted by the accuracy predictor 212.

次に、精度予測器２１２の生成について説明する。精度予測器２１２の入力は２次特徴量生成器２１１が出力した２次特徴量であり、精度予測器２１２の出力は事前精度予測結果である。精度予測器２１２は、事前精度予測結果を、２次特徴量から正しく予測可能となるように設計する。 Next, the generation of the accuracy predictor 212 will be described. The input of the accuracy predictor 212 is the secondary feature amount output by the secondary feature amount generator 211, and the output of the accuracy predictor 212 is the pre-accuracy prediction result. The accuracy predictor 212 is designed so that the pre-accuracy prediction result can be correctly predicted from the secondary features.

２次特徴量生成器２１１および精度予測器２１２の設計は、異常検知装置１００の学習器２２０を用いて行ってもよい。期待予測精度予測器２１０が、学習器２２０を用いて２次特徴量生成器２１１および精度予測器２１２を生成しても良い。 The design of the secondary feature amount generator 211 and the accuracy predictor 212 may be performed by using the learner 220 of the abnormality detection device 100. The expected prediction accuracy predictor 210 may use the learner 220 to generate the secondary feature amount generator 211 and the accuracy predictor 212.

前記２次特徴量生成器２１１および精度予測器２１２の設計に、時間を要したとしても、解析対象データへ適用する際には学習を行わないため、上述のように半年などの長期間を要さず、かつ専門家および高性能な解析用計算器を不要とする本発明の効果には影響を及ぼさない。 Even if it takes time to design the secondary feature generator 211 and the accuracy predictor 212, it takes a long period of time such as half a year as described above because learning is not performed when applying the data to the analysis target data. It does not affect the effectiveness of the present invention, which does not require specialists and high-performance analytical computers.

また、２次特徴量生成器２１１および精度予測器２１２の設計を実現する手法は、機械学習に限定しない。さらに、２次特徴量は、２次特徴量生成器２１１を用いずに、２次特徴量を明示的に与えてもよい。明示的に与える２次特徴量として、事前データおよび解析対象データにおける、サンプル数、特徴量の次元数、欠損値の割合、年齢のような連続値の特徴量の割合、性別のような離散値の特徴量の割合、などが考えられる。 Further, the method for realizing the design of the secondary feature amount generator 211 and the accuracy predictor 212 is not limited to machine learning. Further, as the secondary feature amount, the secondary feature amount may be explicitly given without using the secondary feature amount generator 211. As the secondary features to be explicitly given, the number of samples, the number of dimensions of the features, the proportion of missing values, the proportion of continuous features such as age, and the discrete values such as gender in the preliminary data and the data to be analyzed. The ratio of the feature amount of is considered.

以上のように、異常検知装置１００は、予め用意した事前データについて複数のハイパーパラメータセットのそれぞれについて、精度の予測を実施して事前精度予測結果を算出する。そして、異常検知装置１００は、ハイパーパラメータセット毎に事前精度予測結果を出力（目標値）として、２次特徴量を入力とする精度予測器２１２を生成し、事前データとハイパーパラメータセットを入力とし、精度予測器２１２の入力とした２次特徴量を出力する２次特徴量生成器２１１を生成する。なお、２次特徴量生成器２１１および精度予測器２１２は、異常検知装置１００において生成されることに限定しない。予め、高性能な解析用計算機などを用いて生成しておいてもよい。 As described above, the abnormality detection device 100 predicts the accuracy of each of the plurality of hyperparameter sets for the pre-data prepared in advance, and calculates the pre-accuracy prediction result. Then, the abnormality detection device 100 generates an accuracy predictor 212 that inputs the secondary feature amount as an output (target value) of the pre-accuracy prediction result for each hyperparameter set, and inputs the pre-data and the hyperparameter set. , Generates a secondary feature amount generator 211 that outputs a secondary feature amount as an input of the accuracy predictor 212. The secondary feature amount generator 211 and the accuracy predictor 212 are not limited to being generated by the abnormality detection device 100. It may be generated in advance using a high-performance computer for analysis or the like.

本実施例１では、予め用意した複数のハイパーパラメータセットについて事前データから事前精度予測結果をハイパーパラメータ毎に算出し、事前データとハイパーパラメータセットを入力すると事前精度予測結果を出力するような２次特徴量生成器２１１及び精度予測器２１２を含む期待予測精度予測器２１０を生成する。 In the first embodiment, the pre-accuracy prediction result is calculated for each hyperparameter from the pre-data for a plurality of hyperparameter sets prepared in advance, and when the pre-data and the hyper-parameter set are input, the pre-accuracy prediction result is output. The expected prediction accuracy predictor 210 including the feature quantity generator 211 and the accuracy predictor 212 is generated.

そして、異常検知装置１００は、解析対象データと複数のハイパーパラメータセットの組み合わせを入力として期待予測精度予測器２１０により期待予測精度を算出し、解析対象のデータに対して、最高の精度となると予測されたハイパーパラメータの組み合わせと、最高の精度の値を迅速に評価することが可能となる。 Then, the abnormality detection device 100 calculates the expected prediction accuracy by the expected prediction accuracy predictor 210 by inputting the combination of the analysis target data and the plurality of hyperparameter sets, and predicts that the highest accuracy will be obtained with respect to the analysis target data. It is possible to quickly evaluate the combination of hyperparameters and the value with the highest accuracy.

これにより、機械学習のハイパーパラメータ探索において、半年などの長期間を要することなく、かつ専門家および高性能な解析用の計算機を不要とすることが期待できる。なお、期待予測精度は、当該ハイパーパラメータを用いてモデルを生成した場合に達成可能と見込まれる予測精度である。 As a result, it can be expected that the hyperparameter search of machine learning does not require a long period of time such as half a year, and does not require a specialist and a high-performance computer for analysis. The expected prediction accuracy is the prediction accuracy that is expected to be achievable when a model is generated using the hyperparameters.

次に、実施例２の異常検知装置１００について説明する。実施例１の装置では、期待予測精度を予測する装置としていた。一方、実施例２の異常検知装置１００は、予測した期待予測精度を用いて、解析対象データの予測精度を向上させるために必要なデータ成分を特定する機能を有する例を示す。 Next, the abnormality detection device 100 of the second embodiment will be described. The device of the first embodiment is a device that predicts the expected prediction accuracy. On the other hand, the abnormality detection device 100 of the second embodiment shows an example having a function of specifying a data component necessary for improving the prediction accuracy of the data to be analyzed by using the predicted expected prediction accuracy.

図９は、実施例２で実現する機能を説明する概念図である。本実施例２において、具体的な説明を行うため、２次特徴量は２次特徴量生成器２１１を用いずに、２次特徴量を明示的に与えたものとして説明する。 FIG. 9 is a conceptual diagram illustrating the function realized in the second embodiment. In the second embodiment, in order to give a specific explanation, the secondary feature amount will be described as if the secondary feature amount is explicitly given without using the secondary feature amount generator 211.

まず、事前データにおいて、いずれかのハイパーパラメータセットにおいて予測精度が８５％以上であったデータ群をグループＡとし、７０％以下だったデータ群をグループＢとする。 First, in the prior data, the data group having a prediction accuracy of 85% or more in any of the hyperparameter sets is referred to as group A, and the data group having a prediction accuracy of 70% or less is referred to as group B.

各グループの中で、２次特徴量のそれぞれの項目における平均値を算出する。解析対象データを期待予測精度予測器２１０に適用し、得られた期待予測精度が７０％など、所望の値より低かった場合に、グループＡの各２次特徴量の差と比較し、差異が大きい項目を選択する。 Within each group, the average value of each item of the secondary feature amount is calculated. When the data to be analyzed is applied to the expected prediction accuracy predictor 210 and the obtained expected prediction accuracy is lower than the desired value such as 70%, the difference is compared with the difference between the secondary features of Group A. Select a large item.

異常検知装置１００は、選択された項目に関して、グループＡの値に近づけるために、解析対象データをどう変更するとよいかの提言を提示する。たとえば、サンプル数がグループＡの平均値は１万個、解析対象データの値が１００個であった場合、サンプル数を増やす旨の指示を提示する、などである。 The anomaly detection device 100 presents a proposal on how to change the analysis target data in order to bring the selected item closer to the value of group A. For example, when the average value of the number of samples in group A is 10,000 and the value of the data to be analyzed is 100, an instruction to increase the number of samples is presented.

これにより、異常検知装置１００は、予測精度を向上させるために、具体的にデータをどう変更すればよいかを、オペレータに提示することが可能となる。提示する項目数は、予め個数を設定しておいてもよいし、２次特徴量の項目ごとに提示するようにする差異の閾値を設けておいてもよい。 As a result, the abnormality detection device 100 can present to the operator how to specifically change the data in order to improve the prediction accuracy. The number of items to be presented may be set in advance, or a threshold value of difference may be set so as to be presented for each item of the secondary feature amount.

また、予測精度に応じたグループを２つだけではなく、ハイパーパラメータセットによる予測精度の違いの観点から、ハイパーパラメータセットごとにグループＡおよびＢを設けてもよい。あるいは、２次特徴量の中に、ハイパーパラメータセットを含めてもよい。 Further, not only two groups according to the prediction accuracy but also groups A and B may be provided for each hyperparameter set from the viewpoint of the difference in prediction accuracy depending on the hyperparameter set. Alternatively, the hyperparameter set may be included in the secondary features.

さらに、いずれかのハイパーパラメータセットにおいて予測精度が８５％以上であったデータ群をグループＡ、７０％以下だったデータ群をグループＢとしたが、閾値とする数値は８５％を９０％や、７０％を６０％など、任意の値に変更してもよいし、グループを２つだけではなく、３つや４つなど増やしてもよい。 Furthermore, the data group whose prediction accuracy was 85% or more in any of the hyperparameter sets was group A, and the data group whose prediction accuracy was 70% or less was group B. 70% may be changed to any value such as 60%, and the number of groups may be increased to 3 or 4 instead of 2 groups.

また、上記では、２次特徴量を明示的に与えた場合において説明したが、２次特徴量生成器２１１を用いて生成した２次特徴量を用いてもよいし、１次特徴量を用いてもよい。また、解析対象データを期待予測精度予測器２１０に適用し、得られた期待予測精度が７０％など、所望の値より低かった場合に、グループＡの各２次特徴量の差と比較し、差異が大きい項目を選択したが、異なる手法で予測精度が低い原因と考えられる項目を選択してもよい。 Further, in the above description, the case where the secondary feature amount is explicitly given has been described, but the secondary feature amount generated by using the secondary feature amount generator 211 may be used, or the primary feature amount may be used. You may. Further, when the data to be analyzed is applied to the expected prediction accuracy predictor 210 and the obtained expected prediction accuracy is lower than a desired value such as 70%, it is compared with the difference in each secondary feature amount of Group A. Although the item with a large difference is selected, the item considered to be the cause of the low prediction accuracy may be selected by a different method.

また、２次特徴量の差をグループＡおよびＢと比較するのではなく、事前データの特徴量または１次特徴量または２次特徴量から、グループＡおよびＢのどちらであるのかを識別する機械学習モデルを生成し、解析対象データの特徴量または１次特徴量または２次特徴量を入力し、識別されたグループから解析対象データの評価を行ってもよい。また、グループＡおよびＢのどちらであるのかを識別する方法は、機械学習ではなく、マルコフ連鎖モンテカルロ法などの周知または公知の統計的な手法を用いても良い。 Further, instead of comparing the difference in the secondary features with the groups A and B, a machine that identifies whether the feature is in groups A or B from the features in the prior data, the primary features, or the secondary features. A learning model may be generated, a feature amount or a primary feature amount or a secondary feature amount of the analysis target data may be input, and the analysis target data may be evaluated from the identified group. Further, as a method for identifying which of the groups A and B is, a well-known or known statistical method such as a Markov chain Monte Carlo method may be used instead of machine learning.

以上のように、本実施例２の異常検知装置１００では、期待予測精度が閾値以下の解析対象データについて、期待予測精度が閾値を超える解析対象データの各２次特徴量の差と比較し、差異が大きい項目を選択し、解析対象データを改善する指示を出力することができる。これにより、予測精度を向上させるために必要なデータの成分を特定することが可能となる。 As described above, in the abnormality detection device 100 of the second embodiment, the analysis target data whose expected prediction accuracy is equal to or less than the threshold is compared with the difference in each secondary feature amount of the analysis target data whose expected prediction accuracy exceeds the threshold. It is possible to select an item with a large difference and output an instruction to improve the data to be analyzed. This makes it possible to identify the components of the data required to improve the prediction accuracy.

次に、実施例３の異常検知装置１００について説明する。実施例１の異常検知装置１００では、期待予測精度を予測する装置としていた。実施例３の異常検知装置１００は、解析対象データのデータ解析を行う前に、当該解析対象データが解析に値するデータであるか否かを判断する機能を有する例を示す。 Next, the abnormality detection device 100 of the third embodiment will be described. The abnormality detection device 100 of the first embodiment is a device that predicts the expected prediction accuracy. The abnormality detection device 100 of the third embodiment shows an example having a function of determining whether or not the analysis target data is worthy of analysis before performing data analysis of the analysis target data.

機械学習によって高精度な予測が不可能な解析対象データには、重要な特徴量が含まれている割合が少なく、解析を行っても何も得られる情報がない可能性がある。たとえば、現在、糖尿病と診断するために必要な項目とされている早朝空腹時血糖値や随時血糖値などを医学研究により診断に必要な値であることを明らかにしたい場合において、早朝空腹時血糖値や随時血糖値が解析対象データに含まれていない場合、明らかにしたい糖尿病と診断するために必要な項目が含まれていないため、解析対象データの解析を行っても糖尿病と診断するために必要な項目を見つけ出すことは不可能である。 The data to be analyzed, which cannot be predicted with high accuracy by machine learning, contains a small proportion of important features, and there is a possibility that no information can be obtained by analysis. For example, when it is desired to clarify by medical research that the early morning fasting blood glucose level and the occasional blood glucose level, which are currently necessary items for diagnosing diabetes, are the values necessary for the diagnosis, the early morning fasting blood glucose level. If the value or occasional blood glucose level is not included in the analysis target data, the items necessary for diagnosing diabetes that you want to clarify are not included, so even if you analyze the analysis target data, you can diagnose diabetes. It is impossible to find the item you need.

ここで、解析対象データの解析とは、医学研究であれば周知または公知のロジスティック回帰など、各分野においてゴールドスタンダードとして行われてきた解析方法や、特定の目的に沿うように開発された解析方法などを用いた解析である。このように、解析対象データには、解析目標に十分寄与するような重要な特徴量が含まれている必要がある。 Here, the analysis of the data to be analyzed is an analysis method that has been performed as a gold standard in each field, such as logistic regression that is well known or known in medical research, and an analysis method that has been developed to meet a specific purpose. It is an analysis using. As described above, the data to be analyzed needs to include important features that sufficiently contribute to the analysis target.

しかし、未知の事象の解析を行う場合に、前記重要な特徴量が解析対象データに含まれていることを知ることはできない。そこで、解析を行う前に、前記実施例１の図３のフローチャートで実現されるような期待予測精度予測器２１０を適用することが考えられる。 However, when analyzing an unknown event, it is not possible to know that the important feature amount is included in the analysis target data. Therefore, before performing the analysis, it is conceivable to apply the expected prediction accuracy predictor 210 as realized by the flowchart of FIG. 3 of the first embodiment.

機械学習による予測精度においても、前記重要な特徴量の有無が精度を左右する。期待予測精度が所定の閾値を超えて高い場合には、解析対象データに前記重要な特徴量が含まれており、期待予測精度が所定の閾値以下の低い場合には、解析対象データに前記重要な特徴量が含まれていないと判断することが可能である。異常検知装置１００は、期待予測精度が閾値以下の場合には、解析を行っても有意な結果が得られないと判定して、データの解析を中止、またはオペレータに提示することができる。 Even in the prediction accuracy by machine learning, the presence or absence of the important feature amount affects the accuracy. When the expected prediction accuracy is higher than the predetermined threshold, the analysis target data contains the important feature amount, and when the expected prediction accuracy is lower than the predetermined threshold, the analysis target data is said to be important. It is possible to judge that the characteristic amount is not included. When the expected prediction accuracy is equal to or less than the threshold value, the abnormality detection device 100 can determine that no significant result can be obtained even if the analysis is performed, and can stop the analysis of the data or present it to the operator.

図１０は、本実施例３を実現する処理の一例を示すフローチャートである。まず、異常検知装置１００は、メモリ１０３に解析対象データを読み出す（ステップＳ１００１）。続いて、異常検知装置１００は、解析対象データとハイパーパラメータセットからなる１次特徴量を参照して、２次特徴量生成器２１１により２次特徴量を生成する（ステップＳ１００２）。 FIG. 10 is a flowchart showing an example of processing for realizing the third embodiment. First, the abnormality detection device 100 reads the analysis target data into the memory 103 (step S1001). Subsequently, the abnormality detection device 100 refers to the primary feature amount composed of the analysis target data and the hyperparameter set, and generates the secondary feature amount by the secondary feature amount generator 211 (step S1002).

次に、異常検知装置１００は、生成した２次特徴量を用いて、精度予測器２１２により期待予測精度を予測する（ステップＳ１００３）。続いて、異常検知装置１００は、予測した期待予測精度から、データ解析を行うか否かの判定を行う（ステップＳ１００４）。 Next, the abnormality detection device 100 predicts the expected prediction accuracy by the accuracy predictor 212 using the generated secondary feature amount (step S1003). Subsequently, the abnormality detection device 100 determines whether or not to perform data analysis based on the predicted expected prediction accuracy (step S1004).

異常検知装置１００は、期待予測精度が所定の閾値以上であればステップＳ１００５へ進み、データ解析を行わないと判断した場合は、終了する。 If the expected prediction accuracy is equal to or higher than a predetermined threshold value, the abnormality detection device 100 proceeds to step S1005, and if it determines that the data analysis is not performed, the abnormality detection device 100 ends.

データ解析を行うと判断した場合、異常検知装置１００は、後述するデータ解析を行ってから処理を終了する（ステップＳ１００５）。 When it is determined that the data analysis is to be performed, the abnormality detection device 100 performs the data analysis described later and then ends the process (step S1005).

ステップＳ１００４における期待予測精度の閾値は任意に設定可能としてもよいし、特定の値にしておいてもよい。また、データ解析手法の１つとして、以下の様な手法を使用してもよい。 The threshold value of the expected prediction accuracy in step S1004 may be arbitrarily set or may be set to a specific value. Further, as one of the data analysis methods, the following methods may be used.

図１１はデータ解析手法の１例を示すフローチャートである。この処理は、図１０のステップＳ１００５で行われる。 FIG. 11 is a flowchart showing an example of a data analysis method. This process is performed in step S1005 of FIG.

まず、異常検知装置１００は、メモリ１０３に解析対象データを読み出す（ステップＳ１１０１）。異常検知装置１００は、読みだした解析対象データの各特徴量と、後述するサンプルごとに記載されているラベルに相関関係があるかどうかを、非線形相関により算出し、相関係数が所定の閾値を超えたもののうち、上位の特徴量を選択する（ステップＳ１１０２）。 First, the abnormality detection device 100 reads the analysis target data into the memory 103 (step S1101). The anomaly detection device 100 calculates by non-linear correlation whether or not there is a correlation between each feature amount of the read analysis target data and the label described for each sample described later, and the correlation coefficient is a predetermined threshold value. Among those exceeding the above, the higher feature amount is selected (step S1102).

ここで、解析対象データの各特徴量とは、クレジットカードの審査に用いられる年収や借金の有無等のデータや、病院における血糖値を含む血液検査の結果などである。また、サンプルごとに記載されているラベルとは、例えば、解析対象データのレコード毎に予め付加されたもので、当該データは患者のデータであるか否か、あるいは機器が故障した状態のデータであるか否か、などの異常を検知した状態とそうでない（異常が検知されていない）状態とを識別する情報である。 Here, each feature amount of the data to be analyzed is data such as annual income and the presence or absence of debt used for credit card examination, and the result of a blood test including a blood glucose level in a hospital. Further, the label described for each sample is, for example, a label added in advance for each record of the data to be analyzed, and the data is data on whether or not the data is patient data or data in a state where the device has failed. This is information that distinguishes between a state in which an abnormality such as presence / absence is detected and a state in which an abnormality is not detected (no abnormality is detected).

また、上記では、相関係数の算出に非線形相関を用いたが、線形相関を用いても良いし、相関係数以外の手法によりラベルと解析対象データの各特徴量間に存在する関係性の高さを評価してもよい。 Further, in the above, the non-linear correlation is used for the calculation of the correlation coefficient, but the linear correlation may be used, or the relationship existing between the label and each feature amount of the data to be analyzed by a method other than the correlation coefficient. The height may be evaluated.

続いて、異常検知装置１００は、ステップＳ１１０２により選択した解析対象データの各特徴量の１個ずつに対して、機械学習により精度予測を行う（ステップＳ１１０３）。機械学習による精度予測は、前記実施例１で予測した精度が高いハイパーパラメータセットを採用することで、処理を迅速に行うことができる。 Subsequently, the abnormality detection device 100 predicts the accuracy of each feature amount of the analysis target data selected in step S1102 by machine learning (step S1103). The accuracy prediction by machine learning can be performed quickly by adopting the hyperparameter set with high accuracy predicted in the first embodiment.

さらに、異常検知装置１００は、ステップＳ１１０２により選択した解析対象データの各特徴量において２個ずつのペアの組み合わせを生成する（ステップＳ１１０４）。 Further, the abnormality detection device 100 generates a combination of two pairs in each feature amount of the analysis target data selected in step S1102 (step S1104).

異常検知装置１００は、上記ステップ１１０４Ｓにより生成した組み合わせに対し、機械学習により精度予測を行う（ステップＳ１１０５）。この精度予測も、前記実施例１で予測した精度が高いハイパーパラメータセットを採用することで、処理を迅速に行うことができる。 The abnormality detection device 100 predicts the accuracy of the combination generated in step 1104S by machine learning (step S1105). This accuracy prediction can also be performed quickly by adopting the hyperparameter set with high accuracy predicted in the first embodiment.

異常検知装置１００は、ステップＳ１００３およびステップＳ１００５において、算出した２種類の予測精度および１０フォールドクロスバリデーション結果を用いた重要ペア特徴量の選択規則（後述）に従い、重要ペア特徴量を選択する（ステップＳ１１０６）。 The abnormality detection device 100 selects the important pair feature amount according to the selection rule (described later) of the important pair feature amount using the two types of prediction accuracy calculated and the 10-fold cross-validation result in step S1003 and step S1005 (step). S1106).

異常検知装置１００は、上記ステップＳ１１０６で選択した複数の重要ペア特徴量同士の関係性を選択して処理を終了する（ステップＳ１１０７）。 The abnormality detection device 100 selects the relationship between the plurality of important pair feature quantities selected in step S1106 and ends the process (step S1107).

ここで、重要ペア特徴量とは１つの解析対象データの特徴量では予測精度は高くならないものの、解析対象データの特徴量をペアにすることで期待予測精度が向上するような組み合わせとなる特徴量を指す。 Here, the important pair feature amount is a feature amount that is a combination that improves the expected prediction accuracy by pairing the feature amount of the analysis target data, although the prediction accuracy does not increase with the feature amount of one analysis target data. Point to.

重要ペア特徴量選択の規則について説明する。異常検知装置１００は、上記ステップＳ１１０３において算出した単独の期待予測精度と、ステップＳ１１０５において算出したペアの期待予測精度の差が所定の閾値を上回るような解析対象データの特徴量のペアを１次重要特徴量として選択する。 The rules for selecting important pair features will be described. The anomaly detection device 100 primary sets a pair of feature quantities of analysis target data such that the difference between the single expected prediction accuracy calculated in step S1103 and the expected prediction accuracy of the pair calculated in step S1105 exceeds a predetermined threshold value. Select as an important feature.

ここで、単独の予測精度はペアとした解析対象データの特徴量の２種類に関して存在するが、値が高い方とペア予測精度の差を算出することで、ペアにすることで予測精度が向上するような組み合わせを選択しやすくなると考えられる。 Here, there are two types of individual prediction accuracy for the feature amount of the data to be analyzed as a pair, but the prediction accuracy is improved by making a pair by calculating the difference between the higher value and the pair prediction accuracy. It is thought that it will be easier to select such a combination.

次に、異常検知装置１００は、１次重要特徴量となったペアにおいて、ステップＳ１１０３およびステップＳ１１０５において算出した１０フォールドクロスバリデーションの平均値が閾値（例えば７０％）以上であったペアのみを重要特徴量として選択する。 Next, the abnormality detection device 100 is important only for the pair in which the average value of 10 fold cross validation calculated in step S1103 and step S1105 is equal to or more than the threshold value (for example, 70%) among the pairs that have become the primary important features. Select as a feature quantity.

クロスバリデーションは、１０でなく５などの数値でもよい。また、クロスバリデーションの平均値の閾値は７０％ではなく９０％などでもよいし、回帰の場合は実際の数値や決定係数などであってもよい。 The cross-validation may be a numerical value such as 5 instead of 10. Further, the threshold value of the average value of cross validation may be 90% instead of 70%, and in the case of regression, it may be an actual numerical value or a coefficient of determination.

異常検知装置１００は、クロスバリデーションの値を用いることで、予測精度の確からしさを判断することが可能になる。クロスバリデーションの値を用いずに、実行してもよいが、過学習により単独の予測精度およびペアの予測精度が高く、実際には予測が容易ではないペアを重要ペア特徴量として選択してしまうことの回避に対する難易度は向上することが予想される。 The abnormality detection device 100 can determine the certainty of the prediction accuracy by using the cross-validation value. It may be executed without using the cross-validation value, but overfitting selects a pair as an important pair feature quantity, which has high single prediction accuracy and pair prediction accuracy and is not actually easy to predict. It is expected that the difficulty of avoiding this will increase.

ステップＳ１１０６における、選択した複数の重要ペア特徴量同士の関係性について説明する。例えば、選択された重要ペア特徴量が、特徴量Ａと特徴量Ｂ、特徴量Ｂと特徴量Ｃ、特徴量Ｃと特徴量Ａであった場合を考える。 The relationship between the plurality of selected important pair feature quantities in step S1106 will be described. For example, consider the case where the selected important pair feature amounts are feature amount A and feature amount B, feature amount B and feature amount C, and feature amount C and feature amount A.

特徴量Ａ、特徴量Ｂおよび特徴量Ｃは、互いに重要特徴量であることから、これら３つの特徴量で精度を予測した場合において、単独で予測した場合より予測精度は向上すると考えられる。さらには、これら３つの特徴量で精度を予測した場合において、ペアで予測した場合より期待予測精度が向上する可能性もある。 Since the feature amount A, the feature amount B, and the feature amount C are important feature amounts to each other, it is considered that the prediction accuracy is improved when the accuracy is predicted by these three feature amounts as compared with the case where the prediction is made independently. Furthermore, when the accuracy is predicted by these three features, the expected prediction accuracy may be improved as compared with the case where the prediction is made in pairs.

このように重要ペア特徴量を抽出することにより、２つの特徴量だけでなく、３つ以上の特徴量において、特徴量を増やすことで予測精度が向上する関係にある特徴量を発見することが可能になる。また、上記ステップ１１０４Ｓにより生成した組み合わせである２種の特徴量に対して、非線形相関を実施し、相関が低かったペアをより重要な重要ペア特徴量であると示してもよい。特徴量間に相関がまったくないにも関わらず、図１１に示すフローにより重要ペア特徴量であるとされた特徴量は、線形な手法を用いた解析では発見されづらい特徴量と考えることができる。このような解析手法を用いることで、医学研究で用いられることが多い線形の手法では発見できなかった関係性を発見可能となることが期待できる。また、前記データ解析手法は、解析対象データに適用する以外にも、解析対象データを参照して類似するデータを生成する技術によって生成されたデータに対して適用してもよい。 By extracting the important pair features in this way, it is possible to discover not only two features but also features having a relationship of improving prediction accuracy by increasing the features in three or more features. It will be possible. In addition, a non-linear correlation may be performed on the two types of features that are the combination generated in step 1104S, and the pair with a low correlation may be indicated as a more important important pair feature. Although there is no correlation between the features, the features that are considered to be important pair features by the flow shown in FIG. 11 can be considered to be difficult to find by the analysis using the linear method. .. By using such an analysis method, it can be expected that relationships that could not be found by the linear method often used in medical research can be discovered. In addition to applying the data analysis method to the data to be analyzed, the data analysis method may be applied to data generated by a technique for generating similar data by referring to the data to be analyzed.

以上のように、本実施例３によれば、解析対象データに前記重要な特徴量が含まれており、期待予測精度が低い場合には、解析対象データに前記重要な特徴量が含まれていないと判断することが可能である。この場合、解析を行っても有意な結果が得られないと判定して、データの解析を中止することができる。 As described above, according to the third embodiment, the analysis target data includes the important feature amount, and when the expected prediction accuracy is low, the analysis target data includes the important feature amount. It is possible to judge that there is no such thing. In this case, it can be determined that no significant result can be obtained even if the analysis is performed, and the analysis of the data can be stopped.

また、本実施例３によれば、２次特徴量をペアにすることで予測精度が向上するような組み合わせを重要ペア特徴量として抽出することが可能となる。 Further, according to the third embodiment, it is possible to extract as an important pair feature amount a combination in which the prediction accuracy is improved by pairing the secondary feature amount.

次に、実施例４の異常検知装置１００について説明する。実施例１の装置では、期待予測精度を予測する装置として機能する例を示した。本実施例４の異常検知装置１００は、解析対象データの精製手法の良し悪しを指標（期待予測精度）に基づいて判定する機能を有する例を示す。 Next, the abnormality detection device 100 of the fourth embodiment will be described. In the device of the first embodiment, an example of functioning as a device for predicting the expected prediction accuracy is shown. The abnormality detection device 100 of the fourth embodiment shows an example having a function of determining the quality of the purification method of the data to be analyzed based on an index (expected prediction accuracy).

解析対象データは、取得したデータをそのまま使用することも可能であるが、データ精製（クレンジング）により真に重要（あるいは有用）な特徴量を抽出したデータを使用することもある。 As the data to be analyzed, the acquired data can be used as it is, but the data obtained by extracting the truly important (or useful) features by data purification (cleansing) may be used.

たとえば、特徴量の数が１０万個を超えるような場合、１０万個のすべてが意味のある特徴量ではなく、解析したい問題に対してまったく無関係の特徴量も多く含まれている可能性が高い。 For example, if the number of features exceeds 100,000, not all 100,000 are meaningful features, and there may be many features that are completely unrelated to the problem to be analyzed. high.

そこで、解析に対して無関係の特徴量が多く含まれる場合には、期待予測精度に悪影響を与える可能性がある。また、特徴量の中に、文字が含まれる場合や、画像と検査数値の様な種類の違う特徴量が含まれる場合などにおいては、同一の特徴量ベクトルとして扱えるように前処理を行う必要がある。 Therefore, if a large number of features irrelevant to the analysis are included, the expected prediction accuracy may be adversely affected. In addition, when characters are included in the features, or when different types of features such as images and inspection values are included, it is necessary to perform preprocessing so that they can be treated as the same feature vector. is there.

これらの事前処理には様々な手法や、様々な観点における重要特徴量の決定方法が存在しうるため、解析対象データを１パターンだけでなく、ひとつの解析対象データについて異なる種類の事前処理を施したデータを複数パターン生成し、いずれのパターンが最も期待予測精度が高くなるのか試すこととなる。 Since there may be various methods and methods for determining important features from various viewpoints in these preprocessing, different types of preprocessing are applied not only to one pattern of analysis target data but also to one analysis target data. It is necessary to generate a plurality of patterns of the generated data and test which pattern has the highest expected prediction accuracy.

そこで、本実施例４を用いることで、複数種類の解析対象データを、図２のフローチャートに代表されるような期待予測精度予測器２１０に適用し、入力した複数種類の解析対象データの中から、期待予測精度が高い順に提示することで、どの解析対象データ生成方法が良好であったかを、瞬時に判断することを可能とする。 Therefore, by using the fourth embodiment, a plurality of types of analysis target data are applied to the expected prediction accuracy predictor 210 as represented by the flowchart of FIG. 2, and the input multiple types of analysis target data are selected. By presenting in descending order of expected prediction accuracy, it is possible to instantly determine which analysis target data generation method was good.

図１２は本実施例４を説明する図である。ひとつのオリジナルの解析対象データから、たとえば３種類のデータ精製（またはクレンジング）方法である手法Ａ、手法Ｂおよび手法Ｃを用いて、３種類の解析対象データとして解析対象データＡ、解析対象データＢ、解析対象データＣを精製する。 FIG. 12 is a diagram illustrating the fourth embodiment. From one original analysis target data, for example, using three types of data purification (or cleansing) methods, method A, method B, and method C, analysis target data A and analysis target data B are used as three types of analysis target data. , The data C to be analyzed is purified.

解析対象データＡ〜Ｃのそれぞれに対して、前記実施例１の図２に示したフローチャートに示した期待予測精度予測器２１０を適用し、最も期待予測精度が高かった精度を提示する。３種の解析対象データ（データＡ〜Ｃ）における期待予測精度を比較し、最も期待予測精度が高かった解析対象データを精製した手法が３つのデータ精製手法の中で最も良い手法であると考えられる。 The expected prediction accuracy predictor 210 shown in the flowchart shown in FIG. 2 of the first embodiment is applied to each of the analysis target data A to C, and the accuracy with the highest expected prediction accuracy is presented. It is considered that the method of comparing the expected prediction accuracy of the three types of analysis target data (data A to C) and purifying the analysis target data with the highest expected prediction accuracy is the best method among the three data purification methods. Be done.

本実施例４の異常検知装置１００は、最も期待予測精度が高かった精製方法（データクレンジング）を、解析対象データの前処理として採用することができる。 The abnormality detection device 100 of the fourth embodiment can adopt the purification method (data cleansing) having the highest expected prediction accuracy as the preprocessing of the data to be analyzed.

この知見を活かし、新たなデータ精製手法Ｄなどを実施して、同様の手順により他のデータ精製手法と比較することができる。期待予測精度予測器２１０を用いることで、複数のデータ精製手法の中からもっともよいデータ精製の手法を選択し、データ解析を行いたい場合においても、半年などの長期間を要さず、かつ専門家および高性能な解析用計算器を不要とすることが可能となる。 Utilizing this knowledge, a new data purification method D or the like can be implemented and compared with other data purification methods by the same procedure. By using the Expected Prediction Accuracy Predictor 210, even if you want to select the best data purification method from a plurality of data purification methods and perform data analysis, it does not require a long period of time such as half a year and is specialized. It makes it possible to eliminate the need for homes and high-performance analytical calculators.

＜まとめ＞
なお、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に記載したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加、削除、又は置換のいずれもが、単独で、又は組み合わせても適用可能である。 <Summary>
The present invention is not limited to the above-described examples, and includes various modifications. For example, the above-described embodiment is described in detail in order to explain the present invention in an easy-to-understand manner, and is not necessarily limited to the one including all the configurations described. Further, it is possible to replace a part of the configuration of one embodiment with the configuration of another embodiment, and it is also possible to add the configuration of another embodiment to the configuration of one embodiment. Further, for a part of the configurations of each embodiment, any of addition, deletion, or replacement of other configurations can be applied alone or in combination.

また、上記の各構成、機能、処理部、及び処理手段等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、及び機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記録装置、または、ＩＣカード、ＳＤカード、ＤＶＤ等の記録媒体に置くことができる。 Further, each of the above configurations, functions, processing units, processing means and the like may be realized by hardware by designing a part or all of them by, for example, an integrated circuit. Further, each of the above configurations, functions, and the like may be realized by software by the processor interpreting and executing a program that realizes each function. Information such as programs, tables, and files that realize each function can be stored in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card, an SD card, or a DVD.

また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしも全ての制御線や情報線を示しているとは限らない。実際には殆ど全ての構成が相互に接続されていると考えてもよい。 In addition, the control lines and information lines indicate those that are considered necessary for explanation, and do not necessarily indicate all the control lines and information lines in the product. In practice, it can be considered that almost all configurations are interconnected.

１００異常検知装置
１０１データバス
１０２ＣＰＵ
１０３メモリ
１０４入出力インターフェイス
１０５通信装置
１０６ドライブ装置
１０７入力装置
１０８出力装置
１０９記憶媒体
２１０期待予測精度予測器
２１１２次特徴量生成器
２１２精度予測器
２２０学習器
２３０結果表示器 100 Anomaly detection device 101 Data bus 102 CPU
103 Memory 104 Input / output interface 105 Communication device 106 Drive device 107 Input device 108 Output device 109 Storage medium 210 Expected prediction accuracy predictor 211 Secondary feature quantity generator 212 Accuracy predictor 220 Learner 230 Result display

Claims

A computer with a processor and memory is a hyperparameter evaluation method that evaluates hyperparameters applicable to the data to be analyzed.
A first step in which the computer generates a secondary feature amount generation unit and an accuracy prediction unit from preset preset data and a plurality of hyperparameters.
The second step in which the computer receives the data to be analyzed, and
A third step in which the secondary feature amount generation unit calculates the secondary feature amount by inputting the analysis target data and a plurality of hyperparameters.
The fourth step in which the accuracy prediction unit calculates the expected prediction accuracy for each hyperparameter by inputting the secondary feature amount, and
A method for evaluating hyperparameters, which comprises.

The hyperparameter evaluation method according to claim 1.
The third step is
A method for evaluating hyperparameters, which comprises inputting the analysis target data and a plurality of hyperparameters as primary feature quantities and calculating secondary feature quantities.

The hyperparameter evaluation method according to claim 1.
The fourth step is
A method for evaluating hyperparameters, which comprises calculating the expected prediction accuracy expected to be achievable for each hyperparameter by inputting the secondary feature amount.

The hyperparameter evaluation method according to claim 1.
A fifth step in which the computer outputs the calculated expected prediction accuracy for each hyperparameter, and
The sixth step in which the computer accepts the selected hyperparameters among the output hyperparameters, and
A seventh step in which the computer generates a model for analyzing the analysis target data based on the selected hyperparameters.
A method for evaluating hyperparameters, which further comprises.

The hyperparameter evaluation method according to claim 1.
The computer acquires the secondary features and the expected prediction accuracy of the plurality of analysis target data, and for the analysis target data whose expected prediction accuracy is equal to or less than the threshold, each of the analysis target data whose expected prediction accuracy exceeds the threshold. The eighth step of selecting an item with a large difference compared to the difference in the secondary feature amount, and
A method for evaluating hyperparameters, which further comprises.

The hyperparameter evaluation method according to claim 1.
A ninth step in which the computer determines that when the expected prediction accuracy is equal to or less than a predetermined threshold value, no significant result can be obtained even if data analysis is performed on the analysis target data.
A method for evaluating hyperparameters, which further comprises.

The hyperparameter evaluation method according to claim 1.
The second step is
Including the step of performing data cleansing on the received data to be analyzed.
The tenth step of acquiring the expected prediction accuracy after the computer performs a plurality of data cleansing on the data to be analyzed and selecting the data cleansing with the highest expected prediction accuracy.
A method for evaluating hyperparameters, which further comprises.

The hyperparameter evaluation method according to claim 1.
The first step includes a step of predicting the accuracy of each of the plurality of hyperparameters when applied to the analysis of the prior data and calculating the preliminary accuracy prediction result.
A step of generating an accuracy prediction unit that outputs the pre-accuracy prediction result for each hyperparameter and inputs a secondary feature amount, and
A step of generating a secondary feature amount generation unit that inputs the prior data and a plurality of hyperparameters and outputs the secondary feature amount input of the accuracy prediction unit, and
A method for evaluating hyperparameters, which comprises.

The hyperparameter evaluation method according to claim 4.
The eleventh step in which the computer outputs the hyperparameters and the generated model in the order of the calculated expected prediction accuracy.
A method for evaluating hyperparameters, which further comprises.

A computer that evaluates hyperparameters applicable to the data to be analyzed, including the processor and memory.
Expected prediction accuracy prediction unit including a secondary feature amount generation unit generated from preset preset data and a plurality of hyperparameters, and an accuracy prediction unit generated from preset preset data and a plurality of hyperparameters. Have and
The expected prediction accuracy prediction unit receives the analysis target data, and receives the analysis target data.
The secondary feature amount generation unit calculates the secondary feature amount by inputting the analysis target data and a plurality of hyperparameters.
A computer characterized in that the accuracy prediction unit calculates the expected prediction accuracy for each hyperparameter by inputting the secondary feature amount.

The calculator according to claim 10.
A computer characterized in that the secondary feature amount generation unit calculates the secondary feature amount by inputting the analysis target data and a plurality of hyperparameters as the primary feature amount.

The calculator according to claim 10.
A computer characterized in that the accuracy prediction unit calculates an expected prediction accuracy that is expected to be achievable for each hyperparameter by inputting the secondary feature amount.

The calculator according to claim 10.
The accuracy prediction unit outputs the calculated expected prediction accuracy for each hyperparameter.
It is characterized by further having a learning unit that accepts the selected hyperparameters among the output hyperparameters and generates a model for analyzing the analysis target data based on the selected hyperparameters. Computer to be.

The calculator according to claim 10.
The expected prediction accuracy prediction unit acquires the secondary feature amount and the expected prediction accuracy of a plurality of analysis target data, and analyzes the analysis target data whose expected prediction accuracy is equal to or less than the threshold, and the expected prediction accuracy exceeds the threshold. A computer characterized in that an item having a large difference is selected by comparing with the difference of each secondary feature amount of the target data.

A program for controlling a computer with a processor and memory.
The first step of generating a secondary feature amount generation unit and an accuracy prediction unit from preset preset data and a plurality of hyperparameters, and
The second step of accepting the data to be analyzed and
A third step of inputting the analysis target data and a plurality of hyperparameters to the secondary feature amount generation unit to calculate the secondary feature amount, and
The fourth step of inputting the secondary feature amount into the accuracy prediction unit and calculating the expected prediction accuracy for each hyperparameter, and
A program for causing the computer to execute.