WO2022003762A1

WO2022003762A1 - Question answering device, question answering method, and question answering program

Info

Publication number: WO2022003762A1
Application number: PCT/JP2020/025482
Authority: WO
Inventors: 淳史大塚; 京介西田; 光甫西田; 久子浅野; 準二富田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: NTT Inc
Priority date: 2020-06-29
Filing date: 2020-06-29
Publication date: 2022-01-06
Anticipated expiration: 2022-12-29
Also published as: JP7704245B2; JP7468654B2; JPWO2022003762A1; JP2024071511A; JP2025123543A

Abstract

When outputting an answer to a question by machine reading, the present invention outputs the answer in a plurality of output formats. This question answering device comprises: a calculation unit which receives, as inputs, a question and a related document used to answer the question, and calculates information indicating the relevance between the question and the related document; a plurality of output units which receive, as an input thereof, the information indicating the relevance as calculated by the calculation unit, and output answers to the question in different output formats; and a selection unit which selects a predetermined number of answers from among the answers output by the plurality of output units in the respective output formats thereof.

Description

Question answering device, question answering method and question answering program

　本開示は、質問応答装置、質問応答方法及び質問応答プログラムに関する。 This disclosure relates to a question answering device, a question answering method, and a question answering program.

　ユーザが自然言語で入力した質問に対して、自動で応答する質問応答技術として、機械読解が知られている。機械読解とは、ユーザによる質問と、自然言語で記述された関連文書（「パッセージ」と称す）とを入力し、パッセージ中から抽出した情報に基づいて、入力された質問に対する応答を出力する技術である。 Machine reading comprehension is known as a question answering technology that automatically answers questions entered by the user in natural language. Machine reading comprehension is a technology that inputs a question by a user and a related document (referred to as a "passage") written in natural language, and outputs a response to the input question based on the information extracted from the passage. Is.

　当該機械読解により質問に対する応答を出力する際の出力形式は様々であり、一例として、
・パッセージから抽出した情報に基づいて、文生成により生成した回答文を出力する形式、
・パッセージから抽出した情報に基づいて、ＹＥＳ／ＮＯなどのラベルを出力する形式、
・パッセージから抽出した情報に基づいて生成した質問（回答を絞り込むための質問）を出力する形式、
等が挙げられる。 There are various output formats when outputting the answer to the question by the machine reading comprehension, and as an example,
-A format that outputs the answer sentence generated by sentence generation based on the information extracted from the passage,
-A format that outputs labels such as YES / NO based on the information extracted from the passage.
-A format that outputs questions (questions for narrowing down the answers) generated based on the information extracted from the passage.
And so on.

Kyosuke Nishida, Itsumi Saito, Kosuke Nishida, Kazutoshi Shinoda, Atsushi Otsuka, Hisako Asano, and Junji Tomita, "Multi-style generative reading comprehension", In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL2019), pp. 2273-2284, 2019.Kyosuke Nishida, Itsumi Saito, Kosuke Nishida, Kazutoshi Shinoda, Atsushi Otsuka, Hisako Asano, and Junji Tomita, "Multi-style generative reading comprehension", In Proceedings of The 57th 2273-2284, 2019. Kosuke Nishida，Kyosuke Nishida，Masaaki Nagata，Itsumi Saito，Atushi Otuka， Hisako Asano and Junji Tomita, "Answering while Summarizing: Multi-task Learning for Multi-hop QA with Evidence Extraction", Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL2019)， pp.2273- 2284．2019．Kosuke Nishida, Kyosuke Nishida, Masaaki Nagata, Itsumi Saito, Atushi Otuka, Hisako Asano and Junji Tomita, "Answering while Summarizing: Multi-task Learning for Multi-hop QA with Evidence Extraction" Computational Linguistics (ACL2019), pp.2273-2284.2019. 大塚淳史，西田京介，斉藤いつみ，浅野久子，富田準二，佐藤哲司．質問意図の明確化に着目した機械読解による質問応答手法の提案．人工知能学会論文誌，vol.34， no.5 A-J14，pp.1-12，2019．Atsushi Otsuka, Kyosuke Nishida, Itumi Saito, Hisako Asano, Junji Tomita, Tetsuji Sato. Proposal of question answering method by machine reading comprehension focusing on clarification of question intention. Journal of the Japanese Society for Artificial Intelligence, vol.34, no.5 A-J14, pp.1-12, 2019.

　ここで、上記いずれの出力形式でも出力可能な機械読解を実現するには、それぞれの出力形式に対応する機械読解モデルを組み合わせることが考えられる。 Here, in order to realize a machine reading comprehension that can be output in any of the above output formats, it is conceivable to combine machine reading comprehension models corresponding to each output format.

　しかしながら、出力形式の数に応じた数の機械読解モデルをメモリ上に展開して実行させる構成とすると、装置内のリソースが大量に消費されることとなり、リソースに制約のある装置においては、実現可能性が低い。 However, if the number of machine reading models corresponding to the number of output formats is expanded and executed on the memory, a large amount of resources in the device will be consumed, which is realized in the device with limited resources. It's unlikely.

　本開示は、機械読解により質問に対する応答を出力する際、複数の出力形式で出力可能な質問応答装置、質問応答方法及び質問応答プログラムを提供することを目的とする。 It is an object of the present disclosure to provide a question answering device, a question answering method, and a question answering program that can output in a plurality of output formats when the answer to a question is output by machine reading comprehension.

　本開示の一態様によれば、質問応答装置は、
　質問と、該質問に応答する際に用いられる関連文書とを入力として、該質問と該関連文書との関連性を示す情報を算出する算出部と、
　前記算出部により算出された前記関連性を示す情報をそれぞれの入力として、互いに異なる出力形式で、前記質問に対する応答を出力する複数の出力部と、
　前記複数の出力部が各出力形式で出力した各応答の中から、予め定められた数の応答を選択する選択部とを有する。 According to one aspect of the present disclosure, the question answering device is
A calculation unit that calculates information indicating the relevance of the question and the related document by inputting the question and the related document used when answering the question.
A plurality of output units that output responses to the question in different output formats by using the information indicating the relevance calculated by the calculation unit as their respective inputs.
The plurality of output units have a selection unit that selects a predetermined number of responses from each response output in each output format.

　本開示によれば、機械読解により質問に対する応答を出力する際、複数の出力形式で出力可能な質問応答装置、質問応答方法及び質問応答プログラムを提供することができる。 According to the present disclosure, it is possible to provide a question answering device, a question answering method, and a question answering program that can output a response to a question by machine reading comprehension in a plurality of output formats.

図１は、質問応答装置のハードウェア構成の一例を示す図である。FIG. 1 is a diagram showing an example of a hardware configuration of a question answering device. 図２は、比較例の質問応答部の機能構成を示す図である。FIG. 2 is a diagram showing a functional configuration of a question answering unit of a comparative example. 図３は、第１の実施形態に係る質問応答装置の質問応答部の機能構成の一例を示す図である。FIG. 3 is a diagram showing an example of the functional configuration of the question answering unit of the question answering device according to the first embodiment. 図４は、第１の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第１の図である。FIG. 4 is a first diagram showing an operation example in the learning phase of the question answering device according to the first embodiment. 図５は、第１の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第２の図である。FIG. 5 is a second diagram showing an operation example in the learning phase of the question answering device according to the first embodiment. 図６は、第１の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第３の図である。FIG. 6 is a third diagram showing an operation example in the learning phase of the question answering device according to the first embodiment. 図７は、第１の実施形態に係る質問応答装置による質問応答処理の流れを示すフローチャートである。FIG. 7 is a flowchart showing a flow of question answering processing by the question answering device according to the first embodiment. 図８は、第２の実施形態に係る質問応答装置の機能構成の一例を示す図である。FIG. 8 is a diagram showing an example of the functional configuration of the question answering device according to the second embodiment. 図９は、第２の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第１の図である。FIG. 9 is a first diagram showing an operation example in the learning phase of the question answering device according to the second embodiment. 図１０は、第２の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第２の図である。FIG. 10 is a second diagram showing an operation example in the learning phase of the question answering device according to the second embodiment. 図１１は、第２の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第３の図である。FIG. 11 is a third diagram showing an operation example in the learning phase of the question answering device according to the second embodiment. 図１２は、第２の実施形態に係る質問応答装置による質問応答処理の流れを示すフローチャートである。FIG. 12 is a flowchart showing a flow of question answering processing by the question answering device according to the second embodiment.

　以下、各実施形態について添付の図面を参照しながら説明する。なお、本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複した説明を省略する。 Hereinafter, each embodiment will be described with reference to the attached drawings. In the present specification and the drawings, the components having substantially the same functional configuration are designated by the same reference numerals, and duplicate description thereof will be omitted.

　［第１の実施形態］
　＜質問応答装置のハードウェア構成＞
　はじめに、第１の実施形態に係る質問応答装置のハードウェア構成について説明する。図１は、質問応答装置のハードウェア構成の一例を示す図である。 [First Embodiment]
<Hardware configuration of question answering device>
First, the hardware configuration of the question answering device according to the first embodiment will be described. FIG. 1 is a diagram showing an example of a hardware configuration of a question answering device.

　図１に示すように、質問応答装置１００は、プロセッサ１０１、メモリ１０２、補助記憶装置１０３、Ｉ／Ｆ（Interface）装置１０４、通信装置１０５、ドライブ装置１０６を有する。なお、質問応答装置１００の各ハードウェアは、バス１０７を介して相互に接続されている。 As shown in FIG. 1, the question and answer device 100 includes a processor 101, a memory 102, an auxiliary storage device 103, an I / F (Interface) device 104, a communication device 105, and a drive device 106. The hardware of the question answering device 100 is connected to each other via the bus 107.

　プロセッサ１０１は、ＣＰＵ（Central Processing Unit）、ＧＰＵ（Graphics Processing Unit）等の各種演算デバイスを有する。プロセッサ１０１は、補助記憶装置１０３にインストールされた各種プログラム（不図示）をメモリ１０２上に読み出して実行する。 The processor 101 has various arithmetic devices such as a CPU (Central Processing Unit) and a GPU (Graphics Processing Unit). The processor 101 reads various programs (not shown) installed in the auxiliary storage device 103 onto the memory 102 and executes them.

　メモリ１０２は、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）等の主記憶デバイスを有する。プロセッサ１０１とメモリ１０２とは、いわゆるコンピュータを形成し、プロセッサ１０１が、メモリ１０２上に読み出した各種プログラムを実行することで、当該コンピュータは各種機能を実現する。 The memory 102 has a main storage device such as a ROM (ReadOnlyMemory) and a RAM (RandomAccessMemory). The processor 101 and the memory 102 form a so-called computer, and the processor 101 executes various programs read on the memory 102, so that the computer realizes various functions.

　例えば、本実施形態において、プロセッサ１０１とメモリ１０２とにより形成されるコンピュータは、プロセッサ１０１が、メモリ１０２上に読み出した質問応答プログラムを実行することで、質問応答部１１０を実現する。なお、後述するように、質問応答部１１０は、質問応答装置１００内のコンピュータリソースの消費を抑えられるように構成されている。 For example, in the present embodiment, the computer formed by the processor 101 and the memory 102 realizes the question and answer unit 110 by executing the question and answer program read by the processor 101 on the memory 102. As will be described later, the question answering unit 110 is configured to suppress the consumption of computer resources in the question answering device 100.

　補助記憶装置１０３は、各種プログラムや、各種プログラムがプロセッサ１０１によって実行される際に用いられる各種データを格納する。例えば、本実施形態において、補助記憶装置１０３は、学習用データセット格納部１２０及びパッセージ格納部１３０を有し、各種データ（学習用データセット及びパッセージ（自然言語で記述された関連文書））を格納する。 The auxiliary storage device 103 stores various programs and various data used when the various programs are executed by the processor 101. For example, in the present embodiment, the auxiliary storage device 103 has a learning data set storage unit 120 and a passage storage unit 130, and various data (learning data set and passage (related document described in natural language)) can be stored. Store.

　Ｉ／Ｆ装置１０４は、入力装置１４０及び出力装置１４１と、質問応答装置１００とを接続する接続デバイスである。Ｉ／Ｆ装置１０４は、質問応答装置１００に対する質問を、入力装置１４０を介して受け付ける。また、Ｉ／Ｆ装置１０４は、入力された質問に対して質問応答装置１００が生成した応答を、出力装置１４１を介して出力する。ここでいう入力装置１４０には、入力された質問を音声データに変換する装置や、テキストデータに変換する装置等が含まれる。同様に、ここでいう出力装置１４１には、音声データによる応答を出力する装置や、テキストデータによる応答を出力する装置等が含まれる。 The I / F device 104 is a connection device that connects the input device 140 and the output device 141 to the question answering device 100. The I / F device 104 receives a question for the question answering device 100 via the input device 140. Further, the I / F device 104 outputs a response generated by the question answering device 100 to the input question via the output device 141. The input device 140 referred to here includes a device for converting an input question into voice data, a device for converting into text data, and the like. Similarly, the output device 141 referred to here includes a device that outputs a response by voice data, a device that outputs a response by text data, and the like.

　通信装置１０５は、ネットワークを介して他の装置と通信するための通信デバイスである。 The communication device 105 is a communication device for communicating with another device via a network.

　ドライブ装置１０６は記録媒体１４２をセットするためのデバイスである。ここでいう記録媒体１４２には、ＣＤ－ＲＯＭ、フレキシブルディスク、光磁気ディスク等のように情報を光学的、電気的あるいは磁気的に記録する媒体が含まれる。また、記録媒体１４２には、ＲＯＭ、フラッシュメモリ等のように情報を電気的に記録する半導体メモリ等が含まれていてもよい。 The drive device 106 is a device for setting the recording medium 142. The recording medium 142 referred to here includes a medium such as a CD-ROM, a flexible disk, a magneto-optical disk, or the like, which records information optically, electrically, or magnetically. Further, the recording medium 142 may include a semiconductor memory or the like for electrically recording information such as a ROM or a flash memory.

　なお、補助記憶装置１０３にインストールされる各種プログラムは、例えば、配布された記録媒体１４２がドライブ装置１０６にセットされ、該記録媒体１４２に記録された各種プログラムがドライブ装置１０６により読み出されることでインストールされる。あるいは、補助記憶装置１０３にインストールされる各種プログラムは、通信装置１０５を介してネットワークからダウンロードされることで、インストールされてもよい。 The various programs installed in the auxiliary storage device 103 are installed, for example, by setting the distributed recording medium 142 in the drive device 106 and reading the various programs recorded in the recording medium 142 by the drive device 106. Will be done. Alternatively, various programs installed in the auxiliary storage device 103 may be installed by being downloaded from the network via the communication device 105.

　同様に、補助記憶装置１０３が有する各格納部に格納される各種データは、例えば、配布された記録媒体１４２がドライブ装置１０６にセットされ、該記録媒体１４２に記録された各種データがドライブ装置１０６により読み出されることで格納される。あるいは、補助記憶装置１０３が有する各格納部に格納される各種データは、通信装置１０５を介してネットワークからダウンロードされることで、格納されてもよい。 Similarly, as for the various data stored in each storage unit of the auxiliary storage device 103, for example, the distributed recording medium 142 is set in the drive device 106, and the various data recorded in the recording medium 142 is the drive device 106. It is stored by being read by. Alternatively, various data stored in each storage unit of the auxiliary storage device 103 may be stored by being downloaded from the network via the communication device 105.

　＜質問応答部の機能構成＞
　次に、質問応答部１１０の機能構成の詳細について説明する。なお、説明に際しては、質問応答部１１０の機能構成の特徴を明確にするために、比較例として、まず、出力形式の数に応じた数の機械読解モデルを組み合わせて構築した質問応答部の機能構成について説明する。 <Functional configuration of question answering section>
Next, the details of the functional configuration of the question answering unit 110 will be described. In the explanation, in order to clarify the characteristics of the functional configuration of the question answering unit 110, as a comparative example, first, the function of the question answering unit constructed by combining the number of machine reading models according to the number of output formats. The configuration will be described.

　（１）比較例の質問応答部の機能構成
　図２は、比較例の質問応答部の機能構成を示す図である。機械読解により質問に対する応答を出力する際、複数の出力形式で出力できるようにするための構成として、比較例の質問応答部２００は、
・入力部２１０、
・複数の機械読解モデル（図２の例では、第１の機械読解モデル２２１～第３の機械読解モデル２２３）、
・選択部２３０、
を有する。 (1) Functional configuration of the question answering unit of the comparative example FIG. 2 is a diagram showing a functional configuration of the question answering unit of the comparative example. When outputting the answer to the question by machine reading comprehension, the question answering unit 200 of the comparative example is configured to be able to output in a plurality of output formats.
・ Input unit 210,
A plurality of machine reading models (in the example of FIG. 2, the first machine reading model 221 to the third machine reading model 223),
・ Selection unit 230,
Have.

　入力部２１０は、入力された質問と、パッセージ（自然言語で記述された関連文書）とを、複数の機械読解モデルそれぞれに入力する。 The input unit 210 inputs the input question and a passage (related document written in natural language) to each of a plurality of machine reading comprehension models.

　第１の機械読解モデル２２１は、パッセージから抽出した情報に基づいて、文生成により生成した回答文を、第１の応答として出力する。 The first machine reading model 221 outputs the answer sentence generated by sentence generation as the first response based on the information extracted from the passage.

　第２の機械読解モデル２２２は、パッセージから抽出した情報に基づいて生成した、ＹＥＳ／ＮＯなどのラベルを、第２の応答として出力する。 The second machine reading model 222 outputs a label such as YES / NO generated based on the information extracted from the passage as the second response.

　第３の機械読解モデル２２３は、パッセージから抽出した情報に基づいて生成した質問（回答を絞り込むために生成した質問（改訂質問と称す））を、第３の応答として出力する。 The third machine reading model 223 outputs a question generated based on the information extracted from the passage (a question generated to narrow down the answer (referred to as a revised question)) as a third response.

　選択部２３０は、第１の機械読解モデル２２１～第３の機械読解モデル２２３それぞれから出力された応答のうち、予め定められた数の応答を選択して出力する。 The selection unit 230 selects and outputs a predetermined number of responses from the responses output from each of the first machine reading model 221 to the third machine reading model 223.

　比較例の質問応答部２００に示すように、出力形式の数に応じた数の機械読解モデルを組み合わせることで、質問に対する応答を複数の出力形式で出力することができる。一方で、比較例の質問応答部２００の場合、以下のような問題がある。
・出力形式の数に応じた数の機械読解モデルをメモリ上に展開して実行させる構成のため、質問応答装置１００内のコンピュータリソースが大量に消費されることになる。
・第１の機械読解モデル２２１～第３の機械読解モデル２２３それぞれから出力された応答の中から、予め定められた数の応答を選択する際、適切な応答を選択することができない。第１の応答～第３の応答を比較するだけでは応答の優劣をつけられず、何らかの選択指標を算出する必要があるからである。 As shown in the question answering unit 200 of the comparative example, the answer to the question can be output in a plurality of output formats by combining the number of machine reading models according to the number of output formats. On the other hand, in the case of the question answering unit 200 of the comparative example, there are the following problems.
-Since the configuration is such that a number of machine reading models corresponding to the number of output formats are expanded and executed on the memory, a large amount of computer resources in the question answering device 100 are consumed.
-When selecting a predetermined number of responses from the responses output from each of the first machine reading model 221 to the third machine reading model 223, it is not possible to select an appropriate response. This is because it is not possible to determine the superiority or inferiority of the response simply by comparing the first response to the third response, and it is necessary to calculate some selection index.

　これに対して、第１の実施形態に係る質問応答装置１００の質問応答部１１０は、これらの問題を解決する構成を有する。以下に詳細に説明する。 On the other hand, the question answering unit 110 of the question answering device 100 according to the first embodiment has a configuration for solving these problems. This will be described in detail below.

　（２）質問応答部の機能構成
　図３は、第１の実施形態に係る質問応答装置の質問応答部の機能構成の一例を示す図である。質問応答部１１０は、機械読解により質問に対する応答を出力する際、複数の出力形式で出力できるようにしつつ、コンピュータリソースの消費を抑え、かつ、質問に対する複数の出力形式の応答の中から、適切な応答を選択できるようにするための構成として、
・入力部３１０、
・入力層として機能する理解層３２０、
・出力層として機能する、第１の出力層３２１、第２の出力層３２２、第３の出力層３２３、
・出力層として機能する、出力判断層３２４、
・選択部３３０、
を有する。 (2) Functional configuration of the question answering unit FIG. 3 is a diagram showing an example of the functional configuration of the question answering unit of the question answering device according to the first embodiment. When the question answering unit 110 outputs the answer to the question by machine reading comprehension, the question answering unit 110 can output in a plurality of output formats, suppresses the consumption of computer resources, and is appropriate from among the responses in the plurality of output formats to the question. As a configuration to allow you to select a response
・ Input unit 310,
-Understanding layer 320, which functions as an input layer,
A first output layer 321 and a second output layer 322, a third output layer 323, which function as an output layer.
-Output judgment layer 324, which functions as an output layer,
・ Selection unit 330,
Have.

　このうち、入力部３１０は、入力された質問と、パッセージ（自然言語で記述された関連文書）とを、理解層３２０に入力する。 Of these, the input unit 310 inputs the input question and a passage (related document written in natural language) to the understanding layer 320.

　また、理解層３２０は算出部の一例であり、質問とパッセージとを入力とし、質問とパッセージとの関連性を示す情報を深層学習のベクトル上で算出し、状態ベクトルまたは状態テンソルを出力する。なお、理解層３２０は、質問とパッセージとを入力し、質問とパッセージとの関連性を示す情報を算出できる構造であれば、任意の構造を採用することができる。 Further, the understanding layer 320 is an example of a calculation unit, in which a question and a passage are input, information indicating the relationship between the question and the passage is calculated on a deep learning vector, and a state vector or a state tensor is output. The understanding layer 320 can adopt any structure as long as it can input the question and the passage and calculate the information indicating the relationship between the question and the passage.

　例えば、理解層３２０は、ＲＮＮ（）を用いたＢｉＤＡＦ（非特許文献１参照）や、ＴｒａｎｓｆｏｒｍｅｒベースのＢＥＲＴ（非特許文献２参照）などを採用することができる。ただし、理解層３２０は、理解層３２０より後段の出力層（第１の出力層３２１～第３の出力層３２３、出力判断層３２４）の入力となる状態ベクトルまたは状態テンソルの形式に合わせた出力を行う構造を有している必要がある。 For example, the understanding layer 320 can employ BiDAF using RNN () (see Non-Patent Document 1), Transformer-based BERT (see Non-Patent Document 2), and the like. However, the understanding layer 320 is an output that matches the format of the state vector or the state tensor that is the input of the output layer (first output layer 321 to the third output layer 323, output judgment layer 324) after the understanding layer 320. It is necessary to have a structure to do.

　第１の出力層３２１～第３の出力層３２３は出力部の一例であり、理解層３２０より出力された、質問とパッセージとの関連性を示す情報（状態ベクトルまたは状態テンソル）を入力として、応答（機械読解の出力結果）を出力する。 The first output layer 321 to the third output layer 323 are an example of the output unit, and the information (state vector or state tensor) indicating the relationship between the question and the passage output from the understanding layer 320 is used as an input. Output the response (output result of machine reading comprehension).

　図３の場合、第１の出力層３２１は、文生成により生成した回答文を、第１の応答として出力する。また、第２の出力層３２２は、ＹＥＳ／ＮＯなどのラベルを、第２の応答として出力する。更に、第３の出力層３２３は、入力された質問に対する回答を絞り込むための改訂質問を、第３の応答として出力する。 In the case of FIG. 3, the first output layer 321 outputs the response sentence generated by the sentence generation as the first response. Further, the second output layer 322 outputs a label such as YES / NO as a second response. Further, the third output layer 323 outputs a revised question for narrowing down the answer to the input question as a third response.

　なお、第１の出力層３２１～第３の出力層３２３は、それぞれ第１の応答～第３の応答を出力するにあたり、どのような深層学習の構造を有していてもよい。また、第１の出力層３２１～第３の出力層３２３が出力する出力形式は、回答文、ラベル、改訂質問に限定されず、他の出力形式の応答を出力してもよい。 The first output layer 321 to the third output layer 323 may have any deep learning structure in outputting the first response to the third response, respectively. Further, the output format output by the first output layer 321 to the third output layer 323 is not limited to the answer sentence, the label, and the revised question, and the response of another output format may be output.

　また、図３の例では、第１～第３の出力層を設置する構成としたが、設置する出力層の数は任意であり、また、同一の構造を有する出力層を複数設置してもよい。例えば、文生成のデコーダの構造を２つ設置し、一方の出力層は、文生成により回答文を生成するように学習した出力層とし、他方の出力層は、文生成により改訂質問を生成するように学習した出力層としてもよい。 Further, in the example of FIG. 3, the configuration is such that the first to third output layers are installed, but the number of output layers to be installed is arbitrary, and even if a plurality of output layers having the same structure are installed. good. For example, two sentence generation decoder structures are installed, one output layer is an output layer learned to generate an answer sentence by sentence generation, and the other output layer generates a revised question by sentence generation. It may be an output layer learned in this way.

　出力判断層３２４は指標算出部の一例であり、第１の出力層３２１～第３の出力層３２３から出力される各応答の確率分布を算出する。 The output determination layer 324 is an example of the index calculation unit, and calculates the probability distribution of each response output from the first output layer 321 to the third output layer 323.

　具体的には、出力判断層３２４は、出力形式の数Ｎ（図３の例では、Ｎ＝３）に対応する次元数のｓｏｆｔｍａｘ層を有する。Ｎ個の次元のｓｏｆｔｍａｘ層は、理解層３２０より状態ベクトルまたは状態テンソルを受け取り、各応答の確率分布を算出する。 Specifically, the output determination layer 324 has a softmax layer having a number of dimensions corresponding to the number N of output formats (N = 3 in the example of FIG. 3). The N-dimensional softmax layer receives a state vector or a state tensor from the understanding layer 320, and calculates the probability distribution of each response.

　なお、かかる構成は、出力形式の数Ｎが固定されている場合に有効である。一方で、かかる構成は、出力形式の追加が必要になった場合には、ｓｏｆｔｍａｘ層の次元数が対応できないため、質問応答部１１０全体について、再学習処理を行う必要がある。 Note that this configuration is effective when the number N of output formats is fixed. On the other hand, in such a configuration, when it becomes necessary to add an output format, the number of dimensions of the softmax layer cannot be accommodated, so that it is necessary to perform re-learning processing for the entire question answering unit 110.

　選択部３３０は、第１の出力層３２１～第３の出力層３２３それぞれから出力された応答のうち、出力判断層３２４により算出された確率分布が上位Ｍ個（Ｍは予め定められた数、図３の例は、Ｍ＝１）の応答を選択し、出力装置１４１を介してユーザに出力する。 In the selection unit 330, among the responses output from each of the first output layer 321 to the third output layer 323, the probability distribution calculated by the output determination layer 324 is the upper M (M is a predetermined number, In the example of FIG. 3, the response of M = 1) is selected and output to the user via the output device 141.

　このように、質問応答部１１０では、
・複数の出力形式に対して、共有の理解層を設置し（入力層を共通化し）、
・入力層（理解層）と出力層（第１の出力層３２１～第３の出力層３２３、出力判断層３２４）とを分離して、各層をモジュール化し、
・出力層に設置した出力判断層３２４より算出される、各応答の確率分布を選択指標として、選択部３３０が最終的に出力すべき応答を選択する、
構成とした。これにより、質問応答部１１０によれば、機械読解により質問に対する応答を出力する際、複数の出力形式で出力できるとともに、
・入力層の共通化によりコンピュータリソースの消費を抑え、かつ、
・選択指標の算出により適切な応答を選択する、
ことが可能になる。 In this way, in the question answering unit 110,
・ A shared understanding layer is set up for multiple output formats (the input layer is shared), and
-The input layer (understanding layer) and the output layer (first output layer 321 to third output layer 323, output judgment layer 324) are separated, and each layer is modularized.
-The selection unit 330 selects the response to be finally output using the probability distribution of each response calculated from the output judgment layer 324 installed in the output layer as a selection index.
It was configured. As a result, according to the question answering unit 110, when the answer to the question is output by machine reading comprehension, it can be output in a plurality of output formats, and at the same time, it can be output.
・ By standardizing the input layer, the consumption of computer resources is suppressed and
・ Select an appropriate response by calculating the selection index,
Will be possible.

　＜質問応答部の学習方法＞
　次に、質問応答部１１０の学習方法について説明する。質問応答部１１０に対して学習処理を行う学習フェーズにおいて、本実施形態に係る質問応答装置１００の質問応答部１１０は、まず、選択部３３０に代えて、比較／変更部４１０を設置する。続いて、本実施形態に係る質問応答装置１００の質問応答部１１０は、質問応答部１１０内に設置した各層（理解層３２０、第１の出力層３２１～第３の出力層３２３、出力判断層３２４）について学習処理を行う。 <Learning method of question answering section>
Next, the learning method of the question answering unit 110 will be described. In the learning phase in which the question answering unit 110 is subjected to the learning process, the question answering unit 110 of the question answering device 100 according to the present embodiment first installs the comparison / change unit 410 in place of the selection unit 330. Subsequently, the question answering unit 110 of the question answering device 100 according to the present embodiment has each layer (understanding layer 320, first output layer 321 to third output layer 323, output determination layer) installed in the question answering unit 110. The learning process is performed for 324).

　その際、質問応答装置１００の質問応答部１１０では、個別の機械読解モデル（第１の機械読解モデル２２１～第３の機械読解モデル２２３）について学習処理を行う際に用いられる学習用データセットを使用する。 At that time, in the question-and-answer unit 110 of the question-and-answer device 100, a learning data set used when performing learning processing on an individual machine-reading model (first machine-reading model 221 to third machine-reading model 223) is used. use.

　具体的には、質問応答装置１００の質問応答部１１０は、まず、学習用データセットが、いずれの出力層についての学習処理に用いられるものであるかを示すフラグを設定する。 Specifically, the question answering unit 110 of the question answering device 100 first sets a flag indicating which output layer the learning data set is used for the learning process.

　続いて、質問応答装置１００の質問応答部１１０は、出力判断層３２４の正解データを学習用データセットに格納する。例えば、質問応答装置１００の質問応答部１１０は、設定したフラグに対応する次元の値を"１"、それ以外の次元の値を"０"とするベクトルデータを、出力判断層３２４の正解データとして学習用データセットに格納する。 Subsequently, the question answering unit 110 of the question answering device 100 stores the correct answer data of the output determination layer 324 in the learning data set. For example, the question-and-answer unit 110 of the question-and-answer device 100 sets the vector data in which the dimensional value corresponding to the set flag is "1" and the other dimensional values are "0", and the correct answer data of the output determination layer 324. Store in the training data set as.

　続いて、質問応答装置１００の質問応答部１１０は、設定したフラグに該当する出力層から出力された応答と、対応する正解データとの間で学習損失を算出し、算出した学習損失に基づき、設定したフラグに該当する出力層及び理解層のパラメータを更新する。このとき、質問応答装置１００の質問応答部１１０では、設定したフラグに該当する出力層以外の出力層から出力された応答については無視する。 Subsequently, the question answering unit 110 of the question answering device 100 calculates a learning loss between the response output from the output layer corresponding to the set flag and the corresponding correct answer data, and based on the calculated learning loss. Update the parameters of the output layer and understanding layer corresponding to the set flag. At this time, the question answering unit 110 of the question answering device 100 ignores the response output from the output layer other than the output layer corresponding to the set flag.

　また、質問応答装置１００の質問応答部１１０は、出力判断層から出力されるベクトルデータと、対応する正解データとの間で学習損失を算出し、算出した学習損失に基づき、出力判断層のパラメータを更新する。 Further, the question answering unit 110 of the question answering device 100 calculates a learning loss between the vector data output from the output determination layer and the corresponding correct answer data, and based on the calculated learning loss, the parameter of the output determination layer. To update.

　図４は、第１の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第１の図である。図４の場合、学習用データセット４００＝"第１の出力層３２１についての学習処理に用いられる学習用データセット"であることを示すフラグが、質問応答部１１０により設定されているものとする。 FIG. 4 is a first diagram showing an operation example in the learning phase of the question answering device according to the first embodiment. In the case of FIG. 4, it is assumed that the question answering unit 110 sets a flag indicating that the learning data set 400 = "the learning data set used for the learning process for the first output layer 321". ..

　なお、図４に示すように、学習用データセット４００には、情報の項目として、"入力データ"、"第１の出力層の正解データ"、"出力判断層の正解データ"が含まれ、
・"入力データ"には、質問とパッセージとが格納される。
・"第１の出力層の正解データ"には、対応するパッセージから抽出した情報に基づいて、文生成により生成する回答文の正解データが格納される。
・"出力判断層の正解データ"には、"第１次元"～"第３次元"が含まれ、第１次元の値を"１"、第２次元及び第３次元の値を"０"とするベクトルデータが格納される。 As shown in FIG. 4, the learning data set 400 includes "input data", "correct answer data of the first output layer", and "correct answer data of the output determination layer" as information items.
-Questions and passages are stored in the "input data".
-The "correct answer data of the first output layer" stores the correct answer data of the answer sentence generated by sentence generation based on the information extracted from the corresponding passage.
-The "correct answer data of the output judgment layer" includes "first dimension" to "third dimension", the value of the first dimension is "1", and the values of the second dimension and the third dimension are "0". Vector data is stored.

　図４において、入力部３１０が、学習用データセット４００の入力データ（質問とパッセージの組）を理解層３２０に入力すると、第１の出力層３２１～第３の出力層３２３からは、機械読解の出力結果として、第１の応答～第３の応答が出力される。また、出力判断層３２４からは、Ｎ個の次元（図４の例ではＮ＝３）のベクトルデータが出力される。 In FIG. 4, when the input unit 310 inputs the input data (a set of a question and a passage) of the learning data set 400 to the understanding layer 320, the machine reading comprehension is performed from the first output layer 321 to the third output layer 323. As the output result of, the first response to the third response are output. Further, vector data of N dimensions (N = 3 in the example of FIG. 4) is output from the output determination layer 324.

　比較／変更部４１０では、第１の出力層３２１から出力された第１の応答（回答文）と、学習用データセット４００の"第１の出力層の正解データ"に格納された回答文との間で学習損失を算出する。また、比較／変更部４１０では、算出した学習損失に基づき、第１の出力層３２１及び理解層３２０のパラメータを更新する。 In the comparison / change unit 410, the first response (answer sentence) output from the first output layer 321 and the answer sentence stored in the "correct answer data of the first output layer" of the learning data set 400. Calculate the learning loss between. Further, the comparison / change unit 410 updates the parameters of the first output layer 321 and the understanding layer 320 based on the calculated learning loss.

　同様に、比較／変更部４１０では、出力判断層３２４から出力されたＮ個（図４の例ではＮ＝３）の次元のベクトルデータと、学習用データセット４００の"出力判断層の正解データ"に格納された第１次元～第３次元のベクトルデータとの間で学習損失を算出する。また、比較／変更部４１０では、算出した学習損失に基づき、出力判断層３２４のパラメータを更新する。 Similarly, in the comparison / change unit 410, the vector data of N dimensions (N = 3 in the example of FIG. 4) output from the output judgment layer 324 and the correct answer data of the "output judgment layer" of the training data set 400. The learning loss is calculated between the vector data of the first dimension to the third dimension stored in ". Further, the comparison / change unit 410 updates the parameters of the output determination layer 324 based on the calculated learning loss.

　一方、図５は、第１の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第２の図である。図５の場合、学習用データセット５００＝"第２の出力層３２２についての学習処理に用いられる学習用データセット"であることを示すフラグが、質問応答部１１０により、設定されているものとする。 On the other hand, FIG. 5 is a second diagram showing an operation example in the learning phase of the question answering device according to the first embodiment. In the case of FIG. 5, a flag indicating that the learning data set 500 = "the learning data set used for the learning process for the second output layer 322" is set by the question answering unit 110. do.

　なお、図５に示すように、学習用データセット５００には、情報の項目として、"入力データ"、"第２の出力層の正解データ"、"出力判断層の正解データ"が含まれ、
・"入力データ"には、質問とパッセージとが格納される。
・"第２の出力層の正解データ"には、対応するパッセージから抽出した情報に基づいて生成する、ＹＥＳ／ＮＯなどのラベルの正解データが格納される。
・"出力判断層の正解データ"には、"第１次元"～"第３次元"が含まれ、第２次元の値を"１"、第１次元及び第３次元の値を"０"とするベクトルデータが格納される。 As shown in FIG. 5, the learning data set 500 includes "input data", "correct answer data of the second output layer", and "correct answer data of the output determination layer" as information items.
-Questions and passages are stored in the "input data".
-In the "correct answer data of the second output layer", the correct answer data of the label such as YES / NO generated based on the information extracted from the corresponding passage is stored.
-The "correct answer data of the output judgment layer" includes "first dimension" to "third dimension", the second dimension value is "1", and the first and third dimension values are "0". Vector data is stored.

　図５において、入力部３１０が、学習用データセット５００の入力データ（質問とパッセージの組）を理解層３２０に入力すると、第１の出力層３２１～第３の出力層３２３からは、機械読解の出力結果として、第１の応答～第３の応答が出力される。また、出力判断層３２４からは、Ｎ個の次元（図５の例ではＮ＝３）のベクトルデータが出力される。 In FIG. 5, when the input unit 310 inputs the input data (a set of a question and a passage) of the learning data set 500 to the understanding layer 320, the machine reading comprehension is performed from the first output layer 321 to the third output layer 323. As the output result of, the first response to the third response are output. Further, vector data of N dimensions (N = 3 in the example of FIG. 5) is output from the output determination layer 324.

　比較／変更部４１０では、第２の出力層３２２から出力された第２の応答（ラベル）と、学習用データセット５００の"第２の出力層の正解データ"に格納されたラベルとの間で学習損失を算出する。また、比較／変更部４１０では、算出した学習損失に基づき、第２の出力層３２２及び理解層３２０のパラメータを更新する。 In the comparison / change unit 410, between the second response (label) output from the second output layer 322 and the label stored in the "correct answer data of the second output layer" of the learning data set 500. Calculate the learning loss with. Further, the comparison / change unit 410 updates the parameters of the second output layer 322 and the understanding layer 320 based on the calculated learning loss.

　同様に、比較／変更部４１０では、出力判断層３２４から出力されたＮ個（図５の例ではＮ＝３）の次元のベクトルデータと、学習用データセット５００の"出力判断層の正解データ"に格納された第１次元～第３次元のベクトルデータとの間で学習損失を算出する。また、比較／変更部４１０では、算出した学習損失に基づき、出力判断層３２４のパラメータを更新する。 Similarly, in the comparison / change unit 410, the vector data of N dimensions (N = 3 in the example of FIG. 5) output from the output judgment layer 324 and the correct answer data of the "output judgment layer" of the training data set 500. The learning loss is calculated between the vector data of the first dimension to the third dimension stored in ". Further, the comparison / change unit 410 updates the parameters of the output determination layer 324 based on the calculated learning loss.

　一方、図６は、第１の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第３の図である。図６の場合、学習用データセット６００＝"第３の出力層３２３についての学習処理に用いられる学習用データセット"であることを示すフラグが、質問応答部１１０により設定されているものとする。 On the other hand, FIG. 6 is a third diagram showing an operation example in the learning phase of the question answering device according to the first embodiment. In the case of FIG. 6, it is assumed that the question answering unit 110 sets a flag indicating that the learning data set 600 = "the learning data set used for the learning process for the third output layer 323". ..

　なお、図６に示すように、学習用データセット６００には、情報の項目として、"入力データ"、"第３の出力層の正解データ"、"出力判断層の正解データ"が含まれ、
・"入力データ"には、質問とパッセージとが格納される。
・"第３の出力層の正解データ"には、対応するパッセージから抽出した情報に基づいて生成する改訂質問の正解データが格納される。
・"出力判断層の正解データ"には、"第１次元"～"第３次元"が含まれ、第３次元の値を"１"、第１次元及び第２次元の値を"０"とするベクトルデータが格納される。 As shown in FIG. 6, the learning data set 600 includes "input data", "correct answer data of the third output layer", and "correct answer data of the output determination layer" as information items.
-Questions and passages are stored in the "input data".
-The "correct answer data of the third output layer" stores the correct answer data of the revised question generated based on the information extracted from the corresponding passage.
-The "correct answer data of the output judgment layer" includes "first dimension" to "third dimension", the value of the third dimension is "1", and the values of the first dimension and the second dimension are "0". Vector data is stored.

　図６において、入力部３１０が、学習用データセット６００の入力データ（質問とパッセージの組）を理解層３２０に入力すると、第１の出力層３２１～第３の出力層３２３からは、機械読解の出力結果として、第１の応答～第３の応答が出力される。また、出力判断層３２４からは、Ｎ個の次元（図６の例ではＮ＝３）のベクトルデータが出力される。 In FIG. 6, when the input unit 310 inputs the input data (a set of a question and a passage) of the learning data set 600 to the understanding layer 320, the machine reading comprehension is performed from the first output layer 321 to the third output layer 323. As the output result of, the first response to the third response are output. Further, vector data of N dimensions (N = 3 in the example of FIG. 6) is output from the output determination layer 324.

　比較／変更部４１０では、第３の出力層３２３から出力された第３の応答（改訂質問）と、学習用データセット６００の"第３の出力層の正解データ"に格納された改訂質問との間で学習損失を算出する。また、比較／変更部４１０では、算出した学習損失に基づき、第３の出力層３２３及び理解層３２０のパラメータを更新する。 In the comparison / change unit 410, the third response (revised question) output from the third output layer 323 and the revised question stored in the "correct answer data of the third output layer" of the learning data set 600. Calculate the learning loss between. Further, the comparison / change unit 410 updates the parameters of the third output layer 323 and the understanding layer 320 based on the calculated learning loss.

　同様に、比較／変更部４１０では、出力判断層３２４から出力されたＮ個（図６の例ではＮ＝３）の次元のベクトルデータと、学習用データセット６００の"出力判断層の正解データ"に格納された第１次元～第３次元のベクトルデータとの間で学習損失を算出する。また、比較／変更部４１０では、算出した学習損失に基づき、出力判断層３２４のパラメータを更新する。 Similarly, in the comparison / change unit 410, the vector data of N dimensions (N = 3 in the example of FIG. 6) output from the output judgment layer 324 and the correct answer data of the "output judgment layer" of the training data set 600. The learning loss is calculated between the vector data of the first dimension to the third dimension stored in ". Further, the comparison / change unit 410 updates the parameters of the output determination layer 324 based on the calculated learning loss.

　このように、質問応答装置１００の質問応答部１１０では、学習用データセット４００～６００を用いて、理解層３２０、第１の出力層３２１～第３の出力層３２３、出力判断層３２４について、順次、学習処理を行う。 As described above, in the question answering unit 110 of the question answering device 100, the learning data sets 400 to 600 are used to cover the understanding layer 320, the first output layer 321 to the third output layer 323, and the output determination layer 324. The learning process is performed sequentially.

　＜質問応答処理の流れ＞
　次に、質問応答装置１００による質問応答処理の流れについて説明する。図７は、第１の実施形態に係る質問応答装置による質問応答処理の流れを示すフローチャートである。このうち、ステップＳ７０１～Ｓ７０３は、学習フェーズにおける処理を表し、ステップＳ７０４～Ｓ７０７は、応答フェーズにおける処理を表している。 <Flow of question answering process>
Next, the flow of the question answering process by the question answering device 100 will be described. FIG. 7 is a flowchart showing a flow of question answering processing by the question answering device according to the first embodiment. Of these, steps S701 to S703 represent processing in the learning phase, and steps S704 to S707 represent processing in the response phase.

　ステップＳ７０１において、質問応答部１１０は、学習用データセット４００を用いて、理解層３２０、第１の出力層３２１、出力判断層３２４について学習処理を行う。 In step S701, the question answering unit 110 performs learning processing on the understanding layer 320, the first output layer 321 and the output determination layer 324 using the learning data set 400.

　ステップＳ７０２において、質問応答部１１０は、学習用データセット５００を用いて、理解層３２０、第２の出力層３２２、出力判断層３２４について学習処理を行う。 In step S702, the question answering unit 110 performs learning processing on the understanding layer 320, the second output layer 322, and the output determination layer 324 using the learning data set 500.

　ステップＳ７０３において、質問応答部１１０は、学習用データセット６００を用いて、理解層３２０、第３の出力層３２３、出力判断層３２４について学習処理を行う。 In step S703, the question answering unit 110 performs learning processing on the understanding layer 320, the third output layer 323, and the output determination layer 324 using the learning data set 600.

　ステップＳ７０４において、質問応答部１１０の入力部３１０は、質問及びパッセージの入力を受け付け、入力された質問及びパッセージを、理解層３２０に入力する。 In step S704, the input unit 310 of the question answering unit 110 accepts the input of the question and the passage, and inputs the input question and the passage to the understanding layer 320.

　ステップＳ７０５において、第１の出力層３２１～第３の出力層３２３は、理解層３２０より出力された状態ベクトルを入力として、第１の応答～第３の応答を出力する。 In step S705, the first output layer 321 to the third output layer 323 take the state vector output from the understanding layer 320 as an input, and output the first response to the third response.

　ステップＳ７０６において、質問応答部１１０の出力判断層３２４は、理解層３２０より出力された状態ベクトルを入力として、第１の応答～第３の応答の確率分布を算出することで、選択指標を出力する。 In step S706, the output determination layer 324 of the question answering unit 110 outputs the selection index by calculating the probability distribution of the first response to the third response by inputting the state vector output from the understanding layer 320. do.

　ステップＳ７０７において、質問応答部１１０の選択部３３０は、出力判断層３２４から出力された選択指標に基づいて、予め定められた上位Ｍ個の応答を選択し、選択した応答を出力する。 In step S707, the selection unit 330 of the question answering unit 110 selects the predetermined upper M responses based on the selection index output from the output determination layer 324, and outputs the selected response.

　＜まとめ＞
　以上の説明から明らかなように、第１の実施形態に係る質問応答装置１００は、
・質問とパッセージとを入力として、質問とパッセージとの関連性を示す情報を算出する理解層を有する。
・理解層により算出された関連性を示す情報をそれぞれの入力として、互いに異なる出力形式である第１～第３の応答を出力する、第１の出力層～第３の出力層を有する。
・理解層により出力された関連性を示す情報に基づいて第１の応答～第３の応答の確率分布を算出する出力判断層を有する。更に、出力判断層により算出された第１の応答～第３の応答の確率分布を選択指標として、予め定められた数の応答を選択する選択部を有する。 <Summary>
As is clear from the above description, the question answering device 100 according to the first embodiment is
-It has an understanding layer that calculates information indicating the relationship between the question and the passage by inputting the question and the passage.
-It has a first output layer to a third output layer that outputs first to third responses that are different output formats from each other, using information indicating relevance calculated by the understanding layer as each input.
-It has an output judgment layer that calculates the probability distribution of the first response to the third response based on the information indicating the relevance output by the understanding layer. Further, it has a selection unit for selecting a predetermined number of responses using the probability distribution of the first response to the third response calculated by the output determination layer as a selection index.

　これにより、第１の実施形態に係る質問応答装置１００によれば、質問に対する応答を複数の出力形式で出力できるとともに、コンピュータリソースの消費を抑え、かつ、適切な応答を選択することが可能になる。 Thereby, according to the question answering device 100 according to the first embodiment, it is possible to output the answer to the question in a plurality of output formats, suppress the consumption of computer resources, and select an appropriate response. Become.

　つまり、第１の実施形態によれば、機械読解により質問に対する応答を出力する際、複数の出力形式で出力可能な、実現可能性の高い質問応答装置、質問応答方法及び質問応答プログラムを提供することができる。 That is, according to the first embodiment, when a response to a question is output by machine reading comprehension, a highly feasible question answering device, a question answering method, and a question answering program that can be output in a plurality of output formats are provided. be able to.

　［第２の実施形態］
　上記第１の実施形態では、出力形式の数が固定であることを前提とし、新たな出力形式を追加する場合には、質問応答部１１０全体について、再学習処理を行うものとして質問応答部を構成した。 [Second Embodiment]
In the first embodiment, it is assumed that the number of output formats is fixed, and when a new output format is added, the question answering unit 110 is assumed to perform relearning processing for the entire question answering unit 110. Configured.

　これに対して、第２の実施形態では、新たな出力形式が追加された場合でも、質問応答部１１０全体について、再学習処理を行う必要がないように質問応答部を構成する。以下、第２の実施形態について、上記第１の実施形態との相違点を中心に説明する。 On the other hand, in the second embodiment, the question answering unit is configured so that the question answering unit 110 as a whole does not need to be relearned even when a new output format is added. Hereinafter, the second embodiment will be described focusing on the differences from the first embodiment.

　＜質問応答部の機能構成＞
　はじめに、第２の実施形態に係る質問応答装置の質問応答部の機能構成について説明する。図８は、第２の実施形態に係る質問応答装置の質問応答部の機能構成の一例を示す図である。 <Functional configuration of question answering section>
First, the functional configuration of the question answering unit of the question answering device according to the second embodiment will be described. FIG. 8 is a diagram showing an example of the functional configuration of the question answering unit of the question answering device according to the second embodiment.

　図３に示した機能構成との相違点は、図８の質問応答部８００の場合、第１の出力層３２１～第３の出力層３２３それぞれに対して、選択指標として、個別のスコアを算出する第１の出力判断層８０１～第３の出力判断層８０３が設置されている点である。また、図８の質問応答部８００の場合、選択部８１０の機能が、図３の選択部３３０の機能とは異なる点である。 The difference from the functional configuration shown in FIG. 3 is that in the case of the question answering unit 800 of FIG. 8, individual scores are calculated as selection indexes for each of the first output layer 321 to the third output layer 323. The point is that the first output determination layer 801 to the third output determination layer 803 are installed. Further, in the case of the question answering unit 800 of FIG. 8, the function of the selection unit 810 is different from the function of the selection unit 330 of FIG.

　第１の出力判断層８０１は指標算出部の一例であり、第１の出力層３２１の状態ベクトルを受け取り、第１スコアとして、０～１．０のスカラ値を算出するロジット層を有する。 The first output determination layer 801 is an example of the index calculation unit, and has a logit layer that receives the state vector of the first output layer 321 and calculates a scalar value of 0 to 1.0 as the first score.

　同様に第２の出力判断層８０２は指標算出部の一例であり、第２の出力層３２２の状態ベクトルを受け取り、第２スコアとして、０～１．０のスカラ値を算出するロジット層を有する。 Similarly, the second output determination layer 802 is an example of the index calculation unit, and has a logit layer that receives the state vector of the second output layer 322 and calculates a scalar value of 0 to 1.0 as the second score. ..

　同様に第３の出力判断層８０３は指標算出部の一例であり、第３の出力層３２３の状態ベクトルを受け取り、第３スコアとして、０～１．０のスカラ値を算出するロジット層を有する。 Similarly, the third output determination layer 803 is an example of the index calculation unit, and has a logit layer that receives the state vector of the third output layer 323 and calculates a scalar value of 0 to 1.0 as the third score. ..

　選択部８１０は、第１の出力判断層８０１～第３の出力判断層８０３により算出された第１スコア～第３スコアに基づき、予め定められた上位Ｍ個のスコアに対応する応答を選択して出力する。 The selection unit 810 selects the response corresponding to the predetermined top M scores based on the first score to the third score calculated by the first output judgment layer 801 to the third output judgment layer 803. And output.

　このように、第２の実施形態に係る質問応答装置１００では、第１の出力層３２１～第３の出力層３２３それぞれに対する個別のスコアを算出する第１の出力判断層８０１～第３の出力判断層８０３を設置する。これにより、第２の実施形態によれば、新たな出力形式が追加された場合でも、追加された新たな出力層及び出力判断層と、理解層とについて学習処理を行えば足り、既に学習済みの出力層及び出力判断層について再学習処理を行う必要がなくなる。 As described above, in the question answering device 100 according to the second embodiment, the first output determination layer 801 to the third output for calculating individual scores for each of the first output layer 321 to the third output layer 323. A judgment layer 803 is installed. As a result, according to the second embodiment, even if a new output format is added, it is sufficient to perform learning processing on the added new output layer, output judgment layer, and understanding layer, and the learning has already been completed. It is not necessary to perform re-learning processing on the output layer and the output judgment layer of.

　＜質問応答部の学習方法＞
　次に、質問応答部８００の学習方法について説明する。質問応答部１１０に対して学習処理を行う学習フェーズにおいて、本実施形態に係る質問応答装置１００の質問応答部８００は、まず、選択部８１０に代えて、比較／変更部９１０を設置する。続いて、本実施形態に係る質問応答装置１００の質問応答部８００は、質問応答部８００内に設置した各層（理解層３２０、第１の出力層３２１～第３の出力層３２３、第１の出力判断層８０１～第３の出力判断層８０３）について学習処理を行う。 <Learning method of question answering section>
Next, the learning method of the question answering unit 800 will be described. In the learning phase in which the question answering unit 110 is subjected to the learning process, the question answering unit 800 of the question answering device 100 according to the present embodiment first installs the comparison / change unit 910 in place of the selection unit 810. Subsequently, the question answering unit 800 of the question answering device 100 according to the present embodiment has each layer (understanding layer 320, first output layer 321 to third output layer 323, first) installed in the question answering unit 800. The learning process is performed on the output determination layer 801 to the third output determination layer 803).

　その際、質問応答装置１００では、上記第１の実施形態同様、個別の機械読解モデル（第１の機械読解モデル２２１～第３の機械読解モデル２２３）について学習処理を行う際に用いる学習用データセットを使用する。 At that time, in the question and answer device 100, as in the first embodiment, the learning data used when performing the learning process on the individual machine reading model (first machine reading model 221 to third machine reading model 223). Use the set.

　具体的には、質問応答装置１００の質問応答部８００は、まず、学習用データセットが、いずれの出力層についての学習処理に用いられるものであるかを示すフラグを設定する。 Specifically, the question answering unit 800 of the question answering device 100 first sets a flag indicating which output layer the learning data set is used for the learning process.

　続いて、質問応答装置１００の質問応答部８００は、第１の出力判断層８０１～第３の出力判断層８０３により算出される第１スコア～第３スコアの正解データを学習用データセットに格納する。例えば、質問応答装置１００の質問応答部８００は、設定したフラグに対応するスコアを"１．０"とする正解データを学習用データセットに格納する。 Subsequently, the question answering unit 800 of the question answering device 100 stores the correct answer data of the first score to the third score calculated by the first output determination layer 801 to the third output determination layer 803 in the learning data set. do. For example, the question answering unit 800 of the question answering device 100 stores correct answer data having a score corresponding to the set flag of "1.0" in the learning data set.

　続いて、質問応答装置１００の質問応答部８００は、設定したフラグに該当する出力層から出力された応答と、対応する正解データとの間で学習損失を算出し、算出した学習損失に基づき、設定したフラグに該当する出力層及び理解層のパラメータを更新する。このとき、質問応答装置１００の質問応答部８００は、設定したフラグに該当する出力層以外の出力層から出力された応答については無視する。 Subsequently, the question answering unit 800 of the question answering device 100 calculates a learning loss between the response output from the output layer corresponding to the set flag and the corresponding correct answer data, and based on the calculated learning loss. Update the parameters of the output layer and understanding layer corresponding to the set flag. At this time, the question answering unit 800 of the question answering device 100 ignores the response output from the output layer other than the output layer corresponding to the set flag.

　また、質問応答装置１００の質問応答部８００は、設定したフラグに該当する出力判断層から出力されたスコアと、対応する正解データとの間で学習損失を算出し、算出した学習損失に基づき、設定したフラグに該当する出力判断層のパラメータを更新する。このとき、質問応答装置１００では、設定したフラグに該当する出力判断層以外の出力判断層から出力されたスコアについては無視する。 Further, the question answering unit 800 of the question answering device 100 calculates a learning loss between the score output from the output determination layer corresponding to the set flag and the corresponding correct answer data, and based on the calculated learning loss. Update the parameters of the output judgment layer corresponding to the set flag. At this time, the question answering device 100 ignores the scores output from the output determination layers other than the output determination layer corresponding to the set flag.

　図９は、第２の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第１の図である。図９の場合、学習用データセット９００＝"第１の出力層３２１及び第１の出力判断層８０１についての学習処理に用いられる学習用データセット"であることを示すフラグが、質問応答部８００により設定されているものとする。 FIG. 9 is a first diagram showing an operation example in the learning phase of the question answering device according to the second embodiment. In the case of FIG. 9, the flag indicating that the learning data set 900 = "the learning data set used for the learning process for the first output layer 321 and the first output judgment layer 801" is the question answering unit 800. It is assumed that it is set by.

　なお、図９に示すように、学習用データセット９００には、情報の項目として、"入力データ"、"第１の出力層の正解データ"、"出力判断層の正解データ"が含まれ、
・"入力データ"には、質問とパッセージとが格納される。
・"第１の出力層の正解データ"には、対応するパッセージから抽出した情報に基づいて、文生成により生成する回答文の正解データが格納される。
・"出力判断層の正解データ"には、第１の出力判断層８０１から出力されるスコア（第１スコア）の正解データが格納される。 As shown in FIG. 9, the learning data set 900 includes "input data", "correct answer data of the first output layer", and "correct answer data of the output judgment layer" as information items.
-Questions and passages are stored in the "input data".
-The "correct answer data of the first output layer" stores the correct answer data of the answer sentence generated by sentence generation based on the information extracted from the corresponding passage.
-The "correct answer data of the output determination layer" stores the correct answer data of the score (first score) output from the first output determination layer 801.

　図９において、入力部３１０が、学習用データセット９００の入力データ（質問とパッセージの組）を理解層３２０に入力すると、第１の出力層３２１～第３の出力層３２３からは、機械読解の出力結果として、第１の応答～第３の応答が出力される。また、第１の出力判断層８０１～第３の出力判断層８０３からは、第１スコア～第３スコアが出力される。 In FIG. 9, when the input unit 310 inputs the input data (a set of a question and a passage) of the learning data set 900 to the understanding layer 320, the machine reading comprehension is performed from the first output layer 321 to the third output layer 323. As the output result of, the first response to the third response are output. Further, the first score to the third score are output from the first output determination layer 801 to the third output determination layer 803.

　比較／変更部９１０では、第１の出力層３２１から出力された第１の応答（回答文）と、学習用データセット９００の"第１の出力層の正解データ"に格納された回答文との間で学習損失を算出する。また、比較／変更部９１０では、算出した学習損失に基づき、第１の出力層３２１及び理解層３２０のパラメータを更新する。 In the comparison / change unit 910, the first response (answer sentence) output from the first output layer 321 and the answer sentence stored in the "correct answer data of the first output layer" of the learning data set 900. Calculate the learning loss between. Further, the comparison / change unit 910 updates the parameters of the first output layer 321 and the understanding layer 320 based on the calculated learning loss.

　同様に、比較／変更部９１０では、第１の出力判断層８０１から出力された第１スコアと、学習用データセット９００の"出力判断層の正解データ"の第１スコアに格納された値との間で学習損失を算出する。また、比較／変更部９１０では、算出した学習損失に基づき、第１の出力判断層８０１のパラメータを更新する。 Similarly, in the comparison / change unit 910, the first score output from the first output judgment layer 801 and the value stored in the first score of the "correct answer data of the output judgment layer" of the learning data set 900. Calculate the learning loss between. Further, the comparison / change unit 910 updates the parameters of the first output determination layer 801 based on the calculated learning loss.

　一方、図１０は、第２の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第２の図である。図１０の場合、学習用データセット１０００＝"第２の出力層３２２及び第２の出力判断層８０２についての学習処理に用いられる学習用データセット"であることを示すフラグが、質問応答部８００により設定されているものとする。 On the other hand, FIG. 10 is a second diagram showing an operation example in the learning phase of the question answering device according to the second embodiment. In the case of FIG. 10, the flag indicating that the learning data set 1000 = "the learning data set used for the learning process for the second output layer 322 and the second output judgment layer 802" is the question response unit 800. It is assumed that it is set by.

　なお、図１０に示すように、学習用データセット１０００には、情報の項目として、"入力データ"、"第２の出力層の正解データ"、"出力判断層の正解データ"が含まれ、
・"入力データ"には、質問とパッセージとが格納される。
・"第２の出力層の正解データ"には、対応するパッセージから抽出した情報に基づいて生成する、ＹＥＳ／ＮＯなどのラベルの正解データが格納される。 As shown in FIG. 10, the learning data set 1000 includes "input data", "correct answer data of the second output layer", and "correct answer data of the output determination layer" as information items.
-Questions and passages are stored in the "input data".
-In the "correct answer data of the second output layer", the correct answer data of the label such as YES / NO generated based on the information extracted from the corresponding passage is stored.

　"出力判断層の正解データ"には、第２の出力判断層８０２から出力されるスコア（第２スコア）の正解データが格納される。 In the "correct answer data of the output judgment layer", the correct answer data of the score (second score) output from the second output judgment layer 802 is stored.

　図１０において、入力部３１０が、学習用データセット１０００の入力データ（質問とパッセージの組）を理解層３２０に入力すると、第１の出力層３２１～第３の出力層３２３からは、機械読解の出力結果として、第１の応答～第３の応答が出力される。また、第１の出力判断層８０１～第３の出力判断層８０３からは、第１スコア～第３スコアが出力される。 In FIG. 10, when the input unit 310 inputs the input data (a set of a question and a passage) of the learning data set 1000 to the understanding layer 320, the machine reading comprehension is performed from the first output layer 321 to the third output layer 323. As the output result of, the first response to the third response are output. Further, the first score to the third score are output from the first output determination layer 801 to the third output determination layer 803.

　比較／変更部９１０では、第２の出力層３２２から出力された第２の応答（ラベル）と、学習用データセット１０００の"第２の出力層の正解データ"に格納されたラベルとの間で学習損失を算出する。また、比較／変更部９１０では、算出した学習損失に基づき、第２の出力層３２２及び理解層３２０のパラメータを更新する。 In the comparison / change unit 910, between the second response (label) output from the second output layer 322 and the label stored in the "correct answer data of the second output layer" of the learning data set 1000. Calculate the learning loss with. Further, the comparison / change unit 910 updates the parameters of the second output layer 322 and the understanding layer 320 based on the calculated learning loss.

　同様に、比較／変更部９１０では、第２の出力判断層８０２から出力された第２スコアと、学習用データセット１０００の"出力判断層の正解データ"の第２スコアに格納された値との間で学習損失を算出する。また、比較／変更部９１０では、算出した学習損失に基づき、第２の出力判断層８０２のパラメータを更新する。 Similarly, in the comparison / change unit 910, the second score output from the second output judgment layer 802 and the value stored in the second score of the "correct answer data of the output judgment layer" of the learning data set 1000. Calculate the learning loss between. Further, the comparison / change unit 910 updates the parameters of the second output determination layer 802 based on the calculated learning loss.

　一方、図１１は、第２の実施形態に係る質問応答装置の学習フェーズにおける動作例を示す第３の図である。図１１の場合、学習用データセット１１００＝"第３の出力層３２３及び第３の出力判断層８０３についての学習処理に用いられる学習用データセット"であることを示すフラグが、質問応答部８００により設定されているものとする。 On the other hand, FIG. 11 is a third diagram showing an operation example in the learning phase of the question answering device according to the second embodiment. In the case of FIG. 11, the flag indicating that the learning data set 1100 = "the learning data set used for the learning process for the third output layer 323 and the third output determination layer 803" is the question response unit 800. It is assumed that it is set by.

　なお、図１１に示すように、学習用データセット１１００には、情報の項目として、"入力データ"、"第３の出力層の正解データ"、"出力判断層の正解データ"が含まれ、
・"入力データ"には、質問とパッセージが格納される。
・"第３の出力層の正解データ"には、対応するパッセージから抽出した情報に基づいて生成する改訂質問の正解データが格納される。
・"出力判断層の正解データ"には、第３の出力判断層８０３から出力されるスコア（第３スコア）の正解データが格納される。 As shown in FIG. 11, the learning data set 1100 includes "input data", "correct answer data of the third output layer", and "correct answer data of the output determination layer" as information items.
-Questions and passages are stored in the "input data".
-The "correct answer data of the third output layer" stores the correct answer data of the revised question generated based on the information extracted from the corresponding passage.
-The "correct answer data of the output determination layer" stores the correct answer data of the score (third score) output from the third output determination layer 803.

　図１１において、入力部３１０が、学習用データセット１１００の入力データ（質問とパッセージの組）を理解層３２０に入力すると、第１の出力層３２１～第３の出力層３２３からは、機械学習の出力結果として、第１の応答～第３の応答が出力される。また、第１の出力判断層８０１～第３の出力判断層８０３からは、第１スコア～第３スコアが出力される。 In FIG. 11, when the input unit 310 inputs the input data (a set of a question and a passage) of the learning data set 1100 to the understanding layer 320, machine learning is performed from the first output layer 321 to the third output layer 323. As the output result of, the first response to the third response are output. Further, the first score to the third score are output from the first output determination layer 801 to the third output determination layer 803.

　比較／変更部９１０では、第３の出力層３２３から出力された第３の応答（改訂質問）と、学習用データセット１１００の"第３の出力層の正解データ"に格納された改訂質問との間で学習損失を算出する。また、比較／変更部９１０では、算出した学習損失に基づき、第３の出力層３２３及び理解層３２０のパラメータを更新する。 In the comparison / change unit 910, the third response (revised question) output from the third output layer 323 and the revised question stored in the "correct answer data of the third output layer" of the learning data set 1100. Calculate the learning loss between. Further, the comparison / change unit 910 updates the parameters of the third output layer 323 and the understanding layer 320 based on the calculated learning loss.

　同様に、比較／変更部９１０では、第３の出力判断層８０３から出力された第３スコアと、学習用データセット１１００の"出力判断層の正解データ"の第３スコアに格納された値との間で学習損失を算出する。また、比較／変更部９１０では、算出した学習損失に基づき、第３の出力判断層８０３のパラメータを更新する。 Similarly, in the comparison / change unit 910, the third score output from the third output judgment layer 803 and the value stored in the third score of the "correct answer data of the output judgment layer" of the learning data set 1100. Calculate the learning loss between. Further, the comparison / change unit 910 updates the parameters of the third output determination layer 803 based on the calculated learning loss.

　＜質問応答処理の流れ＞
　次に、第２の実施形態に係る質問応答装置１００による質問応答処理の流れについて説明する。図１２は、第２の実施形態に係る質問応答装置による質問応答処理の流れを示すフローチャートである。上記第１の実施形態において図７を用いて説明したフローチャートとの相違点は、ステップＳ１２０１～Ｓ１２０３である。 <Flow of question answering process>
Next, the flow of the question answering process by the question answering device 100 according to the second embodiment will be described. FIG. 12 is a flowchart showing a flow of question answering processing by the question answering device according to the second embodiment. The difference from the flowchart described with reference to FIG. 7 in the first embodiment is in steps S1201 to S1203.

　ステップＳ１２０１において、質問応答部１１０は、学習用データセット９００を用いて、理解層３２０、第１の出力層３２１、第１の出力判断層８０１について学習処理を行う。 In step S1201, the question answering unit 110 performs learning processing on the understanding layer 320, the first output layer 321 and the first output determination layer 801 using the learning data set 900.

　ステップＳ１２０２において、質問応答部１１０は、学習用データセット１０００を用いて、理解層３２０、第２の出力層３２２、第２の出力判断層８０２について学習処理を行う。 In step S1202, the question answering unit 110 performs learning processing on the understanding layer 320, the second output layer 322, and the second output determination layer 802 using the learning data set 1000.

　ステップＳ１２０３において、質問応答部１１０は、学習用データセット１１００を用いて、理解層３２０、第３の出力層３２３、第３の出力判断層８０３について学習処理を行う。 In step S1203, the question answering unit 110 performs learning processing on the understanding layer 320, the third output layer 323, and the third output determination layer 803 using the learning data set 1100.

　＜まとめ＞
　以上の説明から明らかなように、第２の実施形態に係る質問応答装置１００は、
・質問とパッセージとを入力として、質問とパッセージとの関連性を示す情報を算出する理解層を有する。
・理解層により算出された関連性を示す情報をそれぞれの入力として、互いに異なる出力形式である第１～第３の応答を出力する、第１の出力層～第３の出力層を有する。
・第１の出力層～第３の出力層の状態ベクトルを受け取り、第１の出力層～第３の出力層についての個別のスコア（第１スコア～第３スコア）を算出する、第１の出力判断層～第３の出力判断層を有する。更に、第１の出力判断層～第３の出力判断層により算出された第１スコア～第３スコアを選択指標として、予め定められた数の応答を選択する選択部を有する。 <Summary>
As is clear from the above description, the question answering device 100 according to the second embodiment is
-It has an understanding layer that calculates information indicating the relationship between the question and the passage by inputting the question and the passage.
-It has a first output layer to a third output layer that outputs first to third responses that are different output formats from each other, using information indicating relevance calculated by the understanding layer as each input.
A first, which receives the state vectors of the first output layer to the third output layer and calculates individual scores (first score to third score) for the first output layer to the third output layer. It has an output determination layer to a third output determination layer. Further, it has a selection unit for selecting a predetermined number of responses using the first score to the third score calculated by the first output determination layer to the third output determination layer as a selection index.

　これにより、第２の実施形態に係る質問応答装置１００によれば、第１の実施形態同様、質問に対する応答を複数の出力形式で出力できるとともに、コンピュータリソースの消費を抑え、かつ、適切な応答を選択することが可能になる。加えて、第２の実施形態に係る質問応答装置１００によれば、新たな出力形式が追加される場合でも、質問応答部全体について、再学習処理を行う必要がなくなる。 As a result, according to the question answering device 100 according to the second embodiment, the answer to the question can be output in a plurality of output formats as in the first embodiment, the consumption of computer resources is suppressed, and an appropriate response is made. Can be selected. In addition, according to the question answering device 100 according to the second embodiment, even when a new output format is added, it is not necessary to perform relearning processing for the entire question answering unit.

　つまり、第２の実施形態によれば、機械読解により質問に対する応答を出力する際、複数の出力形式で出力可能な、より実現可能性の高い質問応答装置、質問応答方法及び質問応答プログラムを提供することができる。 That is, according to the second embodiment, when a response to a question is output by machine reading comprehension, a more feasible question answering device, a question answering method, and a question answering program that can be output in a plurality of output formats are provided. can do.

　［その他の実施形態］
　上記第１の実施形態及び第２の実施形態では、それぞれ異なる出力判断層を設置する場合について説明したが、第１の実施形態における出力判断層を設置するか、第２の実施形態における出力判断層を設置するかの決定は任意である。例えば、設定するタスクや目的、質問応答装置のシステム構成等を加味して決定してもよい。 [Other embodiments]
In the first embodiment and the second embodiment, the case where different output judgment layers are installed has been described, but whether the output judgment layer in the first embodiment is installed or the output determination in the second embodiment is performed. The decision to install a layer is optional. For example, it may be determined in consideration of the task and purpose to be set, the system configuration of the question answering device, and the like.

　また、上記第１の実施形態及び第２の実施形態では、同一の質問応答装置１００において学習フェーズと応答フェーズとを実行するものとして説明した。しかしながら、学習フェーズと応答フェーズとは別体の装置で実行するように構成してもよい。この場合、応答フェーズを実行する装置は、学習用データセット格納部１２０を有している必要はなく、また、比較／変更部４１０、９１０が設置されることもない。 Further, in the first embodiment and the second embodiment, it has been described that the learning phase and the response phase are executed in the same question answering device 100. However, the learning phase and the response phase may be configured to be executed by separate devices. In this case, the device that executes the response phase does not need to have the learning data set storage unit 120, and the comparison / change units 410 and 910 are not installed.

　なお、上記実施形態に挙げた構成等に、その他の要素との組み合わせ等、ここで示した構成に本発明が限定されるものではない。これらの点に関しては、本発明の趣旨を逸脱しない範囲で変更することが可能であり、その応用形態に応じて適切に定めることができる。 It should be noted that the present invention is not limited to the configurations shown here, such as combinations with other elements in the configurations and the like described in the above embodiments. These points can be changed without departing from the spirit of the present invention, and can be appropriately determined according to the application form thereof.

　１００　　　　　　：質問応答装置
　１１０　　　　　　：質問応答部
　１２０　　　　　　：学習用データセット格納部
　１３０　　　　　　：パッセージ格納部
　３１０　　　　　　：入力部
　３２０　　　　　　：理解層
　３２１　　　　　　：第１の出力層
　３２２　　　　　　：第２の出力層
　３２３　　　　　　：第３の出力層
　３２４　　　　　　：出力判断層
　３３０　　　　　　：選択部
　４００～６００　　：学習用データセット
　８０１　　　　　　：第１の出力判断層
　８０２　　　　　　：第２の出力判断層
　８０３　　　　　　：第３の出力判断層
　８１０　　　　　　：選択部
　９００～１１００　：学習用データセット 100: Question response device 110: Question response unit 120: Learning data set storage unit 130: Passage storage unit 310: Input unit 320: Understanding layer 321: First output layer 322: Second output layer 323: Third Output layer 324: Output judgment layer 330: Selection unit 400 to 600: Learning data set 801: First output judgment layer 802: Second output judgment layer 803: Third output judgment layer 810: Selection unit 900 to 1100 : Training dataset

Claims

A calculation unit that calculates information indicating the relevance of the question and the related document by inputting the question and the related document used when answering the question.
A plurality of output units that output responses to the question in different output formats by using the information indicating the relevance calculated by the calculation unit as their respective inputs.
A question answering device having a selection unit that selects a predetermined number of responses from each response output by the plurality of output units in each output format.

It further has an index calculation unit that receives the information indicating the relevance calculated by the calculation unit and calculates the probability distribution of each response output by the plurality of output units in each output format.
The question answering device according to claim 1, wherein the selection unit selects a predetermined number of responses based on the probability distribution of each response.

Further having a plurality of index calculation units for receiving information calculated when the plurality of output units output each response and calculating a score for each of the plurality of output units.
The question answering device according to claim 1, wherein the selection unit selects a predetermined number of responses based on a score for each of the plurality of output units.

A calculation step of calculating information indicating the relevance of the question and the related document by inputting the question and the related document used for answering the question.
A plurality of output steps that output a response to the question in different output formats by using the information indicating the relevance calculated in the calculation step as each input, and
A question answering method including a selection step of selecting a predetermined number of responses from each response output in each output format in the plurality of output steps.

A calculation step of calculating information indicating the relevance of the question and the related document by inputting the question and the related document used for answering the question.
A plurality of output steps that output a response to the question in different output formats by using the information indicating the relevance calculated in the calculation step as each input, and
A question answering program for causing a computer to execute a selection process of selecting a predetermined number of responses from each response output in each output format in the plurality of output processes.