WO2024203621A1

WO2024203621A1 - Information processing method, information processing device, and program

Info

Publication number: WO2024203621A1
Application number: PCT/JP2024/010759
Authority: WO
Inventors: 啓輔前田
Original assignee: Sony Group Corp
Current assignee: Sony Group Corp
Priority date: 2023-03-29
Filing date: 2024-03-19
Publication date: 2024-10-03
Anticipated expiration: 2025-09-29
Also published as: JPWO2024203621A1

Abstract

The invention of the present disclosure acquires an operation parameter to be applied to a robot when there is an environmental change, proposes the acquired operation parameter to a user, and applies the proposed operation parameter to the robot when the proposal for the proposed operation parameter is accepted.

Description

Information processing method, information processing device, and program

　本開示は、情報処理方法、情報処理装置及びプログラムに関する。 This disclosure relates to an information processing method, an information processing device, and a program.

　ビデオカメラやマイクロフォン等の環境情報取得部と、現在位置検出部と、地図管理部とを備え、移動領域内の環境情報を参照しつつ自律的に移動するロボットの制御装置がある。 There is a control device for a robot that is equipped with an environmental information acquisition unit such as a video camera or microphone, a current position detection unit, and a map management unit, and moves autonomously while referring to environmental information within the movement area.

　ロボットの制御装置は、異なる環境に対応させてロボットを最適制御するための環境に適合した制御パラメータの格納手段を有し、現在位置検出手段からの信号に基づいて地図上の現在位置を特定すると共に、その領域に適合した制御パラメータを検索し、検索された制御パラメータによって当該ロボットの運動機能或いは環境情報取得部を制御する。 The robot control device has a storage means for control parameters adapted to the environment in order to optimally control the robot in response to different environments, and identifies the current position on the map based on a signal from the current position detection means, searches for control parameters adapted to that area, and controls the robot's movement functions or environmental information acquisition unit using the searched control parameters.

特開２００６－７５９４８号公報JP 2006-75948 A

　しかし、ロボットの動作パラメータは、時間変化とともに最適でなくなる可能性がある。例えば、導入時は低速で動かしていたが、慣れてきたら遅く感じる場合、子供も大きくなったので回避幅を小さくして良い場合、知らない人が来たので低速にしたい場合等である。 However, the robot's operating parameters may no longer be optimal over time. For example, a robot may move slowly when first introduced, but as the user becomes accustomed to it, it may feel slow, or the child may grow older and the avoidance range may be reduced, or an unfamiliar person may approach and the robot may want to move at a slower speed.

　また、新規の制御パラメータを設定するにしてもそのための根拠がなく精度の良いロボットの制御パラメータを設定することが出来ないという課題があった。 In addition, there was a problem in that even if new control parameters were to be set, there was no basis for doing so, making it impossible to set accurate control parameters for the robot.

　本開示は、上記問題に鑑みてなされたものであり、時間変化があっても精度の良いロボットの制御パラメータを設定することができる情報処理方法、情報処理装置及びプログラムに関する。 The present disclosure has been made in consideration of the above problems, and relates to an information processing method, information processing device, and program that can set robot control parameters with high accuracy even if they change over time.

　本開示の情報処理方法は、環境変化がある場合に、ロボットに適用される変更する動作パラメータを取得し、前記取得した前記動作パラメータをユーザに提案し、前記提案した前記動作パラメータの提案が受け付けられた場合に、前記提案された前記動作パラメータを前記ロボットに適用する。 The information processing method disclosed herein acquires operation parameters to be changed and applied to the robot when an environmental change occurs, proposes the acquired operation parameters to a user, and applies the proposed operation parameters to the robot when the proposed operation parameters are accepted.

実施の形態に係る情報処理システムの機能ブロック図である。1 is a functional block diagram of an information processing system according to an embodiment. 実施の形態に係るロボットの動作パラメータの一例を示す図である。FIG. 4 is a diagram showing an example of operation parameters of a robot according to an embodiment. 実施の形態に係るロボットの環境情報の一例を示す図である。FIG. 4 is a diagram showing an example of environmental information of a robot according to an embodiment; 実施の形態に係るサーバにおけるパターン化を説明するための図である。FIG. 11 is a diagram for explaining patterning in a server according to an embodiment. 実施の形態に係るアクションポリシーパターン化の実装例について説明するための図である。11 is a diagram for explaining an implementation example of action policy patterning according to an embodiment. FIG. 実施の形態に係るアクションポリシーのパターン化を説明するためのフローチャートである。11 is a flowchart for explaining patterning of action policies according to an embodiment; 実施の形態に係るアクションポリシー確認部の動作を説明するためのフローチャートである。10 is a flowchart illustrating an operation of an action policy checking unit according to the embodiment. 実施の形態に係る環境変化観測部の動作を説明するためのフローチャートである。5 is a flowchart illustrating an operation of an environmental change observing unit according to the embodiment. 実施の形態に係る動作パラメータ提案部の動作を説明するためのフローチャートである。10 is a flowchart for explaining an operation of an operation parameter proposing unit according to the embodiment. 実施の形態に係るアクションポリシーパターン化部の動作を説明するためのフローチャートである。10 is a flowchart illustrating an operation of an action policy patterning unit according to the embodiment. 実施の形態に係るアクションポリシースコア推定部の動作を説明するためのフローチャートである。10 is a flowchart illustrating an operation of an action policy score estimation unit according to the embodiment. 実施の形態に係る情報処理システム１のロボット及びサーバの演算部の機能を実現するコンピュータの一例を示すハードウェア構成図である。FIG. 2 is a hardware configuration diagram showing an example of a computer that realizes the functions of the robot and the processing unit of the server of the information processing system 1 according to the embodiment.

　以下、添付図面を参照しながら、本開示の好適な実施の形態について詳細に説明する。本明細書及び図面において、実質的に同一の機能構成を有する構成要素については、同一の符号を付することにより重複説明を省略する。説明は以下の順序で行うものとする。 Below, a preferred embodiment of the present disclosure will be described in detail with reference to the attached drawings. In this specification and drawings, components having substantially the same functional configuration will be denoted by the same reference numerals to avoid duplicated explanations. The explanation will be given in the following order.

　１．情報処理システム１の機能ブロック図
　　１．１．ロボット２の機能ブロック図
　　１．２．サーバ３の機能ブロック図
　　１．３．動作パラメータ
　　１．４．環境情報
　　１．５．アクションポリシーパターン化
　　１．６．アクションポリシーパターン化の実装例
　２．情報処理システム１の動作
　　２．１．アクションポリシー確認部１３の動作
　　２．２．環境変化観測部１６の動作
　　２．３．動作パラメータ提案部１５の動作
　　２．４．アクションポリシーパターン化部３２の動作
　　２．５．アクションポリシースコア推定部３５の動作
　３．効果
　４．ハードウェア構成 1. Functional block diagram of information processing system 1 1.1. Functional block diagram of robot 2 1.2. Functional block diagram of server 3 1.3. Operation parameters 1.4. Environmental information 1.5. Action policy patterning 1.6. Implementation example of action policy patterning 2. Operation of information processing system 1 2.1. Operation of action policy confirmation unit 13 2.2. Operation of environmental change observation unit 16 2.3. Operation of operation parameter proposal unit 15 2.4. Operation of action policy patterning unit 32 2.5. Operation of action policy score estimation unit 35 3. Effects 4. Hardware configuration

　＜１．情報処理システム１の機能ブロック図＞
　図１は、実施の形態に係る情報処理システム１の機能ブロック図である。図１に示すように、情報処理システム１は、ロボット２及びサーバ３を有する。 <1. Functional block diagram of information processing system 1>
1 is a functional block diagram of an information processing system 1 according to an embodiment. As shown in FIG. 1, the information processing system 1 includes a robot 2 and a server 3.

　図１では、ロボット２及びサーバ３は、ネットワーク（図示せず）を介して接続される。図１では、１台のロボット２を示しているが、ロボット２は複数台あっても良い。複数台のロボット２は、それぞれロボット２の動作パラメータの制御装置を有する。 In FIG. 1, the robot 2 and the server 3 are connected via a network (not shown). Although one robot 2 is shown in FIG. 1, there may be multiple robots 2. Each of the multiple robots 2 has a control device for the operating parameters of the robot 2.

　　＜１．１．ロボット２の機能ブロック図＞
　以下、ロボット２の動作パラメータの制御装置の機能ブロック図について図１を参照して説明する。 <1.1. Functional block diagram of robot 2>
A functional block diagram of the control device for the motion parameters of the robot 2 will be described below with reference to FIG.

　ロボット２は、ユーザＩ／Ｆ１１、動作パラメータ変更部１２、アクションポリシー確認部１３、アクションポリシーデータベース１４、動作パラメータ提案部１５、環境変化観測部１６、動作パラメータデータベース１７、制御部１８、アクチュエータ１９、センサ２０、認識部２１、環境情報データベース２２を有する。 The robot 2 has a user I/F 11, an operation parameter change unit 12, an action policy confirmation unit 13, an action policy database 14, an operation parameter proposal unit 15, an environmental change observation unit 16, an operation parameter database 17, a control unit 18, an actuator 19, a sensor 20, a recognition unit 21, and an environmental information database 22.

（ユーザＩ／Ｆ１１）
　ユーザＩ／Ｆ１１は、ロボット２の動作パラメータの制御装置への入出力インターフェイスであり、例えば、ＰＤＡ（Ｐｅｒｓｏｎａｌ　Ｄｉｇｉｔａｌ　Ａｓｓｉｓｔａｎｔ）、携帯電話等のコンピュータであっても良い。ユーザＩ／Ｆ１１は、ロボット２の動作パラメータの変更依頼を受け付け、受け付けた変更依頼に対応する変更後の動作パラメータを出力する。「変更依頼」としては、例えば、ロボット２の「移動速度を上げろ」、「移送速度を下げろ」等である。ユーザＩ／Ｆ１１は、このような「変更依頼」に対応する変更後の動作パラメータを動作パラメータ変更部１２に出力する。なお、図１においては、ユーザＩ／Ｆ１１からは、「変更依頼」に対応する変更後の動作パラメータが出力されるが、図１では説明を分かりやすくするため、「変更依頼」として示している。 (User I/F 11)
The user I/F 11 is an input/output interface for the control device of the operation parameters of the robot 2, and may be, for example, a computer such as a PDA (Personal Digital Assistant) or a mobile phone. The user I/F 11 accepts a request to change the operation parameters of the robot 2, and outputs the changed operation parameters corresponding to the accepted change request. Examples of the "change request" include "increase the moving speed" and "decrease the transport speed" of the robot 2. The user I/F 11 outputs the changed operation parameters corresponding to such a "change request" to the operation parameter change unit 12. Note that, in FIG. 1, the user I/F 11 outputs the changed operation parameters corresponding to the "change request", but in FIG. 1, it is shown as a "change request" for ease of understanding.

（動作パラメータ変更部１２）
　動作パラメータ変更部１２は、ユーザＩ／Ｆ１１から出力された変更後の動作パラメータ（変更内容）をアクションポリシー確認部１３に出力する。動作パラメータの変更の例としては、ある値に設定する場合とある値を増減することが可能である。例えば、ある値に設定する場合の例として、「並進速度＝ゆっくり」と設定する場合がある。ある値を増減する場合の例として、「並進速度＋＝０．１ｍ／ｓ」と設定する場合がある。 (Operation parameter change unit 12)
The action parameter change unit 12 outputs the changed action parameters (change contents) output from the user I/F 11 to the action policy confirmation unit 13. As an example of changing the action parameters, it is possible to set a certain value or to increase or decrease a certain value. For example, as an example of setting a certain value, "translation speed = slow" may be set. As an example of increasing or decreasing a certain value, "translation speed + = 0.1 m/s" may be set.

　「アクションポリシー」は、「環境変化」と「動作パラメータ」との組み合わせを有する。例えば、環境変化が「貴重品：＋４個」の場合に、動作パラメータが「並進速度＝ゆっくり」という組み合わせである。また、「アクションポリシー」は、例えば、環境変化が「人が増えた」の場合、動作パラメータが「移動速度を低速にする」という組み合わせである。このアクションポリシーは、人混みではゆっくりロボット２を動作させるためのものである。アクションポリシーは、人の立場、役割、性格によって変わりうるため、パターン化、グループ化が可能である。 An "action policy" is a combination of an "environmental change" and an "action parameter." For example, when the environmental change is "valuables: +4," the action parameter is "translation speed = slow." Another "action policy" is, for example, when the environmental change is "increased people," the action parameter is "slow down the movement speed." This action policy is for making robot 2 move slowly in crowds. Action policies can change depending on people's position, role, and personality, so they can be patterned and grouped.

　アクションポリシーは、ユーザに対してそれぞれ複数でも構わない。例えば、環境変化が「知らない人：有」の場合に、動作パラメータが「音声出力＝ｔｒｕｅ、表示灯＝ｔｒｕｅ」という組み合わせである。他にも環境変化が「道幅＝狭い、子供：有」の場合に、動作パラメータが「よけ始める距離＝遠め、並進速度＝ゆっくり」という組み合わせである。また、動作パラメータ変更部１２は、動作パラメータ提案部１５から出力された受け付けられた提案の動作パラメータを受信し、動作パラメータデータベース１７に格納する。 There may be multiple action policies for each user. For example, when the environmental change is "Strangers: present", the action parameters may be a combination of "audio output = true, indicator light = true". Another example is when the environmental change is "road width = narrow, children: present", the action parameters may be a combination of "distance at which to start dodging = farther, translation speed = slower". The action parameter change unit 12 also receives the proposed action parameters that have been accepted and output from the action parameter suggestion unit 15, and stores them in the action parameter database 17.

　アクションポリシーは、具体的には、文字列または数列からなるデータである。アクションポリシーは、「環境変化」と「動作パラメータ」の組み合わせからなるため、「環境変化」は「文字列または数列」からなるデータで、「動作パラメータ」は「ロボット２の動作に対して影響を与える文字列または数列」からなるデータである。 Specifically, an action policy is data consisting of a character string or a number sequence. An action policy is made up of a combination of "environmental changes" and "operation parameters", so "environmental changes" are data consisting of "character strings or number sequences", and "operation parameters" are data consisting of "character strings or number sequences that affect the operation of robot 2".

　「環境変化」は、ロボット２の周囲の環境が変化したことであり、ロボット２の位置に関するデータ、時間に関する内容を含んでも良い。また、「環境変化」には、ロボット２の場所の意味情報、物体情報、走行領域の形状、実行予定の動作内容、気温/湿度/照度などの環境特性が含まれても良い。 "Environmental change" refers to a change in the environment around robot 2, and may include data on the position of robot 2 and information on time. "Environmental change" may also include semantic information on the location of robot 2, object information, the shape of the travel area, the operation to be performed, and environmental characteristics such as temperature, humidity, and illuminance.

　「環境変化」は、カメラ画像の全部または画像フレームの一部から取得した環境情報の時間差分から得られる。「環境情報」は、カメラ画像以外の、ＬｉＤＡＲ（Ｌｉｇｈｔ　Ｄｅｔｅｃｔｉｏｎ　Ａｎｄ　Ｒａｎｇｉｎｇ）、マイク、温湿度センサ、照度センサ、ＩＭＵ（Ｉｎｅｒｔｉａｌ　Ｍｅａｓｕｒｅｍｅｎｔ　Ｕｎｉｔ）や車輪エンコーダなどのセンサ情報から得られる。 "Environmental changes" are obtained from the time difference of environmental information obtained from all camera images or parts of image frames. "Environmental information" is obtained from sensor information other than camera images, such as LiDAR (Light Detection and Ranging), microphone, temperature and humidity sensor, illuminance sensor, IMU (Inertial Measurement Unit), and wheel encoder.

　「動作パラメータ」は、デフォルト値として工場出荷時に設定される。「動作パラメータ」は、「環境変化」と紐づいておらず、動作パラメータのみで存在する。「動作パラメータ」は、工場出荷後に、ユーザが追加で入力/修正可能である。 "Operation parameters" are set as default values when shipped from the factory. "Operation parameters" are not linked to "environmental changes" and exist as operational parameters alone. "Operation parameters" can be additionally input/modified by the user after shipping from the factory.

　なお、ロボット２が動作パラメータを追加入力/修正することもできるが、ユーザの特性（立場、役割、性格）によって、適切な動作パラメータが異なると考えられるため、動作パラメータは、ユーザの入力や環境変化などなにかの入力と紐づけられても良い。これにより、ロボット２がユーザによって適切な動作パラメータを設定することができる。 Note that the robot 2 can also add/modify movement parameters, but since appropriate movement parameters are thought to differ depending on the characteristics of the user (position, role, personality), the movement parameters may be linked to some input, such as user input or environmental changes. This allows the robot 2 to set appropriate movement parameters according to the user.

（動作パラメータデータベース１７等）
　動作パラメータデータベース１７は、動作パラメータ変更部１２により格納された受け付けられた提案の動作パラメータを制御部１８に出力する。制御部１８は、ロボット２の各アクチュエータ１９を制御する制御装置である。制御部１８は、動作パラメータデータベース１７から出力された動作パラメータを対応するアクチュエータ１９に出力する。アクチュエータ１９は、受信した動作パラメータに基づいて、ロボット２のアクチュエータ１９に対応する各部を制御する。 (Operation parameter database 17, etc.)
The operation parameter database 17 outputs the proposed operation parameters stored by the operation parameter changing unit 12 and accepted to the control unit 18. The control unit 18 is a control device that controls each actuator 19 of the robot 2. The control unit 18 outputs the operation parameters output from the operation parameter database 17 to the corresponding actuators 19. The actuators 19 control each unit of the robot 2 corresponding to the actuators 19 based on the received operation parameters.

（センサ２０等）
　センサ２０は、ロボット２の環境に関する情報を計測する。認識部２１は、センサ２０によって計測された環境に関する情報を受信し、環境情報を認識する。認識部２１は、環境情報を入力された環境に関連する情報と合わせて認識しても良い。認識部２１は、認識した環境情報を環境情報データベース２２に格納する。環境情報データベース２２は、格納した環境情報を環境変化観測部１６に出力する。 (Sensor 20, etc.)
The sensor 20 measures information related to the environment of the robot 2. The recognition unit 21 receives the information related to the environment measured by the sensor 20 and recognizes the environmental information. The recognition unit 21 may recognize the environmental information together with information related to the input environment. The recognition unit 21 stores the recognized environmental information in the environmental information database 22. The environmental information database 22 outputs the stored environmental information to the environmental change observation unit 16.

（環境変化観測部１６）
　環境変化観測部１６は、環境情報データベース２２から出力された環境情報に基づいて、環境変化を観測し、環境変化があったことをアクションポリシー確認部１３及び動作パラメータ提案部１５に出力する。 (Environmental change observation unit 16)
The environmental change observing unit 16 observes an environmental change based on the environmental information output from the environmental information database 22, and outputs a notification that an environmental change has occurred to the action policy confirming unit 13 and the operation parameter proposing unit 15.

　「環境変化」は、ロボット２の周囲の環境が変化したことであり、例えば、ロボット２の環境が変化してある値になったこと（以前の値は関係ない）、環境情報が閾値を上回った／下回ったこと、環境情報の変化量が閾値を上回った／下回ったこと等がある。 "Environmental change" refers to a change in the environment around robot 2, for example, the environment of robot 2 changing to a certain value (the previous value is irrelevant), environmental information exceeding/falling below a threshold, the amount of change in environmental information exceeding/falling below a threshold, etc.

　環境が変化してある値になったこととは、例えば、「知らない人：いない」から「知らない人：いる」に変化した場合や、「道幅：普通、広い」から「道幅：狭い」に変化した場合がある。 The environment may change to a certain value, for example, if it changes from "Strangers: None" to "Strangers: Present," or from "Road width: Normal, Wide" to "Road width: Narrow."

　環境情報が閾値を上回った／下回ったこととは、例えば、「貴重品との距離が１．０ｍを下回った」場合、「使用時間が６カ月を上回った」場合がある。 For example, environmental information exceeding/falling below a threshold value could mean "the distance to valuables is less than 1.0 m" or "the usage time exceeds six months."

　変化量が閾値を上回った／下回ったこととは、例えば、「子供：（普段）１人」から「子供：５人」に変化し、＋４人の変化が閾値を上回った場合がある。 An example of a change exceeding/falling below a threshold would be when the number of children changes from "children (usually) 1" to "children: 5," with the change of +4 exceeding the threshold.

（アクションポリシー確認部１３）
　アクションポリシー確認部１３は、環境変化が観測された場合、動作パラメータの変更が環境変化が原因なのかをユーザに質問する。アクションポリシー確認部１３は、動作パラメータの変更が環境変化が原因の場合、「環境変化」と「（変更依頼の）動作パラメータ」との組み合わせを有する「アクションポリシー」を取得してロボット２のアクションポリシーデータベース１４に保存する。アクションポリシー確認部１３は、動作パラメータの変更が環境変化が原因であって、変更理由が得られた場合には、変更理由を「アクションポリシー」に付加してとして保存する。アクションポリシー確認部１３は、動作パラメータの変更が環境変化によるものである場合、ユーザのアクションポリシーの環境変化を原因となる環境変化に更新する。 (Action Policy Checking Unit 13)
When an environmental change is observed, the action policy checking unit 13 asks the user whether the change in the operating parameters is caused by the environmental change. When the change in the operating parameters is caused by the environmental change, the action policy checking unit 13 acquires an "action policy" having a combination of the "environmental change" and the "operation parameters (of the change request)" and saves it in the action policy database 14 of the robot 2. When the change in the operating parameters is caused by the environmental change and the reason for the change is obtained, the action policy checking unit 13 adds the reason for the change to the "action policy" and saves it as such. When the change in the operating parameters is caused by the environmental change, the action policy checking unit 13 updates the environmental change in the user's action policy to the environmental change that is the cause.

　更新されたユーザのアクションポリシーが所属するアクションポリシーのパターンは、アクションポリシーのスコア（図９のステップＳ４２のアクションポリシーのスコアと同様）を有する。ユーザへの質問は、アクションポリシーのスコアの降順に行われる。 The action policy pattern to which the updated user's action policy belongs has an action policy score (similar to the action policy score in step S42 of FIG. 9). Questions are posed to the user in descending order of the action policy scores.

（アクションポリシーデータベース１４）
　アクションポリシーデータベース１４には、アクションポリシー確認部１３おいて、ユーザのアクションポリシーが更新されると、更新されたユーザのアクションポリシーが格納される。また、アクションポリシー実績をサーバ３に送信し、サーバ３からアクションポリシーのスコアを受信した場合には、その内容に基づいて新たなアクションポリシーの格納や格納済みのアクションポリシーのスコアを更新する。 (Action Policy Database 14)
When a user's action policy is updated by the action policy confirmation unit 13, the updated user's action policy is stored in the action policy database 14. In addition, when the action policy results are transmitted to the server 3 and an action policy score is received from the server 3, a new action policy is stored or the score of an action policy already stored is updated based on the contents of the action policy.

　つまり、アクションポリシーデータベース１４は、環境変化がある場合に、環境変化を含むロボットの変更する動作パラメータを格納する。また、アクションポリシーデータベース１４には、サーバ３からアクションポリシーのスコアがアクションポリシーに付加して格納される。 In other words, the action policy database 14 stores the operating parameters of the robot that are changed, including the environmental change, when there is an environmental change. Also, the action policy score from the server 3 is added to the action policy and stored in the action policy database 14.

　各ロボット２のアクションポリシーデータベース１４に格納されているデータ形式は、［アクションポリシー、採用回数、不採用回数、スコア］の集合となる。 The data format stored in the action policy database 14 of each robot 2 is a collection of [action policy, number of times adopted, number of times rejected, score].

　採用回数の増加／不採用回数の増加は、アクションポリシー確認部１３及び動作パラメータ提案部１５により行われる。採用回数及び不採用回数は、サーバ３のアクションポリシーパターン化部３２で所属するユーザのアクションポリシー実績を取得する際に使用される。 The increase in the number of adoptions/rejections is performed by the action policy confirmation unit 13 and the operation parameter proposal unit 15. The number of adoptions and rejections is used by the action policy patterning unit 32 of the server 3 when acquiring the action policy performance of the user.

（動作パラメータ提案部１５）
　動作パラメータ提案部１５は、環境変化がある場合に、アクションポリシーデータベース１４に格納されたロボット２に適用される変更する動作パラメータを取得する。動作パラメータ提案部１５は、提案した動作パラメータの提案が受け付けられた場合に、提案された動作パラメータを動作パラメータ変更部１２に出力し、動作パラメータの変更依頼を行う。 (Operation Parameter Proposal Unit 15)
When there is an environmental change, the operation parameter suggestion unit 15 acquires operation parameters to be changed and applied to the robot 2 stored in the action policy database 14. When the operation parameter suggestion unit 15 accepts the proposal of the proposed operation parameters, it outputs the proposed operation parameters to the operation parameter change unit 12 and makes a request to change the operation parameters.

　動作パラメータ提案部１５により取得される動作パラメータは、ユーザのアクションポリシーが所属するアクションポリシーのパターンの動作パラメータである。ユーザへの提案は、ユーザのアクションポリシーとの関連度が閾値以上のパターンの動作パラメータについて行われる。 The operation parameters acquired by the operation parameter suggestion unit 15 are the operation parameters of the pattern of the action policy to which the user's action policy belongs. Suggestions to the user are made for operation parameters of patterns whose relevance to the user's action policy is equal to or greater than a threshold value.

　動作パラメータ提案部１５は、提案した動作パラメータの提案が受け付けられた場合に、採用回数を増加する。また、動作パラメータ提案部１５は、提案した動作パラメータの提案が受け付けられない場合に、不採用回数を増加する。 The operation parameter suggestion unit 15 increases the number of times the proposed operation parameter is adopted when the proposed operation parameter is accepted. In addition, the operation parameter suggestion unit 15 increases the number of times the proposed operation parameter is rejected when the proposed operation parameter is not accepted.

　ユーザのアクションポリシーが所属するアクションポリシーのパターンは、アクションポリシーのスコアを有する。動作パラメータ提案部１５は、アクションポリシーのスコアの降順に取得した動作パラメータをユーザに提案する。 The action policy pattern to which the user's action policy belongs has an action policy score. The action parameter suggestion unit 15 suggests to the user the action parameters obtained in descending order of the action policy scores.

　　＜１．２．サーバ３の機能ブロック図＞
　サーバ３は、アクションポリシー実績データベース３１、アクションポリシーパターン化部３２、ユーザデータベース３３、パターンデータベース３４、アクションポリシースコア推定部３５を有する。 <1.2. Functional block diagram of server 3>
The server 3 includes an action policy result database 31 , an action policy patterning unit 32 , a user database 33 , a pattern database 34 , and an action policy score estimation unit 35 .

（アクションポリシー実績データベース３１）
　アクションポリシー実績データベース３１は、各ロボット２のアクションポリシーデータベース１４に格納されたアクションポリシー実績を格納する。アクションポリシー実績のデータのデータ形式は、［アクションポリシー、ユーザ、採用回数、不採用回数］の集合となる。 (Action policy performance database 31)
The action policy result database 31 stores the action policy results stored in the action policy database 14 of each robot 2. The data format of the action policy result data is a set of [action policy, user, number of times adopted, number of times rejected].

（ユーザデータベース３３）
　ユーザデータベース３３は、ユーザデータを格納する。ユーザデータのデータ形式は、［パターン、所属ユーザ（複数）］の集合となる。 (User database 33)
The user database 33 stores user data. The data format of the user data is a set of [pattern, belonging users (multiple)].

（パターンデータベース３４）
　パターンデータベース３４は、パターンデータを格納する。パターンデータの形式は、［ユーザ、所属パターン（複数）］の集合となる。 (Pattern Database 34)
The pattern database 34 stores pattern data. The format of the pattern data is a set of [user, belonging pattern (plural)].

（アクションポリシーパターン化部３２）
　アクションポリシーパターン化部３２は、アクションポリシー実績データベース３１、ユーザデータベース３３、パターンデータベース３４に格納されたデータに基づいて、アクションポリシーのパターン化を行う。アクションポリシーのパターン化については後述する。 (Action Policy Patterning Unit 32)
The action policy patterning unit 32 patterns the action policies based on the data stored in the action policy result database 31, the user database 33, and the pattern database 34. The patterning of the action policies will be described later.

　「アクションポリシーパターン」とは、「アクションポリシー」の集合であり、グループのことである。例えば、（子育て家庭のパターン）の場合、環境変化が「子供：有」の場合に、動作パラメータが「並進速度＝ゆっくり」の場合であり、環境変化が「子供：無」の場合に、動作パラメータが「並進速度＝速め」の場合である。例えば、（人共存工場のパターン）の場合、環境変化が「照度が低い」の場合に、動作パラメータが「並進速度＝速め、表示灯＝ｆａｌｓｅ」の場合であり、環境変化が「人：有」の場合に、動作パラメータが「並進速度＝ゆっくり、表示灯＝ｔｒｕｅ、音声出力＝ｔｒｕｅ」の場合である。 An "action policy pattern" is a collection or group of "action policies." For example, in the case of (a pattern of a family with children), if the environmental change is "children: present," the action parameter is "translational speed = slow," and if the environmental change is "children: absent," the action parameter is "translational speed = fast." For example, in the case of (a pattern of a factory where people coexist), if the environmental change is "low illumination," the action parameters are "translational speed = fast, indicator light = false," and if the environmental change is "people: present," the action parameters are "translational speed = slow, indicator light = true, audio output = true."

（アクションポリシースコア推定部３５）
　アクションポリシースコア推定部３５は、ユーザデータベース３３、パターンデータベース３４に格納されたデータに基づいて、アクションポリシーのスコアの推定を行う。アクションポリシーのスコアの推定については後述する。 (Action Policy Score Estimation Unit 35)
The action policy score estimation unit 35 estimates the score of the action policy based on the data stored in the user database 33 and the pattern database 34. The estimation of the score of the action policy will be described later.

　図１において、
・「変更依頼」／「変更内容」／「変更提案」のデータ形式
　＝「動作パラメータの変更内容」の集合である。
・「アクションポリシー確認」のデータ形式
　＝以下のどちらか
　　・「変更内容」は「環境変化」によるものか？
　　・　「変更内容」はなにか？
である。
・「回答」のデータ形式
　＝以下のどちらか
　　・「ＹＥＳ」「ＮＯ」
　　・「自由回答」
である。
・「受諾／拒否」のデータ形式
　＝「ＹＥＳ」／「ＮＯ」
である。
・「環境変化」のデータ形式
　＝「環境変化」の配列
である。
・「アクションポリシー実績」のデータ形式
　＝［アクションポリシー、採用回数、不採用回数］　の集合
である。
・「アクションポリシーのスコア」のデータ形式
　＝［アクションポリシー、スコア］の集合
である。 In FIG.
Data format of "change request"/"change content"/"change proposal" = a collection of "change content of operation parameters".
- Data format of "Action policy confirmation" = one of the following - Is the "change" due to an "environmental change"?
- What are the “changes”?
It is.
・"Answer" data format = Either of the following ・"YES" or "NO"
・"Free response"
It is.
- "Accept/Reject" data format = "YES"/"NO"
It is.
・Data format of "Environmental changes" = Array of "Environmental changes".
Data format of "action policy performance" = a set of [action policy, number of adoptions, number of rejections].
Data format of "action policy score" = a set of [action policy, score].

　　＜１．３．動作パラメータ＞
　図２は、実施の形態に係るロボット２の動作パラメータの一例を示す図である。図２に示すように、「曲がり角で一時停止、その場回転するか」の動作パラメータについては、データの例として「回転方法＝一時停止及びその場回転」があり、値のバリエーションとして「一時停止及びその場回転、停止せずに通過」があり、データ型として「整数型等」がある。 1.3. Operation parameters
2 is a diagram showing an example of the motion parameters of the robot 2 according to the embodiment. As shown in Fig. 2, for the motion parameter "Whether to pause at a corner and turn on the spot", an example of data is "Turn method = pause and turn on the spot", a variation of the value is "pause and turn on the spot, pass through without stopping", and a data type is "integer type, etc."

　「並進／回転速度」の動作パラメータについては、データの例として「並進速度＝０．８ｍ／ｓ、回転速度＝０．５ｒａｄ／ｓ、並進速度＝速め、回転速度＝ゆっくりめ」があり、データ型として「浮動小数型、整数型など（速め＝０、標準＝１など）」がある。 For the "translation/rotation speed" operation parameters, examples of data include "translation speed = 0.8 m/s, rotation speed = 0.5 rad/s, translation speed = fast, rotation speed = slow," and data types include "floating point type, integer type, etc. (fast = 0, standard = 1, etc.)."

　「並進／回転加速度」の動作パラメータについては、データの例として「並進加速度＝０．３ｍ／ｓ２、回転加速度＝０．２ｒａｄ／ｓ２、並進加速度＝俊敏、回転加速度＝なめらか」があり、データ型として「浮動小数型、整数型など（俊敏＝０、標準＝１など）」がある。なお、「並進／回転速度」、「並進／回転加速度」の動作パラメータの分類は、「速度」に分類される。 For the "translational/rotational acceleration" motion parameter, examples of data include "translational acceleration = 0.3 m/s2, rotational acceleration = 0.2 rad/s2, translational acceleration = agile, rotational acceleration = smooth," and data types include "floating point type, integer type, etc. (agile = 0, standard = 1, etc.)." The motion parameters "translational/rotational speed" and "translational/rotational acceleration" are classified as "speed."

　「与えられた経路からの最大逸脱幅」の動作パラメータについては、データの例として「最大逸脱幅＝２．０ｍ、最大逸脱幅＝広め」があり、データ型として「浮動小数型、整数型など（狭め＝０、標準＝１、広め＝２など）」がある。 For the operating parameter "maximum deviation from a given route," examples of data include "maximum deviation = 2.0 m, maximum deviation = wide," and data types include "floating point type, integer type, etc. (narrow = 0, standard = 1, wide = 2, etc.)."

　「障害物をよけ始める距離」の動作パラメータについては、データの例として「よけ始める距離＝３．０ｍ、よけ始める距離＝近め」があり、データ型として「浮動小数型、整数型など（遠め＝０、標準＝１）」がある。 For the operation parameter "distance at which to start avoiding obstacles," examples of data include "distance at which to start avoiding obstacles = 3.0 m, distance at which to start avoiding obstacles = closer," and data types include "floating point type, integer type, etc. (farther = 0, standard = 1)."

　「障害物の横を通る際の距離」の動作パラメータについては、データの例として「通過のオフセット距離＝０．５ｍ、通過のオフセット距離＝ギリギリ」があり、データ型として「浮動小数型、整数型など（遠め＝０、標準＝１）」がある。 For the operation parameter "distance when passing next to an obstacle," examples of data include "passing offset distance = 0.5 m, passing offset distance = just barely," and data types include "floating point type, integer type, etc. (farther = 0, standard = 1)."

　「障害物をよけるか」の動作パラメータについては、データの例として「よけるか＝ｆａｌｓｅ　（よけない＝止まって待つ）」があり、データ型として「ｂｏｏｌ型など」がある。 For the action parameter "Avoid obstacles?", an example of data is "Avoid = false (Do not avoid = stop and wait)" and an example of the data type is "Boolean type, etc.".

　「上記３種類の「障害物」を「特定の属性の物体」に置き換えたもの」の動作パラメータについては、データの例として「上記３つと同じ」であり、他のバリエーションとして「特定の属性＝知っている人、知らない人、子供、大人、お年寄り、壊れやすいもの、危険物、食品、貴重品など」があり、データ型として「上記３つと同じ」がある。なお、「障害物をよけ始める距離」、「障害物の横を通る際の距離」、「障害物をよけるか」、「上記３種類の「障害物」を「特定の属性の物体」に置き換えたもの」の動作パラメータの分類は、「回避挙動」に分類される。 As for the behavior parameters of "replacing the above three types of 'obstacles' with 'objects with specific attributes'", an example of data is "same as the above three", other variations include "specific attributes = known people, strangers, children, adults, elderly people, fragile items, dangerous items, food, valuables, etc.", and the data type is "same as the above three". Note that the classification of behavior parameters of "distance at which to start avoiding obstacles", "distance when passing by an obstacle", "whether to avoid obstacles", and "replacing the above three types of 'obstacles' with 'objects with specific attributes'" are classified as "avoidance behavior".

　「動作中に音声を出すか」の動作パラメータについては、データの例として「音声出力＝ｆａｌｓｅ」があり、データ型として「ｂｏｏｌ型など」がある。なお、「動作中に音声を出すか」の動作パラメータの分類は、「音声」に分類される。 For the operation parameter "Should sound be produced during operation?", an example of data is "audio output = false" and an example of a data type is "bool type, etc." The classification of the operation parameter "Should sound be produced during operation?" is classified as "audio."

　「動作中に表示灯や進行方法を示すか」の動作パラメータについては、データの例として「表示灯＝ｔｒｕｅ、進行方向表示＝ｆａｌｓｅ」があり、データ型として「ｂｏｏｌ型など」がある。なお、「動作中に表示灯や進行方法を示すか」の動作パラメータの分類は、「表示灯」に分類される。 For the operation parameter "Should the indicator light or direction of travel be displayed during operation?", an example of data is "indicator light = true, direction of travel display = false", and the data type is "Boolean, etc.". The classification of the operation parameter "Should the indicator light or direction of travel be displayed during operation?" is classified as "indicator light".

　　＜１．４．環境情報＞
　図３は、実施の形態に係るロボット２の環境情報の一例を示す図である。図３に示すように、「場所の種別」の環境情報については、データの例として「現在地の種別＝オフィス」があり、値のバリエーションとして「オフィス、倉庫、搬入口、廊下、キッチン、ダイニング、寝室」があり、データ型として「整数型など：オフィス＝１、走行＝２…」がある。 <1.4. Environmental information>
3 is a diagram showing an example of the environmental information of the robot 2 according to the embodiment. As shown in FIG. 3, the environmental information of the "type of place" is, for example, "type of current location = office"", with value variations of "office, warehouse, loading entrance, hallway, kitchen, dining room, bedroom" and data types of "integer type, etc.: office = 1, running = 2...".

　「場所の属性」の環境情報については、データの例として「静かにすべき＝ｔｒｕｅ、清潔にすべき＝ｆａｌｓｅ…」があり、値のバリエーションとして「静かにすべき、清潔にすべき、安全に気を付けるべき」があり、データ型として「ｂｏｏｌ型の連想配列など」がある。なお、「場所の属性」の環境情報の分類は、「意味情報」に分類される。 As for the environmental information of "place attributes", examples of data include "should be quiet = true, should be clean = false...", and value variations include "should be quiet, should be clean, should be careful about safety", and data types include "Boolean associative array, etc.". The classification of environmental information of "place attributes" is categorized as "semantic information".

　「特定の属性の物体の有無」、「特定の属性の物体の数」、「特定の属性の物体の（同カテゴリでの）割合」、「特定の属性の物体の最近傍距離」の環境情報については、データの例として「子供：有無＝有、数＝３、割合＝６０％、距離＝１．２ｍ、お年寄り：有無＝有、数＝２、割合＝４０％、距離＝３．５ｍ、壊れやすいもの：有無＝有、数＝３、割合＝３０％、距離＝１．４ｍ」がある。他のバリエーションとして「属性：知っている人、知らない人、子供、大人、お年寄り、壊れやすいもの、危険物、食品、貴重品、有無：有／無」がある。 As for environmental information such as "presence or absence of objects with a particular attribute," "number of objects with a particular attribute," "proportion of objects with a particular attribute (in the same category)," and "nearest distance of objects with a particular attribute," examples of data include "Children: Presence = Yes, Number = 3, Proportion = 60%, Distance = 1.2 m, Elderly: Presence = Yes, Number = 2, Proportion = 40%, Distance = 3.5 m, Fragile items: Presence = Yes, Number = 3, Proportion = 30%, Distance = 1.4 m." Other variations include "Attributes: known people, strangers, children, adults, elderly people, fragile items, dangerous items, food, valuables, presence/absence: yes/no."

　「特定の属性の物体の有無」については、データ型として「ｂｏｏｌ型など」があり、「特定の属性の物体の数」については、データ型として「浮動小数型など」があり、「特定の属性の物体の（同カテゴリでの）割合」については、データ型として「浮動小数型など」があり、「特定の属性の物体の最近傍距離」については、データ型として「浮動小数型など」がある。なお、「特定の属性の物体の有無」、「特定の属性の物体の数」、「特定の属性の物体の（同カテゴリでの）割合」、「特定の属性の物体の最近傍距離」の環境情報の分類は、「物体情報」に分類される。 For "presence or absence of objects with a specific attribute," the data type is "Boolean, etc.", for "number of objects with a specific attribute," the data type is "floating point, etc.", for "proportion of objects with a specific attribute (in the same category)," the data type is "floating point, etc.", and for "nearest distance of objects with a specific attribute," the data type is "floating point, etc." Note that the classification of environmental information for "presence or absence of objects with a specific attribute," "number of objects with a specific attribute," "proportion of objects with a specific attribute (in the same category)," and "nearest distance of objects with a specific attribute" are classified as "object information."

　「走行領域の属性」の環境情報については、データの例として「道幅が狭い＝ｔｒｕｅ、曲がり角の見通しがよい＝ｆａｌｓｅ、床が滑る＝ｆａｌｓｅ」があり、値のバリエーションとして「道幅が狭い、曲がり角の見通しが良い、床が滑る」があり、データ型として「ｂｏｏｌ型の連想配列など」がある。 For the environmental information of "driving area attributes", examples of data include "road is narrow = true, visibility around corners is good = false, floor is slippery = false", and value variations include "road is narrow, visibility around corners is good, floor is slippery", and data types include "bool-type associative array, etc.".

　「走行領域の特徴量」の環境情報については、データの例として「道幅＝２．３ｍ、摩擦係数＝ＸＸＸ」があり、データ型として「浮動小数型など」がある。なお、「走行領域の属性」、「走行領域の特徴量」の環境情報の分類は、「走行領域の形状」に分類される。 For the environmental information "Characteristics of the driving area", examples of data include "Road width = 2.3 m, Friction coefficient = XXX", and data types include "Floating point type, etc.". The classification of environmental information for "Attributes of the driving area" and "Characteristics of the driving area" is categorized as "Shape of the driving area".

　「取り扱う物体属性」の環境情報については、データの例として「積載物が危険物＝ｔｒｕｅ、操作対象が壊れやすいもの＝ｆａｌｓｅ」があり、データ型として「ｂｏｏｌ型の連想配列など」がある。なお、「取り扱う物体属性」の環境情報の分類は、「動作内容」に分類される。 For the environmental information of "handled object attributes", examples of data include "load is dangerous = true, operation target is fragile = false", and a data type is "Boolean associative array, etc." The classification of environmental information of "handled object attributes" is categorized as "operation content".

　「使用時間」の環境情報については、データの例として「使用時間＝１２４日」があり、データ型として「浮動小数型の連想配列など」がある。なお、「使用時間」の環境情報の分類は、「時間経過」に分類される。 For the "usage time" environmental information, an example of data is "usage time = 124 days" and an example of a data type is "floating-point associative array, etc." The classification of "usage time" environmental information is "elapsed time."

　「季節、月」の環境情報については、データの例として「現在の季節＝夏、月＝７」があり、データ型として「整数型の連想配列など」がある。なお、「季節、月」の環境情報の分類は、「季節」に分類される。 For "season, month" environmental information, an example of data is "current season = summer, month = 7" and an example of a data type is "integer associative array, etc." The classification of "season, month" environmental information is classified as "season."

　「環境情報」の環境情報については、データの例として「気温＝２１．４℃、湿度＝４５％、照度＝１８４ｌｘ、気温＝高い、湿度＝低い、照度＝低い」があり、データ型として「浮動小数型、整数型の連想配列など」がある。なお、「環境情報」の環境情報の分類は、「環境特性」に分類される。 For the environmental information in "Environmental Information", examples of data include "Temperature = 21.4°C, Humidity = 45%, Illuminance = 184 lx, Temperature = High, Humidity = Low, Illuminance = Low", and data types include "floating-point type, integer type associative array, etc." The classification of environmental information in "Environmental Information" is "Environmental Characteristics".

　　＜１．５．アクションポリシーパターン化＞
　次に、サーバ３におけるアクションポリシーパターン化部３２におけるアクションポリシーパターン化について説明する。 1.5. Action Policy Patterning
Next, the action policy patterning in the action policy patterning unit 32 in the server 3 will be described.

　図４は、実施の形態に係るサーバ３におけるパターン化を説明するための図である。図４に示すように、各ユーザは、所属するパターンと、アクションポリシーの集合を有する。各パターンは所属するユーザと、アクションポリシーの集合を有する。あるユーザのアクションポリシーは、複数のパターンの集合で表現できると仮定する。 FIG. 4 is a diagram for explaining patterning in server 3 according to an embodiment. As shown in FIG. 4, each user has a pattern to which it belongs and a set of action policies. Each pattern has a user to which it belongs and a set of action policies. It is assumed that the action policy of a certain user can be expressed as a set of multiple patterns.

　例えば、ユーザＡは、アクションポリシーＸを有し、アクションポリシーＸを有するパターン「１」に属する。ユーザＢは、アクションポリシーＸ、アクションポリシーＹ、アクションポリシーＺを有し、アクションポリシーＸを有するパターン「１」、アクションポリシーＹ、アクションポリシーＺを有するパターン「２」に属する。ユーザＣは、アクションポリシーＹ、アクションポリシーＺ、アクションポリシーＷを有し、アクションポリシーＹ、アクションポリシーＺを有するパターン「２」、アクションポリシーＺ、アクションポリシーＷを有するパターン「３」に属する。 For example, user A has action policy X and belongs to pattern "1" with action policy X. User B has action policy X, action policy Y, and action policy Z, and belongs to pattern "1" with action policy X and pattern "2" with action policy Y and action policy Z. User C has action policy Y, action policy Z, and action policy W, and belongs to pattern "2" with action policy Y and action policy Z, and pattern "3" with action policy Z and action policy W.

　「パターンの更新」については、所属するユーザのアクションポリシーの共通項を有するパターンを探索し、パターンとして持つアクションポリシーを更新する。 For "pattern updates," patterns that have common elements in the action policies of the users who belong to the group are searched for, and the action policies that are included as patterns are updated.

　「ユーザの更新」については、ユーザのアクションポリシーがどのパターンのアクションポリシーの集合として表現すると合致度が高いか計算し、ユーザの所属するパターンを合致度が高いパターンに更新する。 For "updating a user," it calculates which set of action policies represents the user's action policy with the highest degree of match, and updates the pattern to which the user belongs to with the highest degree of match.

　　＜１．６．アクションポリシーパターン化の実装例＞
　図５は、実施の形態に係るアクションポリシーパターン化の実装例について説明するための図である。図５において、これまでのアクションポリシーデータを行列Ｄで表現する。行列Ｄの各行は、ユーザがどのアクションポリシーを持っているかを表す。 <1.6. Example of action policy pattern implementation>
5 is a diagram for explaining an implementation example of action policy patterning according to an embodiment. In Fig. 5, the action policy data up to now is expressed in a matrix D. Each row of the matrix D indicates which action policy the user has.

　図５に示すように、ユーザＡはアクションポリシーＸを有し、ユーザＢはアクションポリシーＸ、アクションポリシーＹ、アクションポリシーＺを有し、ユーザＣはアクションポリシーＹ、アクションポリシーＺ、アクションポリシーＷを有する状態を示している（図４も合わせて参照）。 As shown in Figure 5, user A has action policy X, user B has action policy X, action policy Y, and action policy Z, and user C has action policy Y, action policy Z, and action policy W (also see Figure 4).

　各ユーザの構成パターンを行列Ｕで表現する。Ｕの各行はユーザがどのパターンから構成されるかを表す。図５に示すように、ユーザＡはパターン「１」、ユーザＢはパターン「１」、パターン「２」、ユーザＣはパターン「２」、パターン「３」を有する状態を示している（図４も合わせて参照）。 The configuration patterns of each user are represented by a matrix U. Each row of U indicates which pattern a user is configured with. As shown in Figure 5, user A has pattern "1", user B has pattern "1" and pattern "2", and user C has pattern "2" and pattern "3" (also see Figure 4).

　各パターンのアクションポリシーを行列Ｐで表現する。Ｐの各行はパターンがどのアクションポリシーから構成されるかを表す。図５に示すように、パターン「１」はアクションポリシーＸを有し、パターン「２」はアクションポリシーＹ、アクションポリシーＺを有し、パターン「３」はアクションポリシーＺ、アクションポリシーＷを有する状態を示している（図４も合わせて参照）。 The action policies for each pattern are represented by a matrix P. Each row of P indicates which action policies the pattern is composed of. As shown in Figure 5, pattern "1" has action policy X, pattern "2" has action policy Y and action policy Z, and pattern "3" has action policy Z and action policy W (also see Figure 4).

　行列Ｕ、行列Ｐ、行列Ｄは、ＵＰ＝Ｄ　という関係性を有する。アクションポリシーのパターン化は、以下の順序で行われる。 Matrix U, matrix P, and matrix D have the relationship UP=D. Action policies are patterned in the following order:

　図６は、実施の形態に係るアクションポリシーのパターン化を説明するためのフローチャートである。最初に行列Ｕを乱数で初期化する（ステップＳ１）。次に、行列方程式を解いて行列Ｐを求める（ステップＳ２）。次に、最小二乗法でＵＰ＝Ｄに近くなる行列Ｕを求める（ステップＳ３）。そして、行列Ｕが求められたかの判断が行われ（ステップＳ４）、行列Ｕが求められない場合（ステップＳ４のＮｏ）、ステップＳ２の処理に戻る。行列Ｕが求められた場合（ステップＳ４のＹｅｓ）、処理を終了する。 FIG. 6 is a flowchart for explaining the patterning of action policies according to an embodiment. First, matrix U is initialized with random numbers (step S1). Next, a matrix equation is solved to obtain matrix P (step S2). Next, matrix U that is close to UP=D is obtained using the least squares method (step S3). Then, a determination is made as to whether matrix U has been obtained (step S4). If matrix U has not been obtained (No in step S4), the process returns to step S2. If matrix U has been obtained (Yes in step S4), the process ends.

　＜２．情報処理システム１の動作＞
　　＜２．１．アクションポリシー確認部１３の動作＞
　図７は、実施の形態に係るアクションポリシー確認部１３の動作を説明するためのフローチャートである。図７に示すように、アクションポリシー確認部１３は、環境変化観測部１６より環境変化があったかを判断する（ステップＳ２１）。 2. Operation of Information Processing System 1
<2.1. Operation of action policy check unit 13>
7 is a flow chart for explaining the operation of the action policy checking unit 13 according to the embodiment. As shown in Fig. 7, the action policy checking unit 13 judges whether or not an environmental change has occurred from the environmental change observing unit 16 (step S21).

　ステップＳ２１において、環境変化がない場合（ステップＳ２１のＮＯ）、ステップＳ２７の処理に移る。ステップＳ２１において、環境変化がある場合（ステップＳ２１のＹＥＳ）、環境変化のあったアクションポリシー（「環境変化」＋「変更後の動作パラメータ」）を取得する（ステップＳ２２）。 If there is no environmental change in step S21 (NO in step S21), the process proceeds to step S27. If there is an environmental change in step S21 (YES in step S21), the action policy that has undergone the environmental change ("environmental change" + "changed operating parameters") is obtained (step S22).

　次に、アクションポリシー確認部１３は、取得されたアクションポリシーをアクションポリシーのスコアの降順に並び替えを行う（ステップＳ２３）。このアクションポリシーのスコアは、アクションポリシーに付加されたスコアである。アクションポリシーのスコアは、アクションポリシーとともにアクションポリシーデータベース１４に格納される。 Next, the action policy confirmation unit 13 sorts the obtained action policies in descending order of the action policy score (step S23). This action policy score is the score added to the action policy. The action policy score is stored in the action policy database 14 together with the action policy.

　アクションポリシー確認部１３は、全てのアクションポリシーについて、アクションポリシーの「変更内容」（動作パラメータの変更）は、「環境変化」によるものかをユーザに質問する（ステップＳ２４）。そして、ユーザからの質問に対する回答が「ＹＥＳ」であるかを判断する（ステップＳ２５）。 The action policy confirmation unit 13 asks the user whether the "changes" (changes to the operating parameters) of the action policies for all action policies are due to "environmental changes" (step S24). It then determines whether the user answers the question with "YES" (step S25).

　回答が「ＹＥＳ」である場合（ステップＳ２５のＹＥＳ）、ステップＳ２８の処理に移る。回答が「ＹＥＳ」でない場合（ステップＳ２５のＮＯ）、変更理由が何かをユーザに質問する（ステップＳ２６）。 If the answer is "YES" (YES in step S25), the process proceeds to step S28. If the answer is not "YES" (NO in step S25), the user is asked the reason for the change (step S26).

　ステップＳ２６の質問に対する回答として「環境変化」が得られたかの判断が行われる（ステップＳ２７）。回答として「環境変化」が得られない場合（ステップＳ２７のＮＯ）、処理を終了する。回答として「環境変化」が得られた場合（ステップＳ２７のＹＥＳ）、「変更内容」と「環境変化」の組み合わせの採用回数を１つ増加する（ステップＳ２８）。この「採用回数」は、実績値としてサーバ３のアクションポリシーパターン化部３２で使用される。 It is determined whether "environmental change" is obtained as the answer to the question in step S26 (step S27). If "environmental change" is not obtained as the answer (NO in step S27), the process ends. If "environmental change" is obtained as the answer (YES in step S27), the number of times the combination of "change content" and "environmental change" has been adopted is increased by one (step S28). This "number of times adopted" is used as an achievement value in the action policy patterning unit 32 of the server 3.

　　＜２．２．環境変化観測部１６の動作＞
　図８は、実施の形態に係る環境変化観測部１６の動作を説明するためのフローチャートである。図８に示すように、環境変化観測部１６は、現在時刻の環境情報を取得する（ステップＳ３１）。 <2.2. Operation of the environmental change observation unit 16>
8 is a flow chart for explaining the operation of the environmental change observing section 16 according to the embodiment. As shown in Fig. 8, the environmental change observing section 16 acquires environmental information at the current time (step S31).

　次に、所定の時間差について、現在時刻－時間差の環境情報を取得し（ステップＳ３２）、環境情報の差分を抽出する（ステップＳ３３）。次に、環境変化観測部１６は、環境情報の差分があるかを判断する（ステップＳ３４）。 Next, for a given time difference, environmental information for the current time minus the time difference is acquired (step S32), and the difference in the environmental information is extracted (step S33). Next, the environmental change observation unit 16 determines whether there is a difference in the environmental information (step S34).

　差分がないと判断された場合（ステップＳ３４のＮＯ）、ステップＳ３２の処理に戻る。差分があると判断された場合（ステップＳ３４のＹＥＳ）、環境変化集合に追加し（ステップＳ３５）、ステップＳ３６の処理に移る。 If it is determined that there is no difference (NO in step S34), the process returns to step S32. If it is determined that there is a difference (YES in step S34), it is added to the environmental change set (step S35), and the process proceeds to step S36.

　ステップＳ３６では、ステップＳ３５で追加された環境変化集合が出力され（ステップＳ３６）、処理を終了する。 In step S36, the set of environmental changes added in step S35 is output (step S36), and the process ends.

　　＜２．３．動作パラメータ提案部１５の動作＞
　図９は、実施の形態に係る動作パラメータ提案部１５の動作を説明するためのフローチャートである。 <2.3. Operation of the operation parameter proposing unit 15>
FIG. 9 is a flowchart for explaining the operation of the operation parameter proposing unit 15 according to the embodiment.

　図９に示すように、動作パラメータ提案部１５は、該当の「環境変化」を含むアクションポリシーをアクションポリシーデータベース１４から取得する（ステップＳ４１）。これにより、アクションポリシーは「環境変化」と「動作パラメータ」とを有するので、動作パラメータ提案部１５は、ロボット２に適用される変更する動作パラメータを取得する。次に、動作パラメータ提案部１５は、取得されたアクションポリシーをアクションポリシーのスコアの降順に並び替えを行う（ステップＳ４２）。 As shown in FIG. 9, the operation parameter suggestion unit 15 acquires an action policy including the relevant "environmental change" from the action policy database 14 (step S41). As a result, since the action policy has an "environmental change" and an "operation parameter", the operation parameter suggestion unit 15 acquires the operation parameters to be changed and applied to the robot 2. Next, the operation parameter suggestion unit 15 sorts the acquired action policies in descending order of the action policy scores (step S42).

　次に、関連度が閾値以上の全てのアクションポリシーについて、「環境変化」を理由に「変更内容」を適用するかをユーザに質問（提案）し（ステップＳ４３）、質問に対する回答が「ＹＥＳ」であるかを判断する（ステップＳ４４）。 Next, for all action policies whose relevance is equal to or greater than the threshold, the user is asked (suggested) whether to apply the "changes" due to the "environmental changes" (step S43), and it is determined whether the answer to the question is "YES" (step S44).

　回答が「ＹＥＳ」の場合（ステップＳ４４のＹＥＳ）、「変更内容」と「環境変化」との組み合わせの採用回数を増加し（ステップＳ４５）、「変更内容」を動作パラメータ変更部１２に送信して（ステップＳ４６）、処理を終了する。 If the answer is "YES" (YES in step S44), the number of times the combination of "change content" and "environmental change" is adopted is increased (step S45), the "change content" is sent to the operation parameter change unit 12 (step S46), and the process ends.

　回答が「ＮＯ」の場合（ステップＳ４４のＮＯ）、「変更内容」と「環境変化」との不採用回数を増加し（ステップＳ４７）、処理を終了する。 If the answer is "NO" (NO in step S44), the number of times "Changes" and "Environmental changes" are rejected is increased (step S47), and the process ends.

　　＜２．４．アクションポリシーパターン化部３２の動作＞
　図１０は、実施の形態に係るアクションポリシーパターン化部３２の動作を説明するためのフローチャートである。 <2.4. Operation of the action policy pattern generator 32>
FIG. 10 is a flowchart for explaining the operation of the action policy patterning unit 32 according to the embodiment.

　図１０に示すように、アクションポリシーパターン化部３２はパターンを取得する（ステップＳ５１）。そして、取得した各パターンについて、ステップＳ５２及びステップＳ５３の処理を行う。ステップＳ５２では、所属するユーザのアクションポリシー実績を取得する。各ユーザのアクションポリシーには、各ユーザのロボット２から送られてきた「採用回数」、「不採用回数」のデータが紐づけられている。アクションポリシーパターン化部３２は、「採用回数」、「不採用回数」のデータをアクションポリシー実績として使用する。ステップＳ５３では、共通部分を抽出して新たなパターンとする。 As shown in FIG. 10, the action policy patterning unit 32 acquires a pattern (step S51). Then, for each acquired pattern, steps S52 and S53 are performed. In step S52, the action policy track record of the user is acquired. The action policy of each user is linked to the data of "number of times adopted" and "number of times rejected" sent from the robot 2 of each user. The action policy patterning unit 32 uses the data of "number of times adopted" and "number of times rejected" as the action policy track record. In step S53, the common part is extracted to create a new pattern.

　次に、アクションポリシーパターン化部３２は、ユーザデータベース３３からユーザを取得し（ステップＳ５４）、各ユーザについて、ステップＳ５５～Ｓ５７の処理を行う。 Next, the action policy patterning unit 32 obtains users from the user database 33 (step S54) and performs steps S55 to S57 for each user.

　ステップＳ５５では、アクションポリシー実績をアクションポリシーデータベース１４から取得する。ステップＳ５６では、どのパターンの集合として表現するとアクションポリシー実績との合致度が高いかを計算する。ステップＳ５７では、最も合致度が高い集合を新たな所属パターンとして処理を終了する。 In step S55, the action policy results are obtained from the action policy database 14. In step S56, a calculation is made as to which set of patterns the action policy results match best when expressed as a set. In step S57, the set with the highest match is treated as the new pattern to which the action policy results belong, and the process ends.

　　＜２．５．アクションポリシースコア推定部３５の動作＞
　図１１は、実施の形態に係るアクションポリシースコア推定部３５の動作を説明するためのフローチャートである。 <2.5. Operation of the action policy score estimation unit 35>
FIG. 11 is a flowchart for explaining the operation of the action policy score estimation unit 35 according to the embodiment.

　図１１に示すように、アクションポリシースコア推定部３５はユーザデータベース３３からユーザを取得し（ステップＳ６１）、各ユーザについて、ステップＳ６２～Ｓ６４の処理を行う。 As shown in FIG. 11, the action policy score estimation unit 35 acquires users from the user database 33 (step S61) and performs the processes of steps S62 to S64 for each user.

　ステップＳ６２では、アクションポリシーのスコアを初期化する。ステップＳ６３では、ユーザの所属パターンを取得する。ステップＳ６４では、各所属パターンについて、所属パターンに含まれるアクションポリシーのスコアを増加する。 In step S62, the score of the action policy is initialized. In step S63, the user's belonging pattern is obtained. In step S64, for each belonging pattern, the score of the action policy included in the belonging pattern is increased.

　そして、アクションポリシーのスコアをユーザのロボット２のアクションポリシーデータベース１４に出力し（ステップＳ６５）、処理を終了する。 Then, the action policy score is output to the action policy database 14 of the user's robot 2 (step S65), and the process ends.

　＜３．効果＞
　従って、実施の形態に係る情報処理システム１によれば、環境変化に合わせて動作パラメータを提案することで、ロボット２が常に状況に適した動作となり効率性、快適性が向上する。 <3. Effects>
Therefore, according to the information processing system 1 of the embodiment, by proposing operation parameters in accordance with environmental changes, the robot 2 always operates in a manner appropriate to the situation, thereby improving efficiency and comfort.

　また、実施の形態に係る情報処理システム１によれば、ユーザの特性に応じた提案を行うことで、ユーザの快適性が損なわれない。 In addition, the information processing system 1 according to the embodiment makes suggestions based on the characteristics of the user, so the user's comfort is not compromised.

　＜４．ハードウェア構成＞
　図１２は、実施の形態に係る情報処理システム１のロボット２及びサーバ３の演算部を実現するコンピュータの一例を示すハードウェア構成図である。上述してきた実施の形態に係るロボット２及びサーバ３の演算装置を実現するコンピュータは、例えば図１２に示すような構成のコンピュータ１０００によって実現される。図１２は、演算装置の機能を実現するコンピュータ１０００の一例を示すハードウェア構成図である。コンピュータ１０００は、ＣＰＵ１１００、ＲＡＭ１２００、ＲＯＭ（Ｒｅａｄ　Ｏｎｌｙ　Ｍｅｍｏｒｙ）１３００、ＨＤＤ（Ｈａｒｄ　Ｄｉｓｋ　Ｄｒｉｖｅ）１４００、通信インターフェイス１５００、及び入出力インターフェイス１６００を有する。コンピュータ１０００の各部は、バス１０５０によって接続される。 4. Hardware Configuration
Fig. 12 is a hardware configuration diagram showing an example of a computer that realizes the calculation unit of the robot 2 and the server 3 of the information processing system 1 according to the embodiment. The computer that realizes the calculation device of the robot 2 and the server 3 according to the embodiment described above is realized by a computer 1000 having a configuration as shown in Fig. 12, for example. Fig. 12 is a hardware configuration diagram showing an example of the computer 1000 that realizes the function of the calculation device. The computer 1000 has a CPU 1100, a RAM 1200, a ROM (Read Only Memory) 1300, a HDD (Hard Disk Drive) 1400, a communication interface 1500, and an input/output interface 1600. Each part of the computer 1000 is connected by a bus 1050.

　ＣＰＵ１１００は、ＲＯＭ１３００又はＨＤＤ１４００に格納されたプログラムに基づいて動作し、各部の制御を行う。例えば、ＣＰＵ１１００は、ＲＯＭ１３００又はＨＤＤ１４００に格納されたプログラムをＲＡＭ１２００に展開し、各種プログラムに対応した処理を実行する。 The CPU 1100 operates based on the programs stored in the ROM 1300 or the HDD 1400 and controls each part. For example, the CPU 1100 loads the programs stored in the ROM 1300 or the HDD 1400 into the RAM 1200 and executes processes corresponding to the various programs.

　ＲＯＭ１３００は、コンピュータ１０００の起動時にＣＰＵ１１００によって実行されるＢＩＯＳ（Ｂａｓｉｃ　Ｉｎｐｕｔ　Ｏｕｔｐｕｔ　Ｓｙｓｔｅｍ）等のブートプログラムや、コンピュータ１０００のハードウェアに依存するプログラム等を格納する。 The ROM 1300 stores boot programs such as the BIOS (Basic Input Output System) that is executed by the CPU 1100 when the computer 1000 starts up, as well as programs that depend on the hardware of the computer 1000.

　ＨＤＤ１４００は、ＣＰＵ１１００によって実行されるプログラム、及び、かかるプログラムによって使用されるデータ等を非一時的に記録する、コンピュータが読み取り可能な記録媒体である。具体的には、ＨＤＤ１４００は、プログラムデータ１４５０の一例である本開示に係るアプリケーションプログラムを記録する記録媒体である。 HDD 1400 is a computer-readable recording medium that non-temporarily records programs executed by CPU 1100 and data used by such programs. Specifically, HDD 1400 is a recording medium that records application programs related to the present disclosure, which are an example of program data 1450.

　通信インターフェイス１５００は、コンピュータ１０００が外部ネットワーク１５５０（例えばインターネット）と接続するためのインターフェイスである。例えば、ＣＰＵ１１００は、通信インターフェイス１５００を介して、他の機器からデータを受信したり、ＣＰＵ１１００が生成したデータを他の機器へ送信したりする。 The communication interface 1500 is an interface for connecting the computer 1000 to an external network 1550 (e.g., the Internet). For example, the CPU 1100 receives data from other devices and transmits data generated by the CPU 1100 to other devices via the communication interface 1500.

　入出力インターフェイス１６００は、入出力デバイス１６５０とコンピュータ１０００とを接続するためのインターフェイスである。例えば、ＣＰＵ１１００は、入出力インターフェイス１６００を介して、キーボードやマウス等の入力デバイスからデータを受信する。また、ＣＰＵ１１００は、入出力インターフェイス１６００を介して、ディスプレイやスピーカーやプリンタ等の出力デバイスにデータを送信する。また、入出力インターフェイス１６００は、所定の記録媒体（メディア）に記録されたプログラム等を読み取るメディアインターフェイスとして機能してもよい。メディアとは、例えばＤＶＤ（Ｄｉｇｉｔａｌ　Ｖｅｒｓａｔｉｌｅ　Ｄｉｓｃ）、ＰＤ（Ｐｈａｓｅ　ｃｈａｎｇｅ　ｒｅｗｒｉｔａｂｌｅ　Ｄｉｓｋ）等の光学記録媒体、ＭＯ（Ｍａｇｎｅｔｏ－Ｏｐｔｉｃａｌ　ｄｉｓｋ）等の光磁気記録媒体、テープ媒体、磁気記録媒体、又は半導体メモリ等である。 The input/output interface 1600 is an interface for connecting the input/output device 1650 and the computer 1000. For example, the CPU 1100 receives data from an input device such as a keyboard or a mouse via the input/output interface 1600. The CPU 1100 also transmits data to an output device such as a display, a speaker, or a printer via the input/output interface 1600. The input/output interface 1600 may also function as a media interface that reads programs and the like recorded on a specific recording medium. Examples of media include optical recording media such as DVDs (Digital Versatile Discs) and PDs (Phase change rewritable Discs), magneto-optical recording media such as MOs (Magneto-Optical Disks), tape media, magnetic recording media, and semiconductor memories.

　ＨＤＤ１４００には、本開示に係るプログラムや、データが格納される。なお、ＣＰＵ１１００は、プログラムデータ１４５０をＨＤＤ１４００から読み取って実行するが、他の例として、外部ネットワーク１５５０を介して、他の装置からこれらのプログラムを取得してもよい。 The HDD 1400 stores the programs and data related to the present disclosure. The CPU 1100 reads and executes the program data 1450 from the HDD 1400, but as another example, the CPU 1100 may obtain these programs from other devices via the external network 1550.

　以上、添付図面を参照しながら本開示の好適な実施の形態について詳細に説明したが、本開示の技術的範囲はかかる例に限定されない。本開示の技術分野における通常の知識を有する者であれば、請求の範囲に記載された技術的思想の範疇内において、各種の変更例又は修正例に想到し得ることは明らかであり、これらについても、当然に本開示の技術的範囲に属するものと了解される。 The above describes in detail preferred embodiments of the present disclosure with reference to the attached drawings, but the technical scope of the present disclosure is not limited to such examples. It is clear that a person with ordinary knowledge in the technical field of the present disclosure can conceive of various modified or revised examples within the scope of the technical ideas described in the claims, and it is understood that these also naturally fall within the technical scope of the present disclosure.

　また、本明細書に記載された効果は、あくまで説明的又は例示的なものであって限定的ではない。つまり、本開示に係る技術は、上記の効果とともに、又は上記の効果に代えて、本明細書の記載から当業者には明らかな他の効果を奏しうる。 Furthermore, the effects described in this specification are merely descriptive or exemplary and are not limiting. In other words, the technology disclosed herein may achieve other effects that are apparent to a person skilled in the art from the description in this specification, in addition to or in place of the above effects.

　なお、本技術は以下のような構成も取ることができる。
（１）
　環境変化がある場合に、ロボットに適用される動作パラメータを取得し、
　前記取得した前記動作パラメータをユーザに提案し、
　前記提案した前記動作パラメータの提案が受け付けられた場合に、前記提案された前記動作パラメータを前記ロボットに適用する
情報処理方法。
（２）
　前記ユーザのアクションポリシーは、前記環境変化と前記動作パラメータとを有する
（１）に記載の情報処理方法。
（３）
　前記ロボットの動作パラメータを取得することは、
　前記ユーザの前記アクションポリシーが所属するアクションポリシーのパターンの動作パラメータを取得する
（１）又は（２）に記載の情報処理方法。
（４）
　前記ユーザへの提案は、
　前記ユーザのアクションポリシーとの関連度が閾値以上のパターンの動作パラメータについて行われる
（１）に記載の情報処理方法。
（５）
　前記提案した前記動作パラメータの提案が受け付けられた場合に、前記関連度を増加することを有する
（４）に記載の情報処理方法。
（６）
　前記提案した前記動作パラメータの提案が受け付けられない場合に、前記関連度を減少する
（４）に記載の情報処理方法。
（７）
　前記ユーザの前記アクションポリシーが所属するアクションポリシーのパターンは、前記アクションポリシーのスコアを有し、
　前記取得した前記動作パラメータをユーザに提案することは、
　前記アクションポリシーのスコアの降順に前記取得した前記動作パラメータをユーザに提案する
（３）に記載の情報処理方法。
（８）
　前記環境変化を観測すること
を有する（２）に記載の情報処理方法。
（９）
　前記環境変化が観測された場合、前記動作パラメータの変更が前記環境変化が原因なのかを前記ユーザに質問し、
　前記動作パラメータの変更が前記環境変化によるものである場合、前記ユーザのアクションポリシーの前記環境変化を前記原因となる前記環境変化に更新すること
を有する（８）に記載の情報処理方法。
（１０）
　前記更新された前記ユーザのアクションポリシーが所属するアクションポリシーのパターンは、アクションポリシーのスコアを有し、
　前記ユーザへの前記質問は、前記アクションポリシーのスコアの降順に行われる
（９）に記載の情報処理方法。
（１１）
　前記ユーザの前記アクションポリシーが所属するアクションポリシーのパターンは、前記ユーザの前記アクションポリシーと前記パターンの前記アクションポリシーとの最も合致度が高いパターンである
（３）又は（７）に記載の情報処理方法。
（１２）
　環境変化がある場合に、ロボットに適用される動作パラメータを取得する所得部と、
　前記取得した前記動作パラメータをユーザに提案する提案部と、
　前記提案した前記動作パラメータの提案が受け付けられた場合に、前記提案された前記動作パラメータを前記ロボットに適用する適用部と
を有する情報処理装置。
（１３）
　コンピュータに、
　環境変化がある場合に、ロボットに適用される動作パラメータを取得させ、
　前記取得した前記動作パラメータをユーザに提案させ、
　前記提案した前記動作パラメータの提案が受け付けられた場合に、前記提案された前記動作パラメータを前記ロボットに適用させる
プログラム。 The present technology can also be configured as follows.
(1)
Obtaining the operating parameters applied to the robot when there is an environmental change;
Suggesting the obtained operating parameters to a user;
An information processing method for applying the proposed operation parameters to the robot when the proposed operation parameters are accepted.
(2)
The information processing method according to (1), wherein the user's action policy includes the environmental change and the operation parameters.
(3)
Obtaining the motion parameters of the robot includes:
The information processing method according to (1) or (2), further comprising obtaining operation parameters of a pattern of an action policy to which the action policy of the user belongs.
(4)
The proposal to the user is:
The information processing method according to (1), wherein the processing is performed for operation parameters of a pattern having a degree of relevance to the user's action policy equal to or greater than a threshold value.
(5)
The information processing method according to (4), further comprising increasing the degree of association when the proposed operation parameter is accepted.
(6)
The information processing method according to (4), further comprising decreasing the degree of association when the proposed operation parameter is not accepted.
(7)
A pattern of action policies to which the action policy of the user belongs has a score of the action policy;
Proposing the obtained operating parameters to a user includes:
The information processing method according to (3), further comprising: suggesting the acquired operation parameters to a user in descending order of the scores of the action policies.
(8)
The information processing method according to (2), further comprising observing the environmental change.
(9)
If the environmental change is observed, asking the user whether the change in the operating parameter is due to the environmental change;
The information processing method according to (8), further comprising updating the environmental change in the user's action policy to the causative environmental change if the change in the operating parameter is due to the environmental change.
(10)
The pattern of action policies to which the updated user's action policy belongs has an action policy score;
The information processing method according to (9), wherein the questions to the user are asked in descending order of the scores of the action policies.
(11)
The information processing method according to any one of (3) to (7), wherein the pattern of action policies to which the action policy of the user belongs is a pattern having a highest degree of match between the action policy of the user and the action policy of the pattern.
(12)
An acquisition unit that acquires operation parameters to be applied to the robot when there is an environmental change;
a suggestion unit that suggests the acquired operation parameters to a user;
and an application unit that applies the proposed operation parameters to the robot when the proposal for the proposed operation parameters is accepted.
(13)
On the computer,
When there is an environmental change, the operating parameters applied to the robot are acquired,
having a user suggest the acquired operating parameters;
A program for applying the proposed operation parameters to the robot when the proposal for the proposed operation parameters is accepted.

　１　情報処理システム
　２　ロボット
　３　サーバ
　１１　ユーザＩ／Ｆ
　１２　動作パラメータ変更部
　１３　アクションポリシー確認部
　１４　アクションポリシーデータベース
　１５　動作パラメータ提案部
　１６　環境変化観測部
　１７　動作パラメータデータベース
　１８　制御部
　１９　アクチュエータ
　２０　センサ
　２１　認識部
　２２　環境情報データベース
　３１　アクションポリシー実績データベース
　３２　アクションポリシーパターン化部
　３３　ユーザデータベース
　３４　パターンデータベース
　３５　アクションポリシースコア推定部 Reference Signs List 1 Information processing system 2 Robot 3 Server 11 User I/F
REFERENCE SIGNS LIST 12 Operation parameter change unit 13 Action policy confirmation unit 14 Action policy database 15 Operation parameter proposal unit 16 Environmental change observation unit 17 Operation parameter database 18 Control unit 19 Actuator 20 Sensor 21 Recognition unit 22 Environmental information database 31 Action policy result database 32 Action policy patterning unit 33 User database 34 Pattern database 35 Action policy score estimation unit

Claims

Obtaining the operating parameters applied to the robot when there is an environmental change;
Suggesting the obtained operating parameters to a user;
An information processing method for applying the proposed operation parameters to the robot when the proposed operation parameters are accepted.

2. The information processing method according to claim 1, wherein the user's action policy includes the environmental changes and the operating parameters.

Obtaining the motion parameters of the robot includes:
3. The information processing method according to claim 2, further comprising acquiring operation parameters of a pattern of an action policy to which the action policy of the user belongs.

The proposal to the user is:
The information processing method according to claim 1 , wherein the method is performed for operation parameters of a pattern having a degree of association with the user's action policy equal to or greater than a threshold value.

The information processing method according to claim 4 , further comprising increasing the degree of association when the proposed operation parameter is accepted.

The information processing method according to claim 4 , further comprising the step of decreasing the degree of association when the proposed operation parameter is not accepted.

A pattern of action policies to which the action policy of the user belongs has a score of the action policy;
Proposing the obtained operating parameters to a user includes:
The information processing method according to claim 3 , wherein the acquired operation parameters are proposed to the user in descending order of the scores of the action policies.

The information processing method according to claim 2 , further comprising observing the environmental change.

If the environmental change is observed, asking the user whether the change in the operating parameter is due to the environmental change;
9. The information processing method according to claim 8, further comprising updating the environmental change in the user's action policy to the causative environmental change if the change in the operating parameter is due to the environmental change.

The pattern of action policies to which the updated user's action policy belongs has an action policy score;
The information processing method according to claim 9 , wherein the questions to the user are posed in descending order of the scores of the action policies.

4. The information processing method according to claim 3, wherein the pattern of action policies to which the action policy of the user belongs is a pattern having a highest degree of match between the action policy of the user and the action policy of the pattern.

An acquisition unit that acquires operation parameters to be applied to the robot when there is an environmental change;
a suggestion unit that suggests the acquired operation parameters to a user;
and an application unit that applies the proposed operation parameters to the robot when the proposal for the proposed operation parameters is accepted.

On the computer,
When there is an environmental change, the operating parameters applied to the robot are acquired,
having a user suggest the acquired operating parameters;
A program for applying the proposed operation parameters to the robot when the proposal for the proposed operation parameters is accepted.