[go: up one dir, main page]

WO2025118687A1 - Dialogue processing method and apparatus, and electronic device - Google Patents

Dialogue processing method and apparatus, and electronic device Download PDF

Info

Publication number
WO2025118687A1
WO2025118687A1 PCT/CN2024/113303 CN2024113303W WO2025118687A1 WO 2025118687 A1 WO2025118687 A1 WO 2025118687A1 CN 2024113303 W CN2024113303 W CN 2024113303W WO 2025118687 A1 WO2025118687 A1 WO 2025118687A1
Authority
WO
WIPO (PCT)
Prior art keywords
conversation
dialogue
user
card
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/113303
Other languages
French (fr)
Chinese (zh)
Inventor
王瑀
王宗波
郑江震
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of WO2025118687A1 publication Critical patent/WO2025118687A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0482Interaction with lists of selectable items, e.g. menus
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/34Browsing; Visualisation therefor
    • G06F16/345Summarisation for human users
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/0485Scrolling or panning
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Definitions

  • the present application relates to the field of information technology (IT) technology, and in particular to a dialogue processing method, device and electronic device.
  • IT information technology
  • artificial intelligence systems with natural semantic understanding and natural conversation capabilities can be used on electronic devices, such as digital assistants or large language models. Users can have interactive conversations with this type of artificial intelligence system on electronic devices. However, during the conversation, users often need to review most of the historical conversation content to grasp the key points of the conversation, which is a poor experience.
  • the present application provides a conversation processing method, device, electronic device, computer storage medium and computer product, which can enable users to quickly grasp the key points of the conversation and improve the user experience.
  • the present application provides a method for processing a dialogue, including: displaying a first dialogue interface for a user to interact with an artificial intelligence system; in response to a first operation triggered by the user, displaying a first dialogue shorthand card in the first dialogue interface, wherein the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached a consensus on in the first dialogue interface.
  • the user can quickly grasp the key points of the conversation through the dialogue shorthand card, thereby improving the user experience.
  • the first conversation shorthand card also includes: conversation status. In this way, the user can intuitively understand the conversation status, which improves the user experience.
  • the first operation includes one or more of the following: the user triggers an instruction, operation, or event for creating the first conversation shorthand card, or the conversation intention between the user and the artificial intelligence system is a preset intention.
  • the generation of conversation shorthand cards can be triggered in a variety of ways, enriching the generation methods of conversation shorthand cards.
  • the method further includes: marking the consensus information in the conversation content displayed on the first conversation interface, so as to help the user understand what information is recorded.
  • the method further includes: updating the consensus information included in the first conversation shorthand card when the consensus information in the conversation content displayed on the first conversation interface changes. In this way, the content in the conversation shorthand card can be kept up to date.
  • the method further includes: in response to the user's editing operation on the consensus information in the first conversation shorthand card, updating the consensus information contained in the first conversation shorthand card. In this way, the user can change the content in the conversation shorthand card based on his or her own needs, thereby improving the user experience.
  • the first conversation shorthand card also includes: an automated task control.
  • the first conversation shorthand card after displaying the first conversation shorthand card, it also includes: in response to a trigger operation for the automated task control, displaying a task card in the first conversation interface, the task card is used to indicate the execution process of the task related to the consensus information; after the task indicated by the task card is completed, the task execution result is displayed in the first conversation shorthand card.
  • the automated task control in response to a trigger operation for the automated task control, displaying a task card in the first conversation interface, the task card is used to indicate the execution process of the task related to the consensus information; after the task indicated by the task card is completed, the task execution result is displayed in the first conversation shorthand card.
  • it also includes: closing the first dialogue interface; in response to the user's operation of creating a new dialogue, displaying the second dialogue interface for the user to interact with the artificial intelligence system; in the case where the user enters at least one consensus information in the second dialogue interface, displaying the first dialogue shorthand card in the second dialogue interface, the first dialogue shorthand card is based on the consensus information in the second dialogue interface, and is screened from the stored dialogue shorthand cards.
  • closing the first dialogue interface in response to the user's operation of creating a new dialogue, displaying the second dialogue interface for the user to interact with the artificial intelligence system; in the case where the user enters at least one consensus information in the second dialogue interface, displaying the first dialogue shorthand card in the second dialogue interface, the first dialogue shorthand card is based on the consensus information in the second dialogue interface, and is screened from the stored dialogue shorthand cards.
  • the first conversation shorthand card displayed in the second conversation interface is displayed after the user agrees to Displayed when continuing the conversation in the first conversation interface.
  • the method further includes: when the user adds/modifies consensus information in the second conversation interface, the consensus information is added/modified in the first conversation shorthand card.
  • the content in the conversation shorthand card can be consistent with the user's latest conversation, thereby improving the user experience.
  • the method further includes: displaying a first prompt message in the first dialogue interface, the first prompt message being used to prompt that the first dialogue shorthand card has been created, so that the user can be informed that the dialogue shorthand card has been generated.
  • the present application provides a dialogue processing device, comprising: a first display module and a second display module.
  • the first display module is used to display a first dialogue interface for a user to interact with an artificial intelligence system.
  • the second display module is used to display a first dialogue shorthand card in the first dialogue interface in response to a first operation triggered by the user, wherein the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached consensus on in the first dialogue interface.
  • the first operation includes one or more of the following: the user triggers an instruction, operation, or event for creating a first conversation note card, or the conversation intention between the user and the artificial intelligence system is a preset intention.
  • the second display module is further used to: mark the consensus information in the conversation content displayed on the first conversation interface.
  • the second display module is further used to: update the consensus information included in the first conversation shorthand card when the consensus information in the conversation content displayed on the first conversation interface changes.
  • the second display module is further used to: in response to a user's editing operation on the consensus information in the first conversation shorthand card, update the consensus information included in the first conversation shorthand card.
  • the first dialogue shorthand card also includes: an automated task control.
  • the second display module is further used to: display a task card in the first dialogue interface in response to a trigger operation on the automated task control, the task card being used to indicate the execution process of a task related to the consensus information; after the task indicated by the task card is completed, display the task execution result in the first dialogue shorthand card.
  • the second display module is also used to close the first dialogue interface; in response to the user's operation of creating a new dialogue, display a second dialogue interface for the user to interact with the artificial intelligence system; when the user enters at least one consensus information in the second dialogue interface, display a first dialogue shorthand card in the second dialogue interface, the first dialogue shorthand card being screened from stored dialogue shorthand cards based on the consensus information in the second dialogue interface.
  • the first conversation shorthand card displayed in the second conversation interface is displayed when the user agrees to continue the conversation in the first conversation interface in the second conversation interface.
  • the second display module is further used to: when the user adds/modifies the consensus information in the second dialogue interface, add/modify the consensus information in the first dialogue shorthand card.
  • the present application provides an electronic device, comprising: one or more processors; a memory.
  • the memory stores one or more computer programs, and the one or more computer programs include instructions.
  • the electronic device performs the method described in the first aspect or any possible implementation of the first aspect.
  • the present application provides a computer-readable storage medium, including computer program instructions, when the computer program instructions are executed by a computing device cluster, the computing device cluster executes the method described in the first aspect or any possible implementation of the first aspect.
  • the computing device cluster may include one or more computing devices.
  • the present application provides a computer program product including instructions, which, when executed by a computing device cluster, enables the computing device cluster to perform the method described in the first aspect or any possible implementation of the first aspect.
  • the computing device cluster may include one or more computing devices.
  • FIG1 is a schematic diagram of an application scenario provided by an embodiment of the present application.
  • FIG2 is a flow chart of a method for processing a conversation provided in an embodiment of the present application.
  • FIG3 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG4 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG5 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG6 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG7 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG8 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG9 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG10 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG11 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application.
  • FIG12 is a schematic diagram of the structure of a conversation processing device provided in an embodiment of the present application.
  • FIG. 13 is a schematic diagram of the structure of an electronic device provided in an embodiment of the present application.
  • a and/or B in this article is a description of the association relationship of associated objects, indicating that there can be three relationships.
  • a and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone.
  • the symbol "/" in this article indicates that the associated objects are in an or relationship, for example, A/B means A or B.
  • first and second in the specification and claims herein are used to distinguish different objects rather than to describe a specific order of the objects.
  • a first response message and a second response message are used to distinguish different response messages rather than to describe a specific order of the response messages.
  • words such as “exemplary” or “for example” are used to indicate examples, illustrations or descriptions. Any embodiment or design described as “exemplary” or “for example” in the embodiments of the present application should not be interpreted as being more preferred or more advantageous than other embodiments or designs. Specifically, the use of words such as “exemplary” or “for example” is intended to present related concepts in a specific way.
  • multiple means two or more than two.
  • multiple processing units refer to two or more processing units, etc.; multiple elements refer to two or more elements, etc.
  • the artificial intelligence system can obtain conversation information after detecting a preset input input by the user, and extract the consensus information reached between the user and the artificial intelligence system from the conversation information, and fill the consensus information into a conversation shorthand card, and display the conversation shorthand card on the conversation interface.
  • the user can quickly grasp the key points of the conversation through the conversation shorthand card, thereby improving the user experience.
  • the artificial intelligence system can be deployed on an electronic device in the form of a client, but is not limited to, and the client can be a desktop application, a mobile application, a Web application, or a Web-based application, etc.
  • Figure 1 shows a schematic diagram of an application scenario provided by an embodiment of the present application.
  • user A is using an electronic device 100
  • the user is interacting with an artificial intelligence system on the electronic device 100.
  • FIG2 shows a flow chart of a method for processing a conversation provided in an embodiment of the present application.
  • the method may be, but is not limited to, applied to a client.
  • the client may be, but is not limited to, related to an artificial intelligence system with natural semantic understanding and natural conversation capabilities.
  • the method for processing a conversation may include the following steps:
  • S201 Display a first dialogue interface for a user to interact with an artificial intelligence system.
  • the user can enter the dialogue interface of the artificial intelligence system by clicking on the application icon or web page of the client.
  • the client can display the first dialogue interface for the user to interact with the artificial intelligence system.
  • the user can have a dialogue with the artificial intelligence system in the first dialogue interface.
  • the user can also start the client through voice interaction to interact with the artificial intelligence system.
  • a first dialogue shorthand card is displayed in a first dialogue interface, where the first dialogue shorthand card includes: consensus information reached by the user and the artificial intelligence system in the first dialogue interface.
  • the user can interact with the artificial intelligence system, such as voice interaction, text interaction, etc.
  • the user can trigger some instructions, operations or events, or certain dialogue intentions and other operations on the first dialogue interface.
  • the client can display the first dialogue shorthand card in the first dialogue interface in response to the operation triggered by the user.
  • the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached a consensus on in the first dialogue interface.
  • the first operation triggered by the user can be understood as an operation for creating a dialogue shorthand card.
  • the first operation in S202 may include one or more of the following:
  • the creation of the conversation shorthand card can be triggered by a preset creation statement.
  • the creation statement can be a system default setting or a user-defined one.
  • the creation statement is "Hey, listen", "Help me remember”, etc.
  • FIG3 (A) a conversation interface 31 is displayed on the electronic device 100, and the user can enter "Help me remember” in the conversation interface 31 to trigger the creation of a conversation shorthand card.
  • FIG3 (B) the conversation shorthand card 31 can be displayed in the conversation interface 31.
  • the consensus information reached by the user and the artificial intelligence system is "record content", so these consensus information can be displayed in the conversation shorthand card 32.
  • a prompt information (for example, the content shown in area 33 in FIG3 (B)) can also be displayed in the conversation interface 31, and the prompt information is used to prompt that the conversation shorthand card has been created.
  • the artificial intelligence system can analyze the received dialogue intention.
  • the dialogue intention is a preset type of dialogue intention
  • the creation of a dialogue shorthand card can be triggered.
  • the preset type of dialogue intention can be, but is not limited to, a task intention that requires multiple steps and is carried out continuously, such as ordering food, scheduling, meetings, writing emails, etc.
  • a dialogue interface 41 is displayed on the electronic device 100, and the user can enter "Please help me book a nearby Sichuan restaurant" in the dialogue interface 41.
  • the user's dialogue input clearly indicates the need to book a restaurant.
  • a dialogue shorthand card 42 can be displayed in the dialogue interface 41.
  • the consensus information reached by the user and the artificial intelligence system is "book a restaurant", “Sichuan cuisine” and “near home”, so these consensus information can be displayed in the dialogue shorthand card 42.
  • the artificial intelligence system can display a card creation invitation on the conversation interface, and if the user confirms to accept the invitation through operation, a conversation shorthand card is created.
  • the card creation invitation can be displayed on the conversation interface in the form of a conversation inquiry, and correspondingly, the user's confirmation operation can be a conversation input, and the conversation input can be text input by keyboard or voice input or voice-to-text input, etc.
  • the card creation invitation can also be a pop-up window program, and correspondingly, the user's confirmation operation can be an interaction with the control in the pop-up window, or voice input, etc.
  • a conversation interface 51 is displayed on the electronic device 100.
  • the artificial intelligence system has not received the user's conversation input for a period of time, and it can initiate a conversation shorthand card creation invitation to the user, that is, "Do you need to book a Sichuan restaurant for you?" shown in area 52.
  • the artificial intelligence system receives the user's conversation input "OK”
  • the creation of the conversation shorthand card is triggered.
  • a conversation shorthand card 53 can be displayed in the conversation interface 51.
  • the consensus information reached between the user and the artificial intelligence system is “book a restaurant”, “Sichuan cuisine” and “near home”, so these consensus information can be displayed in the dialogue shorthand card 53 .
  • the artificial intelligence system can also analyze the content of the conversation first and identify at least one user intent based on the content of the conversation. Then, a card creation invitation is sent based on the user intent, and a card creation invitation is only sent when there is a preset type of user intent.
  • the main difference between this method and the second method is that the identification of user intent here is more speculative, that is, the user does not express such intent, but it is inferred that the user has such intent based on the content of the conversation; while in the second method, the user expresses a clear intent.
  • a hidden menu may be provided in the dialogue interface. After the user calls out the hidden menu, the user may select a control in the menu for creating a dialogue note card.
  • the hidden menu may also be replaced by a non-hidden menu.
  • FIG6(A) a dialogue interface 61 is displayed on the electronic device 100.
  • the user may call out a menu 62 and select a control 63 for creating a dialogue note card.
  • a dialogue note card 64 may be displayed in the dialogue interface 61.
  • the operation for creating a conversation shorthand card can be a preset operation.
  • the operation can be a gesture operation of sliding up from the bottom of the conversation interface; on a computer, the operation can be clicking the back button, closing the page, etc.
  • the operation can also reuse other operations, such as: reusing the gesture of exiting the conversation interface, etc.
  • a conversation interface 71 is displayed on the electronic device 100.
  • the creation of a conversation shorthand card is triggered.
  • a conversation shorthand card 72 can be displayed in the conversation interface 71.
  • the exit of the conversation interface can be interrupted first, and then exit after the creation of the conversation shorthand card is completed; or the exit operation of the conversation interface is directly terminated, and only the creation of the conversation shorthand card is executed, and the conversation interface is exited after the user performs the swipe up exit again.
  • the user's operation on the electronic device can be transmitted to the artificial intelligence system through the electronic device.
  • the preset event may be an event of a preset type. For example, this event may be exiting the dialogue interface; or the artificial intelligence system being switched to the background, and so on.
  • a dialogue interface 81 is displayed on the electronic device 100.
  • the user slides his finger on the navigation bar at the bottom of the screen. This operation triggers the switching of the application, switches the previous/next application to the front end to run, and the artificial intelligence system is switched to the background, triggering the dialogue interface closing event.
  • the dialogue interface closing event is detected, the dialogue interface closing operation is first interrupted to create a dialogue shorthand card.
  • a dialogue shorthand card 82 may be displayed in the dialogue interface 81.
  • the dialogue interface closing operation may be performed and switched to the next application.
  • the event triggered by the user on the electronic device may be transmitted to the artificial intelligence system via the electronic device.
  • the artificial intelligence system may first obtain the conversation content between the user and the artificial intelligence system in the first dialogue interface. Then, the obtained dialogue content is identified and the consensus information reached between the two is extracted. Finally, the extracted consensus information is filled into the data template to generate a dialogue shorthand card. Finally, the client can display the dialogue shorthand card in the dialogue interface.
  • the consensus information may come from the user's answer, that is, it is identified from the user's answer, or it may be the information that the artificial intelligence system repeats to the user in a preset sentence pattern after the user answers. For example, the "XXXX, understand" sentence pattern in Figures 7 and 8 is information used for repeated description, and the "xxxx" here is the consensus information.
  • the user can initiate at least one topic.
  • the user can first initiate a topic of ordering meals to the artificial intelligence system, and then initiate a topic of booking a room to the artificial system, and so on.
  • you can first confirm the topic of the conversation at the current time, and then extract consensus information based on the relevant conversation content of each topic under the topic.
  • the data in the dialogue context can be stored by a preset data structure, such as locally or in the cloud. Among them, the data structure can be as follows:
  • Context a context data structure that stores context information contained in a conversation
  • Conversational_Name used to display the name of the saved conversation
  • Conversational_ID The unique ID of the conversation
  • Conversational_LastUpdateTime used to display the last time of the saved conversation
  • Conversational_CreateTime used to record the creation time of the conversation
  • Conversational_Intent marks the main intent type of the conversation
  • Conversational_Goal marks the goal of the conversation
  • Conversational_Consensus records the conversation status of the current conversation, including under discussion, completed, and abandoned.
  • Topic a topic data structure used to store contextual information about a topic in a conversation.
  • a conversation between a user and an AI system (usually one question and one answer) corresponds to one Topic.
  • Topic_Name used to display the saved topic name
  • Topic_LastUpdateTime used to display the time when the topic was created
  • Topic_CreateTime the time when the user records the creation of the topic
  • Topic_Intent marks the intention of the topic
  • Topic_Entity records the entity parameter list corresponding to the topic intent; for example, when Topic_Name is the number of people, Topic_Entity can be 4 people.
  • the artificial intelligence system can identify the topic name and intention in each round of dialogue, and thus can identify the conversation topic of the current topic based on at least one of the topic name and intention.
  • the artificial intelligence system can determine the relevance of the conversation topics of different topics (Topic), thereby determining a group of related topics, which are all related matters about the same topic.
  • Figures 7 and 8 contain multiple topics related to ordering, so that the current conversation topic is ordering, and then the consensus information is identified from the conversation corresponding to the current conversation topic, and the conversation shorthand card can be obtained.
  • the key-value pair structure can be used to store consensus information in each topic data structure.
  • the key parameter is "Topic_Name” and the value variable is “Topic_Entity”.
  • the latest value variable can be used to replace the previous value variable. For example, in the example of ordering a meal, if the consensus information is "the number of people dining is 4", then the key parameter of the consensus information is "dining voice”, and the value variable is "4 people” or “number of people "4 people”; at this time, when the user is expected to modify, the sentence provided is "help me change the number of people to 6 people", then the value variable of the consensus information is "6 people” or “number of people: 6 people”, and the key parameter remains unchanged.
  • a key-value pair judgment can be performed on the consensus information to determine whether there is consensus information with the same key parameter. If there is consensus information with the same key parameter, the value in the original key-value pair is overwritten with the new value, so that the content in the conversation shorthand card can be kept up to date. For example: as shown in (C) of Figure 8, the user's latest conversation input is "changed to a family dinner with elderly and children", then the conversation input from this round is obtained. The consensus information obtained is "family dinner, with elderly and children", and accordingly, the nature of the dinner is modified to "family dinner, with elderly and children". In addition, in addition to updating the content in the conversation shorthand card, the user can also directly edit the content in the conversation shorthand card.
  • the user can trigger the conversation shorthand card to enter the editing mode by triggering a preset operation (such as long pressing, single clicking, double clicking the conversation shorthand card, etc.); then, the user can edit the conversation shorthand card to modify the content of the conversation shorthand card.
  • a modification viewing entrance can be provided in the conversation interface to view the modification record of the conversation shorthand card; through this entrance, the conversation shorthand card before the modification, or the modification comparison of the original conversation shorthand card can be viewed.
  • the recognition of consensus information is mainly based on each topic (Topic), specifically, it is based on the dialogue text of each topic. Exemplarily, it can be directly extracted from the repeated confirmation statement.
  • the answer keyword is the number of people "4 people”, and 4 people is the consensus information.
  • the consensus information is combined, the question keyword is the number of people dining, the answer keyword is 4 people, and the number of people dining has a content relevance with 4 people, then "The number of people dining is 4 people” or "The number of people: 4 people” is used as consensus information.
  • the content relevance between keywords is judged by the artificial intelligence system.
  • the end of a round of dialogue is determined by the artificial intelligence system.
  • the artificial intelligence system in addition to questions and answers, also repeatedly confirms the key points of the user's answer before asking new questions. Therefore, the end of a round of dialogue starts with a question and ends with the key points of the answer repeatedly confirmed by the artificial intelligence system. That is, a round of dialogue includes questions and answers, or includes questions, answers and repeated confirmation statements.
  • the artificial intelligence system can also combine the scene information obtained by the electronic device to preset the consensus information so that the consensus information is expressed more clearly.
  • Scene information includes: current location information or current time information, etc.
  • the user's dialogue input is "Please help me find a Sichuan restaurant at 7 o'clock tomorrow night”
  • the consensus information includes "7 o'clock tomorrow night” and "Sichuan cuisine”.
  • the 7 o'clock tomorrow night is inferred based on the current time information. If 7 o'clock tomorrow night is directly recorded on the dialogue shorthand card, the user will not be able to express it in the later stage.
  • the time information of 7 o'clock tomorrow night cannot be accurately expressed. Therefore, "7 o'clock tomorrow night” is edited again in combination with the current time information (November 17), and the dining time information of "November 18 19:00" is obtained as the consensus information.
  • the consensus information is empty, that is, no consensus information is filled into the conversation shorthand card, and correspondingly, the created conversation shorthand card is a blank card.
  • the client can display the conversation shorthand card on the conversation interface.
  • the conversation shorthand card can be displayed at the top of the conversation interface, or it can be displayed in other locations of the conversation interface.
  • the conversation shorthand card can also be displayed above the conversation content in a suspended manner, so that it will not occupy the display of the conversation content, but block part of the conversation content.
  • the conversation shorthand card can also be used as an additional interactive interface that can be called up, and it will be displayed on the conversation interface when the user calls it up or there is information update.
  • decorative marks can be added to the text corresponding to the consensus information in the conversation interface, and these marks can be displayed through the client. In this way, it helps users understand what information is recorded.
  • decorative marks may include: underline, shading or star marks, etc.
  • the text corresponding to the consensus information can also be changed, such as iterating, changing the font color, changing the font size, etc. For example, as shown in (B) or (C) of Figure 8, the consensus information can be highlighted by bolding, italicizing and adding underlining to remind users that these contents are associated with the conversation shorthand card.
  • the decorative logo/text effect of the text corresponding to the original consensus information can be cancelled in the dialogue interface, and the new consensus information can be marked.
  • the mark of "family dinner with elderly people” can be cancelled, and "family dinner with elderly people and children” can be marked.
  • the conversation shorthand card in addition to displaying the consensus information reached between the user and the artificial intelligence system, can also support the creation and execution of automated tasks based on the conversation shorthand card.
  • the artificial intelligence system can create an automated task of automatically booking a restaurant based on the information recorded in the conversation shorthand card and according to the conversation intention, and the artificial intelligence system calls the local application (applications, applets, service cards, HTML cards, etc., which are presented as cards/windows at runtime, and can be included in the application concept) to create a task of booking a restaurant.
  • an automated task control can also be displayed.
  • the automated task control can be used to execute automated tasks on an electronic device.
  • the artificial intelligence system can create an automated task based on the consensus information.
  • the client can display a task card on the conversation interface.
  • the task card can be used to indicate the execution process of the automated task, and after further interaction by the user, the automated task is completed; at this time, the task execution result can be displayed in the conversation shorthand card.
  • FIG9 as shown in FIG9 (A), a conversation interface 91 is displayed on the electronic device 100. In the dialogue interface 91, the user slides his finger on the navigation bar at the bottom of the screen.
  • This operation triggers the switching of the application, switches the previous/next application to the front end, and switches the artificial intelligence system to the background, which triggers the dialogue interface closing event.
  • the dialogue interface closing event is detected, the dialogue interface closing operation is first interrupted to create a dialogue shorthand card. Afterwards, as shown in FIG9 (B), a dialogue shorthand card 92 can be displayed in the dialogue interface 91. At the same time, an automated task control 93 for "booking a restaurant" is displayed in the dialogue shorthand card 92. Then, the user can click on the automated task control 93, and then the electronic device 100 can display an interface as shown in FIG9 (C), that is, a task card 95 is displayed.
  • the user can click on "xx restaurant” to complete the reservation of the restaurant.
  • the electronic device 100 can display an interface as shown in FIG9 (D).
  • a status control 94 for indicating the task status can also be displayed on the dialogue shorthand card 92.
  • the status control 94 can intuitively observe the completion status of the automated task.
  • the status control 94 can also display the status of the current conversation.
  • the current conversation status is usually associated with the inquiry statement of the artificial intelligence system, which is the intention of the current inquiry statement.
  • the conversation status can be represented by "Discussing (To be determined the number of people for the dinner party)", where "Discussing” means that the artificial intelligence system is discussing with the user, and "To be determined the number of people for the dinner party” means the intention of the current discussion.
  • the artificial intelligence system can automatically determine whether to support the creation of automated tasks based on the existing consensus information in the conversation shorthand card.
  • the existing consensus information is not sufficient to support the creation of automated tasks (such as the number of people at the dinner, the dishes, etc. are not determined)
  • the automated task control can be set to unavailable; only when the existing consensus information can support the creation of automated tasks, the automated task control is set to available.
  • the conversation shorthand card after the conversation shorthand card is created, it can be saved locally or in the cloud. Since the conversation shorthand card saves text information in a predetermined data structure instead of saving all the conversation content, it does not need to occupy token resources.
  • the artificial intelligence system can automatically match existing conversation shorthand cards to find out whether there are conversation shorthand cards with the same or similar intentions. If an existing conversation shorthand card is matched, the existing conversation shorthand card is directly called and displayed on the current conversation interface through the client. For example, referring to Figure 8, in Figure 8 (C), the user did not take the next step but ended the conversation; then as shown in Figure 10 (A), when the user makes a reservation at a restaurant/hotel again, after the user triggers the generation of a "conversation shorthand card", it can first automatically match the conversation shorthand card created in Figure 8, and then ask the user whether it needs it. Continue the last conversation, that is, display the interface shown in FIG10 (B).
  • the conversation shorthand card created in FIG8 can be displayed on the conversation interface 101, that is, display the interface shown in FIG10 (C).
  • the user only needs to make a simple modification to the conversation shorthand card to make a restaurant reservation through the automated task, without having to conduct multiple rounds of conversation interactions to repeatedly ask the user.
  • the user can choose to close the original conversation interface.
  • the artificial intelligence system can match them in combination with the scene information obtained by the electronic device to improve the accuracy of the match. For example, combined with the current location information, the previously created conversation shorthand card for booking a restaurant near the current location is matched.
  • the status information of historical conversation shorthand cards can be combined to match the cards, giving priority to matching conversation shorthand cards with a status of "under discussion". In this way, you can avoid asking for requirements from the beginning, and avoid the problem of no conversation experience caused by going directly to the modification stage.
  • users can also search for dialogue shorthand cards through dialogue input.
  • Users can carry at least one question attribute of consensus information in the dialogue input.
  • the artificial intelligence system can match dialogue shorthand cards with the same or similar attributes in combination with the question attributes. For example: the user's dialogue input can be "Please help me book a Sichuan restaurant nearby", then the dialogue shorthand cards of historically booked restaurants can be matched; further, the dialogue shorthand cards of the same location (or the historical location closest to the current location) can be searched in combination with the user's current location; further, dialogue shorthand cards with Sichuan dishes as the reserved dishes can be searched.
  • the user can also add/modify new consensus information during the dialogue input.
  • the artificial intelligence system displays the dialogue shorthand card, it can modify the historical dialogue shorthand card based on the consensus information obtained by parsing the dialogue input, and then display it. For example, as shown in Figure 11, the artificial intelligence system matches the existing dialogue shorthand card according to the user's dialogue input, and the user's dialogue input describes the new consensus information, "The dinner time is 7 o'clock tomorrow night”. Therefore, when presenting the dialogue shorthand card, it does not directly present the dialogue shorthand card, but modifies the dinner time and then displays it in the dialogue interface. If the user's dialogue input is "Help me find a restaurant near the company that has not been booked in July this year", the user has not expressed new consensus information at this time, so the matched historical dialogue shorthand card can be directly called and displayed.
  • the embodiment of the present application also provides a dialogue processing device.
  • FIG12 shows a schematic diagram of the structure of a dialogue processing device provided by an embodiment of the present application.
  • the dialogue processing device 1200 includes: a first display module 1201 and a second display module 1202.
  • the first display module 1201 is used to display a first dialogue interface for a user to interact with an artificial intelligence system.
  • the second display module 1202 is used to display a first dialogue shorthand card in the first dialogue interface in response to a first operation triggered by a user, and the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached a consensus on in the first dialogue interface.
  • the first operation includes one or more of the following: the user triggers an instruction, operation, or event for creating a first conversation note card, or the conversation intention between the user and the artificial intelligence system is a preset intention.
  • the second display module 1202 is further used to: mark the consensus information in the conversation content displayed on the first conversation interface.
  • the second display module 1202 is further used to: update the consensus information included in the first conversation shorthand card when the consensus information in the conversation content displayed on the first conversation interface changes.
  • the second display module 1202 is further used to: in response to a user's editing operation on the consensus information in the first conversation shorthand card, update the consensus information included in the first conversation shorthand card.
  • the first dialogue shorthand card also includes: an automated task control.
  • the second display module 1202 is further used to: in response to a trigger operation on the automated task control, display a task card in the first dialogue interface, the task card is used to indicate the execution process of the task related to the consensus information; after the task indicated by the task card is completed, display the task execution result in the first dialogue shorthand card.
  • the second display module 1202 is also used to: close the first dialogue interface; in response to the user's operation of creating a new dialogue, display a second dialogue interface for the user to interact with the artificial intelligence system; when the user enters at least one consensus information in the second dialogue interface, display a first dialogue shorthand card in the second dialogue interface, where the first dialogue shorthand card is based on the consensus information in the second dialogue interface and is screened from the stored dialogue shorthand cards.
  • the first conversation shorthand card displayed in the second conversation interface is displayed when the user agrees to continue the conversation in the first conversation interface in the second conversation interface.
  • the second display module 1202 is further used to: add/modify consensus information in the first dialogue shorthand card when the user adds/modifies consensus information in the second dialogue interface.
  • the above-mentioned device is used to execute the method in the above-mentioned embodiment.
  • the implementation principle and technical effect of the corresponding program module in the device are similar to those described in the above-mentioned method.
  • the working process of the device can refer to the corresponding process in the above-mentioned method, which will not be repeated here.
  • the electronic devices in the embodiments of the present application can be mobile phones, tablet computers, desktop computers, laptop computers, handheld computers, notebook computers, ultra-mobile personal computers (UMPC), netbooks, as well as cellular phones, personal digital assistants (PDA), augmented reality (AR) devices, virtual reality (VR) devices, artificial intelligence (AI) devices, wearable devices, vehicle-mounted devices, smart home devices and/or smart city devices, etc.
  • PDA personal digital assistants
  • AR augmented reality
  • VR virtual reality
  • AI artificial intelligence
  • wearable devices wearable devices
  • vehicle-mounted devices smart home devices and/or smart city devices, etc.
  • smart home devices smart home devices and/or smart city devices
  • Fig. 13 shows a schematic diagram of the structure of an electronic device provided in an embodiment of the present application.
  • the electronic device 13300 may include: a processor 1310, a memory 1320, a display screen 1330, a sensor module 1340 and a battery 1350.
  • the processor 1310 is the computing core and control core of the electronic device 1300.
  • the processor 1310 may include one or more processing units.
  • the processor 1310 may include one or more of an application processor (application processor, AP), a modem, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), a controller, a video codec, a digital signal processor (digital signal processor, DSP), a baseband processor, and/or a neural network processor (neural-network processing unit, NPU).
  • application processor application processor
  • AP graphics processor
  • ISP image signal processor
  • controller a graphics processor
  • video codec image signal processor
  • DSP digital signal processor
  • baseband processor baseband processor
  • NPU neural-network processing unit
  • different processing units can be independent devices or integrated in one or more processors.
  • the NPU can be a neural network (neural-network, NN) computing processor, which can quickly process input information by drawing on the biological neural network structure, such as drawing on the transmission mode between neurons in the human brain, and can also continuously self-learn.
  • NN neural-network
  • applications such as intelligent cognition of the electronic device 1300 can be realized, such as: image recognition, face recognition, voice recognition, text understanding, text recognition, etc.
  • the processor 1300 can be used but not limited to execute the method in the aforementioned embodiment.
  • the memory 1320 may store a program, and the program may be executed by the processor 1310, so that the processor 1310 executes the method performed by the electronic device 1300 provided in the embodiment of the present application.
  • the memory 1320 may also store data.
  • the processor 1310 may read the data stored in the memory 1320.
  • the memory 1320 and the processor 1310 may be provided separately.
  • the memory 1320 may also be integrated in the processor 1310.
  • the display screen 1330 can be used to display images, videos, etc.
  • the display screen 1330 includes a display panel.
  • the display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (AMOLED), a flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, a quantum dot light-emitting diode (QLED), etc.
  • LCD liquid crystal display
  • OLED organic light-emitting diode
  • AMOLED active-matrix organic light-emitting diode
  • FLED flexible light-emitting diode
  • Miniled MicroLed, Micro-oLed, a quantum dot light-emitting diode (QLED), etc.
  • the electronic device 1300 may further include a sensor module 1340.
  • the sensor module 1340 may include a touch sensor.
  • the touch sensor may be used to detect a touch operation on or near the electronic device 1300.
  • the touch sensor may be implemented in a variety of ways such as resistive, capacitive, infrared, and surface acoustic waves.
  • the touch sensor may be disposed on the display screen 1330, and the touch sensor and the display screen 1330 may form a touch screen, also known as a "touch screen".
  • the touch sensor may also be disposed on the surface of the electronic device 1300, at a different location from the display screen 1330.
  • the sensor module 1340 may also include a pressure sensor, a gyroscope sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a distance sensor, a proximity light sensor, a fingerprint sensor, a temperature sensor, an ambient light sensor, or a bone conduction sensor, etc.
  • a battery 1350 may also be provided on the electronic device 1300.
  • the battery 1350 may be used to supply power to the electronic device 1300.
  • the structure shown in FIG. 13 of this embodiment does not constitute a specific limitation on the electronic device 1300.
  • the electronic device 1300 may include more or fewer components than shown in the figure, or combine some components, or split some components, or arrange the components differently.
  • the components shown in the figure may be implemented in hardware, software, or a combination of software and hardware.
  • the embodiment of the present application provides a computer-readable storage medium, which stores a computer program.
  • the computer program runs on a computing device cluster including at least one computing device
  • the computing device cluster executes the method described in the above embodiment.
  • the computer-readable storage medium can be any available medium that can be stored by the computing device or a data storage device such as a data center containing one or more available media.
  • the available medium can be a magnetic medium (e.g., a floppy disk, a hard disk, a magnetic tape), an optical medium (e.g., a DVD), or a semiconductor medium (e.g., a solid-state hard disk), etc.
  • an embodiment of the present application provides a computer program product including instructions.
  • the computer program product is run on a computing device cluster including at least one computing device, the computing device cluster executes the method in the above embodiment.
  • processors in the embodiments of the present application may be a central processing unit (CPU), or other general-purpose processors, digital signal processors (DSP), application-specific integrated circuits, or a processor. (application specific integrated circuit, ASIC), field programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof.
  • a general-purpose processor can be a microprocessor or any conventional processor.
  • the method steps in the embodiments of the present application can be implemented by hardware or by a processor executing software instructions.
  • the software instructions can be composed of corresponding software modules, which can be stored in random access memory (RAM), flash memory, read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), registers, hard disks, mobile hard disks, CD-ROMs, or any other form of storage medium known in the art.
  • An exemplary storage medium is coupled to a processor so that the processor can read information from the storage medium and write information to the storage medium.
  • the storage medium can also be a component of the processor.
  • the processor and the storage medium can be located in an ASIC.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device.
  • the computer instructions may be stored in a computer-readable storage medium or transmitted through the computer-readable storage medium.
  • the computer instructions may be transmitted from one website, computer, server or data center to another website, computer, server or data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.).
  • the computer-readable storage medium may be any available medium that a computer can access or a data storage device such as a server or data center that includes one or more available media integrated.
  • the available medium may be a magnetic medium (e.g., a floppy disk, a hard disk, a tape), an optical medium (e.g., a DVD), or a semiconductor medium (e.g., a solid state drive (SSD)), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

A dialogue processing method, comprising: displaying a first dialogue interface for performing interaction with an artificial intelligence system by a user; and in response to a first operation triggered by the user, displaying a first dialogue shorthand card in the first dialogue interface, wherein the first dialogue shorthand card comprises consensus information of a consensus that has been reached by the user and the artificial intelligence system in the first dialogue interface. In this way, during a dialogue with an artificial intelligence system, a user can quickly master key points of the dialogue by means of a dialogue shorthand card, thereby improving the user experience.

Description

一种对话处理方法、装置及电子设备A conversation processing method, device and electronic equipment

本申请要求于2023年12月4日提交中国国家知识产权局、申请号为202311647974.4、申请名称为“一种对话处理方法、装置及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the State Intellectual Property Office of China on December 4, 2023, with application number 202311647974.4 and application name “A dialogue processing method, device and electronic device”, all contents of which are incorporated by reference in this application.

技术领域Technical Field

本申请涉及信息技术(information technology,IT)技术领域,尤其涉及一种对话处理方法、装置及电子设备。The present application relates to the field of information technology (IT) technology, and in particular to a dialogue processing method, device and electronic device.

背景技术Background Art

目前,在电子设备上可以使用具备自然语义理解和自然对话能力的人工智能系统,比如:数字助手或大语言模型等。用户在电子设备上可以与这种类型的人工智能系统进行交互式对话。但在对话过程中,用户常需要查看大部分历史对话内容,才能掌握会话要点,体验较差。Currently, artificial intelligence systems with natural semantic understanding and natural conversation capabilities can be used on electronic devices, such as digital assistants or large language models. Users can have interactive conversations with this type of artificial intelligence system on electronic devices. However, during the conversation, users often need to review most of the historical conversation content to grasp the key points of the conversation, which is a poor experience.

发明内容Summary of the invention

本申请提供了一种对话处理方法、装置、电子设备、计算机存储介质及计算机产品,能够使用户快速掌握会话要点,提升了用户体验。The present application provides a conversation processing method, device, electronic device, computer storage medium and computer product, which can enable users to quickly grasp the key points of the conversation and improve the user experience.

第一方面,本申请提供一种对话处理方法,包括:显示用户与人工智能系统进行交互的第一对话界面;响应于用户触发的第一操作,在第一对话界面中显示第一对话速记卡片,第一对话速记卡片中包括:用户与人工智能系统在第一对话界面中已达成共识的共识信息。这样,用户在与人工智能系统对话过程中,就可以通过该对话速记卡片快速掌握会话要点,提升了用户体验。In the first aspect, the present application provides a method for processing a dialogue, including: displaying a first dialogue interface for a user to interact with an artificial intelligence system; in response to a first operation triggered by the user, displaying a first dialogue shorthand card in the first dialogue interface, wherein the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached a consensus on in the first dialogue interface. In this way, during the dialogue process with the artificial intelligence system, the user can quickly grasp the key points of the conversation through the dialogue shorthand card, thereby improving the user experience.

在一种可能的实现方式中,第一对话速记卡片中还包括:对话状态。这样,用户就可以直观的了解到对话状态,提升了用户体验。In a possible implementation, the first conversation shorthand card also includes: conversation status. In this way, the user can intuitively understand the conversation status, which improves the user experience.

在一种可能的实现方式中,第一操作包括以下一项或多项:用户触发用于创建第一对话速记卡片的指令、操作或事件,或者,用户与人工智能系统的对话意图为预设意图。这样,就可以通过多种方式来触发对话速记卡片的生成,丰富了对话速记卡片的生成方式。In a possible implementation, the first operation includes one or more of the following: the user triggers an instruction, operation, or event for creating the first conversation shorthand card, or the conversation intention between the user and the artificial intelligence system is a preset intention. In this way, the generation of conversation shorthand cards can be triggered in a variety of ways, enriching the generation methods of conversation shorthand cards.

在一种可能的实现方式中,还包括:在第一对话界面所显示的对话内容中标记共识信息。这样,就可以帮助用户了解记录了哪些信息。In a possible implementation, the method further includes: marking the consensus information in the conversation content displayed on the first conversation interface, so as to help the user understand what information is recorded.

在一种可能的实现方式中,在显示第一对话速记卡片之后,还包括:在第一对话界面所显示的对话内容中的共识信息发生变更的情况下,对第一对话速记卡片中包含的共识信息进行更新。这样,就可以使得对话速记卡片中的内容保持最新。In a possible implementation, after displaying the first conversation shorthand card, the method further includes: updating the consensus information included in the first conversation shorthand card when the consensus information in the conversation content displayed on the first conversation interface changes. In this way, the content in the conversation shorthand card can be kept up to date.

在一种可能的实现方式中,在显示第一对话速记卡片之后,还包括:响应于用户对第一对话速记卡片中共识信息的编辑操作,对第一对话速记卡片中包含的共识信息进行更新。这样,用户就可以基于自身需求更改对话速记卡片中的内容,提升了用户体验。In a possible implementation, after displaying the first conversation shorthand card, the method further includes: in response to the user's editing operation on the consensus information in the first conversation shorthand card, updating the consensus information contained in the first conversation shorthand card. In this way, the user can change the content in the conversation shorthand card based on his or her own needs, thereby improving the user experience.

在一种可能的实现方式中,第一对话速记卡片中还包括:自动化任务控件。此时,在显示第一对话速记卡片之后,还包括:响应于针对自动化任务控件的触发操作,在第一对话界面中显示任务卡片,任务卡片用于指示与共识信息相关的任务的执行过程;在任务卡片所指示的任务执行完成后,在第一对话速记卡片中显示任务执行结果。这样,就可以实现任务自动化,提升了用户体验。In a possible implementation, the first conversation shorthand card also includes: an automated task control. At this time, after displaying the first conversation shorthand card, it also includes: in response to a trigger operation for the automated task control, displaying a task card in the first conversation interface, the task card is used to indicate the execution process of the task related to the consensus information; after the task indicated by the task card is completed, the task execution result is displayed in the first conversation shorthand card. In this way, task automation can be achieved, and the user experience can be improved.

在一种可能的实现方式中,还包括:关闭第一对话界面;响应于用户创建新对话的操作,显示用户与人工智能系统进行交互的第二对话界面;在用户在第二对话界面中输入至少一个共识信息的情况下,在第二对话界面中显示第一对话速记卡片,第一对话速记卡片是基于第二对话界面中的共识信息,从已存储的对话速记卡片中筛选得到。这样,用户就可以实现继续以前未完成的对话,而不用再进行多轮交互来重复询问,提升了用户体验。In a possible implementation, it also includes: closing the first dialogue interface; in response to the user's operation of creating a new dialogue, displaying the second dialogue interface for the user to interact with the artificial intelligence system; in the case where the user enters at least one consensus information in the second dialogue interface, displaying the first dialogue shorthand card in the second dialogue interface, the first dialogue shorthand card is based on the consensus information in the second dialogue interface, and is screened from the stored dialogue shorthand cards. In this way, the user can continue the previously unfinished dialogue without having to repeat the inquiry through multiple rounds of interaction, thereby improving the user experience.

在一种可能的实现方式中,第二对话界面中所显示的第一对话速记卡片是在用户同意在第二对话界 面中继续第一对话界面中的对话的情况下显示。In a possible implementation, the first conversation shorthand card displayed in the second conversation interface is displayed after the user agrees to Displayed when continuing the conversation in the first conversation interface.

在一种可能的实现方式中,还包括:在用户在第二对话界面中增加/修改共识信息的情况下,在第一对话速记卡片中增加/修改共识信息。这样,就可以使得对话速记卡片中的内容与用户的最新对话相符,提升了用户体验。In a possible implementation, the method further includes: when the user adds/modifies consensus information in the second conversation interface, the consensus information is added/modified in the first conversation shorthand card. In this way, the content in the conversation shorthand card can be consistent with the user's latest conversation, thereby improving the user experience.

在一种可能的实现方式中,还包括:在第一对话界面中显示第一提示信息,第一提示信息用于提示已创建第一对话速记卡片。这样,就可以使得用户获知到当前已生成对话速记卡片。In a possible implementation, the method further includes: displaying a first prompt message in the first dialogue interface, the first prompt message being used to prompt that the first dialogue shorthand card has been created, so that the user can be informed that the dialogue shorthand card has been generated.

第二方面,本申请提供一种对话处理装置,包括:第一显示模块和第二显示模块。其中,第一显示模块,用于显示用户与人工智能系统进行交互的第一对话界面。第二显示模块,用于响应于用户触发的第一操作,在第一对话界面中显示第一对话速记卡片,第一对话速记卡片中包括:用户与人工智能系统在第一对话界面中已达成共识的共识信息。In a second aspect, the present application provides a dialogue processing device, comprising: a first display module and a second display module. The first display module is used to display a first dialogue interface for a user to interact with an artificial intelligence system. The second display module is used to display a first dialogue shorthand card in the first dialogue interface in response to a first operation triggered by the user, wherein the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached consensus on in the first dialogue interface.

在一种可能的实现方式中,第一操作包括以下一项或多项:用户触发用于创建第一对话速记卡片的指令、操作或事件,或者,用户与人工智能系统的对话意图为预设意图。In one possible implementation, the first operation includes one or more of the following: the user triggers an instruction, operation, or event for creating a first conversation note card, or the conversation intention between the user and the artificial intelligence system is a preset intention.

在一种可能的实现方式中,第二显示模块,还用于:在第一对话界面所显示的对话内容中标记共识信息。In a possible implementation, the second display module is further used to: mark the consensus information in the conversation content displayed on the first conversation interface.

在一种可能的实现方式中,第二显示模块在显示第一对话速记卡片之后,还用于:在第一对话界面所显示的对话内容中的共识信息发生变更的情况下,对第一对话速记卡片中包含的共识信息进行更新。In a possible implementation, after displaying the first conversation shorthand card, the second display module is further used to: update the consensus information included in the first conversation shorthand card when the consensus information in the conversation content displayed on the first conversation interface changes.

在一种可能的实现方式中,第二显示模块在显示第一对话速记卡片之后,还用于:响应于用户对第一对话速记卡片中共识信息的编辑操作,对第一对话速记卡片中包含的共识信息进行更新。In a possible implementation, after displaying the first conversation shorthand card, the second display module is further used to: in response to a user's editing operation on the consensus information in the first conversation shorthand card, update the consensus information included in the first conversation shorthand card.

在一种可能的实现方式中,第一对话速记卡片中还包括:自动化任务控件。此时,第二显示模块在显示第一对话速记卡片之后,还用于:响应于针对自动化任务控件的触发操作,在第一对话界面中显示任务卡片,任务卡片用于指示与共识信息相关的任务的执行过程;在任务卡片所指示的任务执行完成后,在第一对话速记卡片中显示任务执行结果。In a possible implementation, the first dialogue shorthand card also includes: an automated task control. At this time, after displaying the first dialogue shorthand card, the second display module is further used to: display a task card in the first dialogue interface in response to a trigger operation on the automated task control, the task card being used to indicate the execution process of a task related to the consensus information; after the task indicated by the task card is completed, display the task execution result in the first dialogue shorthand card.

在一种可能的实现方式中,第二显示模块,还用于关闭第一对话界面;响应于用户创建新对话的操作,显示用户与人工智能系统进行交互的第二对话界面;在用户在第二对话界面中输入至少一个共识信息的情况下,在第二对话界面中显示第一对话速记卡片,第一对话速记卡片是基于第二对话界面中的共识信息,从已存储的对话速记卡片中筛选得到。In one possible implementation, the second display module is also used to close the first dialogue interface; in response to the user's operation of creating a new dialogue, display a second dialogue interface for the user to interact with the artificial intelligence system; when the user enters at least one consensus information in the second dialogue interface, display a first dialogue shorthand card in the second dialogue interface, the first dialogue shorthand card being screened from stored dialogue shorthand cards based on the consensus information in the second dialogue interface.

在一种可能的实现方式中,第二对话界面中所显示的第一对话速记卡片是在用户同意在第二对话界面中继续第一对话界面中的对话的情况下显示。In a possible implementation, the first conversation shorthand card displayed in the second conversation interface is displayed when the user agrees to continue the conversation in the first conversation interface in the second conversation interface.

在一种可能的实现方式中,第二显示模块,还用于:在用户在第二对话界面中增加/修改共识信息的情况下,在第一对话速记卡片中增加/修改共识信息。In a possible implementation, the second display module is further used to: when the user adds/modifies the consensus information in the second dialogue interface, add/modify the consensus information in the first dialogue shorthand card.

第三方面,本申请提供一种电子设备,包括:一个或多个处理器;存储器。其中,存储器中存储有一个或多个计算机程序,一个或多个计算机程序包括指令,当指令被一个或多个处理器执行时,使得电子设备执行如第一方面或第一方面的任一种可能的实现方式所描述的方法。In a third aspect, the present application provides an electronic device, comprising: one or more processors; a memory. The memory stores one or more computer programs, and the one or more computer programs include instructions. When the instructions are executed by the one or more processors, the electronic device performs the method described in the first aspect or any possible implementation of the first aspect.

第四方面,本申请提供一种计算机可读存储介质,包括计算机程序指令,当计算机程序指令由计算设备集群执行时,计算设备集群执行第一方面或第一方面的任一种可能的实现方式所描述的方法。其中,计算设备集群中可以包括一个或多个计算设备。In a fourth aspect, the present application provides a computer-readable storage medium, including computer program instructions, when the computer program instructions are executed by a computing device cluster, the computing device cluster executes the method described in the first aspect or any possible implementation of the first aspect. The computing device cluster may include one or more computing devices.

第五方面,本申请提供一种包含指令的计算机程序产品,当指令被计算设备集群运行时,使得计算设备集群执行第一方面或第一方面的任一种可能的实现方式所描述的方法。其中,计算设备集群中可以包括一个或多个计算设备。In a fifth aspect, the present application provides a computer program product including instructions, which, when executed by a computing device cluster, enables the computing device cluster to perform the method described in the first aspect or any possible implementation of the first aspect. The computing device cluster may include one or more computing devices.

可以理解的是,上述第二方面至第五方面的有益效果可以参见上述第一方面中的相关描述,在此不再赘述。It can be understood that the beneficial effects of the second to fifth aspects mentioned above can be found in the relevant description of the first aspect mentioned above, and will not be repeated here.

附图说明BRIEF DESCRIPTION OF THE DRAWINGS

图1是本申请实施例提供的一种应用场景的示意图;FIG1 is a schematic diagram of an application scenario provided by an embodiment of the present application;

图2是本申请实施例提供的一种对话处理方法的流程示意图;FIG2 is a flow chart of a method for processing a conversation provided in an embodiment of the present application;

图3是本申请实施例提供的一种电子设备上的界面变化示意图;FIG3 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图4是本申请实施例提供的一种电子设备上的界面变化示意图;FIG4 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图5是本申请实施例提供的一种电子设备上的界面变化示意图; FIG5 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图6是本申请实施例提供的一种电子设备上的界面变化示意图;FIG6 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图7是本申请实施例提供的一种电子设备上的界面变化示意图;FIG7 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图8是本申请实施例提供的一种电子设备上的界面变化示意图;FIG8 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图9是本申请实施例提供的一种电子设备上的界面变化示意图;FIG9 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图10是本申请实施例提供的一种电子设备上的界面变化示意图;FIG10 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图11是本申请实施例提供的一种电子设备上的界面变化示意图;FIG11 is a schematic diagram of interface changes on an electronic device provided in an embodiment of the present application;

图12是本申请实施例提供的一种对话处理装置的结构示意图;FIG12 is a schematic diagram of the structure of a conversation processing device provided in an embodiment of the present application;

图13是本申请实施例提供的一种电子设备的结构示意图。FIG. 13 is a schematic diagram of the structure of an electronic device provided in an embodiment of the present application.

具体实施方式DETAILED DESCRIPTION

本文中术语“和/或”,是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。本文中符号“/”表示关联对象是或者的关系,例如A/B表示A或者B。The term "and/or" in this article is a description of the association relationship of associated objects, indicating that there can be three relationships. For example, A and/or B can represent: A exists alone, A and B exist at the same time, and B exists alone. The symbol "/" in this article indicates that the associated objects are in an or relationship, for example, A/B means A or B.

本文中的说明书和权利要求书中的术语“第一”和“第二”等是用于区别不同的对象,而不是用于描述对象的特定顺序。例如,第一响应消息和第二响应消息等是用于区别不同的响应消息,而不是用于描述响应消息的特定顺序。The terms "first" and "second" in the specification and claims herein are used to distinguish different objects rather than to describe a specific order of the objects. For example, a first response message and a second response message are used to distinguish different response messages rather than to describe a specific order of the response messages.

在本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念。In the embodiments of the present application, words such as "exemplary" or "for example" are used to indicate examples, illustrations or descriptions. Any embodiment or design described as "exemplary" or "for example" in the embodiments of the present application should not be interpreted as being more preferred or more advantageous than other embodiments or designs. Specifically, the use of words such as "exemplary" or "for example" is intended to present related concepts in a specific way.

在本申请实施例的描述中,除非另有说明,“多个”的含义是指两个或者两个以上,例如,多个处理单元是指两个或者两个以上的处理单元等;多个元件是指两个或者两个以上的元件等。In the description of the embodiments of the present application, unless otherwise specified, "multiple" means two or more than two. For example, multiple processing units refer to two or more processing units, etc.; multiple elements refer to two or more elements, etc.

在本申请实施例中,用户在电子设备上与具备自然语义理解和自然对话能力的人工智能系统进行对话过程中,人工智能系统在检测到用户输入的预设输入后,可以获取对话信息,并从对话信息中提取用户与人工智能系统达成的共识信息,以及,将共识信息填充到对话速记卡片内,并将该对话速记卡片显示在对话界面。这样,用户就可以通过该对话速记卡片快速掌握会话要点,提升了用户体验。示例性的,人工智能系统可以但不限于通过客户端的方式部署在电子设备上,该客户端可以为桌面应用程序、移动应用程序、Web应用程序或基于Web的应用程序等。In an embodiment of the present application, during a conversation between a user and an artificial intelligence system with natural semantic understanding and natural conversation capabilities on an electronic device, the artificial intelligence system can obtain conversation information after detecting a preset input input by the user, and extract the consensus information reached between the user and the artificial intelligence system from the conversation information, and fill the consensus information into a conversation shorthand card, and display the conversation shorthand card on the conversation interface. In this way, the user can quickly grasp the key points of the conversation through the conversation shorthand card, thereby improving the user experience. Exemplarily, the artificial intelligence system can be deployed on an electronic device in the form of a client, but is not limited to, and the client can be a desktop application, a mobile application, a Web application, or a Web-based application, etc.

示例性的,图1示出了本申请实施例提供的一种应用场景的示意图。如图1所示,在该应用场景下,在图1的(A)中,用户A正在使用电子设备100,且,如图1的(B)所示,用户正在电子设备100上与人工智能系统进行交互。Exemplarily, Figure 1 shows a schematic diagram of an application scenario provided by an embodiment of the present application. As shown in Figure 1, in this application scenario, in (A) of Figure 1, user A is using an electronic device 100, and, as shown in (B) of Figure 1, the user is interacting with an artificial intelligence system on the electronic device 100.

示例性的,图2示出了本申请实施例提供的一种对话处理方法的流程示意图。该方法可以但不限于应用于客户端。该客户端可以但不限于与具备自然语义理解和自然对话能力的人工智能系统相关。如图2所示,该对话处理方法可以包括以下步骤:Exemplarily, FIG2 shows a flow chart of a method for processing a conversation provided in an embodiment of the present application. The method may be, but is not limited to, applied to a client. The client may be, but is not limited to, related to an artificial intelligence system with natural semantic understanding and natural conversation capabilities. As shown in FIG2 , the method for processing a conversation may include the following steps:

S201、显示用户与人工智能系统进行交互的第一对话界面。S201. Display a first dialogue interface for a user to interact with an artificial intelligence system.

本实施例中,用户可以通过点击客户端的应用图标或者网页等方式进入到人工智能系统的对话界面。在用户选择与人工智能系统交互后,客户端可以显示出用户与人工智能系统进行交互的第一对话界面。之后,用户就可以在该第一对话界面中与人工智能系统进行对话。另外,用户也可以通过语音交互的方式启动客户端,以与人工智能系统交互。In this embodiment, the user can enter the dialogue interface of the artificial intelligence system by clicking on the application icon or web page of the client. After the user chooses to interact with the artificial intelligence system, the client can display the first dialogue interface for the user to interact with the artificial intelligence system. After that, the user can have a dialogue with the artificial intelligence system in the first dialogue interface. In addition, the user can also start the client through voice interaction to interact with the artificial intelligence system.

S202、响应于用户触发的第一操作,在第一对话界面中显示第一对话速记卡片,第一对话速记卡片中包括:用户与人工智能系统在第一对话界面中已达成共识的共识信息。S202: In response to a first operation triggered by a user, a first dialogue shorthand card is displayed in a first dialogue interface, where the first dialogue shorthand card includes: consensus information reached by the user and the artificial intelligence system in the first dialogue interface.

本实施例中,在显示第一对话界面后,用户可以与人工智能系统进行交互,比如,语音交互、文字交互等。另外,用户可以在第一对话界面上触发一些指令、操作或事件,或者,某些对话意图等操作。之后,客户端可以响应于用户触发的操作,在第一对话界面中显示第一对话速记卡片。该第一对话速记卡片中包括:用户与人工智能系统在第一对话界面中已达成共识的共识信息。示例性的,用户所触发的第一操作,可以理解为是用于创建对话速记卡片的操作。In this embodiment, after the first dialogue interface is displayed, the user can interact with the artificial intelligence system, such as voice interaction, text interaction, etc. In addition, the user can trigger some instructions, operations or events, or certain dialogue intentions and other operations on the first dialogue interface. Afterwards, the client can display the first dialogue shorthand card in the first dialogue interface in response to the operation triggered by the user. The first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached a consensus on in the first dialogue interface. Exemplarily, the first operation triggered by the user can be understood as an operation for creating a dialogue shorthand card.

这样,用户在与人工智能系统对话过程中,就可以通过该对话速记卡片快速掌握会话要点,提升了用户体验。In this way, when users are talking with the artificial intelligence system, they can quickly grasp the key points of the conversation through the conversation shorthand cards, thereby improving the user experience.

在一些实施例中,S202中的第一操作可以包括以下一项或多项: In some embodiments, the first operation in S202 may include one or more of the following:

1)用户在对话界面中输入预设的创建语句1) The user enters the preset creation statement in the dialogue interface

当用户想主动创建对话速记卡片时,可以通过预设的创建语句来触发对话速记卡片的创建。创建语句可以是系统默认设置的,也可以是用户自定义的。例如,创建语句为“嘿,听着”、“帮我记一下”等。例如,如图3的(A)所示,在电子设备100上显示有对话界面31,在对话界面31中用户可以输入“帮我记一下”,以触发创建对话速记卡片。之后,如图3的(B)所示,可以在对话界面31中显示出对话速记卡片31。在对话界面31中,用户与人工智能系统所达成的共识信息为“记录内容”,因此,可以在对话速记卡片32中显示出这些共识信息。另外,为了便于让用户获知到已创建对话速记卡片,在对话界面31中还可以显示有一个提示信息(比如,图3的(B)中区域33中所示的内容),该提示信息用于提示已创建对话速记卡片。When the user wants to actively create a conversation shorthand card, the creation of the conversation shorthand card can be triggered by a preset creation statement. The creation statement can be a system default setting or a user-defined one. For example, the creation statement is "Hey, listen", "Help me remember", etc. For example, as shown in FIG3 (A), a conversation interface 31 is displayed on the electronic device 100, and the user can enter "Help me remember" in the conversation interface 31 to trigger the creation of a conversation shorthand card. Afterwards, as shown in FIG3 (B), the conversation shorthand card 31 can be displayed in the conversation interface 31. In the conversation interface 31, the consensus information reached by the user and the artificial intelligence system is "record content", so these consensus information can be displayed in the conversation shorthand card 32. In addition, in order to facilitate the user to know that the conversation shorthand card has been created, a prompt information (for example, the content shown in area 33 in FIG3 (B)) can also be displayed in the conversation interface 31, and the prompt information is used to prompt that the conversation shorthand card has been created.

2)用户与人工智能系统的对话意图为预设意图2) The conversation intention between the user and the AI system is the preset intention

人工智能系统每次在接收到用户的对话输入后,均可以对接收到的对话意图进行分析,当对话意图为预设类型的对话意图时,则可以触发对话速记卡片的创建。预设类型的对话意图可以但不限于为需要多步操作、持续性进行的任务意图,例如订餐、行程安排、会议、编写邮件等。例如,如图4的(A)所示,在电子设备100上显示有对话界面41,在对话界面41中用户可以输入“请帮我预定一下附近的川菜馆”。用户的对话输入明确表示了预定饭店的需求,通过对用户输入的内容进行分析,可以获知到此时对话意图为订餐类意图,这时就可以触发创建对话速记卡片。之后,如图4的(B)所示,可以在对话界面41中显示出对话速记卡片42。在对话界面41中,用户与人工智能系统所达成的共识信息为“预定一家饭店”、“川菜”和“家附近”,因此,可以在对话速记卡片42中显示出这些共识信息。Each time the artificial intelligence system receives the user's dialogue input, it can analyze the received dialogue intention. When the dialogue intention is a preset type of dialogue intention, the creation of a dialogue shorthand card can be triggered. The preset type of dialogue intention can be, but is not limited to, a task intention that requires multiple steps and is carried out continuously, such as ordering food, scheduling, meetings, writing emails, etc. For example, as shown in FIG4 (A), a dialogue interface 41 is displayed on the electronic device 100, and the user can enter "Please help me book a nearby Sichuan restaurant" in the dialogue interface 41. The user's dialogue input clearly indicates the need to book a restaurant. By analyzing the content input by the user, it can be known that the dialogue intention at this time is an ordering intention, and the creation of a dialogue shorthand card can be triggered. Afterwards, as shown in FIG4 (B), a dialogue shorthand card 42 can be displayed in the dialogue interface 41. In the dialogue interface 41, the consensus information reached by the user and the artificial intelligence system is "book a restaurant", "Sichuan cuisine" and "near home", so these consensus information can be displayed in the dialogue shorthand card 42.

3)客户端在显示卡片创建邀请后,用户触发确认操作3) After the client displays the card to create the invitation, the user triggers the confirmation operation

当在对话过程中,如果超过预设时间没有接收到用户的输入,则人工智能系统可以在对话界面显示卡片创建邀请,如果用户通过操作确认接受该邀请,则创建对话速记卡片。示例性的,卡片创建邀请可以是以对话询问的方式显示在对话界面,对应地,用户的确认操作可以是对话输入,对话输入可以是键盘输入的文字或者语音输入或语音转文字输入,等等。当然,卡片创建邀请也可以是以弹窗的方式程序,对应地,用户的确认操作可以是与弹窗内控件的交互,或者语音输入,等。例如:如图5的(A)所示,用户在与人工智能系统进行对话咨询,在电子设备100上显示有对话界面51。人工智能系统在一段时间内未接收到用户的对话输入,其可以向用户发起对话速记卡片创建邀请,即区域52中所示的“请问需要为您预订一家川菜馆吗?”。在人工智能系统接收到用户的对话输入“好的”后,则触发对话速记卡片的创建。之后,如图5的(B)所示,可以在对话界面51中显示出对话速记卡片53。在对话界面51中,用户与人工智能系统所达成的共识信息为“预定一家饭店”、“川菜”和“家附近”,因此,可以在对话速记卡片53中显示出这些共识信息。During the conversation, if the user input is not received within the preset time, the artificial intelligence system can display a card creation invitation on the conversation interface, and if the user confirms to accept the invitation through operation, a conversation shorthand card is created. Exemplarily, the card creation invitation can be displayed on the conversation interface in the form of a conversation inquiry, and correspondingly, the user's confirmation operation can be a conversation input, and the conversation input can be text input by keyboard or voice input or voice-to-text input, etc. Of course, the card creation invitation can also be a pop-up window program, and correspondingly, the user's confirmation operation can be an interaction with the control in the pop-up window, or voice input, etc. For example: As shown in (A) of Figure 5, the user is having a conversation consultation with the artificial intelligence system, and a conversation interface 51 is displayed on the electronic device 100. The artificial intelligence system has not received the user's conversation input for a period of time, and it can initiate a conversation shorthand card creation invitation to the user, that is, "Do you need to book a Sichuan restaurant for you?" shown in area 52. After the artificial intelligence system receives the user's conversation input "OK", the creation of the conversation shorthand card is triggered. Afterwards, as shown in (B) of Figure 5, a conversation shorthand card 53 can be displayed in the conversation interface 51. In the dialogue interface 51 , the consensus information reached between the user and the artificial intelligence system is “book a restaurant”, “Sichuan cuisine” and “near home”, so these consensus information can be displayed in the dialogue shorthand card 53 .

另外,人工智能系统也可以先分析对话内容,并根据对话内容识别出至少一个用户意图。然后,再根据用户意图来发送卡片创建邀请,且仅有在存在预设类型的用户意图的情况下,才发出卡片创建邀请。这种方式与第2种方式的主要区别是:这里的识别用户意图更多是推测性质的,即用户并未表达该类意图,而是基于对话内容推测用户具有该类意图;而第2种方式中用户表达了明确的意图。In addition, the artificial intelligence system can also analyze the content of the conversation first and identify at least one user intent based on the content of the conversation. Then, a card creation invitation is sent based on the user intent, and a card creation invitation is only sent when there is a preset type of user intent. The main difference between this method and the second method is that the identification of user intent here is more speculative, that is, the user does not express such intent, but it is inferred that the user has such intent based on the content of the conversation; while in the second method, the user expresses a clear intent.

4)用户在对话界面中选择菜单中用于创建对话速记卡片的控件4) The user selects a control in the menu in the conversation interface to create a conversation note card

在对话界面中可以设置有隐藏式菜单,用户唤出隐藏式菜单后,可以选择菜单内用于创建对话速记卡片的控件。当然,隐藏式菜单也可以替换为非隐藏式菜单。例如,如图6的(A),在电子设备100上显示有对话界面61。在对话界面61中,用户可以唤出菜单62,并选择用于创建对话速记卡片的控件63。之后,如图6的(B)所示,可以在对话界面61中显示出对话速记卡片64。A hidden menu may be provided in the dialogue interface. After the user calls out the hidden menu, the user may select a control in the menu for creating a dialogue note card. Of course, the hidden menu may also be replaced by a non-hidden menu. For example, as shown in FIG6(A), a dialogue interface 61 is displayed on the electronic device 100. In the dialogue interface 61, the user may call out a menu 62 and select a control 63 for creating a dialogue note card. Afterwards, as shown in FIG6(B), a dialogue note card 64 may be displayed in the dialogue interface 61.

5)用户触发用于创建对话速记卡片的操作5) User-triggered action to create a conversation note card

用于创建对话速记卡片的操作可以预设的操作。例如,在手机上,该操作可以为从对话界面底部上滑的手势操作;在电脑上,该操作可以是点击返回键、关闭页面等。另外,该操作也可以复用其他操作,比如:复用退出对话界面的手势,等。例如,如图7的(A)所示,在电子设备100上显示有对话界面71。在对话界面71中,用户在执行屏幕底部上滑手势后,即触发创建对话速记卡片。之后,如图7的(B)所示,可以在对话界面71中显示出对话速记卡片72。另外,如果用于创建对话速记卡片的操作触发了对话界面的退出,则可以会先中断对话界面的退出,待完成对话速记卡片的创建后再退出;或者直接终结对话界面的退出操作,仅执行对话速记卡片的创建,待用户再次执行上滑退出后,再退出对话界面。示例性的,用户在电子设备上的操作可以通过电子设备传输至人工智能系统。 The operation for creating a conversation shorthand card can be a preset operation. For example, on a mobile phone, the operation can be a gesture operation of sliding up from the bottom of the conversation interface; on a computer, the operation can be clicking the back button, closing the page, etc. In addition, the operation can also reuse other operations, such as: reusing the gesture of exiting the conversation interface, etc. For example, as shown in (A) of FIG. 7 , a conversation interface 71 is displayed on the electronic device 100. In the conversation interface 71, after the user performs a swipe gesture at the bottom of the screen, the creation of a conversation shorthand card is triggered. Afterwards, as shown in (B) of FIG. 7 , a conversation shorthand card 72 can be displayed in the conversation interface 71. In addition, if the operation for creating a conversation shorthand card triggers the exit of the conversation interface, the exit of the conversation interface can be interrupted first, and then exit after the creation of the conversation shorthand card is completed; or the exit operation of the conversation interface is directly terminated, and only the creation of the conversation shorthand card is executed, and the conversation interface is exited after the user performs the swipe up exit again. Exemplarily, the user's operation on the electronic device can be transmitted to the artificial intelligence system through the electronic device.

6)用户触发预设事件6) User triggers a preset event

预设事件可以是预设类型的事件。例如,这个事件可以是退出对话界面;或者人工智能系统被切换到后台,等等。例如,如图8的(A)所示,在电子设备100上显示有对话界面81。在对话界面81中,用户手指在屏幕底部的导航条上滑动,此操作触发应用的切换,将上一个/下一个应用切换到前端运行,人工智能系统被切换到后台,即触发了对话界面关闭事件。在检测到对话界面关闭事件时,先中断执行对话界面关闭操作,进行对话速记卡片的创建。之后,如图8的(B)所示,可以在对话界面81中显示出对话速记卡片82。最后,可以执行对话界面关闭操作,并切换到下一个应用。示例性的,用户在电子设备上触发的事件可以通过电子设备传输至人工智能系统。The preset event may be an event of a preset type. For example, this event may be exiting the dialogue interface; or the artificial intelligence system being switched to the background, and so on. For example, as shown in (A) of FIG8 , a dialogue interface 81 is displayed on the electronic device 100. In the dialogue interface 81, the user slides his finger on the navigation bar at the bottom of the screen. This operation triggers the switching of the application, switches the previous/next application to the front end to run, and the artificial intelligence system is switched to the background, triggering the dialogue interface closing event. When the dialogue interface closing event is detected, the dialogue interface closing operation is first interrupted to create a dialogue shorthand card. Afterwards, as shown in (B) of FIG8 , a dialogue shorthand card 82 may be displayed in the dialogue interface 81. Finally, the dialogue interface closing operation may be performed and switched to the next application. Exemplarily, the event triggered by the user on the electronic device may be transmitted to the artificial intelligence system via the electronic device.

在一些实施例中,在S202中,在用户触发第一操作后,人工智能系统可以先获取在第一对话界面中用户与人工智能系统之间的对话内容。然后,再对获取到的对话内容进行识别,从中提取出两者之间达成共识的共识信息。最后,再将提取到的共识信息填充到数据模版内,以生成对话速记卡片。最后,客户端就可以在对话界面中显示出该对话速记卡片。本实施例中,共识信息可以来源于用户的回答,即从用户回答中识别得到,也可以是人工智能系统在用户回答后以预设的句式向用户重复描述的信息。例如,图7和图8中的“XXXX,了解”句式,即用于重复描述的信息,这里边的“xxxx”即为共识信息。In some embodiments, in S202, after the user triggers the first operation, the artificial intelligence system may first obtain the conversation content between the user and the artificial intelligence system in the first dialogue interface. Then, the obtained dialogue content is identified and the consensus information reached between the two is extracted. Finally, the extracted consensus information is filled into the data template to generate a dialogue shorthand card. Finally, the client can display the dialogue shorthand card in the dialogue interface. In this embodiment, the consensus information may come from the user's answer, that is, it is identified from the user's answer, or it may be the information that the artificial intelligence system repeats to the user in a preset sentence pattern after the user answers. For example, the "XXXX, understand" sentence pattern in Figures 7 and 8 is information used for repeated description, and the "xxxx" here is the consensus information.

另外,在用户与人工智能系统之间的一个对话界面中,用户可以发起至少一个话题。例如,用户可以先向人工智能系统发起订餐的话题,再向人工系统发起订房的话题,等等。在这种场景下,在获取到对话界面中的对话内容后,可以先确认当前时间正在对话的主题,然后,再基于该主题下各个话题的相关对话内容提取共识信息。本实施例中,可以通过预设的数据结构来存储对话情境下的数据,比如存储在本地或者存储在云端等。其中,该数据结构可以如下所示:
In addition, in a dialogue interface between a user and an artificial intelligence system, the user can initiate at least one topic. For example, the user can first initiate a topic of ordering meals to the artificial intelligence system, and then initiate a topic of booking a room to the artificial system, and so on. In this scenario, after obtaining the conversation content in the dialogue interface, you can first confirm the topic of the conversation at the current time, and then extract consensus information based on the relevant conversation content of each topic under the topic. In this embodiment, the data in the dialogue context can be stored by a preset data structure, such as locally or in the cloud. Among them, the data structure can be as follows:

该数据结构中的变量解释:Explanation of the variables in this data structure:

Context:情景数据结构体,储存了一段对话中包含的情境信息;Context: a context data structure that stores context information contained in a conversation;

Conversational_Name:用于显示保存的对话名称;Conversational_Name: used to display the name of the saved conversation;

Conversational_ID:对话的独特ID;Conversational_ID: The unique ID of the conversation;

Conversational_LastUpdateTime:用于显示保存的对话最后交流的时间; Conversational_LastUpdateTime: used to display the last time of the saved conversation;

Conversational_CreateTime:用于记录该对话的创建时间;Conversational_CreateTime: used to record the creation time of the conversation;

Conversational_Intent:标志对话的主要意图类型;Conversational_Intent: marks the main intent type of the conversation;

Conversational_Goal:标志对话要达到的目标;Conversational_Goal: marks the goal of the conversation;

Conversational_Consensus:记录当前对话的对话状态,包括讨论中、已完成、已放弃。Conversational_Consensus: records the conversation status of the current conversation, including under discussion, completed, and abandoned.

Topic:话题数据结构体,用于保存一段对话中的一个话题内的情境信息,一般用户与人工智能系统的一轮对话(通常为一问一答)对应一个Topic;Topic: a topic data structure used to store contextual information about a topic in a conversation. Generally, a conversation between a user and an AI system (usually one question and one answer) corresponds to one Topic.

Topic_Name:用于显示保存的话题名称;Topic_Name: used to display the saved topic name;

Topic_LastUpdateTime:用于显示话题创建的时间;Topic_LastUpdateTime: used to display the time when the topic was created;

Topic_CreateTime:用户记录该话题创建的时间;Topic_CreateTime: the time when the user records the creation of the topic;

Topic_Intent:标志该话题的意图;Topic_Intent: marks the intention of the topic;

Topic_Entity:记录该话题意图对应的实体参数列表;例如,当Topic_Name为人数时,Topic_Entity可以为4人。Topic_Entity: records the entity parameter list corresponding to the topic intent; for example, when Topic_Name is the number of people, Topic_Entity can be 4 people.

本实施例中,人工智能系统在每轮对话中,均可以进行话题名称、意图的识别,因而,可以基于话题名称、意图中的至少一种来识别当前话题的会话主题。从而,在根据当前时间确定当前话题的会话主题后,人工智能系统能够判断不同话题(Topic)的会话主题的相关性,从而确定一组关联的话题,这组话题都是谈论同一主题的相关事项。例如图7和图8中包含订餐相关的多个话题,从而得到当前会话主题为订餐,再从当前会话主题对应的会话中识别共识信息,就可以得到对话速记卡片。另外,在每个话题数据结构体中均可以使用键值对结构来存储共识信息。在键值对结构中,键参数为“Topic_Name”,值变量为“Topic_Entity”。当话题中的键参数相同,但值变量发生变化时,可以利用最新的值变量替换以前的值变量。例如,在订餐的例子中,若共识信息为“用餐人数为4人”,则该共识信息的键参数为“用餐人声”,值变量为“4人”或者“人数“4人”;此时,用户在预期修改时,提供的语句为“帮我把人数改成6人”,则共识信息的值变量为“6人”或“人数:6人”,键参数保持不变。进一步地,在每得到一个新的共识信息时,即可以对共识信息进行一次键值对判断,判断是否存在相同键参数的共识信息,如果存在相同键参数的共识信息,则用新的值覆盖原键值对中的值,这样就可以完成使得对话速记卡片中的内容保持最新状态。例如:如图8的(C)所示,用户的最新对话输入为“改成家庭聚餐,有老人和儿童”,则从此轮对话输入提取得到的共识信息为“家庭聚餐,有老人和儿童”,对应的,将聚餐性质修改为“家庭聚餐,有老人和儿童”。另外,除了这种更新对话速记卡片中的内容外,用户也可以直接对对话速记卡片中的内容进行编辑。例如,用户可以通过触发预设的操作(比如,长按、单击、双击对话速记卡片等),以触发对话速记卡片进行编辑模式;接着,用户可以在对话速记卡片中进行编辑,以修改对话速记卡片的内容。进一步地,为了便于用户获知到其编辑的内容,在用户对对话速记进行修改后,还可以在对话界面提供修改查看入口,以查看对话速记卡片的修改记录;通过该入口,可以查看修改前的对话速记卡片,或者原对话速记卡片的修改对照。In this embodiment, the artificial intelligence system can identify the topic name and intention in each round of dialogue, and thus can identify the conversation topic of the current topic based on at least one of the topic name and intention. Thus, after determining the conversation topic of the current topic according to the current time, the artificial intelligence system can determine the relevance of the conversation topics of different topics (Topic), thereby determining a group of related topics, which are all related matters about the same topic. For example, Figures 7 and 8 contain multiple topics related to ordering, so that the current conversation topic is ordering, and then the consensus information is identified from the conversation corresponding to the current conversation topic, and the conversation shorthand card can be obtained. In addition, the key-value pair structure can be used to store consensus information in each topic data structure. In the key-value pair structure, the key parameter is "Topic_Name" and the value variable is "Topic_Entity". When the key parameters in the topic are the same, but the value variable changes, the latest value variable can be used to replace the previous value variable. For example, in the example of ordering a meal, if the consensus information is "the number of people dining is 4", then the key parameter of the consensus information is "dining voice", and the value variable is "4 people" or "number of people "4 people"; at this time, when the user is expected to modify, the sentence provided is "help me change the number of people to 6 people", then the value variable of the consensus information is "6 people" or "number of people: 6 people", and the key parameter remains unchanged. Furthermore, every time a new consensus information is obtained, a key-value pair judgment can be performed on the consensus information to determine whether there is consensus information with the same key parameter. If there is consensus information with the same key parameter, the value in the original key-value pair is overwritten with the new value, so that the content in the conversation shorthand card can be kept up to date. For example: as shown in (C) of Figure 8, the user's latest conversation input is "changed to a family dinner with elderly and children", then the conversation input from this round is obtained. The consensus information obtained is "family dinner, with elderly and children", and accordingly, the nature of the dinner is modified to "family dinner, with elderly and children". In addition, in addition to updating the content in the conversation shorthand card, the user can also directly edit the content in the conversation shorthand card. For example, the user can trigger the conversation shorthand card to enter the editing mode by triggering a preset operation (such as long pressing, single clicking, double clicking the conversation shorthand card, etc.); then, the user can edit the conversation shorthand card to modify the content of the conversation shorthand card. Furthermore, in order to facilitate the user to know the content he edited, after the user modifies the conversation shorthand, a modification viewing entrance can be provided in the conversation interface to view the modification record of the conversation shorthand card; through this entrance, the conversation shorthand card before the modification, or the modification comparison of the original conversation shorthand card can be viewed.

本实施例中,共识信息的识别主要基于每个话题(Topic)来进行,具体是根据每个话题的对话文本来进行。示例性的,可以直接重复确认语句中直接提取得到。在其他方式中,也可以根据对话文本的内容,进行信息提炼后,得到共识信息。例如,用户在提问前,仅以“好的,了解”结束一轮会话。此时,无法根据结尾句来得到共识信息,则通过对提问问题和用户回答中,关键字之间的内容相关性来得到。将用户回答中,回答问题的关键字作为共识信息。例如,提问为用餐人数几人,则回答关键词为人数“4人”,则4人为共识信息。或者,根据提问和回答,组合得到共识信息,提问关键字为用餐人数,回答关键词为4人,用餐人数与4人具有内容上的关联性,则将“用餐人数为4人”或“人数:4人”作为共识信息。关键字之间的内容关联性由人工智能系统进行判断。In this embodiment, the recognition of consensus information is mainly based on each topic (Topic), specifically, it is based on the dialogue text of each topic. Exemplarily, it can be directly extracted from the repeated confirmation statement. In other ways, it is also possible to obtain consensus information after refining information based on the content of the dialogue text. For example, before asking a question, the user only ends a round of conversation with "OK, I understand". At this time, it is impossible to obtain consensus information based on the ending sentence, so it is obtained by the content relevance between the keywords in the question and the user's answer. The keywords that answer the question in the user's answer are used as consensus information. For example, if the question is how many people are dining, then the answer keyword is the number of people "4 people", and 4 people is the consensus information. Alternatively, based on the question and answer, the consensus information is combined, the question keyword is the number of people dining, the answer keyword is 4 people, and the number of people dining has a content relevance with 4 people, then "The number of people dining is 4 people" or "The number of people: 4 people" is used as consensus information. The content relevance between keywords is judged by the artificial intelligence system.

此外,一轮对话的结束是由人工智能系统来判断的。以图7和图8中的会话为例,除了问、答之外,人工智能系统提出新的问题前,还向用户重复确认了用户的回答要点,因此,一轮会话的结束是以提问起始,人工智能系统重复确认的回答要点结束的。即一轮会话包括问、答,或者,包括问、答和重复确认语句。In addition, the end of a round of dialogue is determined by the artificial intelligence system. Taking the conversations in Figures 7 and 8 as an example, in addition to questions and answers, the artificial intelligence system also repeatedly confirms the key points of the user's answer before asking new questions. Therefore, the end of a round of dialogue starts with a question and ends with the key points of the answer repeatedly confirmed by the artificial intelligence system. That is, a round of dialogue includes questions and answers, or includes questions, answers and repeated confirmation statements.

在一些情况下,人工智能系统获取到共识信息后,还可以结合电子设备获取的场景信息,对共识信息进行预设处理,使得共识信息表达的更为清楚。场景信息包括:当前位置信息或者当前时间信息等。例如,用户的对话输入为“帮我看一下明晚7点的川菜馆”,中,共识信息包括“明晚7点”、“川菜”,其中明晚7点是基于当前时间信息来推论的,如果直接将明晚7点记入到对话速记卡片,则用户在后期 查看该对话速记卡片时,对明晚7点就不能准确表达时间信息,因此,结合当前时间信息(11月17日)对“明晚7点”进行再次编辑,得到“11月18日19:00”的用餐时间信息,作为共识信息。In some cases, after the artificial intelligence system obtains the consensus information, it can also combine the scene information obtained by the electronic device to preset the consensus information so that the consensus information is expressed more clearly. Scene information includes: current location information or current time information, etc. For example, the user's dialogue input is "Please help me find a Sichuan restaurant at 7 o'clock tomorrow night", and the consensus information includes "7 o'clock tomorrow night" and "Sichuan cuisine". The 7 o'clock tomorrow night is inferred based on the current time information. If 7 o'clock tomorrow night is directly recorded on the dialogue shorthand card, the user will not be able to express it in the later stage. When checking the conversation shorthand card, the time information of 7 o'clock tomorrow night cannot be accurately expressed. Therefore, "7 o'clock tomorrow night" is edited again in combination with the current time information (November 17), and the dining time information of "November 18 19:00" is obtained as the consensus information.

若从当前对话内容/当前话题的对话内容没有识别到共识信息,则共识信息为空,即没有共识信息填充到对话速记卡片,对应的,创建的对话速记卡片为空白卡片。If no consensus information is identified from the current conversation content/the conversation content of the current topic, the consensus information is empty, that is, no consensus information is filled into the conversation shorthand card, and correspondingly, the created conversation shorthand card is a blank card.

在一些实施例中,在对话速记卡片填充共识信息后,客户端就可以将对话速记卡片显示在对话界面上。其中,对话速记卡片可以显示在对话界面的顶部,也可以显示在对话界面的其他位置。当然,对话速记卡片也可以使用悬浮的方式显示在对话内容的上方,这样就不会挤占对话内容的显示,而是遮挡部分对话内容。此外,对话速记卡片也可作为一个可唤出的额外交互界面,在用户唤起或有信息更新时在显示在对话界面上。In some embodiments, after the conversation shorthand card is filled with consensus information, the client can display the conversation shorthand card on the conversation interface. Among them, the conversation shorthand card can be displayed at the top of the conversation interface, or it can be displayed in other locations of the conversation interface. Of course, the conversation shorthand card can also be displayed above the conversation content in a suspended manner, so that it will not occupy the display of the conversation content, but block part of the conversation content. In addition, the conversation shorthand card can also be used as an additional interactive interface that can be called up, and it will be displayed on the conversation interface when the user calls it up or there is information update.

在共识信息被添加到对话速记卡片内之后,可以为在对话界面中与共识信息对应的文字添加装饰标识,并通过客户端展示出这些标识。由此,以帮助用户了解记录了哪些信息。其中,装饰标识可以包括:下划线、底纹或者星标等。另外,还可以对与共识信息对应的文字进行更改,如倾斜、更改字体颜色、改变字号大小等操作。例如,如图8的(B)或(C)所示,可以使用加粗、倾斜和添加下划线的方式来突出共识信息,以提醒用户这些内容被关联到对话速记卡片中。After the consensus information is added to the conversation shorthand card, decorative marks can be added to the text corresponding to the consensus information in the conversation interface, and these marks can be displayed through the client. In this way, it helps users understand what information is recorded. Among them, decorative marks may include: underline, shading or star marks, etc. In addition, the text corresponding to the consensus information can also be changed, such as iterating, changing the font color, changing the font size, etc. For example, as shown in (B) or (C) of Figure 8, the consensus information can be highlighted by bolding, italicizing and adding underlining to remind users that these contents are associated with the conversation shorthand card.

另外,当有新的共识信息覆盖原共识信息时,对话界面内,原共识信息对应文字的装饰标识/文字效果可以相应取消,并对新的共识信息进行标记。例如:如图8的(C)所示,可以取消对“家庭聚餐,有老人”的标记,并对“家庭聚餐,有老人和儿童”进行标记。In addition, when new consensus information covers the original consensus information, the decorative logo/text effect of the text corresponding to the original consensus information can be cancelled in the dialogue interface, and the new consensus information can be marked. For example, as shown in (C) of Figure 8, the mark of "family dinner with elderly people" can be cancelled, and "family dinner with elderly people and children" can be marked.

在一些实施例中,对话速记卡片除了显示用户与人工智能系统达成的共识信息之外,还可以支持基于对话速记卡片创建自动化任务,并执行。以图8中描述的订饭店为例,人工智能系统可以基于对话速记卡片内记载的信息,并根据对话意图来创建一个自动订饭店的自动化任务,并由人工智能系统调用本地应用(应用程序、小程序、服务卡片、HTML卡片等在运行时以卡片/窗口呈现的程序,均可以包括在应用概念内)来创建一个订饭店的任务。示例性的,在对话速记卡片上除了显示共识信息之外,还可以显示有自动化任务控件。该自动化任务控件可以用于在电子设备上执行自动化任务。当用户点击自动化任务控件后,人工智能系统可以基于共识信息创建一个自动化任务。之后,客户端可以在对话界面上显示一个任务卡片。在该任务卡片内可以用于指示自动化任务的执行过程,待用户进一步交互后,则完成自动化任务;此时,在对话速记卡片中可以显示出任务执行结果。例如,请参阅图9,如图9的(A)所示,在电子设备100上显示有对话界面91。在对话界面91中,用户手指在屏幕底部的导航条上滑动,此操作触发应用的切换,将上一个/下一个应用切换到前端运行,人工智能系统被切换到后台,即触发了对话界面关闭事件。在检测到对话界面关闭事件时,先中断执行对话界面关闭操作,进行对话速记卡片的创建。之后,如图9的(B)所示,可以在对话界面91中显示出对话速记卡片92。同时,在对话速记卡片92中显示有用于“预定餐厅”的自动化任务控件93。然后,用户可以点击自动化任务控件93,之后,电子设备100上可以显示如图9的(C)所示的界面,即显示出任务卡片95。在图9的(C)中,用户可以点击“xx饭店”,以完成该饭店的预定。之后,电子设备100上可以显示如图9的(D)所示的界面。其中,在对话速记卡片92上还可以显示有用于指示任务状态的状态控件94。通过状态控件94,用户可以直观的观察到自动化任务的完成话状态。此外,状态控件94还可以可以显示当前对话的状态。例如,当前对话状态通常与人工智能系统的询问语句相关联,为当前询问语句的意图,例如询问语句为“请问人数几人?”,则对话状态可以以“讨论中(待确定聚餐人数)”来表示,讨论中表示人工智能系统正在于用户讨论,待确定聚餐人数表示当前正在讨论的意图。In some embodiments, in addition to displaying the consensus information reached between the user and the artificial intelligence system, the conversation shorthand card can also support the creation and execution of automated tasks based on the conversation shorthand card. Taking the restaurant booking described in FIG8 as an example, the artificial intelligence system can create an automated task of automatically booking a restaurant based on the information recorded in the conversation shorthand card and according to the conversation intention, and the artificial intelligence system calls the local application (applications, applets, service cards, HTML cards, etc., which are presented as cards/windows at runtime, and can be included in the application concept) to create a task of booking a restaurant. Exemplarily, in addition to displaying the consensus information on the conversation shorthand card, an automated task control can also be displayed. The automated task control can be used to execute automated tasks on an electronic device. When the user clicks the automated task control, the artificial intelligence system can create an automated task based on the consensus information. After that, the client can display a task card on the conversation interface. The task card can be used to indicate the execution process of the automated task, and after further interaction by the user, the automated task is completed; at this time, the task execution result can be displayed in the conversation shorthand card. For example, please refer to FIG9, as shown in FIG9 (A), a conversation interface 91 is displayed on the electronic device 100. In the dialogue interface 91, the user slides his finger on the navigation bar at the bottom of the screen. This operation triggers the switching of the application, switches the previous/next application to the front end, and switches the artificial intelligence system to the background, which triggers the dialogue interface closing event. When the dialogue interface closing event is detected, the dialogue interface closing operation is first interrupted to create a dialogue shorthand card. Afterwards, as shown in FIG9 (B), a dialogue shorthand card 92 can be displayed in the dialogue interface 91. At the same time, an automated task control 93 for "booking a restaurant" is displayed in the dialogue shorthand card 92. Then, the user can click on the automated task control 93, and then the electronic device 100 can display an interface as shown in FIG9 (C), that is, a task card 95 is displayed. In FIG9 (C), the user can click on "xx restaurant" to complete the reservation of the restaurant. Afterwards, the electronic device 100 can display an interface as shown in FIG9 (D). Among them, a status control 94 for indicating the task status can also be displayed on the dialogue shorthand card 92. Through the status control 94, the user can intuitively observe the completion status of the automated task. In addition, the status control 94 can also display the status of the current conversation. For example, the current conversation status is usually associated with the inquiry statement of the artificial intelligence system, which is the intention of the current inquiry statement. For example, if the inquiry statement is "How many people are there?", the conversation status can be represented by "Discussing (To be determined the number of people for the dinner party)", where "Discussing" means that the artificial intelligence system is discussing with the user, and "To be determined the number of people for the dinner party" means the intention of the current discussion.

另外,人工智能系统可以自动根据对话速记卡片内已有的共识信息,判断是否支持自动化任务的创建。当已有共识信息不足以支撑自动化任务的创建(如聚餐人数、菜式等均未确定)时,自动化任务控件可以设置为不可用;仅在已有共识信息能够支撑自动化任务的创建时,自动化任务控件才设置为可用。In addition, the artificial intelligence system can automatically determine whether to support the creation of automated tasks based on the existing consensus information in the conversation shorthand card. When the existing consensus information is not sufficient to support the creation of automated tasks (such as the number of people at the dinner, the dishes, etc. are not determined), the automated task control can be set to unavailable; only when the existing consensus information can support the creation of automated tasks, the automated task control is set to available.

在一些实施例中,对话速记卡片创建后,可以保存在本地或者云端。由于对话速记卡片以预定的数据结构保存了文本信息,而不是保存所有的对话内容,因而无需占用token资源。In some embodiments, after the conversation shorthand card is created, it can be saved locally or in the cloud. Since the conversation shorthand card saves text information in a predetermined data structure instead of saving all the conversation content, it does not need to occupy token resources.

进一步地,当用户触发对话速记卡片的创建时,人工智能系统可以自动匹配已有的对话速记卡片,查找是否有相同或类似意图的对话速记卡片。如果匹配到已有对话速记卡片,则直接调用已有对话速记卡片,以及,通过客户端显示在当前对话界面。例如,继续参阅图8,在图8的(C)中,用户未进行下一步操作,而是结束了对话;则如图10的(A)所示,当用户再次预定餐厅/饭店时,在用户触发生成“对话速记卡片”后,可以先可以自动匹配到图8所创建的对话速记卡片,然后,再询问用户是否需要 继续上次对话,即显示如图10的(B)所示的界面。在用户同意继续上次对话后,可以将图8的中创建的对话速记卡片显示对话界面101上,即显示如图10的(C)所示的界面。这样,用户只需要对该对话速记卡片进行简单的修改,即可通过自动化任务来进行餐厅的预定,而无需再进行多轮会话交互来重复询问用户。示例性的,用户在创建新的对话界面后,可以选择关闭原有的对话界面。Furthermore, when the user triggers the creation of a conversation shorthand card, the artificial intelligence system can automatically match existing conversation shorthand cards to find out whether there are conversation shorthand cards with the same or similar intentions. If an existing conversation shorthand card is matched, the existing conversation shorthand card is directly called and displayed on the current conversation interface through the client. For example, referring to Figure 8, in Figure 8 (C), the user did not take the next step but ended the conversation; then as shown in Figure 10 (A), when the user makes a reservation at a restaurant/hotel again, after the user triggers the generation of a "conversation shorthand card", it can first automatically match the conversation shorthand card created in Figure 8, and then ask the user whether it needs it. Continue the last conversation, that is, display the interface shown in FIG10 (B). After the user agrees to continue the last conversation, the conversation shorthand card created in FIG8 can be displayed on the conversation interface 101, that is, display the interface shown in FIG10 (C). In this way, the user only needs to make a simple modification to the conversation shorthand card to make a restaurant reservation through the automated task, without having to conduct multiple rounds of conversation interactions to repeatedly ask the user. Exemplarily, after creating a new conversation interface, the user can choose to close the original conversation interface.

此外,人工智能系统在进行对话速记卡片的匹配时,可以结合电子设备获取的场景信息来匹配,以提升匹配的精准度。例如,结合当前位置信息,匹配到之前创建的在当前位置附近预定餐厅的对话速记卡片。另外,还可以结合历史对话速记卡片的状态信息来进行卡片的匹配,优先匹配状态为“讨论中”的对话速记卡片。这样,即可以避免从头开始询问需求,又可以避免直接进行修改阶段而导致无对话体验的问题。In addition, when matching conversation shorthand cards, the artificial intelligence system can match them in combination with the scene information obtained by the electronic device to improve the accuracy of the match. For example, combined with the current location information, the previously created conversation shorthand card for booking a restaurant near the current location is matched. In addition, the status information of historical conversation shorthand cards can be combined to match the cards, giving priority to matching conversation shorthand cards with a status of "under discussion". In this way, you can avoid asking for requirements from the beginning, and avoid the problem of no conversation experience caused by going directly to the modification stage.

此外,用户也可以通过对话输入来查找对话速记卡片。用户可以在对话输入内携带至少一个共识信息的问题属性,则在查找对话速记卡片时,人工智能系统可以结合问题属性来匹配具有相同或类似属性的对话速记卡片。例如:用户的对话输入可以是“请帮我预定附近的川菜馆”,则可以匹配历史的预定饭店的对话速记卡片;进一步,还可以结合用户当前位置查找相同位置(或历史位置与当前位置距离最近)的对话速记卡片;再进一步,还可以查找预定的菜式为川菜的对话速记卡片。In addition, users can also search for dialogue shorthand cards through dialogue input. Users can carry at least one question attribute of consensus information in the dialogue input. When searching for dialogue shorthand cards, the artificial intelligence system can match dialogue shorthand cards with the same or similar attributes in combination with the question attributes. For example: the user's dialogue input can be "Please help me book a Sichuan restaurant nearby", then the dialogue shorthand cards of historically booked restaurants can be matched; further, the dialogue shorthand cards of the same location (or the historical location closest to the current location) can be searched in combination with the user's current location; further, dialogue shorthand cards with Sichuan dishes as the reserved dishes can be searched.

用户在对话输入中,也可以增加/修改新的共识信息,人工智能系统在显示对话速记卡片时,可以基于对话输入解析得到的共识信息,对历史对话速记卡片进行修改后,再显示。例如,如图11所示,人工智能系统根据用户的对话输入,匹配已有的对话速记卡片,且用户的对话输入中描述了新的共识信息,“聚餐时间为明晚7点”,因此,在呈现对话速记卡片时,不是直接呈现对话速记卡片,而是对聚餐时间进行修改后,再显示在对话界面。如果用户的对话输入为“帮我看下今年7月份在公司附近又没预定过饭店”,则此时用户没有表达新的共识信息,因此,直接调用、显示匹配到的历史对话速记卡片即可。The user can also add/modify new consensus information during the dialogue input. When the artificial intelligence system displays the dialogue shorthand card, it can modify the historical dialogue shorthand card based on the consensus information obtained by parsing the dialogue input, and then display it. For example, as shown in Figure 11, the artificial intelligence system matches the existing dialogue shorthand card according to the user's dialogue input, and the user's dialogue input describes the new consensus information, "The dinner time is 7 o'clock tomorrow night". Therefore, when presenting the dialogue shorthand card, it does not directly present the dialogue shorthand card, but modifies the dinner time and then displays it in the dialogue interface. If the user's dialogue input is "Help me find a restaurant near the company that has not been booked in July this year", the user has not expressed new consensus information at this time, so the matched historical dialogue shorthand card can be directly called and displayed.

可以理解的是,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。另外,上述所描述的各个实施例,可根据实际情况进行组合,组合后的方案仍在本申请的保护范围之内。It is understandable that the order of execution of the steps in the above embodiments does not mean the order of execution. The execution order of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiments of the present application. In addition, the various embodiments described above can be combined according to actual conditions, and the combined solutions are still within the scope of protection of the present application.

基于上述实施例中的方法,本申请实施例还提供了一种对话处理装置。Based on the method in the above embodiment, the embodiment of the present application also provides a dialogue processing device.

示例性的,图12示出了本申请实施例提供了一种对话处理装置的结构示意图。如图12所示,该对话处理装置1200包括:第一显示模块1201和第二显示模块1202。其中,第一显示模块1201,用于显示用户与人工智能系统进行交互的第一对话界面。第二显示模块1202,用于响应于用户触发的第一操作,在第一对话界面中显示第一对话速记卡片,第一对话速记卡片中包括:用户与人工智能系统在第一对话界面中已达成共识的共识信息。Exemplarily, FIG12 shows a schematic diagram of the structure of a dialogue processing device provided by an embodiment of the present application. As shown in FIG12, the dialogue processing device 1200 includes: a first display module 1201 and a second display module 1202. Among them, the first display module 1201 is used to display a first dialogue interface for a user to interact with an artificial intelligence system. The second display module 1202 is used to display a first dialogue shorthand card in the first dialogue interface in response to a first operation triggered by a user, and the first dialogue shorthand card includes: consensus information that the user and the artificial intelligence system have reached a consensus on in the first dialogue interface.

在一些实施例中,第一操作包括以下一项或多项:用户触发用于创建第一对话速记卡片的指令、操作或事件,或者,用户与人工智能系统的对话意图为预设意图。In some embodiments, the first operation includes one or more of the following: the user triggers an instruction, operation, or event for creating a first conversation note card, or the conversation intention between the user and the artificial intelligence system is a preset intention.

在一些实施例中,第二显示模块1202,还用于:在第一对话界面所显示的对话内容中标记共识信息。In some embodiments, the second display module 1202 is further used to: mark the consensus information in the conversation content displayed on the first conversation interface.

在一些实施例中,第二显示模块1202在显示第一对话速记卡片之后,还用于:在第一对话界面所显示的对话内容中的共识信息发生变更的情况下,对第一对话速记卡片中包含的共识信息进行更新。In some embodiments, after displaying the first conversation shorthand card, the second display module 1202 is further used to: update the consensus information included in the first conversation shorthand card when the consensus information in the conversation content displayed on the first conversation interface changes.

在一些实施例中,第二显示模块1202在显示第一对话速记卡片之后,还用于:响应于用户对第一对话速记卡片中共识信息的编辑操作,对第一对话速记卡片中包含的共识信息进行更新。In some embodiments, after displaying the first conversation shorthand card, the second display module 1202 is further used to: in response to a user's editing operation on the consensus information in the first conversation shorthand card, update the consensus information included in the first conversation shorthand card.

在一些实施例中,第一对话速记卡片中还包括:自动化任务控件。此时,第二显示模块1202在显示第一对话速记卡片之后,还用于:响应于针对自动化任务控件的触发操作,在第一对话界面中显示任务卡片,任务卡片用于指示与共识信息相关的任务的执行过程;在任务卡片所指示的任务执行完成后,在第一对话速记卡片中显示任务执行结果。In some embodiments, the first dialogue shorthand card also includes: an automated task control. At this time, after displaying the first dialogue shorthand card, the second display module 1202 is further used to: in response to a trigger operation on the automated task control, display a task card in the first dialogue interface, the task card is used to indicate the execution process of the task related to the consensus information; after the task indicated by the task card is completed, display the task execution result in the first dialogue shorthand card.

在一些实施例中,第二显示模块1202,还用于:关闭第一对话界面;响应于用户创建新对话的操作,显示用户与人工智能系统进行交互的第二对话界面;在用户在第二对话界面中输入至少一个共识信息的情况下,在第二对话界面中显示第一对话速记卡片,第一对话速记卡片是基于第二对话界面中的共识信息,从已存储的对话速记卡片中筛选得到。In some embodiments, the second display module 1202 is also used to: close the first dialogue interface; in response to the user's operation of creating a new dialogue, display a second dialogue interface for the user to interact with the artificial intelligence system; when the user enters at least one consensus information in the second dialogue interface, display a first dialogue shorthand card in the second dialogue interface, where the first dialogue shorthand card is based on the consensus information in the second dialogue interface and is screened from the stored dialogue shorthand cards.

在一些实施例中,第二对话界面中所显示的第一对话速记卡片是在用户同意在第二对话界面中继续第一对话界面中的对话的情况下显示。In some embodiments, the first conversation shorthand card displayed in the second conversation interface is displayed when the user agrees to continue the conversation in the first conversation interface in the second conversation interface.

在一些实施例中,第二显示模块1202,还用于:在用户在第二对话界面中增加/修改共识信息的情况下,在第一对话速记卡片中增加/修改共识信息。 In some embodiments, the second display module 1202 is further used to: add/modify consensus information in the first dialogue shorthand card when the user adds/modifies consensus information in the second dialogue interface.

应当理解的是,上述装置用于执行上述实施例中的方法,装置中相应的程序模块,其实现原理和技术效果与上述方法中的描述类似,该装置的工作过程可参考上述方法中的对应过程,此处不再赘述。It should be understood that the above-mentioned device is used to execute the method in the above-mentioned embodiment. The implementation principle and technical effect of the corresponding program module in the device are similar to those described in the above-mentioned method. The working process of the device can refer to the corresponding process in the above-mentioned method, which will not be repeated here.

可以理解的是,本申请实施例中的电子设备可以是手机、平板电脑、桌面型计算机、膝上型计算机、手持计算机、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本,以及蜂窝电话、个人数字助理(personal digital assistant,PDA)、增强现实(augmented reality,AR)设备、虚拟现实(virtual reality,VR)设备、人工智能(artificial intelligence,AI)设备、可穿戴式设备、车载设备、智能家居设备和/或智慧城市设备等的电子设备,本申请实施例对该电子设备的具体类型不作特殊限制。It can be understood that the electronic devices in the embodiments of the present application can be mobile phones, tablet computers, desktop computers, laptop computers, handheld computers, notebook computers, ultra-mobile personal computers (UMPC), netbooks, as well as cellular phones, personal digital assistants (PDA), augmented reality (AR) devices, virtual reality (VR) devices, artificial intelligence (AI) devices, wearable devices, vehicle-mounted devices, smart home devices and/or smart city devices, etc. The embodiments of the present application do not impose any special restrictions on the specific type of the electronic device.

示例性的,图13示出了本申请实施例提供的一种电子设备的结构示意图。如图13所示,该电子设备13300可以包括:处理器1310、存储器1320、显示屏1330、传感器模块1340和电池1350。For example, Fig. 13 shows a schematic diagram of the structure of an electronic device provided in an embodiment of the present application. As shown in Fig. 13, the electronic device 13300 may include: a processor 1310, a memory 1320, a display screen 1330, a sensor module 1340 and a battery 1350.

处理器1310是电子设备1300的计算核心及控制核心。处理器1310可以包括一个或多个处理单元。例如,处理器1310可以包括应用处理器(application processor,AP)、调制解调器(modem)、图形处理器(graphics processing unit,GPU)、图像信号处理器(image signal processor,ISP)、控制器、视频编解码器、数字信号处理器(digital signal processor,DSP)、基带处理器、和/或神经网络处理器(neural-network processing unit,NPU)等中的一项或多项。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。本实施例中,NPU可以为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备1300的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解,文本识别等。示例性的,处理器1300可以但不限于用于执行前述实施例中的方法。The processor 1310 is the computing core and control core of the electronic device 1300. The processor 1310 may include one or more processing units. For example, the processor 1310 may include one or more of an application processor (application processor, AP), a modem, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), a controller, a video codec, a digital signal processor (digital signal processor, DSP), a baseband processor, and/or a neural network processor (neural-network processing unit, NPU). Among them, different processing units can be independent devices or integrated in one or more processors. In this embodiment, the NPU can be a neural network (neural-network, NN) computing processor, which can quickly process input information by drawing on the biological neural network structure, such as drawing on the transmission mode between neurons in the human brain, and can also continuously self-learn. Through the NPU, applications such as intelligent cognition of the electronic device 1300 can be realized, such as: image recognition, face recognition, voice recognition, text understanding, text recognition, etc. Exemplarily, the processor 1300 can be used but not limited to execute the method in the aforementioned embodiment.

存储器1320可以存储有程序,程序可被处理器1310运行,使得处理器1310执行本申请实施例中提供的电子设备1300所执行的方法。存储器1320还可以存储有数据。处理器1310可以读取存储器1320中存储的数据。存储器1320和处理器1310可以单独设置。可选地,存储器1320也可以集成在处理器1310中。The memory 1320 may store a program, and the program may be executed by the processor 1310, so that the processor 1310 executes the method performed by the electronic device 1300 provided in the embodiment of the present application. The memory 1320 may also store data. The processor 1310 may read the data stored in the memory 1320. The memory 1320 and the processor 1310 may be provided separately. Optionally, the memory 1320 may also be integrated in the processor 1310.

显示屏1330可以用于显示图像、视频等。显示屏1330包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。The display screen 1330 can be used to display images, videos, etc. The display screen 1330 includes a display panel. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (AMOLED), a flexible light-emitting diode (FLED), Miniled, MicroLed, Micro-oLed, a quantum dot light-emitting diode (QLED), etc.

电子设备1300还可以包括传感器模块1340。传感器模块1340可以包括触摸传感器。该触摸传感器可以用于检测作用于电子设备1300上或附近的触摸操作。示例性的,触摸传感器可采用电阻式,电容式,红外线以及表面声波等多种方式实现。触摸传感器可以设置于显示屏1330上,由触摸传感器与显示屏1330可以组成触摸屏,也称“触控屏”。当然,触摸传感器也可以设置于电子设备1300的表面,与显示屏1330所处的位置不同。在一些实施例中,传感器模块1340也可以包括压力传感器、陀螺仪传感器、气压传感器、磁传感器、加速度传感器、距离传感器、接近光传感器、指纹传感器、温度传感器、环境光传感器或者骨传导传感器等。The electronic device 1300 may further include a sensor module 1340. The sensor module 1340 may include a touch sensor. The touch sensor may be used to detect a touch operation on or near the electronic device 1300. Exemplarily, the touch sensor may be implemented in a variety of ways such as resistive, capacitive, infrared, and surface acoustic waves. The touch sensor may be disposed on the display screen 1330, and the touch sensor and the display screen 1330 may form a touch screen, also known as a "touch screen". Of course, the touch sensor may also be disposed on the surface of the electronic device 1300, at a different location from the display screen 1330. In some embodiments, the sensor module 1340 may also include a pressure sensor, a gyroscope sensor, an air pressure sensor, a magnetic sensor, an acceleration sensor, a distance sensor, a proximity light sensor, a fingerprint sensor, a temperature sensor, an ambient light sensor, or a bone conduction sensor, etc.

在电子设备1300上还可以设置有电池1350。通过电池1350可以为电子设备1300供电。A battery 1350 may also be provided on the electronic device 1300. The battery 1350 may be used to supply power to the electronic device 1300.

可以理解的是,本实施例图13示意的结构并不构成对电子设备1300的具体限定。在另一些实施例中,电子设备1300可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It is understood that the structure shown in FIG. 13 of this embodiment does not constitute a specific limitation on the electronic device 1300. In other embodiments, the electronic device 1300 may include more or fewer components than shown in the figure, or combine some components, or split some components, or arrange the components differently. The components shown in the figure may be implemented in hardware, software, or a combination of software and hardware.

基于上述实施例中的方法,本申请实施例提供了一种计算机可读存储介质,计算机可读存储介质存储有计算机程序,当计算机程序在包括至少一个计算设备的计算设备集群上运行时,使得计算设备集群执行上述实施例中描述的方法。示例性的,该计算机可读存储介质可以是计算设备能够存储的任何可用介质或者是包含一个或多个可用介质的数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘)等。Based on the method in the above embodiment, the embodiment of the present application provides a computer-readable storage medium, which stores a computer program. When the computer program runs on a computing device cluster including at least one computing device, the computing device cluster executes the method described in the above embodiment. Exemplarily, the computer-readable storage medium can be any available medium that can be stored by the computing device or a data storage device such as a data center containing one or more available media. The available medium can be a magnetic medium (e.g., a floppy disk, a hard disk, a magnetic tape), an optical medium (e.g., a DVD), or a semiconductor medium (e.g., a solid-state hard disk), etc.

基于上述实施例中的方法,本申请实施例提供了一种包含指令的计算机程序产品,当计算机程序产品在包括至少一个计算设备的计算设备集群上运行时,使得计算设备集群执行上述实施例中的方法。Based on the method in the above embodiment, an embodiment of the present application provides a computer program product including instructions. When the computer program product is run on a computing device cluster including at least one computing device, the computing device cluster executes the method in the above embodiment.

可以理解的是,本申请的实施例中的处理器可以是中央处理单元(central processing unit,CPU),还可以是其他通用处理器、数字信号处理器(digital signal processor,DSP)、专用集成电路 (application specific integrated circuit,ASIC)、现场可编程门阵列(field programmable gate array,FPGA)或者其他可编程逻辑器件、晶体管逻辑器件,硬件部件或者其任意组合。通用处理器可以是微处理器,也可以是任何常规的处理器。It is understood that the processor in the embodiments of the present application may be a central processing unit (CPU), or other general-purpose processors, digital signal processors (DSP), application-specific integrated circuits, or a processor. (application specific integrated circuit, ASIC), field programmable gate array (field programmable gate array, FPGA) or other programmable logic devices, transistor logic devices, hardware components or any combination thereof. A general-purpose processor can be a microprocessor or any conventional processor.

本申请的实施例中的方法步骤可以通过硬件的方式来实现,也可以由处理器执行软件指令的方式来实现。软件指令可以由相应的软件模块组成,软件模块可以被存放于随机存取存储器(random access memory,RAM)、闪存、只读存储器(read-only memory,ROM)、可编程只读存储器(programmable rom,PROM)、可擦除可编程只读存储器(erasable PROM,EPROM)、电可擦除可编程只读存储器(electrically EPROM,EEPROM)、寄存器、硬盘、移动硬盘、CD-ROM或者本领域熟知的任何其它形式的存储介质中。一种示例性的存储介质耦合至处理器,从而使处理器能够从该存储介质读取信息,且可向该存储介质写入信息。当然,存储介质也可以是处理器的组成部分。处理器和存储介质可以位于ASIC中。The method steps in the embodiments of the present application can be implemented by hardware or by a processor executing software instructions. The software instructions can be composed of corresponding software modules, which can be stored in random access memory (RAM), flash memory, read-only memory (ROM), programmable read-only memory (PROM), erasable programmable read-only memory (EPROM), electrically erasable programmable read-only memory (EEPROM), registers, hard disks, mobile hard disks, CD-ROMs, or any other form of storage medium known in the art. An exemplary storage medium is coupled to a processor so that the processor can read information from the storage medium and write information to the storage medium. Of course, the storage medium can also be a component of the processor. The processor and the storage medium can be located in an ASIC.

在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者通过所述计算机可读存储介质进行传输。所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(solid state disk,SSD))等。In the above embodiments, it can be implemented in whole or in part by software, hardware, firmware or any combination thereof. When implemented by software, it can be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the process or function described in the embodiment of the present application is generated in whole or in part. The computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions may be stored in a computer-readable storage medium or transmitted through the computer-readable storage medium. The computer instructions may be transmitted from one website, computer, server or data center to another website, computer, server or data center by wired (e.g., coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (e.g., infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that a computer can access or a data storage device such as a server or data center that includes one or more available media integrated. The available medium may be a magnetic medium (e.g., a floppy disk, a hard disk, a tape), an optical medium (e.g., a DVD), or a semiconductor medium (e.g., a solid state drive (SSD)), etc.

可以理解的是,在本申请的实施例中涉及的各种数字编号仅为描述方便进行的区分,并不用来限制本申请的实施例的范围。It should be understood that the various numerical numbers involved in the embodiments of the present application are only used for the convenience of description and are not used to limit the scope of the embodiments of the present application.

最后应说明的是:以上实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的保护范围。 Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present application, rather than to limit it. Although the present application has been described in detail with reference to the aforementioned embodiments, those skilled in the art should understand that they can still modify the technical solutions described in the aforementioned embodiments, or make equivalent replacements for some of the technical features therein. However, these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the protection scope of the technical solutions of the embodiments of the present application.

Claims (23)

一种对话处理方法,其特征在于,包括:A method for processing a conversation, characterized by comprising: 显示用户与人工智能系统进行交互的第一对话界面;Displaying a first dialogue interface for a user to interact with the artificial intelligence system; 响应于所述用户触发的第一操作,在所述第一对话界面中显示第一对话速记卡片,所述第一对话速记卡片中包括:所述用户与所述人工智能系统在所述第一对话界面中已达成共识的共识信息。In response to a first operation triggered by the user, a first dialogue shorthand card is displayed in the first dialogue interface, wherein the first dialogue shorthand card includes: consensus information reached by the user and the artificial intelligence system in the first dialogue interface. 根据权利要求1所述的方法,其特征在于,所述第一对话速记卡片中还包括:对话状态。The method according to claim 1 is characterized in that the first conversation shorthand card also includes: conversation status. 根据权利要求1或2所述的方法,其特征在于,所述第一操作包括以下一项或多项:The method according to claim 1 or 2, characterized in that the first operation comprises one or more of the following: 所述用户触发用于创建所述第一对话速记卡片的指令、操作或事件,或者,所述用户与所述人工智能系统的对话意图为预设意图。The user triggers an instruction, operation or event for creating the first conversation shorthand card, or the conversation intention between the user and the artificial intelligence system is a preset intention. 根据权利要求1-3任一所述的方法,其特征在于,还包括:The method according to any one of claims 1 to 3, characterized in that it also includes: 在所述第一对话界面所显示的对话内容中标记所述共识信息。The consensus information is marked in the conversation content displayed on the first conversation interface. 根据权利要求1-4任一所述的方法,其特征在于,在显示所述第一对话速记卡片之后,还包括:The method according to any one of claims 1 to 4, characterized in that after displaying the first conversation shorthand card, it also includes: 在所述第一对话界面所显示的对话内容中的共识信息发生变更的情况下,对所述第一对话速记卡片中包含的共识信息进行更新。When the consensus information in the conversation content displayed on the first conversation interface changes, the consensus information included in the first conversation shorthand card is updated. 根据权利要求1-5任一所述的方法,其特征在于,在显示所述第一对话速记卡片之后,还包括:The method according to any one of claims 1 to 5, characterized in that after displaying the first conversation shorthand card, it also includes: 响应于所述用户对所述第一对话速记卡片中共识信息的编辑操作,对所述第一对话速记卡片中包含的共识信息进行更新。In response to the user's editing operation on the consensus information in the first conversation shorthand card, the consensus information included in the first conversation shorthand card is updated. 根据权利要求1-6任一所述的方法,其特征在于,所述第一对话速记卡片中还包括:自动化任务控件;The method according to any one of claims 1-6, characterized in that the first conversation shorthand card further includes: an automated task control; 在显示所述第一对话速记卡片之后,还包括:After displaying the first conversation shorthand card, the method further includes: 响应于针对所述自动化任务控件的触发操作,在所述第一对话界面中显示任务卡片,所述任务卡片用于指示与所述共识信息相关的任务的执行过程。In response to a trigger operation on the automated task control, a task card is displayed in the first dialogue interface, wherein the task card is used to indicate an execution process of a task related to the consensus information. 在所述任务卡片所指示的任务执行完成后,在所述第一对话速记卡片中显示任务执行结果。After the task indicated by the task card is completed, the task execution result is displayed in the first conversation shorthand card. 根据权利要求1-7任一所述的方法,其特征在于,还包括:The method according to any one of claims 1 to 7, characterized in that it also includes: 关闭所述第一对话界面;Close the first dialogue interface; 响应于所述用户创建新对话的操作,显示所述用户与所述人工智能系统进行交互的第二对话界面;In response to the user's operation of creating a new dialogue, displaying a second dialogue interface for the user to interact with the artificial intelligence system; 在所述用户在所述第二对话界面中输入至少一个所述共识信息的情况下,在所述第二对话界面中显示所述第一对话速记卡片,所述第一对话速记卡片是基于所述第二对话界面中的所述共识信息,从已存储的对话速记卡片中筛选得到。When the user inputs at least one of the consensus information in the second dialogue interface, the first dialogue shorthand card is displayed in the second dialogue interface, and the first dialogue shorthand card is obtained by filtering from the stored dialogue shorthand cards based on the consensus information in the second dialogue interface. 根据权利要求8所述的方法,其特征在于,所述第二对话界面中所显示的第一对话速记卡片是在所述用户同意在所述第二对话界面中继续所述第一对话界面中的对话的情况下显示。The method according to claim 8 is characterized in that the first conversation shorthand card displayed in the second conversation interface is displayed when the user agrees to continue the conversation in the first conversation interface in the second conversation interface. 根据权利要求8所述的方法,其特征在于,还包括:The method according to claim 8, further comprising: 在所述用户在所述第二对话界面中增加/修改所述共识信息的情况下,在所述第一对话速记卡片中增加/修改所述共识信息。When the user adds/modifies the consensus information in the second dialogue interface, the consensus information is added/modified in the first dialogue shorthand card. 根据权利要求1-10任一所述的方法,其特征在于,还包括:The method according to any one of claims 1 to 10, characterized in that it also includes: 在所述第一对话界面中显示第一提示信息,所述第一提示信息用于提示已创建所述第一对话速记卡片。A first prompt message is displayed in the first dialogue interface, where the first prompt message is used to prompt that the first dialogue shorthand card has been created. 一种对话处理装置,其特征在于,包括:A conversation processing device, characterized by comprising: 第一显示模块,用于显示用户与人工智能系统进行交互的第一对话界面;A first display module, used to display a first dialogue interface for a user to interact with the artificial intelligence system; 第二显示模块,用于响应于所述用户触发的第一操作,在所述第一对话界面中显示第一对话速记卡片,所述第一对话速记卡片中包括:所述用户与所述人工智能系统在所述第一对话界面中已达成共识的共识信息。A second display module is used to display a first dialogue shorthand card in the first dialogue interface in response to the first operation triggered by the user, wherein the first dialogue shorthand card includes: consensus information reached by the user and the artificial intelligence system in the first dialogue interface. 根据权利要求12所述的装置,其特征在于,所述第一操作包括以下一项或多项:The device according to claim 12, wherein the first operation comprises one or more of the following: 所述用户触发用于创建所述第一对话速记卡片的指令、操作或事件,或者,所述用户与所述人工智能系统的对话意图为预设意图。The user triggers an instruction, operation or event for creating the first conversation shorthand card, or the conversation intention between the user and the artificial intelligence system is a preset intention. 根据权利要求12或13所述的装置,其特征在于,所述第二显示模块,还用于: The device according to claim 12 or 13, characterized in that the second display module is further used for: 在所述第一对话界面所显示的对话内容中标记所述共识信息。The consensus information is marked in the conversation content displayed on the first conversation interface. 根据权利要求12-14任一所述的装置,其特征在于,所述第二显示模块在显示所述第一对话速记卡片之后,还用于:The device according to any one of claims 12 to 14, characterized in that after displaying the first conversation shorthand card, the second display module is further used to: 在所述第一对话界面所显示的对话内容中的共识信息发生变更的情况下,对所述第一对话速记卡片中包含的共识信息进行更新。When the consensus information in the conversation content displayed on the first conversation interface changes, the consensus information included in the first conversation shorthand card is updated. 根据权利要求12-15任一所述的装置,其特征在于,所述第二显示模块在显示所述第一对话速记卡片之后,还用于:The device according to any one of claims 12 to 15, characterized in that after displaying the first conversation shorthand card, the second display module is further used to: 响应于所述用户对所述第一对话速记卡片中共识信息的编辑操作,对所述第一对话速记卡片中包含的共识信息进行更新。In response to the user's editing operation on the consensus information in the first conversation shorthand card, the consensus information included in the first conversation shorthand card is updated. 根据权利要求12-16任一所述的装置,其特征在于,所述第一对话速记卡片中还包括:自动化任务控件;The device according to any one of claims 12-16, characterized in that the first conversation shorthand card further includes: an automated task control; 所述第二显示模块在显示所述第一对话速记卡片之后,还用于:After displaying the first conversation shorthand card, the second display module is further used to: 响应于针对所述自动化任务控件的触发操作,在所述第一对话界面中显示任务卡片,所述任务卡片用于指示与所述共识信息相关的任务的执行过程;In response to a trigger operation on the automated task control, displaying a task card in the first dialogue interface, the task card being used to indicate an execution process of a task related to the consensus information; 在所述任务卡片所指示的任务执行完成后,在所述第一对话速记卡片中显示任务执行结果。After the task indicated by the task card is completed, the task execution result is displayed in the first conversation shorthand card. 根据权利要求12-17任一所述的装置,其特征在于,所述第二显示模块,还用于:The device according to any one of claims 12 to 17, characterized in that the second display module is further used for: 关闭所述第一对话界面;Close the first dialogue interface; 响应于所述用户创建新对话的操作,显示所述用户与所述人工智能系统进行交互的第二对话界面;In response to the user's operation of creating a new dialogue, displaying a second dialogue interface for the user to interact with the artificial intelligence system; 在所述用户在所述第二对话界面中输入至少一个所述共识信息的情况下,在所述第二对话界面中显示所述第一对话速记卡片,所述第一对话速记卡片是基于所述第二对话界面中的所述共识信息,从已存储的对话速记卡片中筛选得到。When the user inputs at least one of the consensus information in the second dialogue interface, the first dialogue shorthand card is displayed in the second dialogue interface, and the first dialogue shorthand card is obtained by filtering from the stored dialogue shorthand cards based on the consensus information in the second dialogue interface. 根据权利要求18所述的装置,其特征在于,所述第二对话界面中所显示的第一对话速记卡片是在所述用户同意在所述第二对话界面中继续所述第一对话界面中的对话的情况下显示。The device according to claim 18 is characterized in that the first conversation shorthand card displayed in the second conversation interface is displayed when the user agrees to continue the conversation in the first conversation interface in the second conversation interface. 根据权利要求18所述的装置,其特征在于,所述第二显示模块,还用于:The device according to claim 18, characterized in that the second display module is further used for: 在所述用户在所述第二对话界面中增加/修改所述共识信息的情况下,在所述第一对话速记卡片中增加/修改所述共识信息。When the user adds/modifies the consensus information in the second dialogue interface, the consensus information is added/modified in the first dialogue shorthand card. 一种电子设备,其特征在于,包括:An electronic device, comprising: 一个或多个处理器;one or more processors; 存储器;Memory; 其中,所述存储器中存储有一个或多个计算机程序,所述一个或多个计算机程序包括指令,当所述指令被所述一个或多个处理器执行时,使得所述电子设备执行如权利要求1-12任一所述的方法。Wherein, one or more computer programs are stored in the memory, and the one or more computer programs include instructions. When the instructions are executed by the one or more processors, the electronic device executes any one of the methods described in claims 1-12. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,当所述计算机程序在电子设备上运行时,使得所述电子设备执行如权利要求1-12任一所述的方法。A computer-readable storage medium stores a computer program, and when the computer program is executed on an electronic device, the electronic device executes the method according to any one of claims 1 to 12. 一种计算机程序产品,其特征在于,当所述计算机程序产品在电子设备上运行时,使得所述电子设备执行如权利要求1-12任一所述的方法。 A computer program product, characterized in that when the computer program product is run on an electronic device, the electronic device executes the method according to any one of claims 1 to 12.
PCT/CN2024/113303 2023-12-04 2024-08-20 Dialogue processing method and apparatus, and electronic device Pending WO2025118687A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202311647974.4 2023-12-04
CN202311647974.4A CN120122853A (en) 2023-12-04 2023-12-04 A conversation processing method, device and electronic equipment

Publications (1)

Publication Number Publication Date
WO2025118687A1 true WO2025118687A1 (en) 2025-06-12

Family

ID=95919279

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/113303 Pending WO2025118687A1 (en) 2023-12-04 2024-08-20 Dialogue processing method and apparatus, and electronic device

Country Status (2)

Country Link
CN (1) CN120122853A (en)
WO (1) WO2025118687A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106919321A (en) * 2015-12-25 2017-07-04 阿里巴巴集团控股有限公司 A kind of information interacting method and device, user equipment
CN110046238A (en) * 2019-03-29 2019-07-23 华为技术有限公司 Talk with exchange method, graphic user interface, terminal device and the network equipment
CN112799764A (en) * 2021-01-29 2021-05-14 上海明略人工智能(集团)有限公司 Method, device, device and storage medium for generating task flow chart
CN113689253A (en) * 2021-07-09 2021-11-23 广州华多网络科技有限公司 Live scene order generation method and corresponding device, equipment and medium thereof
US20220301082A1 (en) * 2021-02-24 2022-09-22 Conversenowai Order Post to Enable Parallelized Order Taking Using Artificial Intelligence Engine(s)

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106919321A (en) * 2015-12-25 2017-07-04 阿里巴巴集团控股有限公司 A kind of information interacting method and device, user equipment
CN110046238A (en) * 2019-03-29 2019-07-23 华为技术有限公司 Talk with exchange method, graphic user interface, terminal device and the network equipment
CN112799764A (en) * 2021-01-29 2021-05-14 上海明略人工智能(集团)有限公司 Method, device, device and storage medium for generating task flow chart
US20220301082A1 (en) * 2021-02-24 2022-09-22 Conversenowai Order Post to Enable Parallelized Order Taking Using Artificial Intelligence Engine(s)
CN113689253A (en) * 2021-07-09 2021-11-23 广州华多网络科技有限公司 Live scene order generation method and corresponding device, equipment and medium thereof

Also Published As

Publication number Publication date
CN120122853A (en) 2025-06-10

Similar Documents

Publication Publication Date Title
US11223584B2 (en) Automatic action responses
US11379529B2 (en) Composing rich content messages
US11086515B2 (en) Modifying captured stroke information into an actionable form
CN108027873B (en) Interacting with an assistant component based on captured stroke information
CN107924342B (en) Automated data transfer from mobile application silos to authorized third-party applications
US10733372B2 (en) Dynamic content generation
CN102193972B (en) System and method for predicting meeting subject, rear service and resource
US9046917B2 (en) Device, method and system for monitoring, predicting, and accelerating interactions with a computing device
CN116501960B (en) Content retrieval method, device, equipment and medium
US20170068436A1 (en) Interpreting and Supplementing Captured Stroke Information
CN112840335A (en) User-centric contextual information for browsers
TW202046082A (en) Thread of conversation displaying method, computer readable recording medium and computer device
CN111868769A (en) Situational news delayed
WO2025006173A1 (en) User activity history experiences powered by a machine learning model
CN110753911A (en) Automatic context transfer between applications
CN114238745B (en) Method and device for providing search results, electronic device and medium
CN108369589A (en) Automatic theme label recommendations for classifying to communication are provided
WO2025118687A1 (en) Dialogue processing method and apparatus, and electronic device
CN113609370A (en) Data processing method, device, electronic device and storage medium
CN119740659B (en) Memory processing method and device for large language model, electronic equipment and medium
US12260029B2 (en) Universal highlighter for contextual notetaking
CN120929170A (en) A split-screen display method, apparatus, electronic device, and medium
CN120980050A (en) Methods, apparatuses, electronic devices and storage media for processing session messages
CN119226496A (en) Method, device, equipment and medium for processing documents based on large models
CN117806521A (en) An information processing method, electronic device and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24899335

Country of ref document: EP

Kind code of ref document: A1