[go: up one dir, main page]

CN109800361A - A kind of method for digging of interest point name, device, electronic equipment and storage medium - Google Patents

A kind of method for digging of interest point name, device, electronic equipment and storage medium Download PDF

Info

Publication number
CN109800361A
CN109800361A CN201910110277.2A CN201910110277A CN109800361A CN 109800361 A CN109800361 A CN 109800361A CN 201910110277 A CN201910110277 A CN 201910110277A CN 109800361 A CN109800361 A CN 109800361A
Authority
CN
China
Prior art keywords
search
poi
interest
name
interest point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910110277.2A
Other languages
Chinese (zh)
Inventor
张伟
朱重黎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201910110277.2A priority Critical patent/CN109800361A/en
Publication of CN109800361A publication Critical patent/CN109800361A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the invention discloses a kind of method for digging of interest point name, device, electronic equipment and storage mediums.The described method includes: obtaining the search sessions of each user in historical search log;The corresponding interest point name pair of each search sessions is determined according to the search sessions of each user;Wherein, the interest point name of the interest point name centering belongs to the same search sessions;Determine each interest point name to corresponding search characteristics in predetermined search characteristics library;According to the corresponding interest point name pair of each search sessions and each interest point name to corresponding search characteristics, the interest point name of each interest point name centering is excavated.The digging efficiency of interest point name not only can be improved, it can also be ensured that the accuracy rate of interest point name.

Description

一种兴趣点名称的挖掘方法、装置、电子设备及存储介质A method, device, electronic device and storage medium for mining the name of a point of interest

技术领域technical field

本发明实施例涉及互联网技术领域,尤其涉及一种兴趣点名称的挖掘方法、装置、电子设备及存储介质。Embodiments of the present invention relate to the field of Internet technologies, and in particular, to a method, device, electronic device, and storage medium for mining a name of a point of interest.

背景技术Background technique

在电子地图中有大量的定位点,例如,地图上标注的餐馆、酒店、景点、收费站等,这些定位点即为用户可能会进行查询或者想要到达的兴趣点(Point of Interest,POI)。There are a large number of positioning points in the electronic map, such as restaurants, hotels, scenic spots, toll stations, etc. marked on the map. These positioning points are the points of interest (POI) that users may query or want to reach. .

一般情况下,同一个兴趣点会有多个不同的名称,例如,某一个兴趣点可以有两个名称,分别为:“雍和宫”和“喇嘛庙”;此时,“雍和宫”可以作为该兴趣点的原始名称;该原始名称是在现有的电子地图中已经存在的名称;“喇嘛庙”可以作为该兴趣点的候选名称;该候选名称是在现有的电子地图中不存在的名称。在现有的电子地图数据库中,很可能没有包含一个兴趣点的所有名称。例如,对于原始名称为“雍和宫”的兴趣点,电子地图中的数据库中只存在“雍和宫”这个原始名称,而不存在“喇嘛庙”这个候选名称。那么,如果用户在使用电子地图中搜索名称为“喇嘛庙”的兴趣点时,由于电子地图中的数据库中不存在“喇嘛庙”这个候选名称,因此用户就无法查找到这个兴趣点。Under normal circumstances, the same POI will have multiple different names. For example, a POI can have two names, namely: "Lama Temple" and "Lama Temple"; in this case, "Lama Temple" can be used as the The original name of the POI; the original name is the name that already exists in the existing electronic map; "Lama Temple" can be used as the candidate name of the POI; the candidate name does not exist in the existing electronic map name. In the existing electronic map database, it is likely that not all the names of a point of interest are contained. For example, for the point of interest with the original name of "Lama Temple", only the original name of "Lama Temple" exists in the database in the electronic map, but the candidate name of "Lama Temple" does not exist. Then, if a user searches for a point of interest named "Lama Temple" in the electronic map, since the candidate name "Lama Temple" does not exist in the database in the electronic map, the user cannot find this point of interest.

为了提升用户的搜索体验,需要对各个兴趣点的候选名称进行挖掘,然后将挖掘出的候选名称补充到电子地图数据库中。在现有的兴趣点名称的挖掘方法中,通常采用人工方式挖掘兴趣点名称,不仅挖掘效率较低,而且准确率也无法保证。In order to improve the user's search experience, the candidate names of each point of interest need to be mined, and then the mined candidate names are added to the electronic map database. In the existing mining methods of POI names, the POI names are usually mined manually, which not only has low mining efficiency, but also cannot guarantee the accuracy.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明实施例提供一种兴趣点名称的挖掘方法、装置、电子设备及存储介质,不仅可以提高兴趣点名称的挖掘效率,还可以保证兴趣点名称的准确率。In view of this, the embodiments of the present invention provide a method, device, electronic device and storage medium for mining POI names, which can not only improve the mining efficiency of POI names, but also ensure the accuracy of POI names.

第一方面,本发明实施例提供了一种兴趣点名称的挖掘方法,所述方法包括:In a first aspect, an embodiment of the present invention provides a method for mining POI names, the method comprising:

在历史搜索日志中获取各个用户的搜索会话;Obtain the search sessions of each user in the historical search log;

根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;其中,所述兴趣点名称对中的兴趣点名称属于同一个搜索会话;Determine the POI name pairs corresponding to each search session according to the search sessions of each user; wherein, the POI names in the POI name pairs belong to the same search session;

在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;Determine the search feature corresponding to each point of interest name pair in the predetermined search feature library;

根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。According to the POI name pairs corresponding to each search session and the search features corresponding to each POI name pair, the POI names in each POI name pair are mined.

在上述实施例中,所述在历史搜索日志中获取各个用户的搜索会话,包括:In the above embodiment, the acquisition of the search sessions of each user in the historical search log includes:

在所述历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间;Obtain all the POI search terms of each user and the search time corresponding to each POI search term in the historical search log;

根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。All POI search terms of each user are divided into corresponding search sessions according to the search time corresponding to each POI search term; wherein, all POI search terms in each search session are from the same user.

在上述实施例中,所述根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对,包括:In the above-mentioned embodiment, the determining of the name pairs of points of interest corresponding to each search session according to each user's search session includes:

在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词;Determine the POI search terms in each search session in each user's search session;

将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。Every two POI search terms in each search session are combined into a point-of-interest name pair, which is used as a point-of-interest name pair corresponding to each search session.

在上述实施例中,所述在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征,包括:In the above embodiment, determining the search features corresponding to each point of interest name pair in the predetermined search feature library includes:

在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征;其中,所述兴趣点名称对包括:第一兴趣点名称和第二兴趣点名称;Determine the search result feature corresponding to each POI name pair in a predetermined search result feature library; wherein, the POI name pair includes: a first POI name and a second POI name;

在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,所述用户行为特征包括:所述第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。A user behavior feature corresponding to each POI name pair is determined in a predetermined user behavior feature database; wherein, the user behavior feature includes: the search popularity feature of the first POI name and the search commonality of each POI name pair. present feature.

在上述实施例中,所述根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘,包括:In the above embodiment, mining the POI names in each POI name pair according to the POI name pairs corresponding to each search session and the corresponding search features of each POI name pair, including:

将各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;Input the POI name pair corresponding to each search session and the search feature corresponding to each POI name pair into the pre-trained discriminant model;

通过所述判别模型获取到各个兴趣点名称对对应的判别概率值;The discrimination probability value corresponding to each point of interest name pair is obtained through the discriminant model;

根据各个兴趣点名称对对应的判别概率值和预先确定的判别阈值,确定各个兴趣点名称对对应的判别结果。According to the discrimination probability value corresponding to each interest point name pair and the predetermined discrimination threshold, the discrimination result corresponding to each interest point name pair is determined.

第二方面,本发明实施例提供了一种兴趣点名称的挖掘装置,所述装置包括:获取模块、确定模块和挖掘模块;其中,In a second aspect, an embodiment of the present invention provides a device for mining a point of interest name, the device includes: an acquisition module, a determination module, and a mining module; wherein,

所述获取模块,用于在历史搜索日志中获取各个用户的搜索会话;The obtaining module is used to obtain the search session of each user in the historical search log;

所述确定模块,用于根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;其中,所述兴趣点名称对中的兴趣点名称属于同一个搜索会话;在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;The determining module is used to determine the POI name pairs corresponding to each search session according to the search sessions of each user; wherein, the POI names in the POI name pairs belong to the same search session; in a predetermined search feature database Determine the search feature corresponding to each point of interest name pair in ;

所述挖掘模块,用于根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。The mining module is configured to mine the POI names in each POI name pair according to the POI name pair corresponding to each search session and the corresponding search feature of each POI name pair.

在上述实施例中,所述获取模块包括:获取子模块和划分子模块;其中,In the above embodiment, the acquisition module includes: an acquisition sub-module and a division sub-module; wherein,

所述获取子模块,用于在所述历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间;The acquisition submodule is used to acquire all the POI search terms of each user and the search time corresponding to each POI search term in the historical search log;

所述划分子模块,用于根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。The dividing submodule is used to divide all POI search terms of each user into the corresponding search sessions according to the search time corresponding to each POI search term; wherein, all POI search terms in each search session are from the same user.

在上述实施例中,所述确定模块,具体用于在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词;将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。In the above embodiment, the determining module is specifically configured to determine the POI search terms in each search session in each user's search session; combine every two POI search terms in each search session into a point-of-interest name pair , as the POI name pairs corresponding to each search session.

在上述实施例中,所述确定模块,具体用于在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征;其中,所述兴趣点名称对包括:第一兴趣点名称和第二兴趣点名称;在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,所述用户行为特征包括:所述第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。In the above embodiment, the determining module is specifically configured to determine the search result feature corresponding to each POI name pair in a predetermined search result feature library; wherein, the POI name pair includes: a first POI name and the name of the second point of interest; determine the corresponding user behavior characteristics of each point of interest name in the predetermined user behavior feature library; wherein, the user behavior characteristics include: the search popularity feature of the name of the first point of interest and each Search co-occurrence features for POI name pairs.

在上述实施例中,所述挖掘模块,具体用于将各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;通过所述判别模型获取到各个兴趣点名称对对应的判别概率值;根据各个兴趣点名称对对应的判别概率值和预先确定的判别阈值,确定各个兴趣点名称对对应的判别结果。In the above embodiment, the mining module is specifically configured to input the name pairs of interest points corresponding to each search session and the search features corresponding to the name pairs of each interest point into the pre-trained discriminant model; Discrimination probability value corresponding to each POI name pair; according to the corresponding discrimination probability value of each POI name pair and a predetermined discrimination threshold, determine the corresponding discrimination result of each POI name pair.

第三方面,本发明实施例提供了一种电子设备,包括:In a third aspect, an embodiment of the present invention provides an electronic device, including:

一个或多个处理器;one or more processors;

存储器,用于存储一个或多个程序,memory for storing one or more programs,

当所述一个或多个程序被所述一个或多个处理器执行,使得所述一个或多个处理器实现本发明任意实施例所述的兴趣点名称的挖掘方法。When the one or more programs are executed by the one or more processors, the one or more processors implement the method for mining POI names according to any embodiment of the present invention.

第四方面,本发明实施例提供了一种存储介质,其上存储有计算机程序,该程序被处理器执行时实现本发明任意实施例所述的兴趣点名称的挖掘方法。In a fourth aspect, an embodiment of the present invention provides a storage medium on which a computer program is stored, and when the program is executed by a processor, implements the method for mining a point of interest name described in any embodiment of the present invention.

本发明施例提出了一种兴趣点名称的挖掘方法、装置、电子设备及存储介质,先在历史搜索日志中获取各个用户的搜索会话;然后根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。也就是说,在本发明的技术方案中,可以根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在确定出各个兴趣点名称对对应的搜索特征之后,再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。而在现有的兴趣点名称的挖掘方法中,通常采用人工方式挖掘兴趣点名称,不仅挖掘效率较低,而且准确率也无法保证。因此,和现有技术相比,本发明实施例提出的兴趣点名称的挖掘方法、装置、电子装置及存储介质,不仅可以提高兴趣点名称的挖掘效率,还可以保证兴趣点名称的准确率;并且,本发明实施例的技术方案实现简单方便、便于普及,适用范围更广。The embodiments of the present invention provide a method, device, electronic device and storage medium for mining the name of a point of interest. First, the search sessions of each user are obtained from a historical search log; then the interest corresponding to each search session is determined according to the search sessions of each user. point name pair; determine the search feature corresponding to each point of interest name pair in a predetermined search feature library; then according to the point of interest name pair corresponding to each search session and the corresponding search feature of each point of interest name Mining for POI names in . That is to say, in the technical solution of the present invention, the point-of-interest name pair corresponding to each search session can be determined according to the search session of each user; after the search feature corresponding to each point-of-interest name pair is determined, the corresponding The POI name pairs of , and the corresponding search features of each POI name pair, and the POI names in each POI name pair are mined. However, in the existing mining methods of POI names, the POI names are usually mined manually, which not only has low mining efficiency, but also cannot guarantee the accuracy. Therefore, compared with the prior art, the mining method, device, electronic device and storage medium for POI names proposed in the embodiments of the present invention can not only improve the mining efficiency of POI names, but also ensure the accuracy of POI names; In addition, the technical solutions of the embodiments of the present invention are simple and convenient to implement, easy to popularize, and have wider application range.

附图说明Description of drawings

图1为本发明实施例一提供的兴趣点名称的挖掘方法的流程示意图;1 is a schematic flowchart of a method for mining POI names according to Embodiment 1 of the present invention;

图2为本发明实施例二提供的兴趣点名称的挖掘方法的流程示意图;2 is a schematic flowchart of a method for mining POI names according to Embodiment 2 of the present invention;

图3为本发明实施例三提供的兴趣点名称的挖掘方法的流程示意图;3 is a schematic flowchart of a method for mining POI names according to Embodiment 3 of the present invention;

图4为本发明实施例四提供的兴趣点名称的挖掘装置的第一结构示意图;FIG. 4 is a first structural schematic diagram of the excavation device for POI names according to Embodiment 4 of the present invention;

图5为本发明实施例四提供的兴趣点名称的挖掘装置的第二结构示意图;FIG. 5 is a second structural schematic diagram of the excavation device for POI names according to Embodiment 4 of the present invention;

图6为本发明实施例五提供的电子设备的结构示意图。FIG. 6 is a schematic structural diagram of an electronic device according to Embodiment 5 of the present invention.

具体实施方式Detailed ways

下面结合附图和实施例对本发明作进一步的详细说明。可以理解的是,此处所描述的具体实施例仅仅用于解释本发明,而非对本发明的限定。另外还需要说明的是,为了便于描述,附图中仅示出了与本发明相关的部分而非全部内容。The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention. In addition, it should be noted that, for the convenience of description, the drawings only show some but not all of the contents related to the present invention.

实施例一Example 1

图1为本发明实施例一提供的兴趣点名称的挖掘方法的流程示意图,该方法可以由兴趣点名称的挖掘装置或者电子设备来执行,该装置或者电子设备可以由软件和/或硬件的方式实现,该装置或者电子设备可以集成在任何具有网络通信功能的智能设备中。如图1所示,兴趣点名称的挖掘方法可以包括以下步骤:1 is a schematic flowchart of a method for mining POI names according to Embodiment 1 of the present invention. The method may be executed by a POI name mining device or electronic device, and the device or electronic device may be implemented by software and/or hardware. Implementation, the apparatus or electronic device can be integrated in any smart device with network communication function. As shown in Figure 1, the mining method of POI names can include the following steps:

S101、在历史搜索日志中获取各个用户的搜索会话。S101. Acquire search sessions of each user from a historical search log.

在本发明的具体实施例中,电子设备可以在历史搜索日志中获取各个用户的搜索会话。具体地,电子设备可以先在历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间;然后根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。例如,假设某一个用户的历史搜索日志包括N个POI搜索词,分别为:POI搜索词1、POI搜索词2、…、POI搜索词N;其中,N为大于1的自然数;其中,POI搜索词1对应的搜索时间为搜索时间1、POI搜索词2对应的搜索时间为搜索时间2、…、POI搜索词N对应的搜索时间为搜索时间N。在本发明的具体实施例中,如果两个连续的POI搜索词对应的搜索时间的差值小于或者等于预设阈值,表示这两个连续的POI搜索词属于同一个搜索会话;如果两个连续的POI搜索词对应的搜索时间的差值大于预设阈值,表示这两个联系的POI搜索词不属于同一个搜索会话。In a specific embodiment of the present invention, the electronic device may acquire the search sessions of each user in the historical search log. Specifically, the electronic device can first obtain all the POI search terms of each user and the search time corresponding to each POI search term in the historical search log; then, according to the search time corresponding to each POI search term, all POI search terms of each user are divided into in the corresponding search session; wherein, all POI search terms in each search session are from the same user. For example, suppose a user's historical search log includes N POI search terms, namely: POI search term 1, POI search term 2, ..., POI search term N; where N is a natural number greater than 1; where POI search term The search time corresponding to word 1 is search time 1, the search time corresponding to POI search word 2 is search time 2, ..., and the search time corresponding to POI search word N is search time N. In a specific embodiment of the present invention, if the difference between the search times corresponding to two consecutive POI search terms is less than or equal to a preset threshold, it means that the two consecutive POI search terms belong to the same search session; if the two consecutive POI search terms belong to the same search session; The difference between the search times corresponding to the POI search terms is greater than the preset threshold, indicating that the two related POI search terms do not belong to the same search session.

S102、根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;其中,兴趣点名称对中的兴趣点名称属于同一个搜索会话。S102. Determine, according to the search sessions of each user, a point-of-interest name pair corresponding to each search session; wherein, the point-of-interest names in the point-of-interest name pair belong to the same search session.

在本发明的具体实施例中,电子设备可以根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;其中,兴趣点名称对中的兴趣点名称属于同一个搜索会话。例如,假设某一个用户的历史搜索日志可以包括N个POI搜索词,分别为:POI搜索词1、POI搜索词2、…、POI搜索词N;其中,N为大于1的自然数;其中,POI搜索词1对应的搜索时间为搜索时间1、POI搜索词2对应的搜索时间为搜索时间2、…、POI搜索词N对应的搜索时间为搜索时间N。假设POI搜索词1对应的搜索时间1与搜索词2对应的搜索时间2的小于预设阈值,表示POI搜索词1和POI搜索词2属于同一个搜索会话,因此,可以将POI搜索词1和POI搜索词2确定为该搜索会话对应的兴趣点名称对。再假设POI搜索词2对应的搜索时间2与搜索词3对应的搜索时间3的小于预设阈值,表示POI搜索词2和POI搜索词3也属于同一个搜索会话,因此,还可以将POI搜索词2和POI搜索词3确定为该搜索会话对应的兴趣点名称对。In a specific embodiment of the present invention, the electronic device may determine the POI name pairs corresponding to each search session according to the search sessions of each user; wherein the POI names in the POI name pairs belong to the same search session. For example, suppose a user's historical search log may include N POI search terms, namely: POI search term 1, POI search term 2, ..., POI search term N; where N is a natural number greater than 1; where POI The search time corresponding to the search term 1 is the search time 1, the search time corresponding to the POI search term 2 is the search time 2, ..., and the search time corresponding to the POI search term N is the search time N. Assuming that the search time 1 corresponding to the POI search term 1 and the search time 2 corresponding to the search term 2 are less than the preset threshold, it means that the POI search term 1 and the POI search term 2 belong to the same search session. The POI search term 2 is determined as a point-of-interest name pair corresponding to the search session. It is further assumed that the search time 2 corresponding to the POI search term 2 and the search time 3 corresponding to the search term 3 are less than the preset threshold, indicating that the POI search term 2 and the POI search term 3 also belong to the same search session. Therefore, the POI search term can also be searched. Word 2 and POI search word 3 are determined as the point-of-interest name pair corresponding to the search session.

S103、在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征。S103: Determine the search feature corresponding to each point of interest name pair in a predetermined search feature library.

在本发明的具体实施例中,电子设备可以在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征。具体地,电子设备可以在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征;其中,兴趣点名称对包括:第一兴趣点名称和第二兴趣点名称;在本发明的具体实施例中,要求兴趣点名称对中,第二个POI搜索词能对应到电子地图数据库中的一个POI;确定第二个POI搜索词能对应到电子地图数据库中的一个POI的方法是:用户在搜索第二个POI搜索词时,通过POI搜索引擎可以获取到第二个POI搜索词的搜索结果,并且点击了第二个POI搜索词的搜索结果中的POI,点击的POI就作为第二POI搜索词对应的POI。另外,电子设备还可以在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,用户行为特征包括:第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。In a specific embodiment of the present invention, the electronic device may determine a search feature corresponding to each point of interest name pair in a predetermined search feature library. Specifically, the electronic device may determine the search result feature corresponding to each POI name pair in the predetermined search result feature library; wherein, the POI name pair includes: the first POI name and the second POI name; in the present invention In the specific embodiment of , the name of the point of interest is required to be in the middle, and the second POI search term can correspond to a POI in the electronic map database; the method for determining that the second POI search term can correspond to a POI in the electronic map database is: : When the user searches for the second POI search term, the user can obtain the search result of the second POI search term through the POI search engine, and clicks the POI in the search result of the second POI search term, and the clicked POI is used as The POI corresponding to the second POI search term. In addition, the electronic device may also determine the user behavior characteristics corresponding to each POI name pair in a predetermined user behavior characteristic database; wherein, the user behavior characteristics include: the search popularity characteristics of the first POI name and the corresponding user behavior characteristics of each POI name pair. Search for co-occurrence features.

S104、根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。S104 , mining the POI names in each POI name pair according to the POI name pair corresponding to each search session and the search feature corresponding to each POI name pair.

在本发明的具体实施例中,电子设备可以根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。具体地,电子设备可以将各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;通过该判别模型可以获取到各个兴趣点名称对对应的判别概率值;根据各个兴趣点名称对对应的判别概率值和预先确定的判别阈值,确定各个兴趣点名称对对应的判别结果。例如,对于兴趣点名称对(POI搜索词1,POI搜索词2),POI搜索词2为该POI的原始名称;该原始名称是在现有的电子地图中已经存在的名称;POI搜索词1为该POI的疑似候选名称;该疑似候选名称是在现有的电子地图中准备加入的备选名称。在本步骤中,电子设备可以将兴趣点名称对(POI搜索词1,POI搜索词2)以及该兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;假设通过该判别模型获取到该兴趣点名称对对应的判别概率值为80%;假设预先确定的判别阈值为50%;则可以确定出POI搜索词1为POI搜索词2对应的POI的候选名称。In a specific embodiment of the present invention, the electronic device may mine the POI names in each POI name pair according to the POI name pairs corresponding to each search session and the search features corresponding to each POI name pair. Specifically, the electronic device can input the name pairs of the points of interest corresponding to each search session and the search features corresponding to the name pairs of the points of interest into the pre-trained discriminant model; through the discriminant model, the corresponding discriminants of the name pairs of the respective points of interest can be obtained Probability value; according to the corresponding discrimination probability value of each interest point name pair and the predetermined discrimination threshold, determine the discrimination result corresponding to each interest point name pair. For example, for a point-of-interest name pair (POI search term 1, POI search term 2), POI search term 2 is the original name of the POI; the original name is a name that already exists in an existing electronic map; POI search term 1 It is the suspected candidate name of the POI; the suspected candidate name is the candidate name to be added to the existing electronic map. In this step, the electronic device may input the POI name pair (POI search term 1, POI search term 2) and the search feature corresponding to the POI name pair into the pre-trained discriminant model; it is assumed that the discriminant model obtains The corresponding discrimination probability value of the POI name pair is 80%; assuming that the predetermined discrimination threshold is 50%; it can be determined that POI search term 1 is a candidate name of POI corresponding to POI search term 2.

本发明实施例提出的兴趣点名称的挖掘方法,先在历史搜索日志中获取各个用户的搜索会话;然后根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。也就是说,在本发明的技术方案中,可以根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在确定出各个兴趣点名称对对应的搜索特征之后,再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。而在现有的兴趣点名称的挖掘方法中,通常采用人工方式挖掘兴趣点名称,不仅挖掘效率较低,而且准确率也无法保证。因此,和现有技术相比,本发明实施例提出的兴趣点名称的挖掘方法,不仅可以提高兴趣点名称的挖掘效率,还可以保证兴趣点名称的准确率;并且,本发明实施例的技术方案实现简单方便、便于普及,适用范围更广。The method for mining POI names proposed by the embodiment of the present invention first acquires the search sessions of each user in the historical search log; then determines the POI name pairs corresponding to each search session according to the search sessions of each user; The search feature corresponding to each POI name pair is determined in the database; and then the POI name in each POI name pair is mined according to the POI name pair corresponding to each search session and the corresponding search feature of each POI name pair. That is to say, in the technical solution of the present invention, the point-of-interest name pair corresponding to each search session can be determined according to the search session of each user; after the search feature corresponding to each point-of-interest name pair is determined, the corresponding The POI name pairs of , and the corresponding search features of each POI name pair, and the POI names in each POI name pair are mined. However, in the existing mining methods of POI names, the POI names are usually mined manually, which not only has low mining efficiency, but also cannot guarantee the accuracy. Therefore, compared with the prior art, the mining method for POI names proposed by the embodiments of the present invention can not only improve the mining efficiency of POI names, but also ensure the accuracy of POI names; The implementation of the scheme is simple and convenient, easy to popularize, and has a wider application range.

实施例二Embodiment 2

图2为本发明实施例二提供的兴趣点名称的挖掘方法的流程示意图。如图2所示,兴趣点名称的挖掘方法可以包括以下步骤:FIG. 2 is a schematic flowchart of a method for mining POI names according to Embodiment 2 of the present invention. As shown in Figure 2, the mining method of POI names can include the following steps:

S201、在历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间。S201. Acquire all the POI search terms of each user and the search time corresponding to each POI search term from the historical search log.

在本发明的具体实施例中,电子设备可以在历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间。例如,假设某一个用户的历史搜索日志可以包括N个POI搜索词,分别为:POI搜索词1、POI搜索词2、…、POI搜索词N;其中,N为大于1的自然数;其中,POI搜索词1对应的搜索时间为搜索时间1、POI搜索词2对应的搜索时间为搜索时间2、…、POI搜索词N对应的搜索时间为搜索时间N。In a specific embodiment of the present invention, the electronic device may acquire all the POI search terms of each user and the search time corresponding to each POI search term from the historical search log. For example, suppose a user's historical search log may include N POI search terms, namely: POI search term 1, POI search term 2, ..., POI search term N; where N is a natural number greater than 1; where POI The search time corresponding to the search term 1 is the search time 1, the search time corresponding to the POI search term 2 is the search time 2, ..., and the search time corresponding to the POI search term N is the search time N.

S202、根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。S202: Divide all the POI search terms of each user into their corresponding search sessions according to the search time corresponding to each POI search term; wherein, all POI search terms in each search session are from the same user.

在本发明的具体实施例中,电子设备可以根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。具体地,在本发明的具体实施例中,如果两个连续的POI搜索词对应的搜索时间的差值小于或者等于预设阈值,表示这两个连续的POI搜索词属于同一个搜索会话;如果两个连续的POI搜索词对应的搜索时间的差值大于预设阈值,表示这两个联系的POI搜索词不属于同一个搜索会话。例如,假设POI搜索词1对应的搜索时间1与搜索词2对应的搜索时间2的小于预设阈值,表示POI搜索词1和POI搜索词2属于同一个搜索会话,因此,在本步骤中,电子设备可以将POI搜索词1和POI搜索词2划分到该搜索会话中;再假设POI搜索词2对应的搜索时间2与搜索词3对应的搜索时间3的小于预设阈值,表示POI搜索词2和POI搜索词3也属于同一个搜索会话,因此,在本步骤中,电子设备也可以将POI搜索词2和POI搜索词3划分到该搜索会话中。In a specific embodiment of the present invention, the electronic device may divide all the POI search terms of each user into their corresponding search sessions according to the search time corresponding to each POI search term; wherein, all POI search terms in each search session come from the same user. Specifically, in a specific embodiment of the present invention, if the difference between the search times corresponding to two consecutive POI search terms is less than or equal to a preset threshold, it means that the two consecutive POI search terms belong to the same search session; if The difference between the search times corresponding to two consecutive POI search terms is greater than the preset threshold, indicating that the two related POI search terms do not belong to the same search session. For example, assuming that the search time 1 corresponding to the POI search term 1 and the search time 2 corresponding to the search term 2 are less than the preset threshold, it means that the POI search term 1 and the POI search term 2 belong to the same search session. Therefore, in this step, The electronic device can divide the POI search term 1 and the POI search term 2 into the search session; it is then assumed that the search time 2 corresponding to the POI search term 2 and the search time 3 corresponding to the search term 3 are less than the preset threshold, indicating that the POI search term 2 and POI search term 3 also belong to the same search session. Therefore, in this step, the electronic device may also divide POI search term 2 and POI search term 3 into the search session.

S203、在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词。S203. Determine the POI search words in each search session in each user's search session.

在本发明的具体实施例中,电子设备可以在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词。例如,假设某一个用户的搜索会话包括三个POI搜索词,分别为:POI搜索词q1、POI搜索词q2和POI搜索词q3;在本步骤中,电子设备可以在该搜索会话中确定出POI搜索词q1、POI搜索词q2和POI搜索词q3。In a specific embodiment of the present invention, the electronic device may determine the POI search terms in each search session of each user in each user's search session. For example, suppose a user's search session includes three POI search terms, namely: POI search term q1, POI search term q2, and POI search term q3; in this step, the electronic device can determine the POI search term in the search session Search term q1, POI search term q2, and POI search term q3.

S204、将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。S204. Combine every two POI search terms in each search session into a point-of-interest name pair, which is used as a point-of-interest name pair corresponding to each search session.

在本发明的具体实施例中,电子设备可以将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。例如,假设某一个用户的搜索会话包括三个POI搜索词,分别为:POI搜索词q1、POI搜索词q2和POI搜索词q3;在本步骤中,电子设备可以将POI搜索词q1和POI搜索词q2组合为一个兴趣点名称对,作为该搜索会话对应的兴趣点名称对;还可以将POI搜索词q2和POI搜索词q3组合为一个兴趣点名称对,作为该搜索会话对应的兴趣点名称对;还可以将POI搜索词q1和POI搜索词q3组合为一个兴趣点名称对,作为该搜索会话对应的兴趣点名称对。In a specific embodiment of the present invention, the electronic device may combine every two POI search terms in each search session into a point-of-interest name pair, as a point-of-interest name pair corresponding to each search session. For example, suppose a user's search session includes three POI search terms, namely: POI search term q1, POI search term q2, and POI search term q3; in this step, the electronic device can search POI search term q1 and POI search term The word q2 is combined into a point-of-interest name pair, which is the point-of-interest name pair corresponding to the search session; the POI search word q2 and the POI search word q3 can also be combined into a point-of-interest name pair, which is used as the point-of-interest name corresponding to the search session. Yes; the POI search term q1 and the POI search term q3 can also be combined into a point-of-interest name pair, which is used as a point-of-interest name pair corresponding to the search session.

S205、在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征。S205: Determine the search feature corresponding to each point of interest name pair in a predetermined search feature library.

在本发明的具体实施例中,电子设备可以在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征。具体地,电子设备可以在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征;其中,兴趣点名称对包括:第一兴趣点名称和第二兴趣点名称;在本发明的具体实施例中,要求兴趣点名称对中,第二个POI搜索词能对应到电子地图数据库中的一个POI;确定第二个POI搜索词能对应到电子地图数据库中的一个POI的方法是:用户在搜索第二个POI搜索词时,通过POI搜索引擎可以获取到第二个POI搜索词的搜索结果,并且点击了第二个POI搜索词的搜索结果中的POI,点击的POI就作为第二POI搜索词对应的POI。另外,电子设备还可以在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,用户行为特征包括:第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。In a specific embodiment of the present invention, the electronic device may determine a search feature corresponding to each point of interest name pair in a predetermined search feature database. Specifically, the electronic device may determine the search result feature corresponding to each POI name pair in the predetermined search result feature library; wherein, the POI name pair includes: the first POI name and the second POI name; in the present invention In the specific embodiment of , the name of the point of interest is required to be in the middle, and the second POI search term can correspond to a POI in the electronic map database; the method for determining that the second POI search term can correspond to a POI in the electronic map database is: : When the user searches for the second POI search term, the user can obtain the search result of the second POI search term through the POI search engine, and clicks the POI in the search result of the second POI search term, and the clicked POI is used as The POI corresponding to the second POI search term. In addition, the electronic device may also determine the user behavior characteristics corresponding to each POI name pair in a predetermined user behavior characteristic database; wherein, the user behavior characteristics include: the search popularity characteristics of the first POI name and the corresponding user behavior characteristics of each POI name pair. Search for co-occurrence features.

S206、根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。S206 , mining the POI names in each POI name pair according to the POI name pair corresponding to each search session and the search feature corresponding to each POI name pair.

在本发明的具体实施例中,电子设备可以根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。具体地,电子设备可以将各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;通过该判别模型可以获取到各个兴趣点名称对对应的判别概率值;根据各个兴趣点名称对对应的判别概率值和预先确定的判别阈值,确定各个兴趣点名称对对应的判别结果。例如,对于兴趣点名称对(POI搜索词1,POI搜索词2),POI搜索词2为该POI的原始名称;该原始名称是在现有的电子地图中已经存在的名称;POI搜索词1为该POI的疑似候选名称;该疑似候选名称是在现有的电子地图中准备加入的备选名称。在本步骤中,电子设备可以将兴趣点名称对(POI搜索词1,POI搜索词2)以及该兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;假设通过该判别模型获取到该兴趣点名称对对应的判别概率值为80%;假设预先确定的判别阈值为50%;则可以确定出POI搜索词1为POI搜索词2对应的POI的候选名称。In a specific embodiment of the present invention, the electronic device may mine the POI names in each POI name pair according to the POI name pairs corresponding to each search session and the search features corresponding to each POI name pair. Specifically, the electronic device can input the name pairs of the points of interest corresponding to each search session and the search features corresponding to the name pairs of the points of interest into the pre-trained discriminant model; through the discriminant model, the corresponding discriminants of the name pairs of the respective points of interest can be obtained Probability value; according to the corresponding discrimination probability value of each interest point name pair and the predetermined discrimination threshold, determine the discrimination result corresponding to each interest point name pair. For example, for a point-of-interest name pair (POI search term 1, POI search term 2), POI search term 2 is the original name of the POI; the original name is a name that already exists in an existing electronic map; POI search term 1 It is the suspected candidate name of the POI; the suspected candidate name is the candidate name to be added to the existing electronic map. In this step, the electronic device may input the POI name pair (POI search term 1, POI search term 2) and the search feature corresponding to the POI name pair into the pre-trained discriminant model; it is assumed that the discriminant model obtains The corresponding discrimination probability value of the POI name pair is 80%; assuming that the predetermined discrimination threshold is 50%; it can be determined that POI search term 1 is a candidate name of POI corresponding to POI search term 2.

本发明实施例提出的兴趣点名称的挖掘方法,先在历史搜索日志中获取各个用户的搜索会话;然后根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。也就是说,在本发明的技术方案中,可以根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在确定出各个兴趣点名称对对应的搜索特征之后,再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。而在现有的兴趣点名称的挖掘方法中,通常采用人工方式挖掘兴趣点名称,不仅挖掘效率较低,而且准确率也无法保证。因此,和现有技术相比,本发明实施例提出的兴趣点名称的挖掘方法,不仅可以提高兴趣点名称的挖掘效率,还可以保证兴趣点名称的准确率;并且,本发明实施例的技术方案实现简单方便、便于普及,适用范围更广。The method for mining POI names proposed by the embodiment of the present invention first acquires the search sessions of each user in the historical search log; then determines the POI name pairs corresponding to each search session according to the search sessions of each user; The search feature corresponding to each POI name pair is determined in the database; and then the POI name in each POI name pair is mined according to the POI name pair corresponding to each search session and the corresponding search feature of each POI name pair. That is to say, in the technical solution of the present invention, the point-of-interest name pair corresponding to each search session can be determined according to the search session of each user; after the search feature corresponding to each point-of-interest name pair is determined, the corresponding The POI name pairs of , and the corresponding search features of each POI name pair, and the POI names in each POI name pair are mined. However, in the existing mining methods of POI names, the POI names are usually mined manually, which not only has low mining efficiency, but also cannot guarantee the accuracy. Therefore, compared with the prior art, the mining method for POI names proposed by the embodiments of the present invention can not only improve the mining efficiency of POI names, but also ensure the accuracy of POI names; The implementation of the scheme is simple and convenient, easy to popularize, and has a wider application range.

实施例三Embodiment 3

图3为本发明实施例三提供的兴趣点名称的挖掘方法的流程示意图。如图3所示,兴趣点名称的挖掘方法可以包括以下步骤:FIG. 3 is a schematic flowchart of a method for mining POI names according to Embodiment 3 of the present invention. As shown in Figure 3, the mining method of POI names can include the following steps:

S301、在历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间。S301. Acquire all the POI search terms of each user and the search time corresponding to each POI search term from the historical search log.

在本发明的具体实施例中,电子设备可以在历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间。例如,假设某一个用户的历史搜索日志可以包括N个POI搜索词,分别为:POI搜索词1、POI搜索词2、…、POI搜索词N;其中,N为大于1的自然数;其中,POI搜索词1对应的搜索时间为搜索时间1、POI搜索词2对应的搜索时间为搜索时间2、…、POI搜索词N对应的搜索时间为搜索时间N。In a specific embodiment of the present invention, the electronic device may acquire all the POI search terms of each user and the search time corresponding to each POI search term from the historical search log. For example, suppose a user's historical search log may include N POI search terms, namely: POI search term 1, POI search term 2, ..., POI search term N; where N is a natural number greater than 1; where POI The search time corresponding to the search term 1 is the search time 1, the search time corresponding to the POI search term 2 is the search time 2, ..., and the search time corresponding to the POI search term N is the search time N.

S302、根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。S302. Divide all the POI search terms of each user into the corresponding search sessions according to the search time corresponding to each POI search term; wherein, all the POI search terms in each search session are from the same user.

在本发明的具体实施例中,电子设备可以根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。具体地,在本发明的具体实施例中,如果两个连续的POI搜索词对应的搜索时间的差值小于或者等于预设阈值,表示这两个连续的POI搜索词属于同一个搜索会话;如果两个连续的POI搜索词对应的搜索时间的差值大于预设阈值,表示这两个联系的POI搜索词不属于同一个搜索会话。例如,假设POI搜索词1对应的搜索时间1与搜索词2对应的搜索时间2的小于预设阈值,表示POI搜索词1和POI搜索词2属于同一个搜索会话,因此,在本步骤中,电子设备可以将POI搜索词1和POI搜索词2划分到该搜索会话中;再假设POI搜索词2对应的搜索时间2与搜索词3对应的搜索时间3的小于预设阈值,表示POI搜索词2和POI搜索词3也属于同一个搜索会话,因此,在本步骤中,电子设备也可以将POI搜索词2和POI搜索词3划分到该搜索会话中。In a specific embodiment of the present invention, the electronic device may divide all the POI search terms of each user into their corresponding search sessions according to the search time corresponding to each POI search term; wherein, all POI search terms in each search session come from the same user. Specifically, in a specific embodiment of the present invention, if the difference between the search times corresponding to two consecutive POI search terms is less than or equal to a preset threshold, it means that the two consecutive POI search terms belong to the same search session; if The difference between the search times corresponding to two consecutive POI search terms is greater than the preset threshold, indicating that the two related POI search terms do not belong to the same search session. For example, assuming that the search time 1 corresponding to the POI search term 1 and the search time 2 corresponding to the search term 2 are less than the preset threshold, it means that the POI search term 1 and the POI search term 2 belong to the same search session. Therefore, in this step, The electronic device can divide the POI search term 1 and the POI search term 2 into the search session; it is then assumed that the search time 2 corresponding to the POI search term 2 and the search time 3 corresponding to the search term 3 are less than the preset threshold, indicating that the POI search term 2 and POI search term 3 also belong to the same search session. Therefore, in this step, the electronic device may also divide POI search term 2 and POI search term 3 into the search session.

S303、在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词。S303. Determine the POI search words in each search session in each user's search session.

在本发明的具体实施例中,电子设备可以在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词。例如,假设某一个用户的搜索会话包括三个POI搜索词,分别为:POI搜索词q1、POI搜索词q2和POI搜索词q3;在本步骤中,电子设备可以在该搜索会话中确定出POI搜索词q1、POI搜索词q2和POI搜索词q3。In a specific embodiment of the present invention, the electronic device may determine the POI search terms in each search session of each user in each user's search session. For example, suppose a user's search session includes three POI search terms, namely: POI search term q1, POI search term q2, and POI search term q3; in this step, the electronic device can determine the POI search term in the search session Search term q1, POI search term q2, and POI search term q3.

S304、将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。S304. Combine every two POI search terms in each search session into a point-of-interest name pair, which is used as a point-of-interest name pair corresponding to each search session.

在本发明的具体实施例中,电子设备可以将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。例如,假设某一个用户的搜索会话包括三个POI搜索词,分别为:POI搜索词q1、POI搜索词q2和POI搜索词q3;在本步骤中,电子设备可以将POI搜索词q1和POI搜索词q2组合为一个兴趣点名称对,作为该搜索会话对应的兴趣点名称对;还可以将POI搜索词q2和POI搜索词q3组合为一个兴趣点名称对,作为该搜索会话对应的兴趣点名称对;还可以将POI搜索词q1和POI搜索词q3组合为一个兴趣点名称对,作为该搜索会话对应的兴趣点名称对。In a specific embodiment of the present invention, the electronic device may combine every two POI search terms in each search session into a point-of-interest name pair, as a point-of-interest name pair corresponding to each search session. For example, suppose a user's search session includes three POI search terms, namely: POI search term q1, POI search term q2, and POI search term q3; in this step, the electronic device can search POI search term q1 and POI search term The word q2 is combined into a point-of-interest name pair, which is the point-of-interest name pair corresponding to the search session; the POI search word q2 and the POI search word q3 can also be combined into a point-of-interest name pair, which is used as the point-of-interest name corresponding to the search session. Yes; the POI search term q1 and the POI search term q3 can also be combined into a point-of-interest name pair, which is used as a point-of-interest name pair corresponding to the search session.

S305、在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征。S305: Determine the search result feature corresponding to each POI name pair in a predetermined search result feature library.

在本发明的具体实施例中,电子设备可以在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征。通用搜索引擎的检索结果在相关性上效果很好,对于兴趣点名称对(qi,qj),如果qi是qj的候选名称,那么qi的搜索结果很可能包含qj代表的字符串;反之,搜索结果中出现qj代表的字符串的概率很低。因此,在本发明的具体实施例中,电子设备可以用qi作为通用搜索引擎的搜索词,在通用搜索引擎中获取搜索结果,然后在搜索结果中统计qj出现的次数,作为该兴趣点名称对对应的搜索结果特征。这里,通用搜索引擎是用于搜索POI和非POI的搜索引擎;POI搜索引擎是专门用于搜索POI的搜索引擎。In a specific embodiment of the present invention, the electronic device may determine the search result feature corresponding to each point of interest name pair in a predetermined search result feature library. The retrieval results of general search engines are very effective in terms of relevance. For POI name pairs (q i , q j ), if q i is a candidate name of q j , then the search results of q i are likely to contain the name represented by q j . string; on the contrary, the probability of the string represented by q j appearing in the search results is very low. Therefore, in a specific embodiment of the present invention, the electronic device can use qi as a search term of a general search engine, obtain search results in the general search engine, and then count the number of times q j appears in the search results as the point of interest The name pair corresponds to the corresponding search result feature. Here, a general search engine is a search engine for searching POIs and non-POIs; a POI search engine is a search engine specifically for searching for POIs.

S306、在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,用户行为特征包括:第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。S306. Determine user behavior features corresponding to each POI name pair in a predetermined user behavior feature database; wherein, the user behavior features include: a search popularity feature of the first POI name and a search co-occurrence feature of each POI name pair .

在本发明的具体实施例中,电子设备可以在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,用户行为特征包括:第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。具体地,对于兴趣点名称对(qi,qj),如果qi的搜索热度特征很低,那么表示qi和qj表示同一个POI的概率就会很低。另外,qi和qj在同一个搜索会话中出现过,那么qi和qj表示同一个POI的概率就会很高。在本发明的具体实施例中,各个兴趣点名称对的搜索共现特征可以包括:各个兴趣点名称对的共现次数、各个兴趣点名称对的转移概率和各个兴趣点名称对的互信息。具体地,对于兴趣点名称对(qi,qj),该兴趣点名称对的共现次数为:在某一历史时段内所有用户的搜索会话中,qi和qj共同出现的搜索会话的次数cnt(qi,qj);该兴趣点名称对的转移概率为:在某一历史时段内所有用户的搜索会话中,该兴趣点名称对的共现的搜索会话的次数cnt(qi,qj)与qj的出现的搜索会话的次数cnt(qj)的比值;公式表示为该兴趣点名称对的互信息表示qi和qj之间的相互依赖性;用公式表示为其中,cnt(qi,qj)为在某一历史时段内所有用户的搜索会话中,qi和qj共同出现的搜索会话的次数;cnt(qi)为在某一历史时段内所有用户的搜索会话中,qi出现的搜索会话的次数;cnt(qi)为在某一历史时段内所有用户的搜索会话中,qj出现的搜索会话的次数。In a specific embodiment of the present invention, the electronic device may determine the user behavior characteristics corresponding to each POI name pair in a predetermined user behavior characteristic database; wherein the user behavior characteristics include: the search popularity characteristics of the first POI name and Search co-occurrence features for each POI name pair. Specifically, for the point-of-interest name pair (qi i , q j ), if the search popularity feature of qi is very low , the probability that qi and q j represent the same POI will be very low. In addition, qi and q j have appeared in the same search session , so the probability that qi and q j represent the same POI will be high. In a specific embodiment of the present invention, the search co-occurrence feature of each POI name pair may include: co-occurrence times of each POI name pair, transition probability of each POI name pair, and mutual information of each POI name pair. Specifically, for the POI name pair (qi i , q j ), the co-occurrence times of the POI name pair are: in the search sessions of all users within a certain historical period, the search sessions in which qi and q j co -occur The number of times cnt(q i , q j ); the transition probability of this POI name pair is: in the search sessions of all users in a certain historical period, the number of co-occurrence search sessions of the POI name pair cnt(q The ratio of i , q j ) to the number of times cnt(q j ) of the search session where q j appeared; the formula is expressed as The mutual information of this POI name pair represents the interdependence between qi and q j ; it is expressed as Among them, cnt(q i , q j ) is the number of search sessions in which qi and q j co -occur in the search sessions of all users in a certain historical period; cnt(q i ) is all the search sessions in a certain historical period In the user's search sessions, the number of search sessions in which q i appears; cnt(q i ) is the number of times in which q j appears in the search sessions of all users in a certain historical period.

S307、根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。S307 , mining the POI names in each POI name pair according to the POI name pair corresponding to each search session and the search feature corresponding to each POI name pair.

在本发明的具体实施例中,电子设备可以根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。具体地,电子设备可以将各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;通过该判别模型可以获取到各个兴趣点名称对对应的判别概率值;根据各个兴趣点名称对对应的判别概率值和预先确定的判别阈值,确定各个兴趣点名称对对应的判别结果。例如,对于兴趣点名称对(POI搜索词1,POI搜索词2),POI搜索词2为该POI的原始名称;该原始名称是在现有的电子地图中已经存在的名称;POI搜索词1为该POI的疑似候选名称;该疑似候选名称是在现有的电子地图中准备加入的备选名称。在本步骤中,电子设备可以将兴趣点名称对(POI搜索词1,POI搜索词2)以及该兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;假设通过该判别模型获取到该兴趣点名称对对应的判别概率值为80%;假设预先确定的判别阈值为50%;则可以确定出POI搜索词1为POI搜索词的候选名称。In a specific embodiment of the present invention, the electronic device may mine the POI names in each POI name pair according to the POI name pairs corresponding to each search session and the search features corresponding to each POI name pair. Specifically, the electronic device can input the name pairs of the points of interest corresponding to each search session and the search features corresponding to the name pairs of the points of interest into the pre-trained discriminant model; through the discriminant model, the corresponding discriminants of the name pairs of the respective points of interest can be obtained Probability value; according to the corresponding discrimination probability value of each interest point name pair and the predetermined discrimination threshold, determine the discrimination result corresponding to each interest point name pair. For example, for a point-of-interest name pair (POI search term 1, POI search term 2), POI search term 2 is the original name of the POI; the original name is a name that already exists in an existing electronic map; POI search term 1 It is the suspected candidate name of the POI; the suspected candidate name is the candidate name to be added to the existing electronic map. In this step, the electronic device may input the POI name pair (POI search term 1, POI search term 2) and the search feature corresponding to the POI name pair into the pre-trained discriminant model; it is assumed that the discriminant model obtains The corresponding discrimination probability value of the POI name pair is 80%; assuming that the predetermined discrimination threshold is 50%, it can be determined that the POI search term 1 is a candidate name of the POI search term.

本发明实施例提出的兴趣点名称的挖掘方法,先在历史搜索日志中获取各个用户的搜索会话;然后根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。也就是说,在本发明的技术方案中,可以根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;在确定出各个兴趣点名称对对应的搜索特征之后,再根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。而在现有的兴趣点名称的挖掘方法中,通常采用人工方式挖掘兴趣点名称,不仅挖掘效率较低,而且准确率也无法保证。因此,和现有技术相比,本发明实施例提出的兴趣点名称的挖掘方法,不仅可以提高兴趣点名称的挖掘效率,还可以保证兴趣点名称的准确率;并且,本发明实施例的技术方案实现简单方便、便于普及,适用范围更广。The method for mining POI names proposed by the embodiment of the present invention first acquires the search sessions of each user in the historical search log; then determines the POI name pairs corresponding to each search session according to the search sessions of each user; The search feature corresponding to each POI name pair is determined in the database; and then the POI name in each POI name pair is mined according to the POI name pair corresponding to each search session and the corresponding search feature of each POI name pair. That is to say, in the technical solution of the present invention, the point-of-interest name pair corresponding to each search session can be determined according to the search session of each user; after the search feature corresponding to each point-of-interest name pair is determined, the corresponding The POI name pairs of , and the corresponding search features of each POI name pair, and the POI names in each POI name pair are mined. However, in the existing mining methods of POI names, the POI names are usually mined manually, which not only has low mining efficiency, but also cannot guarantee the accuracy. Therefore, compared with the prior art, the mining method for POI names proposed by the embodiments of the present invention can not only improve the mining efficiency of POI names, but also ensure the accuracy of POI names; The implementation of the scheme is simple and convenient, easy to popularize, and has a wider application range.

实施例四Embodiment 4

图4为本发明实施例四提供的兴趣点名称的挖掘装置的第一结构示意图。如图4所示,本发明实施例所述的兴趣点名称的挖掘装置可以包括:获取模块401、确定模块402和挖掘模块403;其中,FIG. 4 is a schematic diagram of a first structure of an excavating device for a point of interest name provided in Embodiment 4 of the present invention. As shown in FIG. 4 , the device for mining POI names according to the embodiment of the present invention may include: an acquisition module 401, a determination module 402, and a mining module 403; wherein,

所述获取模块401,用于在历史搜索日志中获取各个用户的搜索会话;The obtaining module 401 is used to obtain the search session of each user in the historical search log;

所述确定模块402,用于根据各个用户的搜索会话确定各个搜索会话对应的兴趣点名称对;其中,所述兴趣点名称对中的兴趣点名称属于同一个搜索会话;在预先确定的搜索特征库中确定各个兴趣点名称对对应的搜索特征;The determining module 402 is configured to determine the POI name pairs corresponding to each search session according to the search sessions of each user; wherein, the POI names in the POI name pairs belong to the same search session; Determine the search feature corresponding to each POI name pair in the library;

所述挖掘模块403,用于根据各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征,对各个兴趣点名称对中的兴趣点名称进行挖掘。The mining module 403 is configured to mine the POI names in each POI name pair according to the POI name pair corresponding to each search session and the search feature corresponding to each POI name pair.

图5为本发明实施例四提供的兴趣点名称的挖掘装置的第二结构示意图。如图5所示,所述获取模块401包括:获取子模块4011和划分子模块4012;其中,FIG. 5 is a schematic diagram of the second structure of the excavation device for POI names according to Embodiment 4 of the present invention. As shown in FIG. 5, the acquisition module 401 includes: an acquisition sub-module 4011 and a division sub-module 4012; wherein,

所述获取子模块4011,用于在所述历史搜索日志中获取各个用户的全部POI搜索词和各个POI搜索词对应的搜索时间;The acquisition submodule 4011 is used to acquire all the POI search terms of each user and the search time corresponding to each POI search term in the historical search log;

所述划分子模块4012,用于根据各个POI搜索词对应的搜索时间将各个用户的全部POI搜索词划分到与其对应的搜索会话中;其中,各个搜索会话中的所有POI搜索词来自同一个用户。The dividing submodule 4012 is used to divide all the POI search terms of each user into their corresponding search sessions according to the search time corresponding to each POI search term; wherein, all POI search terms in each search session are from the same user .

进一步的,所述确定模块402,具体用于在各个用户的搜索会话中确定出各个搜索会话中的POI搜索词;将各个搜索会话中每两个POI搜索词组合为一个兴趣点名称对,作为各个搜索会话对应的兴趣点名称对。Further, the determining module 402 is specifically configured to determine the POI search terms in each search session in each user's search session; combine every two POI search terms in each search session into a point-of-interest name pair, as Point-of-interest name pairs corresponding to each search session.

进一步的,所述确定模块402,具体用于在预先确定的搜索结果特征库中确定各个兴趣点名称对对应的搜索结果特征;其中,所述兴趣点名称对包括:第一兴趣点名称和第二兴趣点名称;在预先确定的用户行为特征库中确定各个兴趣点名称对对应的用户行为特征;其中,所述用户行为特征包括:所述第一兴趣点名称的搜索热度特征以及各个兴趣点名称对的搜索共现特征。Further, the determining module 402 is specifically configured to determine the search result feature corresponding to each POI name pair in a predetermined search result feature library; wherein, the POI name pair includes: the first POI name and the first POI name pair. Two point of interest names; determine the user behavior characteristics corresponding to each point of interest name pair in a predetermined user behavior feature database; wherein, the user behavior characteristics include: the search popularity feature of the name of the first point of interest and each point of interest Search co-occurrence features for name pairs.

进一步的,所述挖掘模块403,具体用于将各个搜索会话对应的兴趣点名称对以及各个兴趣点名称对对应的搜索特征输入到预先训练的判别模型中;通过所述判别模型获取到各个兴趣点名称对对应的判别概率值;根据各个兴趣点名称对对应的判别概率值和预先确定的判别阈值,确定各个兴趣点名称对对应的判别结果。Further, the mining module 403 is specifically configured to input the name pairs of interest points corresponding to each search session and the search features corresponding to the name pairs of each interest point into the pre-trained discriminant model; obtain each interest point through the discriminant model. The point name pair corresponds to the discrimination probability value; the discrimination result corresponding to each interest point name pair is determined according to the discrimination probability value corresponding to each interest point name pair and the predetermined discrimination threshold.

上述兴趣点名称的挖掘装置可执行本发明任意实施例所提供的方法,具备执行方法相应的功能模块和有益效果。未在本实施例中详尽描述的技术细节,可参见本发明任意实施例提供的兴趣点名称的挖掘方法。The above-mentioned mining device for the name of the point of interest can execute the method provided by any embodiment of the present invention, and has corresponding functional modules and beneficial effects for executing the method. For technical details not described in detail in this embodiment, reference may be made to the method for mining POI names provided by any embodiment of the present invention.

实施例五Embodiment 5

图6为本发明实施例五提供的电子设备的结构示意图。图6示出了适于用来实现本发明实施方式的示例性电子设备的框图。图6显示的电子设备12仅仅是一个示例,不应对本发明实施例的功能和使用范围带来任何限制。FIG. 6 is a schematic structural diagram of an electronic device according to Embodiment 5 of the present invention. Figure 6 shows a block diagram of an exemplary electronic device suitable for use in implementing embodiments of the present invention. The electronic device 12 shown in FIG. 6 is only an example, and should not impose any limitations on the functions and scope of use of the embodiments of the present invention.

如图6所示,电子设备12以通用计算设备的形式表现。电子设备12的组件可以包括但不限于:一个或者多个处理器或者处理单元16,系统存储器28,连接不同系统组件(包括系统存储器28和处理单元16)的总线18。As shown in FIG. 6, the electronic device 12 takes the form of a general-purpose computing device. Components of electronic device 12 may include, but are not limited to, one or more processors or processing units 16 , system memory 28 , and a bus 18 connecting various system components including system memory 28 and processing unit 16 .

总线18表示几类总线结构中的一种或多种,包括存储器总线或者存储器控制器,外围总线,图形加速端口,处理器或者使用多种总线结构中的任意总线结构的局域总线。举例来说,这些体系结构包括但不限于工业标准体系结构(ISA)总线,微通道体系结构(MAC)总线,增强型ISA总线、视频电子标准协会(VESA)局域总线以及外围组件互连(PCI)总线。Bus 18 represents one or more of several types of bus structures, including a memory bus or memory controller, a peripheral bus, a graphics acceleration port, a processor, or a local bus using any of a variety of bus structures. By way of example, these architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MAC) bus, Enhanced ISA bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect ( PCI) bus.

电子设备12典型地包括多种计算机系统可读介质。这些介质可以是任何能够被电子设备12访问的可用介质,包括易失性和非易失性介质,可移动的和不可移动的介质。Electronic device 12 typically includes a variety of computer system readable media. These media can be any available media that can be accessed by electronic device 12, including both volatile and non-volatile media, removable and non-removable media.

系统存储器28可以包括易失性存储器形式的计算机系统可读介质,例如随机存取存储器(RAM)30和/或高速缓存存储器32。电子设备12可以进一步包括其它可移动/不可移动的、易失性/非易失性计算机系统存储介质。仅作为举例,存储系统34可以用于读写不可移动的、非易失性磁介质(图6未显示,通常称为“硬盘驱动器”)。尽管图6中未示出,可以提供用于对可移动非易失性磁盘(例如“软盘”)读写的磁盘驱动器,以及对可移动非易失性光盘(例如CD-ROM,DVD-ROM或者其它光介质)读写的光盘驱动器。在这些情况下,每个驱动器可以通过一个或者多个数据介质接口与总线18相连。存储器28可以包括至少一个程序产品,该程序产品具有一组(例如至少一个)程序模块,这些程序模块被配置以执行本发明各实施例的功能。System memory 28 may include computer system readable media in the form of volatile memory, such as random access memory (RAM) 30 and/or cache memory 32 . Electronic device 12 may further include other removable/non-removable, volatile/non-volatile computer system storage media. For example only, storage system 34 may be used to read and write to non-removable, non-volatile magnetic media (not shown in FIG. 6, commonly referred to as a "hard drive"). Although not shown in Figure 6, a disk drive may be provided for reading and writing to removable non-volatile magnetic disks (eg "floppy disks"), as well as removable non-volatile optical disks (eg CD-ROM, DVD-ROM) or other optical media) to read and write optical drives. In these cases, each drive may be connected to bus 18 through one or more data media interfaces. Memory 28 may include at least one program product having a set (eg, at least one) of program modules configured to perform the functions of various embodiments of the present invention.

具有一组(至少一个)程序模块42的程序/实用工具40,可以存储在例如存储器28中,这样的程序模块42包括但不限于操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。程序模块42通常执行本发明所描述的实施例中的功能和/或方法。A program/utility 40 having a set (at least one) of program modules 42, which may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data , each or some combination of these examples may include an implementation of a network environment. Program modules 42 generally perform the functions and/or methods of the described embodiments of the present invention.

电子设备12也可以与一个或多个外部设备14(例如键盘、指向设备、显示器24等)通信,还可与一个或者多个使得用户能与该电子设备12交互的设备通信,和/或与使得该电子设备12能与一个或多个其它计算设备进行通信的任何设备(例如网卡,调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口22进行。并且,电子设备12还可以通过网络适配器20与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。如图所示,网络适配器20通过总线18与电子设备12的其它模块通信。应当明白,尽管图6中未示出,可以结合电子设备12使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID系统、磁带驱动器以及数据备份存储系统等。The electronic device 12 may also communicate with one or more external devices 14 (eg, a keyboard, pointing device, display 24, etc.), with one or more devices that enable a user to interact with the electronic device 12, and/or with Any device (eg, network card, modem, etc.) that enables the electronic device 12 to communicate with one or more other computing devices. Such communication may take place through input/output (I/O) interface 22 . Also, the electronic device 12 may communicate with one or more networks (eg, a local area network (LAN), a wide area network (WAN), and/or a public network such as the Internet) through a network adapter 20 . As shown, network adapter 20 communicates with other modules of electronic device 12 via bus 18 . It should be appreciated that, although not shown in FIG. 6, other hardware and/or software modules may be used in conjunction with electronic device 12, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tapes drives and data backup storage systems, etc.

处理单元16通过运行存储在系统存储器28中的程序,从而执行各种功能应用以及数据处理,例如实现本发明实施例所提供的兴趣点名称的挖掘方法。The processing unit 16 executes various functional applications and data processing by running the programs stored in the system memory 28, for example, implementing the method for mining the name of the point of interest provided by the embodiment of the present invention.

实施例六Embodiment 6

本发明实施例六提供了一种计算机存储介质。Embodiment 6 of the present invention provides a computer storage medium.

本发明实施例的计算机可读存储介质,可以采用一个或多个计算机可读的介质的任意组合。计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本文件中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。The computer-readable storage medium of the embodiments of the present invention may adopt any combination of one or more computer-readable mediums. The computer-readable medium may be a computer-readable signal medium or a computer-readable storage medium. The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples (a non-exhaustive list) of computer readable storage media include: electrical connections having one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), Erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the above. In this document, a computer-readable storage medium can be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.

计算机可读的信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读的信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。A computer-readable signal medium may include a propagated data signal in baseband or as part of a carrier wave, with computer-readable program code embodied thereon. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .

计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括——但不限于无线、电线、光缆、RF等等,或者上述的任意合适的组合。Program code embodied on a computer readable medium may be transmitted using any suitable medium, including - but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.

可以以一种或多种程序设计语言或其组合来编写用于执行本发明操作的计算机程序代码,所述程序设计语言包括面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)—连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for carrying out operations of the present invention may be written in one or more programming languages, including object-oriented programming languages—such as Java, Smalltalk, C++, but also conventional Procedural programming language - such as the "C" language or similar programming language. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider through Internet connection).

注意,上述仅为本发明的较佳实施例及所运用技术原理。本领域技术人员会理解,本发明不限于这里所述的特定实施例,对本领域技术人员来说能够进行各种明显的变化、重新调整和替代而不会脱离本发明的保护范围。因此,虽然通过以上实施例对本发明进行了较为详细的说明,但是本发明不仅仅限于以上实施例,在不脱离本发明构思的情况下,还可以包括更多其他等效实施例,而本发明的范围由所附的权利要求范围决定。Note that the above are only preferred embodiments of the present invention and applied technical principles. Those skilled in the art will understand that the present invention is not limited to the specific embodiments described herein, and various obvious changes, readjustments and substitutions can be made by those skilled in the art without departing from the protection scope of the present invention. Therefore, although the present invention has been described in detail through the above embodiments, the present invention is not limited to the above embodiments, and can also include more other equivalent embodiments without departing from the concept of the present invention. The scope is determined by the scope of the appended claims.

Claims (12)

1. A method for mining a point of interest name, the method comprising:
acquiring search sessions of all users from a historical search log;
determining interest point name pairs corresponding to the search sessions according to the search sessions of the users; wherein the interest point names in the interest point name pair belong to the same search session;
determining search features corresponding to the name pairs of the interest points in a predetermined search feature library;
and mining the interest point names in the interest point name pairs according to the interest point name pairs corresponding to the search sessions and the search characteristics corresponding to the interest point name pairs.
2. The method of claim 1, wherein obtaining search sessions of respective users in the historical search log comprises:
all POI search words of all the users and the search time corresponding to each POI search word are obtained from the historical search log;
dividing all POI search words of each user into corresponding search sessions according to the search time corresponding to each POI search word; wherein all POI search terms in each search session are from the same user.
3. The method of claim 1, wherein determining the interest point name pair corresponding to each search session according to the search session of each user comprises:
determining POI search words in each search session in the search session of each user;
and combining every two POI search words in each search session into an interest point name pair as the interest point name pair corresponding to each search session.
4. The method of claim 1, wherein determining the search feature corresponding to each interest point name pair in a predetermined search feature library comprises:
determining search result characteristics corresponding to the name pairs of the interest points in a predetermined search result characteristic library; wherein the interest point name pair comprises: a first point of interest name and a second point of interest name;
determining user behavior characteristics corresponding to the name pairs of the interest points in a predetermined user behavior characteristic library; wherein the user behavior characteristics include: the search heat characteristic of the first interest point name and the search co-occurrence characteristic of each interest point name pair.
5. The method of claim 1, wherein mining the interest point names in each pair of interest point names according to the pair of interest point names corresponding to each search session and the search features corresponding to each pair of interest point names comprises:
inputting the interest point name pairs corresponding to the search sessions and the search features corresponding to the interest point name pairs into a pre-trained discrimination model;
obtaining the discrimination probability value corresponding to each interest point name pair through the discrimination model;
and determining the discrimination result corresponding to each interest point name pair according to the discrimination probability value corresponding to each interest point name pair and a predetermined discrimination threshold value.
6. An apparatus for mining a point of interest name, the apparatus comprising: the system comprises an acquisition module, a determination module and a mining module; wherein,
the acquisition module is used for acquiring the search conversation of each user in the historical search log;
the determining module is used for determining interest point name pairs corresponding to the search sessions according to the search sessions of the users; wherein the interest point names in the interest point name pair belong to the same search session; determining search features corresponding to the name pairs of the interest points in a predetermined search feature library;
and the mining module is used for mining the interest point names in the interest point name pairs according to the interest point name pairs corresponding to the search sessions and the search characteristics corresponding to the interest point name pairs.
7. The apparatus of claim 6, wherein the obtaining module comprises: acquiring a submodule and a dividing submodule; wherein,
the acquisition sub-module is used for acquiring all POI search words of each user and search time corresponding to each POI search word in the historical search log;
the dividing submodule is used for dividing all POI search words of each user into corresponding search sessions according to the search time corresponding to each POI search word; wherein all POI search terms in each search session are from the same user.
8. The apparatus of claim 6, wherein:
the determining module is specifically configured to determine, in the search session of each user, a POI search term in each search session; and combining every two POI search words in each search session into an interest point name pair as the interest point name pair corresponding to each search session.
9. The apparatus of claim 6, wherein:
the determining module is specifically configured to determine search result features corresponding to the name pairs of the interest points in a predetermined search result feature library; wherein the interest point name pair comprises: a first point of interest name and a second point of interest name; determining user behavior characteristics corresponding to the name pairs of the interest points in a predetermined user behavior characteristic library; wherein the user behavior characteristics include: the search heat characteristic of the first interest point name and the search co-occurrence characteristic of each interest point name pair.
10. The apparatus of claim 6, wherein:
the mining module is specifically used for inputting the interest point name pairs corresponding to the search sessions and the search features corresponding to the interest point name pairs into a pre-trained discrimination model; obtaining the discrimination probability value corresponding to each interest point name pair through the discrimination model; and determining the discrimination result corresponding to each interest point name pair according to the discrimination probability value corresponding to each interest point name pair and a predetermined discrimination threshold value.
11. An electronic device, comprising:
one or more processors;
a memory for storing one or more programs,
when executed by the one or more processors, cause the one or more processors to implement the method of mining point of interest names according to any one of claims 1 to 5.
12. A storage medium on which a computer program is stored which, when being executed by a processor, implements a method of mining a point of interest name as claimed in any one of claims 1 to 5.
CN201910110277.2A 2019-02-11 2019-02-11 A kind of method for digging of interest point name, device, electronic equipment and storage medium Pending CN109800361A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910110277.2A CN109800361A (en) 2019-02-11 2019-02-11 A kind of method for digging of interest point name, device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910110277.2A CN109800361A (en) 2019-02-11 2019-02-11 A kind of method for digging of interest point name, device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN109800361A true CN109800361A (en) 2019-05-24

Family

ID=66562011

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910110277.2A Pending CN109800361A (en) 2019-02-11 2019-02-11 A kind of method for digging of interest point name, device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109800361A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110457706A (en) * 2019-08-15 2019-11-15 腾讯科技(深圳)有限公司 Interest point name preference pattern training method, application method, device and storage medium
CN111737537A (en) * 2020-07-21 2020-10-02 杭州欧若数网科技有限公司 POI recommendation method, device and medium based on graph database

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103455552A (en) * 2013-08-01 2013-12-18 百度在线网络技术(北京)有限公司 Point-of-interest mining method and device based on terms of interest
CN105224660A (en) * 2015-09-30 2016-01-06 北京奇虎科技有限公司 A kind of disposal route of map point of interest POI data and device
US20170068683A1 (en) * 2015-09-04 2017-03-09 Google Inc. Context based instant search suggestions
CN107291914A (en) * 2017-06-27 2017-10-24 达而观信息科技(上海)有限公司 A kind of method and system for generating search engine inquiry expansion word
CN107491537A (en) * 2017-08-23 2017-12-19 北京百度网讯科技有限公司 POI data excavation, information retrieval method, device, equipment and medium
CN108345702A (en) * 2018-04-10 2018-07-31 北京百度网讯科技有限公司 Entity recommends method and apparatus

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103455552A (en) * 2013-08-01 2013-12-18 百度在线网络技术(北京)有限公司 Point-of-interest mining method and device based on terms of interest
US20170068683A1 (en) * 2015-09-04 2017-03-09 Google Inc. Context based instant search suggestions
CN105224660A (en) * 2015-09-30 2016-01-06 北京奇虎科技有限公司 A kind of disposal route of map point of interest POI data and device
CN107291914A (en) * 2017-06-27 2017-10-24 达而观信息科技(上海)有限公司 A kind of method and system for generating search engine inquiry expansion word
CN107491537A (en) * 2017-08-23 2017-12-19 北京百度网讯科技有限公司 POI data excavation, information retrieval method, device, equipment and medium
CN108345702A (en) * 2018-04-10 2018-07-31 北京百度网讯科技有限公司 Entity recommends method and apparatus

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110457706A (en) * 2019-08-15 2019-11-15 腾讯科技(深圳)有限公司 Interest point name preference pattern training method, application method, device and storage medium
CN110457706B (en) * 2019-08-15 2023-08-22 腾讯科技(深圳)有限公司 Point-of-interest name selection model training method, using method, device and storage medium
CN111737537A (en) * 2020-07-21 2020-10-02 杭州欧若数网科技有限公司 POI recommendation method, device and medium based on graph database
CN111737537B (en) * 2020-07-21 2020-11-27 杭州欧若数网科技有限公司 POI recommendation method, device and medium based on graph database

Similar Documents

Publication Publication Date Title
CN113590645B (en) Searching method, searching device, electronic equipment and storage medium
US11709999B2 (en) Method and apparatus for acquiring POI state information, device and computer storage medium
US10885281B2 (en) Natural language document summarization using hyperbolic embeddings
CN110990520B (en) Address coding method and device, electronic equipment and storage medium
CN109325108B (en) Query processing method, device, server and storage medium
US11861516B2 (en) Methods and system for associating locations with annotations
CN107491537A (en) POI data excavation, information retrieval method, device, equipment and medium
CN108932218B (en) Instance extension method, device, equipment and medium
CN111611452A (en) Method, system, device and storage medium for ambiguity recognition of search text
CN110377750B (en) Comment generation method, comment generation device, comment generation model training device and storage medium
CN109376173A (en) A data query method, device, electronic device and storage medium
CN111552792B (en) Information query method and device, electronic equipment and storage medium
CN107704589A (en) Interest point failure method for digging, device, server and medium based on waybill
CN110413904A (en) Method, device, server and medium for processing point-of-interest address data
CN111198861A (en) Logic log processing method and device and electronic equipment
CN110287338B (en) Industry hot spot determination method, device, equipment and medium
CN119719297A (en) Question and answer generation method, device, equipment and storage medium
CN110704608A (en) Text theme generation method and device and computer equipment
CN109885610A (en) A method, device, electronic device and storage medium for extracting structured data
CN109800361A (en) A kind of method for digging of interest point name, device, electronic equipment and storage medium
CN107729347A (en) Acquisition methods, device, equipment and the computer-readable recording medium of synonymous label
CN110688467A (en) Named entity identification method, device, computer equipment and storage medium
CN110263140B (en) Method, device, electronic device and storage medium for mining subject words
CN114253992A (en) Data aggregation method, device, equipment and storage medium
CN113032650B (en) Library book management method, library book management device, library book management server and library book management storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190524