[go: up one dir, main page]

CN110689402A - Method, apparatus, electronic device, and readable storage medium for recommending merchants - Google Patents

Method, apparatus, electronic device, and readable storage medium for recommending merchants Download PDF

Info

Publication number
CN110689402A
CN110689402A CN201910833752.9A CN201910833752A CN110689402A CN 110689402 A CN110689402 A CN 110689402A CN 201910833752 A CN201910833752 A CN 201910833752A CN 110689402 A CN110689402 A CN 110689402A
Authority
CN
China
Prior art keywords
merchant
user
target
candidate
order
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910833752.9A
Other languages
Chinese (zh)
Inventor
孙正
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sankuai Online Technology Co Ltd
Original Assignee
Beijing Sankuai Online Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sankuai Online Technology Co Ltd filed Critical Beijing Sankuai Online Technology Co Ltd
Priority to CN201910833752.9A priority Critical patent/CN110689402A/en
Publication of CN110689402A publication Critical patent/CN110689402A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Recommending goods or services
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0207Discounts or incentives, e.g. coupons or rebates

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • Finance (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Human Resources & Organizations (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The embodiment of the application provides a method and a device for recommending merchants, electronic equipment and a readable storage medium, aiming at enabling a server to recommend merchant information to a user more accurately and improving the operation efficiency of the user. The method comprises the following steps: obtaining user characteristics of a target user and merchant characteristics of a plurality of candidate merchants respectively; inputting the user characteristics and the merchant characteristics into a repurchase rate prediction model to obtain the repurchase rate of the target user for each candidate merchant in the candidate merchants; recommending a target merchant to the target user according to the repurchase rate of the target user for each candidate merchant in the candidate merchants, wherein the target merchant is at least one of the candidate merchants.

Description

推荐商家的方法、装置、电子设备及可读存储介质Method, apparatus, electronic device, and readable storage medium for recommending merchants

技术领域technical field

本申请实施例涉及互联网技术领域,尤其涉及一种推荐商家的方法、装置、电子设备及可读存储介质。The embodiments of the present application relate to the field of Internet technologies, and in particular, to a method, an apparatus, an electronic device, and a readable storage medium for recommending a merchant.

背景技术Background technique

随着互联挖技术的发展和智能终端设备的普及,越来越多的终端用户通过浏览器或客户端连接至电商平台或O2O(Online To Offline)平台等服务端,以实现网上购物、线上点餐、网上购票等线上交易活动。相关技术中,服务端会根据用户的历史浏览记录主动地向用户推荐商家信息,使用户在不执行搜索操作的情况下,快速地进入目标商家的主页,以此推广目标商家而获得收益并提高用户操作效率。With the development of Internet mining technology and the popularization of intelligent terminal devices, more and more end users connect to e-commerce platforms or O2O (Online To Offline) platforms and other servers through browsers or clients to realize online shopping, online shopping, and online shopping. Online transaction activities such as ordering meals and online ticket purchases. In the related art, the server will actively recommend business information to the user according to the user's historical browsing records, so that the user can quickly enter the homepage of the target business without performing a search operation, so as to promote the target business to obtain benefits and improve the performance of the target business. User operation efficiency.

目前的服务端在向用户推荐商家信息时,通常是根据各商家信息的历史点击率和/或历史转化率,确定历史点击率和/或历史转化率较高的目标商家,并将目标商家的信息推荐给用户。然而,采用目前的这种推荐方式,向用户推荐的商家信息仅能满足服务端当前收益的最大化和用户当前操作效率的最大化,而不能满足服务端长期收益的最大化和用户长期操作效率的最大化。When the current server recommends business information to users, it usually determines the target business with a higher historical click-through rate and/or historical conversion rate according to the historical click-through rate and/or historical conversion rate of each business information, and compares the target merchant's Information is recommended to users. However, with the current recommendation method, the business information recommended to the user can only satisfy the maximization of the current revenue of the server and the maximization of the user's current operation efficiency, but cannot satisfy the maximization of the long-term revenue of the server and the long-term operation efficiency of the user maximization.

发明内容SUMMARY OF THE INVENTION

本申请实施例提供一种推荐商家的方法、装置、电子设备及可读存储介质,旨在使服务端更准确地向用户推荐商家信息,以提高服务端长期收益和用户长期操作效率。Embodiments of the present application provide a method, apparatus, electronic device, and readable storage medium for recommending merchants, which aim to enable the server to more accurately recommend merchant information to users, so as to improve the long-term revenue of the server and the long-term operation efficiency of the user.

本申请实施例第一方面提供了一种推荐商家的方法,所述方法包括:A first aspect of the embodiments of the present application provides a method for recommending a merchant, and the method includes:

获得目标用户的用户特征和多个候选商家各自的商家特征;Obtain the user characteristics of the target user and the respective merchant characteristics of multiple candidate merchants;

将所述用户特征和所述多个商家特征输入复购率预测模型,得到所述目标用户针对所述多个候选商家中每个候选商家的复购率;Inputting the user characteristics and the multiple merchant characteristics into a repurchase rate prediction model to obtain the repurchase rate of the target user for each candidate merchant in the multiple candidate merchants;

根据所述目标用户针对所述多个候选商家中每个候选商家的复购率,向所述目标用户推荐目标商家,所述目标商家为所述多个候选商家中的至少一个。According to the repurchase rate of the target user for each candidate merchant of the plurality of candidate merchants, a target merchant is recommended to the target user, and the target merchant is at least one of the plurality of candidate merchants.

可选地,所述方法还包括构建复购率预测模型。Optionally, the method further includes constructing a repurchase rate prediction model.

可选地,所述构建复购率预测模型,包括:Optionally, the building a repurchase rate prediction model includes:

针对已下单的历史用户,获得该历史用户在从下单时间起的预设时间段内的商家浏览记录序列;For a historical user who has placed an order, obtain the sequence of business browsing records of the historical user within a preset time period from the time of placing the order;

针对所述商家浏览记录序列中的每次商家浏览记录,建立该次商家浏览记录对应的训练样本,所述训练样本包括:该次商家浏览记录对应的商家特征、用户特征以及用户复购情况对应的奖励值、下一次商家浏览记录对应的商家特征和用户特征;For each merchant browsing record in the merchant browsing record sequence, a training sample corresponding to the merchant browsing record is established, and the training sample includes: the merchant characteristics, user characteristics and user repurchase situations corresponding to the merchant browsing record. The reward value of , the merchant characteristics and user characteristics corresponding to the next merchant browsing record;

根据多次商家浏览记录各自对应的训练样本,构建训练样本集;Construct a training sample set according to the corresponding training samples of multiple business browsing records;

基于所述训练样本集,对预设强化学习模型进行训练,得到所述复购率预测模型。Based on the training sample set, a preset reinforcement learning model is trained to obtain the repurchase rate prediction model.

可选地,所述方法还包括:Optionally, the method further includes:

针对所述商家浏览记录序列中的每次商家浏览记录:For each merchant browsing record in the merchant browsing record sequence:

在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定用户复购情况对应的奖励值是大于零的值;In the case that the historical user places an order for the merchant corresponding to the merchant's browsing record, it is determined that the reward value corresponding to the user's repurchase situation is a value greater than zero;

在所述历史用户针对该次商家浏览记录对应的商家未下单的情况下,确定用户复购情况对应的奖励值是不大零的值。In the case that the historical user has not placed an order for the merchant corresponding to the merchant's browsing record, it is determined that the reward value corresponding to the user's repurchase situation is a value not greater than zero.

可选地,在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定用户复购情况对应的奖励值,包括:Optionally, in the case that the historical user places an order for the merchant corresponding to the merchant's browsing record, determining the reward value corresponding to the user's repurchase situation, including:

在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定该历史用户本次下单时间与上次下单时间的下单时间差;In the case where the historical user places an order for the merchant corresponding to the merchant's browsing record, determine the time difference between the historical user's current ordering time and the last ordering time;

根据所述下单时间差以及预设的奖励值函数,确定用户复购情况对应的奖励值,该奖励值与所述下单时间差呈负相关。According to the order time difference and the preset reward value function, the reward value corresponding to the user's repurchase situation is determined, and the reward value is negatively correlated with the order time difference.

可选地,根据所述下单时间差以及预设的奖励值函数,确定用户复购情况对应的奖励值,包括:Optionally, according to the order time difference and a preset reward value function, determine the reward value corresponding to the user's repurchase situation, including:

按照以下公式确定用户复购情况对应的奖励值:The reward value corresponding to the user's repurchase situation is determined according to the following formula:

Figure BDA0002191553640000031
Figure BDA0002191553640000031

其中,r表示用户复购情况对应的奖励值,C表示调权系数,T表示所述下单时间差。Among them, r represents the reward value corresponding to the user's repurchase situation, C represents the weight adjustment coefficient, and T represents the time difference between the order placement.

可选地,所述用户特征包括以下至少一者:用户的消费偏好、用户的地理位置、用户画像以及行为特征。Optionally, the user characteristics include at least one of the following: the user's consumption preference, the user's geographic location, a user portrait, and a behavior characteristic.

可选地,所述商家特征包括以下至少一者:商家ID、品类ID、配送时长、销量、满减额度以及客单价。Optionally, the merchant characteristics include at least one of the following: merchant ID, category ID, delivery time, sales volume, full discount limit, and customer unit price.

本申请实施例第二方面提供一种推荐商家的装置,所述装置包括:A second aspect of an embodiment of the present application provides an apparatus for recommending a merchant, the apparatus comprising:

特征获得模块,用于获得目标用户的用户特征和多个候选商家各自的商家特征;The feature obtaining module is used to obtain the user feature of the target user and the respective merchant features of multiple candidate merchants;

复购率获得模块,用于将所述用户特征和所述多个商家特征输入复购率预测模型,得到所述目标用户针对所述多个候选商家中每个候选商家的复购率;a repurchase rate obtaining module, configured to input the user characteristics and the multiple merchant characteristics into a repurchase rate prediction model to obtain the repurchase rate of the target user for each candidate merchant in the multiple candidate merchants;

目标商家推荐模块,用于根据所述目标用户针对所述多个候选商家中每个候选商家的复购率,向所述目标用户推荐目标商家,所述目标商家为所述多个候选商家中的至少一个。A target merchant recommendation module, configured to recommend a target merchant to the target user according to the repurchase rate of the target user for each candidate merchant in the plurality of candidate merchants, and the target merchant is one of the plurality of candidate merchants at least one of.

可选地,所述装置还包括:Optionally, the device further includes:

模型构建模块,用于构建复购率预测模型。The model building module is used to build a repurchase rate prediction model.

可选地,所述模型构建模块包括:Optionally, the model building module includes:

商家浏览记录序列子模块,用于针对已下单的历史用户,获得该历史用户在从下单时间起的预设时间段内的商家浏览记录序列;The merchant browsing record sequence sub-module is used to obtain the merchant browsing record sequence of the historical user within the preset time period from the time of placing the order for the historical user who has placed the order;

训练样本建立子模块,用于针对所述商家浏览记录序列中的每次商家浏览记录,建立该次商家浏览记录对应的训练样本,所述训练样本包括:该次商家浏览记录对应的商家特征、用户特征以及用户复购情况对应的奖励值、下一次商家浏览记录对应的商家特征和用户特征;A training sample establishment sub-module is used to establish a training sample corresponding to the merchant browsing record for each merchant browsing record in the merchant browsing record sequence, and the training sample includes: the merchant feature corresponding to the merchant browsing record, User characteristics and reward value corresponding to the user's repurchase situation, merchant characteristics and user characteristics corresponding to the next merchant browsing record;

训练样本集构建子模块,用于根据多次商家浏览记录各自对应的训练样本,构建训练样本集;The training sample set construction sub-module is used to construct the training sample set according to the corresponding training samples of multiple merchant browsing records;

模型训练子模块,用于基于所述训练样本集,对预设强化学习模型进行训练,得到所述复购率预测模型。The model training sub-module is used for training a preset reinforcement learning model based on the training sample set to obtain the repurchase rate prediction model.

可选地,所述模型构建模块还包括:Optionally, the model building module further includes:

奖励值确定子模块,用于针对所述商家浏览记录序列中的每次商家浏览记录,在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定用户复购情况对应的奖励值是大于零的值;在所述历史用户针对该次商家浏览记录对应的商家未下单的情况下,确定用户复购情况对应的奖励值是不大零的值。The reward value determination sub-module is used for each merchant browsing record in the merchant browsing record sequence, in the case that the historical user places an order for the merchant corresponding to the merchant browsing record, determine the user's repurchase situation corresponding to the situation. The reward value is a value greater than zero; in the case that the historical user has not placed an order for the merchant corresponding to the merchant's browsing record, it is determined that the reward value corresponding to the user's repurchase is a value not greater than zero.

可选地,所述奖励值确定子模块包括:Optionally, the reward value determination submodule includes:

下单时间差确定单元,用于在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定该历史用户本次下单时间与上次下单时间的下单时间差;an order time difference determination unit, configured to determine the order time difference between the current order time of the historical user and the last order time when the historical user places an order for the merchant corresponding to the merchant browsing record;

奖励值确定单元,用于根据所述下单时间差以及预设的奖励值函数,确定用户复购情况对应的奖励值,该奖励值与所述下单时间差呈负相关。The reward value determination unit is configured to determine the reward value corresponding to the user's repurchase situation according to the order time difference and a preset reward value function, and the reward value is negatively correlated with the order time difference.

可选地,所述奖励值确定单元包括:Optionally, the reward value determination unit includes:

奖励值确定子单元,用于按照以下公式确定用户复购情况对应的奖励值:The reward value determination subunit is used to determine the reward value corresponding to the user's repurchase situation according to the following formula:

Figure BDA0002191553640000041
Figure BDA0002191553640000041

其中,r表示用户复购情况对应的奖励值,C表示调权系数,T表示所述下单时间差。Among them, r represents the reward value corresponding to the user's repurchase situation, C represents the weight adjustment coefficient, and T represents the time difference between the order placement.

可选地,所述用户特征包括以下至少一者:用户的消费偏好、用户的地理位置、用户画像以及行为特征。Optionally, the user characteristics include at least one of the following: the user's consumption preference, the user's geographic location, a user portrait, and a behavior characteristic.

可选地,所述商家特征包括以下至少一者:商家ID、品类ID、配送时长、销量、满减额度以及客单价。Optionally, the merchant characteristics include at least one of the following: merchant ID, category ID, delivery time, sales volume, full discount limit, and customer unit price.

本申请实施例第三方面提供一种计算机可读存储介质,其上存储有计算机程序,该计算机程序被处理器执行时,实现如本申请第一方面所述的方法中的步骤。A third aspect of the embodiments of the present application provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps in the method described in the first aspect of the present application.

本申请实施例第四方面提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行时实现本申请第一方面所述的方法的步骤。A fourth aspect of the embodiments of the present application provides an electronic device, including a memory, a processor, and a computer program stored in the memory and running on the processor, where the processor implements the method described in the first aspect of the present application when executed. A step of.

采用本申请实施例提供的推荐商家的方法,首先获得目标用户的用户特征和多个候选商家各自的商家特征,然后将这些特征输入预先训练的复购率预测模型,得到目标用户针对多个候选商家中每个候选商家的复购率。其中,每个候选商家的复购率表征了该目标用户针对该候选商家的长远需求。复购率越高,该目标用户针对该候选商家的长远需求越大,目标用户针对该候选商家再次下单的概率越大。最后根据多个候选商家各自的复购率,向该目标用户推荐目标商家,从而满足用户的长远需求,提高服务端长期收益,并提高用户对服务端主页的长期操作效率。Using the method for recommending merchants provided by the embodiments of the present application, the user characteristics of the target user and the respective merchant characteristics of multiple candidate merchants are first obtained, and then these characteristics are input into the pre-trained repurchase rate prediction model to obtain the target user for multiple candidate merchants. The repurchase rate of each candidate merchant in the merchant. Among them, the repurchase rate of each candidate merchant represents the long-term demand of the target user for the candidate merchant. The higher the repurchase rate, the greater the long-term demand of the target user for the candidate merchant, and the greater the probability that the target user places an order for the candidate merchant again. Finally, according to the repurchase rates of multiple candidate merchants, the target merchant is recommended to the target user, so as to meet the user's long-term needs, improve the long-term revenue of the server, and improve the user's long-term operation efficiency on the server homepage.

附图说明Description of drawings

为了更清楚地说明本申请实施例的技术方案,下面将对本申请实施例的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments of the present application. Obviously, the drawings in the following description are only some embodiments of the present application. , for those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative labor.

图1是本申请一实施例提出的复购率预测模型的训练流程图;Fig. 1 is the training flow chart of the repurchase rate prediction model proposed by an embodiment of the present application;

图2是本申请一实施例提出的训练样本示意图;2 is a schematic diagram of a training sample proposed by an embodiment of the present application;

图3是本申请一实施例提出的一种确定奖励值的方法示意图;3 is a schematic diagram of a method for determining a reward value proposed by an embodiment of the present application;

图4是本申请一实施例提出的推荐商家的方法的流程图;4 is a flowchart of a method for recommending a merchant proposed by an embodiment of the present application;

图5是本申请一实施例提出的服务端与客户端的交互示意图;5 is a schematic diagram of interaction between a server and a client according to an embodiment of the present application;

图6是本申请一实施例提供的推荐商家的装置的示意图。FIG. 6 is a schematic diagram of an apparatus for recommending a merchant provided by an embodiment of the present application.

具体实施方式Detailed ways

下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动的前提下所获得的所有其他实施例,均应属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.

在互联网技术领域,当通过浏览器或客户端用户进行服务端主页后,服务端会根据用户的历史浏览记录主动地向用户推荐商家信息,使用户在不执行搜索操作的情况下,快速地进入目标商家主页,提高用户操作效率。目前的服务端在向用户推荐商家信息时,通常是根据各商家信息的历史点击率和/或历史转化率,确定历史点击率和/或历史转化率较高的目标商家,并将目标商家的信息推荐给用户。然而,采用目前的这种推荐方式,向用户推荐的商家信息仅能满足服务端当前收益的最大化和用户当前操作效率的最大化,而不能满足服务端长期收益的最大化和用户长期操作效率的最大化。In the field of Internet technology, after the server homepage is displayed by a browser or client user, the server will actively recommend business information to the user according to the user's historical browsing records, so that the user can quickly enter the website without performing a search operation. Target business home page, improve user operation efficiency. When the current server recommends business information to users, it usually determines the target business with a higher historical click-through rate and/or historical conversion rate according to the historical click-through rate and/or historical conversion rate of each business information, and compares the target merchant's Information is recommended to users. However, with the current recommendation method, the business information recommended to the user can only satisfy the maximization of the current revenue of the server and the maximization of the user's current operation efficiency, but cannot satisfy the maximization of the long-term revenue of the server and the long-term operation efficiency of the user maximization.

有鉴于此,本申请实施例提出:首先获得目标用户的用户特征和多个候选商家各自的商家特征,然后根据这些特征确定目标用户针对多个候选商家中每个候选商家的复购率。最后根据多个候选商家各自的复购率,向该目标用户推荐目标商家,从而满足用户的长远需求,提高服务端长期收益,并提高用户对服务端主页的长期操作效率。In view of this, the embodiment of the present application proposes to first obtain the user characteristics of the target user and the respective merchant characteristics of multiple candidate merchants, and then determine the repurchase rate of the target user for each of the multiple candidate merchants according to these characteristics. Finally, according to the repurchase rates of multiple candidate merchants, the target merchant is recommended to the target user, so as to meet the user's long-term needs, improve the long-term revenue of the server, and improve the user's long-term operation efficiency on the server homepage.

此外,为了更智能地实施本申请实施例提出的上述方法,本申请实施例首先针对预设强化学习模型搜集训练样本,构建训练样本集,最后基于该训练样本集,对该预设强化学习模型进行训练,得到复购率预测模型。其中,预设强化学习模型可以是相关技术中常用的模型。该复购率预测模型可用于执行本申请实施例提出的上述方法中的部分或全部步骤。In addition, in order to more intelligently implement the above method proposed by the embodiment of the present application, the embodiment of the present application first collects training samples for the preset reinforcement learning model, constructs a training sample set, and finally, based on the training sample set, generates the preset reinforcement learning model for the preset reinforcement learning model. Perform training to obtain a repurchase rate prediction model. The preset reinforcement learning model may be a commonly used model in related technologies. The repurchase rate prediction model may be used to execute some or all of the steps in the above method proposed in the embodiments of the present application.

参考图1,图1是本申请一实施例提出的复购率预测模型的训练流程图。如图1所示,该训练流程包括以下步骤:Referring to FIG. 1 , FIG. 1 is a training flow chart of a repurchase rate prediction model proposed by an embodiment of the present application. As shown in Figure 1, the training process includes the following steps:

步骤S11:针对已下单的历史用户,获得该历史用户在从下单时间起的预设时间段内的商家浏览记录序列。Step S11: For a historical user who has placed an order, obtain a sequence of business browsing records of the historical user within a preset time period from the time of placing the order.

本实施例中,历史用户在从下单时间起的预设时间段内的商家浏览记录序列是指:以该历史用户的一次下单行为为起点,在预设时间段内,该历史用户连续多次进入服务端主页后,由服务端推送给该历史用户的商家信息,和/或,该历史用户主动搜索的商家信息所组成的商家信息序列。In this embodiment, the sequence of business browsing records of a historical user within a preset time period from the time of placing an order refers to: starting from an order placing behavior of the historical user, within the preset time period, the historical user has continuously After entering the home page of the server multiple times, the business information pushed by the server to the historical user, and/or a business information sequence composed of business information actively searched by the historical user.

商家浏览记录序列中的每次商家浏览记录除了包括商家信息外,还包括:该商家信息被浏览时或者被推送时的时间戳、以及用户针对该商家信息对应的商家是否下单的标记。In addition to the merchant information, each merchant browsing record in the merchant browsing record sequence also includes: a timestamp when the merchant information was browsed or pushed, and a mark indicating whether the user placed an order for the merchant corresponding to the merchant information.

示例地,假设某一历史用户在4月16日产生了下单行为,以4月16日为起点,以15天为预设时间段,参考表1,表1示意性地示出了该历史用户的商家浏览记录序列。For example, it is assumed that a historical user placed an order on April 16, with April 16 as the starting point and 15 days as the preset time period, referring to Table 1, which schematically shows the history A sequence of user's business browsing records.

表1历史用户的商家浏览记录序列表Table 1 List of business browsing records of historical users

序号serial number 时间time 商家信息Business information 是否下单Whether to place an order 11 4月17日11点23分April 17 at 11:23 c商家c Merchant 未下单No order 22 4月17日11点25分April 17 at 11:25 b商家b business 下单place an order 33 4月21日11点49分April 21 at 11:49 c商家c Merchant 未下单No order 44 4月22日18点33分April 22 at 18:33 d商家d Merchant 未下单No order 55 4月26日11点19分April 26 at 11:19 e商家e-merchants 未下单No order 66 4月26日11点24分April 26 at 11:24 d商家d Merchant 下单place an order 77 4月29日12点07分April 29 at 12:07 b商家b business 下单place an order 88 5月1日18点51分May 1st at 18:51 f商家f Merchant 未下单No order 99 5月1日19点15分May 1st at 19:15 f商家f Merchant 下单place an order

表1中的每一行代表一次商家浏览记录,以表1中的第一行为例,其表示该历史用户在4月17日11点23分浏览了或被推送了c商家信息,该历史用户没有在c商家下单。Each row in Table 1 represents a merchant browsing record. Taking the first row in Table 1 as an example, it means that the historical user browsed or was pushed the c-merchants information at 11:23 on April 17, and the historical user did not Place an order at c merchant.

步骤S12:针对所述商家浏览记录序列中的每次商家浏览记录,建立该次商家浏览记录对应的训练样本,所述训练样本包括:该次商家浏览记录对应的商家特征、用户特征以及用户复购情况对应的奖励值、下一次商家浏览记录对应的商家特征和用户特征。Step S12: For each merchant browsing record in the merchant browsing record sequence, establish a training sample corresponding to the merchant browsing record, and the training sample includes: merchant characteristics, user characteristics and user repetitions corresponding to the merchant browsing record. The reward value corresponding to the purchase situation, the merchant characteristics and user characteristics corresponding to the next merchant browsing record.

参考图2,图2是本申请一实施例提出的训练样本示意图。如图2所示,其中s表示用户特征,a表示商家特征,r表示奖励值。以图2中的第一个训练样本为例进行说明:其中s1表示第一个商家浏览记录对应的用户特征,a1表示第一个商家浏览记录对应的商家特征,r1表示第一个商家浏览记录中用户复购情况对应的奖励值,s2表示第二个商家浏览记录对应的用户特征,a2表示第二个商家浏览记录对应的商家特征。Referring to FIG. 2 , FIG. 2 is a schematic diagram of a training sample proposed by an embodiment of the present application. As shown in Figure 2, where s represents user characteristics, a represents merchant characteristics, and r represents reward value. Take the first training sample in Figure 2 as an example for illustration: where s1 represents the user feature corresponding to the first merchant browsing record, a1 represents the merchant feature corresponding to the first merchant browsing record, and r1 represents the first merchant browsing record The reward value corresponding to the user's repurchase situation, s2 represents the user feature corresponding to the second merchant's browsing record, and a2 represents the merchant's feature corresponding to the second merchant's browsing record.

沿用上述表1,以表1中的第2个商家浏览记录为例进行说明:该商家浏览记录对应的训练样本包括:在4月17日11点25分时b商家的商家特征、在4月17日11点25分时该历史用户的用户特征、该历史用户的下单行为所对应的奖励值、在4月21日11点49分时c商家的商家特征、以及在4月21日11点49分时该历史用户的用户特征。Following Table 1 above, take the second merchant browsing record in Table 1 as an example to illustrate: the training samples corresponding to this merchant browsing record include: merchant characteristics of merchant b at 11:25 on April 17, The user characteristics of the historical user at 11:25 on the 17th, the reward value corresponding to the order behavior of the historical user, the merchant characteristics of the merchant at 11:49 on April 21, and the merchant characteristics at 11:49 on April 21 User characteristics of the historical user at 49:00.

本实施例中,商家特征可以包括以下至少一者:商家ID、品类ID、配送时长、销量、满减额度以及客单价。具体地,商家特征可以是上述几者的向量化表示,如词向量。In this embodiment, the merchant characteristics may include at least one of the following: merchant ID, category ID, delivery time, sales volume, full discount limit, and customer unit price. Specifically, the merchant feature may be a vectorized representation of the above, such as word vectors.

本实施例中,用户特征可以包括以下至少一者:用户的消费偏好、用户的地理位置、用户画像以及行为特征。具体地,用户特征可以是上述几者的向量化表示,如词向量。In this embodiment, the user characteristics may include at least one of the following: the user's consumption preference, the user's geographic location, the user's portrait, and behavior characteristics. Specifically, the user feature may be a vectorized representation of the above, such as word vectors.

其中,用户的消费偏好是指用户对商品类别的偏好,以外卖业务为例,用户的消费偏好可以是用户的口味偏好。用户画像是指从海量的用户数据中抽取出的用户属性信息,该用户属性信息可以包括以下信息中的一种或多种:性别、职业、年龄段、收入水平、婚育情况、教育程度等基础属性,APP使用频率、下单概率等行为属性,以及外卖偏好、电影偏好、商品偏好等兴趣属性。用户的行为特征可以是指用户多次浏览商家信息时的下单率、用户使用客户端的频率、用户使用优惠券的概率等等。Among them, the user's consumption preference refers to the user's preference for commodity categories. For example, the takeaway business may be the user's taste preference. User portrait refers to user attribute information extracted from massive user data. The user attribute information can include one or more of the following information: gender, occupation, age group, income level, marital status, education level, etc. Basic attributes, behavioral attributes such as APP usage frequency and order probability, as well as interest attributes such as takeaway preferences, movie preferences, and commodity preferences. The behavioral characteristics of the user may refer to the order rate when the user browses the business information multiple times, the frequency of the user using the client, the probability of the user using coupons, and the like.

本实施例中,在针对每次商家浏览记录,建立该次商家浏览记录对应的训练样本时,可以从缓存的日志中获得该次商家浏览记录对应时间的商家特征和用户特征,以及从缓存的日志中获得下次商家浏览记录对应时间的商家特征和用户特征。In this embodiment, when a training sample corresponding to the merchant browsing record is established for each merchant browsing record, the merchant characteristics and user characteristics of the time corresponding to the merchant browsing record can be obtained from the cached log, and the corresponding time of the merchant browsing record can be obtained from the cached log. The business characteristics and user characteristics of the time corresponding to the next business browsing record are obtained from the log.

或者,在该历史用户于4月16日产生了下单行为后,在执行步骤S11时,针对该历史用户的每次商家浏览,记录当时的商家特征和用户特征,并将记录的商家特征和用户特征作为该次商家浏览记录中的部分信息。如此,在针对每次商家浏览记录,建立该次商家浏览记录对应的训练样本时,直接从该次商家浏览记录中读取商家特征和用户特征,并从下次商家浏览记录中读取商家特征和用户特征。Or, after the historical user has placed an order on April 16, when step S11 is executed, for each merchant browsing of the historical user, the current merchant characteristics and user characteristics are recorded, and the recorded merchant characteristics and user characteristics are recorded. User characteristics are part of the information in the business browsing record. In this way, when establishing a training sample corresponding to the merchant browsing record for each merchant browsing record, the merchant characteristics and user characteristics are directly read from the merchant browsing record, and the merchant characteristics are read from the next merchant browsing record. and user characteristics.

本实施例中,用户复购情况对应的奖励值,用于表征历史用户针对商家浏览记录中的商家是否下单。本申请在实施期间,如果历史用户针对商家浏览记录中的商家产生了下单行为,则对应的奖励值是一个较大的数值。如果历史用户针对商家浏览记录中的商家未产生下单行为,则对应的奖励值是一个较小的数值。In this embodiment, the reward value corresponding to the user's repurchase situation is used to represent whether the historical user places an order for the merchant in the merchant's browsing record. During the implementation of this application, if a historical user has placed an order for a merchant in the merchant's browsing record, the corresponding reward value is a larger value. If the historical user does not place an order for the merchant in the merchant's browsing record, the corresponding reward value is a small value.

本实施例中,针对所述商家浏览记录序列中的每次商家浏览记录,建立其训练样本时,为了确定该训练样本中的奖励值,一种可行的实施方式是:针对所述商家浏览记录序列中的每次商家浏览记录:在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定用户复购情况对应的奖励值是大于零的值;在所述历史用户针对该次商家浏览记录对应的商家未下单的情况下,确定用户复购情况对应的奖励值是不大零的值。In this embodiment, when establishing a training sample for each merchant browsing record in the merchant browsing record sequence, in order to determine the reward value in the training sample, a feasible implementation is: for the merchant browsing record Each merchant browsing record in the sequence: when the historical user places an order for the merchant corresponding to the merchant browsing record, it is determined that the reward value corresponding to the user's repurchase situation is a value greater than zero; In the case that the merchant corresponding to the merchant browsing record has not placed an order, it is determined that the reward value corresponding to the user's repurchase situation is a value not greater than zero.

示例地,沿用上述表1,其中序号为1、3、4、5或8的商家浏览记录对应的训练样本中的奖励值不大于0,序号为2、6、7或9的商家浏览记录对应的训练样本中的奖励值大于0。Exemplarily, following Table 1 above, the reward value in the training samples corresponding to the merchant browsing records with the serial numbers 1, 3, 4, 5 or 8 is not greater than 0, and the merchant browsing records with the serial numbers 2, 6, 7 or 9 correspond to The reward value in the training samples is greater than 0.

其中,在历史用户针对某次商家浏览记录对应的商家下单的情况下,确定奖励值的具体方式可参考图3,图3是本申请一实施例提出的一种确定奖励值的方法示意图。如图3所示,该确定奖励值的方法包括以下步骤:Among them, when a historical user places an order for a merchant corresponding to a certain merchant browsing record, the specific method of determining the reward value can refer to FIG. As shown in Figure 3, the method for determining the reward value includes the following steps:

步骤S12-1:在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定该历史用户本次下单时间与上次下单时间的下单时间差;Step S12-1: In the case that the historical user places an order for the merchant corresponding to the current merchant browsing record, determine the time difference between the historical user's current ordering time and the previous ordering time;

步骤S12-2:根据所述下单时间差以及预设的奖励值函数,确定用户复购情况对应的奖励值,该奖励值与所述下单时间差呈负相关。Step S12-2: Determine a reward value corresponding to the user's repurchase situation according to the order time difference and a preset reward value function, and the reward value is negatively correlated with the order time difference.

示例地,沿用上述表1,在序号为2的商家浏览记录中记录了历史用户的下单行为,该下单行为的下单时间(4月17日)与上次下单时间(4月16日)的时间差为1天。在序号为6的商家浏览记录中记录了历史用户的下单行为,该下单行为的下单时间(4月26日)与上次下单时间(4月17日)的时间差为9天。在经过上述步骤S12-2后,序号为2的商家浏览记录对应的训练样本中的奖励值大于序号为6的商家浏览记录对应的训练样本中的奖励值。For example, following Table 1 above, the ordering behavior of historical users is recorded in the merchant browsing record with serial number 2. The ordering time of the ordering behavior (April 17) and the last order time (April 16) are recorded. day) time difference is 1 day. The ordering behavior of historical users is recorded in the merchant browsing record with the serial number 6. The time difference between the ordering time (April 26) and the last ordering time (April 17) of the ordering behavior is 9 days. After the above step S12-2, the reward value in the training sample corresponding to the merchant browsing record with the serial number 2 is greater than the reward value in the training sample corresponding to the merchant browsing record with the serial number 6.

其中,奖励值函数可以形如:r=C/(T+1),其中r表示用户复购情况对应的奖励值,C表示调权系数,T表示所述下单时间差。调权系数C是一个大于0的正数,调权系数可根据训练需要进行手动更改。下单时间差的单位可以是分、小时、天等等,本申请对此不做限定。The reward value function can be in the form of: r=C/(T+1), where r represents the reward value corresponding to the user's repurchase situation, C represents the weight adjustment coefficient, and T represents the time difference for placing the order. The weighting coefficient C is a positive number greater than 0, and the weighting coefficient can be manually changed according to training needs. The unit of time difference for placing an order may be minutes, hours, days, etc., which is not limited in this application.

通过以上述步骤S12-1和步骤S12-2的方式确定奖励值,得到的奖励值不仅表征了历史用户是否复购(即下单),还表征了复购间隔时间。利用包括该奖励值的训练样本对强化学习模型进行训练,可以使训练所得的复购率预测模型不仅能针对用户的长远需求推荐商家信息,提高用户复购概率,帮助用户减少主动搜索、浏览的时间。该复购率预测模型还能区分出复购间隔长短因素,可以推荐使用户复购时间更短的商家,提高用户的下单频次,从而进一步提高推荐准确性,并进一步提高用户操作效率。By determining the reward value in the manner of the above steps S12-1 and S12-2, the obtained reward value not only represents whether the historical user repurchases (ie, places an order), but also represents the repurchase interval. Using the training samples including the reward value to train the reinforcement learning model can make the repurchase rate prediction model obtained from the training not only recommend business information for the user's long-term needs, but also improve the user's repurchase probability and help users reduce active searches and browsing. time. The repurchase rate prediction model can also distinguish factors of the length of the repurchase interval, and can recommend merchants that make the user repurchase shorter, increase the user's order frequency, further improve the recommendation accuracy, and further improve the user's operation efficiency.

步骤S13:根据多次商家浏览记录各自对应的训练样本,构建训练样本集。Step S13: Construct a training sample set according to the training samples corresponding to the multiple browsing records of the merchants.

本实施例中,训练样本集中包括多次商家浏览记录各自对应的训练样本,并且多个训练样本按照各自对应的商家浏览记录时间进行排序。如图2所示,训练样本1至训练样本4组成一个训练样本集,该训练样本集中各训练样本按照时间先后的排序为:训练样本1、训练样本2、训练样本3以及训练样本4。In this embodiment, the training sample set includes training samples corresponding to multiple merchant browsing records, and the multiple training samples are sorted according to their corresponding merchant browsing record times. As shown in FIG. 2 , training sample 1 to training sample 4 form a training sample set. The training samples in the training sample set are arranged in chronological order: training sample 1 , training sample 2 , training sample 3 and training sample 4 .

步骤S14:基于所述训练样本集,对预设强化学习模型进行训练,得到所述复购率预测模型。Step S14: Based on the training sample set, a preset reinforcement learning model is trained to obtain the repurchase rate prediction model.

本实施例中,基于训练样本集,对预设强化学习模型进行训练,得到复购率预测模型。其中,预设强化学习模型可以是DQN(Deep Q Network;深度Q网络)模型、DDPG(DeepDeterministic Policy Gradient;深度确定性策略梯度)模型或者DDQN(Double DQN;双重深度Q网络)模型等。该复购率预测模型可作为一种可选手段,用于执行本申请实施例提出的上述方法中的部分或全部步骤。In this embodiment, based on the training sample set, a preset reinforcement learning model is trained to obtain a repurchase rate prediction model. The preset reinforcement learning model may be a DQN (Deep Q Network; Deep Q Network) model, a DDPG (Deep Deterministic Policy Gradient; Deep Deterministic Policy Gradient) model, or a DDQN (Double DQN; Double Deep Q Network) model, or the like. The repurchase rate prediction model can be used as an optional means to execute some or all of the steps in the above method proposed in the embodiment of the present application.

参考图4,图4是本申请一实施例提出的推荐商家的方法的流程图。如图4所示,该方法包括以下步骤:Referring to FIG. 4 , FIG. 4 is a flowchart of a method for recommending a merchant proposed by an embodiment of the present application. As shown in Figure 4, the method includes the following steps:

步骤S41:获得目标用户的用户特征和多个候选商家各自的商家特征。Step S41: Obtain the user characteristics of the target user and the respective merchant characteristics of the multiple candidate merchants.

其中,用户特征包括以下至少一者:用户的消费偏好、用户的地理位置、用户画像以及行为特征。用户特征可以是上述几者中至少一者的抽象化表达,例如词向量。商家特征包括以下至少一者:商家ID、品类ID、配送时长、销量、满减额度以及客单价。商家特征可以是上述几者中至少一者的抽象化表达,例如词向量。The user characteristics include at least one of the following: the user's consumption preference, the user's geographic location, the user's portrait, and behavior characteristics. User features can be abstract representations of at least one of the above, such as word vectors. The merchant characteristics include at least one of the following: merchant ID, category ID, delivery time, sales volume, full discount limit, and customer unit price. The merchant feature can be an abstract representation of at least one of the above, such as a word vector.

其中,目标用户可以是指:正在打开客户端的用户,或者正在通过浏览器进入服务端主页的用户。以外卖类服务端为例,该服务端同时面向大量的用户,部分用户的终端设备上安装有外卖客户端,部分用户的终端设备上安装有浏览器。由于该外卖类服务端的用户访问并发量较大,在某一时刻,该外卖类服务端可能会同时存在多个目标用户,其中一部分目标用户正在打开外卖类客户端,另一部分目标用户正在通过浏览器进入该外卖类服务端主页。该外卖类服务端针对每个目标用户,分别执行上述步骤S41、下述步骤S42以及下述步骤S43。The target user may refer to: a user who is opening the client, or a user who is entering the homepage of the server through a browser. Take a takeaway server as an example, the server is oriented to a large number of users at the same time, some users have a takeaway client installed on their terminal devices, and some users have a browser installed on their terminal devices. Due to the large number of concurrent users accessing the food delivery server, at a certain moment, there may be multiple target users on the food delivery server at the same time, some of which are opening the food delivery client, and some target users are browsing The server enters the home page of the takeaway server. The takeaway server performs the above step S41 , the following step S42 and the following step S43 respectively for each target user.

候选商家可以是指:针对该目标用户,通过现有的筛选方式所筛选出的多个商家。例如根据配送距离、用户口味、销量、历史评价、或广告竞拍等方式筛选出的多个商家。The candidate merchants may refer to: multiple merchants selected by the existing screening methods for the target user. For example, multiple merchants are selected based on delivery distance, user taste, sales volume, historical evaluation, or advertising auction.

步骤S42:将所述用户特征和所述多个商家特征输入复购率预测模型,得到所述目标用户针对所述多个候选商家中每个候选商家的复购率。Step S42: Input the user characteristics and the multiple merchant characteristics into a repurchase rate prediction model to obtain the repurchase rate of the target user for each of the multiple candidate merchants.

其中,复购率预测模型可以是通过上述步骤S11至步骤S14的方式训练得到的复购率预测模型,也可以是通过其他方式训练得到的复购率预测模型,本步骤对复购率预测模型的来源和训练方式不做限定。Wherein, the repurchase rate prediction model may be the repurchase rate prediction model trained by the above steps S11 to S14, or may be the repurchase rate prediction model trained by other methods. In this step, the repurchase rate prediction model is The source and training method are not limited.

本实施例中,可以将用户特征和多个商家特征一对一组合,形成多个特征组合,每个特征组合中包括该用户特征和一个候选商家的商家特征。然后依次将多个特征组合输入复购率预测模型,得到该复购率预测模型针对每个特征组合输出的复购率,该复购率表征目标用户针对该特征组合对应候选商家的长远需求。复购率越高,该目标用户针对该候选商家的长远需求越大,目标用户针对该候选商家的下单概率越大。In this embodiment, the user feature and multiple merchant features can be combined one-to-one to form multiple feature combinations, and each feature combination includes the user feature and the merchant feature of a candidate merchant. Then, multiple feature combinations are input into the repurchase rate prediction model in turn, and the repurchase rate output by the repurchase rate prediction model for each feature combination is obtained. The higher the repurchase rate, the greater the long-term demand of the target user for the candidate merchant, and the higher the probability of the target user placing an order for the candidate merchant.

步骤S43:根据所述目标用户针对所述多个候选商家中每个候选商家的复购率,向所述目标用户推荐目标商家,所述目标商家为所述多个候选商家中的至少一个。Step S43: Recommend a target merchant to the target user according to the repurchase rate of the target user for each candidate merchant among the multiple candidate merchants, where the target merchant is at least one of the multiple candidate merchants.

示例地,可以直接将多个候选商家中复购率最大的候选商家作为目标商家,推荐给目标用户。For example, the candidate merchant with the largest repurchase rate among the multiple candidate merchants may be directly used as the target merchant and recommended to the target user.

或者,为了增加推荐的探索性,可以根据多个候选商家的复购率,确定每个候选商家的推荐概率,然后以每个候选商家的推荐概率向目标用户推荐该候选商家。例如,候选商家A、B、C各自对应的复购率分别为:0.2、0.6、0.4,则商家A的推荐概率等于0.2/(0.2+0.6+0.4)=0.17,商家B的推荐概率等于0.6/(0.2+0.6+0.4)=0.50,商家C的推荐概率等于0.4/(0.2+0.6+0.4)=0.33。如此,在向目标用户推荐候选商家时,以0.17的概率推荐商家A,以0.5的概率推荐商家B,以0.33的概率推荐商家C。换言之,每个候选商家都有被推荐的可能性,各候选商家被推荐的可能性按照大小排序为:商家B、商家C以及商家A。Alternatively, in order to increase the exploratory nature of the recommendation, the recommendation probability of each candidate merchant can be determined according to the repurchase rate of multiple candidate merchants, and then the candidate merchant can be recommended to the target user based on the recommendation probability of each candidate merchant. For example, the repurchase rates corresponding to candidate merchants A, B, and C are: 0.2, 0.6, and 0.4, respectively, then the recommendation probability of merchant A is equal to 0.2/(0.2+0.6+0.4)=0.17, and the recommendation probability of merchant B is equal to 0.6 /(0.2+0.6+0.4)=0.50, the recommendation probability of merchant C is equal to 0.4/(0.2+0.6+0.4)=0.33. In this way, when recommending candidate merchants to the target user, merchant A is recommended with a probability of 0.17, merchant B is recommended with a probability of 0.5, and merchant C is recommended with a probability of 0.33. In other words, each candidate merchant has a possibility of being recommended, and the possibility of being recommended for each candidate merchant is sorted as follows: merchant B, merchant C, and merchant A.

或者,在确定每个候选商家的复购率后,还可以根据每个候选商家的其他考虑因素,例如配送距离、用户口味、销量、历史评价、广告竞价、点击率(CTR)、或转化率(CVR)等,综合确定目标商家。Alternatively, after determining the repurchase rate of each candidate merchant, other considerations for each candidate merchant, such as delivery distance, user taste, sales, historical evaluation, ad bidding, click-through rate (CTR), or conversion rate may be used (CVR), etc., comprehensively determine the target merchants.

通过执行上述包括步骤S41至步骤S43的推荐商家的方法,首先获得目标用户的用户特征和多个候选商家各自的商家特征,然后将这些特征输入预先训练的复购率预测模型,得到目标用户针对多个候选商家中每个候选商家的复购率。其中,每个候选商家的复购率表征了该目标用户针对该候选商家的长远需求。复购率越高,该目标用户针对该候选商家的长远需求越大,目标用户针对该候选商家的下单概率越大。最后根据多个候选商家各自的复购率,向该目标用户推荐目标商家,从而满足用户的长远需求,提高服务端长期收益,并提高用户对服务端主页的长期操作效率。By executing the above-mentioned method for recommending merchants including steps S41 to S43, first obtain the user characteristics of the target user and the respective merchant characteristics of multiple candidate merchants, and then input these characteristics into the pre-trained repurchase rate prediction model to obtain the target user's target The repurchase rate of each candidate merchant among multiple candidate merchants. Among them, the repurchase rate of each candidate merchant represents the long-term demand of the target user for the candidate merchant. The higher the repurchase rate, the greater the long-term demand of the target user for the candidate merchant, and the higher the probability of the target user placing an order for the candidate merchant. Finally, according to the repurchase rates of multiple candidate merchants, the target merchant is recommended to the target user, so as to meet the user's long-term needs, improve the long-term revenue of the server, and improve the user's long-term operation efficiency on the server homepage.

参考图5,图5是本申请一实施例提出的服务端与客户端的交互示意图。图5中所示的服务端用于实施以上任一方法实施例中的推荐商家的方法。如图5所示:Referring to FIG. 5 , FIG. 5 is a schematic diagram of interaction between a server and a client according to an embodiment of the present application. The server shown in FIG. 5 is used to implement the method for recommending a merchant in any of the above method embodiments. As shown in Figure 5:

服务端主要包括数据存储模块和商家推荐模块。其中,数据存储模块中存储有用户下单日志、用户点击日志、以及商家曝光日志,还存储有商家特征和用户特征。商家推荐模块中包括复购率预测模型和投放服务接口。其中复购率预测模型用于接收目标用户的用户特征和候选商家的商家特征,并输出该目标用户针对该候选商家的复购率。投放服务接口将候选商家中的目标商家信息输出给客户端,并接收客户端针对目标商家的点击、下单等行为。The server mainly includes a data storage module and a merchant recommendation module. The data storage module stores user order logs, user click logs, and merchant exposure logs, as well as merchant features and user features. The merchant recommendation module includes a repurchase rate prediction model and a delivery service interface. The repurchase rate prediction model is used to receive the user characteristics of the target user and the merchant characteristics of the candidate merchant, and output the repurchase rate of the target user for the candidate merchant. The delivery service interface outputs the target merchant information in the candidate merchants to the client, and receives the client's actions such as clicking and placing orders for the target merchant.

具体地,如图5所示,数据存储模块依次通过分布式发布订阅消息系统Kafka和流处理框架Storm将用户下单日志、用户点击日志、以及商家曝光日志等日志信息送入KV(key-value)模块,数据存储模块依次通过数据仓库工具HIVE和分布式计算模型MapReduce将用户特征和商家特征送入KV模块。如此,该KV模块中存在多个由用户特征和商家特征构成的键值对。在线上应用时,复购率预测模型从KV模块中读取目标用户的用户特征和候选商家的商家特征,从而输出候选商家的复购率。Specifically, as shown in Figure 5, the data storage module sequentially sends log information such as user order logs, user click logs, and business exposure logs into KV (key-value ) module, the data storage module sends user features and merchant features into the KV module through the data warehouse tool HIVE and the distributed computing model MapReduce in turn. In this way, there are multiple key-value pairs composed of user characteristics and merchant characteristics in the KV module. When applied online, the repurchase rate prediction model reads the user characteristics of the target user and the merchant characteristics of the candidate merchants from the KV module, thereby outputting the repurchase rate of the candidate merchants.

此外,如图5所示,客户端还将用户的点击行为、下单行为等信息发送给数据存储模块,使得数据存储模块依据点击行为、下单行为等信息生成用户下单日志和用户点击日志。服务端该依据用户下单日志、用户点击日志、以及商家曝光日志等日志信息,构建训练样本集,用于对复购率预测模型进行训练更新。In addition, as shown in Figure 5, the client also sends the user's click behavior, ordering behavior and other information to the data storage module, so that the data storage module generates the user's order log and user click log based on the click behavior, ordering behavior and other information . The server should build a training sample set based on log information such as user order logs, user click logs, and merchant exposure logs, which are used to train and update the repurchase rate prediction model.

基于同一发明构思,本申请一实施例提供一种推荐商家的装置。参考图6,图6是本申请一实施例提供的推荐商家的装置的示意图。如图6所示,该装置包括:Based on the same inventive concept, an embodiment of the present application provides a device for recommending merchants. Referring to FIG. 6 , FIG. 6 is a schematic diagram of an apparatus for recommending a merchant provided by an embodiment of the present application. As shown in Figure 6, the device includes:

特征获得模块61,用于获得目标用户的用户特征和多个候选商家各自的商家特征;The feature obtaining module 61 is used to obtain the user feature of the target user and the respective merchant features of a plurality of candidate merchants;

复购率获得模块62,用于将所述用户特征和所述多个商家特征输入复购率预测模型,得到所述目标用户针对所述多个候选商家中每个候选商家的复购率;A repurchase rate obtaining module 62, configured to input the user characteristics and the multiple merchant characteristics into a repurchase rate prediction model to obtain the repurchase rate of the target user for each candidate merchant in the multiple candidate merchants;

目标商家推荐模块63,用于根据所述目标用户针对所述多个候选商家中每个候选商家的复购率,向所述目标用户推荐目标商家,所述目标商家为所述多个候选商家中的至少一个。A target merchant recommendation module 63, configured to recommend a target merchant to the target user according to the repurchase rate of the target user for each candidate merchant in the plurality of candidate merchants, where the target merchant is the plurality of candidate merchants at least one of the.

在一种可能的实施方式中,所述装置还包括:In a possible implementation, the device further includes:

模型构建模块,用于构建复购率预测模型。The model building module is used to build a repurchase rate prediction model.

在一种可能的实施方式中,所述模型构建模块包括:In a possible implementation, the model building module includes:

商家浏览记录序列子模块,用于针对已下单的历史用户,获得该历史用户在从下单时间起的预设时间段内的商家浏览记录序列;The merchant browsing record sequence sub-module is used to obtain the merchant browsing record sequence of the historical user within the preset time period from the time of placing the order for the historical user who has placed the order;

训练样本建立子模块,用于针对所述商家浏览记录序列中的每次商家浏览记录,建立该次商家浏览记录对应的训练样本,所述训练样本包括:该次商家浏览记录对应的商家特征、用户特征以及用户复购情况对应的奖励值、下一次商家浏览记录对应的商家特征和用户特征;A training sample establishment sub-module is used to establish a training sample corresponding to the merchant browsing record for each merchant browsing record in the merchant browsing record sequence, and the training sample includes: the merchant feature corresponding to the merchant browsing record, User characteristics and reward value corresponding to the user's repurchase situation, merchant characteristics and user characteristics corresponding to the next merchant browsing record;

训练样本集构建子模块,用于根据多次商家浏览记录各自对应的训练样本,构建训练样本集;The training sample set construction sub-module is used to construct the training sample set according to the corresponding training samples of multiple merchant browsing records;

模型训练子模块,用于基于所述训练样本集,对预设强化学习模型进行训练,得到所述复购率预测模型。The model training sub-module is used for training a preset reinforcement learning model based on the training sample set to obtain the repurchase rate prediction model.

在一种可能的实施方式中,所述模型构建模块还包括:In a possible implementation, the model building module further includes:

奖励值确定子模块,用于针对所述商家浏览记录序列中的每次商家浏览记录,在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定用户复购情况对应的奖励值是大于零的值;在所述历史用户针对该次商家浏览记录对应的商家未下单的情况下,确定用户复购情况对应的奖励值是不大零的值。The reward value determination sub-module is used for each merchant browsing record in the merchant browsing record sequence, in the case that the historical user places an order for the merchant corresponding to the merchant browsing record, determine the user's repurchase situation corresponding to the situation. The reward value is a value greater than zero; in the case that the historical user has not placed an order for the merchant corresponding to the merchant's browsing record, it is determined that the reward value corresponding to the user's repurchase is a value not greater than zero.

在一种可能的实施方式中,所述奖励值确定子模块包括:In a possible implementation, the reward value determination submodule includes:

下单时间差确定单元,用于在所述历史用户针对该次商家浏览记录对应的商家下单的情况下,确定该历史用户本次下单时间与上次下单时间的下单时间差;an order time difference determining unit, configured to determine the order time difference between the time the historical user places an order this time and the time when the previous order is placed when the historical user places an order for the merchant corresponding to the merchant browsing record;

奖励值确定单元,用于根据所述下单时间差以及预设的奖励值函数,确定用户复购情况对应的奖励值,该奖励值与所述下单时间差呈负相关。The reward value determination unit is configured to determine the reward value corresponding to the user's repurchase situation according to the order time difference and a preset reward value function, and the reward value is negatively correlated with the order time difference.

在一种可能的实施方式中,所述奖励值确定单元包括:In a possible implementation, the reward value determination unit includes:

奖励值确定子单元,用于按照以下公式确定用户复购情况对应的奖励值:The reward value determination subunit is used to determine the reward value corresponding to the user's repurchase situation according to the following formula:

Figure BDA0002191553640000151
Figure BDA0002191553640000151

其中,r表示用户复购情况对应的奖励值,C表示调权系数,T表示所述下单时间差。Among them, r represents the reward value corresponding to the user's repurchase situation, C represents the weight adjustment coefficient, and T represents the time difference between the order placement.

在一种可能的实施方式中,所述用户特征包括以下至少一者:用户的消费偏好、用户的地理位置、用户画像以及行为特征。In a possible implementation manner, the user characteristics include at least one of the following: the user's consumption preference, the user's geographic location, the user's portrait, and behavior characteristics.

在一种可能的实施方式中,所述商家特征包括以下至少一者:商家ID、品类ID、配送时长、销量、满减额度以及客单价。In a possible implementation manner, the merchant characteristics include at least one of the following: merchant ID, category ID, delivery time, sales volume, full discount limit, and customer unit price.

基于同一发明构思,本申请另一实施例提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现如本申请上述任一实施例所述的方法中的步骤。Based on the same inventive concept, another embodiment of the present application provides a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements the steps in the method described in any of the foregoing embodiments of the present application .

基于同一发明构思,本申请另一实施例提供一种电子设备,包括存储器、处理器及存储在存储器上并可在处理器上运行的计算机程序,所述处理器执行时实现本申请上述任一实施例所述的方法中的步骤。Based on the same inventive concept, another embodiment of the present application provides an electronic device, including a memory, a processor, and a computer program stored in the memory and running on the processor, the processor implements any of the above-mentioned applications when executed Steps in the methods described in the Examples.

对于装置实施例而言,由于其与方法实施例基本相似,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。As for the apparatus embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and reference may be made to the partial description of the method embodiment for related parts.

本说明书中的各个实施例均采用递进的方式描述,每个实施例重点说明的都是与其他实施例的不同之处,各个实施例之间相同相似的部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments may be referred to each other.

本领域内的技术人员应明白,本申请实施例的实施例可提供为方法、装置、或计算机程序产品。因此,本申请实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。Those skilled in the art should understand that the embodiments of the embodiments of the present application may be provided as methods, apparatuses, or computer program products. Accordingly, the embodiments of the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present application may take the form of a computer program product implemented on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein.

本申请实施例是参照根据本申请实施例的方法、终端设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The embodiments of the present application are described with reference to the flowcharts and/or block diagrams of the methods, terminal devices (systems), and computer program products according to the embodiments of the present application. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing terminal equipment to produce a machine that causes the instructions to be executed by the processor of the computer or other programmable data processing terminal equipment Means are created for implementing the functions specified in the flow or flows of the flowcharts and/or the blocks or blocks of the block diagrams.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer readable memory capable of directing a computer or other programmable data processing terminal equipment to operate in a particular manner, such that the instructions stored in the computer readable memory result in an article of manufacture comprising instruction means, the The instruction means implement the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上,使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing terminal equipment, so that a series of operational steps are performed on the computer or other programmable terminal equipment to produce a computer-implemented process, thereby executing on the computer or other programmable terminal equipment The instructions executed on the above provide steps for implementing the functions specified in the flowchart or blocks and/or the block or blocks of the block diagrams.

尽管已描述了本申请实施例的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例做出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本申请实施例范围的所有变更和修改。Although the preferred embodiments of the embodiments of the present application have been described, those skilled in the art may make additional changes and modifications to these embodiments once the basic inventive concepts are known. Therefore, the appended claims are intended to be construed to include the preferred embodiments as well as all changes and modifications that fall within the scope of the embodiments of the present application.

最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。Finally, it should also be noted that in this document, relational terms such as first and second are used only to distinguish one entity or operation from another, and do not necessarily require or imply these entities or that there is any such actual relationship or sequence between operations. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or terminal device comprising a list of elements includes not only those elements, but also a non-exclusive list of elements. other elements, or also include elements inherent to such a process, method, article or terminal equipment. Without further limitation, an element defined by the phrase "comprises a..." does not preclude the presence of additional identical elements in the process, method, article or terminal device comprising said element.

以上对本申请所提供的一种推荐商家的方法、装置、电子设备及可读存储介质,进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。A method, device, electronic device, and readable storage medium for recommending a merchant provided by the present application have been described in detail above. The principles and implementations of the present application are described with specific examples. The description is only used to help understand the method of the present application and its core idea; meanwhile, for those of ordinary skill in the art, according to the idea of the present application, there will be changes in the specific embodiments and application scope. In summary, the above , the contents of this specification should not be construed as limiting the application.

Claims (11)

1. A method of recommending merchants, comprising:
obtaining user characteristics of a target user and merchant characteristics of a plurality of candidate merchants respectively;
inputting the user characteristics and the merchant characteristics into a repurchase rate prediction model to obtain the repurchase rate of the target user for each candidate merchant in the candidate merchants;
recommending a target merchant to the target user according to the repurchase rate of the target user for each candidate merchant in the candidate merchants, wherein the target merchant is at least one of the candidate merchants.
2. The method of claim 1, further comprising constructing a repurchase rate prediction model.
3. The method of claim 2, wherein the constructing a repurchase rate prediction model comprises:
aiming at the historical users who have placed an order, acquiring a merchant browsing record sequence of the historical users in a preset time period from the order placing time;
aiming at each merchant browsing record in the merchant browsing record sequence, establishing a training sample corresponding to the merchant browsing record, wherein the training sample comprises: the merchant characteristics and the user characteristics corresponding to the merchant browsing records, the reward value corresponding to the user re-purchase condition and the merchant characteristics and the user characteristics corresponding to the next merchant browsing records are recorded;
constructing a training sample set according to training samples corresponding to multiple merchant browsing records;
and training a preset reinforcement learning model based on the training sample set to obtain the repurchase rate prediction model.
4. The method of claim 3, further comprising:
for each merchant browsing record in the sequence of merchant browsing records:
under the condition that the historical user places an order for the merchant corresponding to the browsing record of the merchant, determining that the reward value corresponding to the user re-purchasing condition is a value larger than zero;
and under the condition that the historical user does not place an order for the merchant corresponding to the browsing record of the merchant, determining that the reward value corresponding to the user re-purchasing condition is a value which is not more than zero.
5. The method according to claim 4, wherein in the case that the historical user places an order for the merchant corresponding to the merchant browsing record, determining the reward value corresponding to the user who purchases again comprises:
under the condition that the historical user places an order for a merchant corresponding to the merchant browsing record, determining an order placing time difference between the order placing time of the historical user at this time and the order placing time of the historical user at the last time;
and determining an incentive value corresponding to the user re-purchasing condition according to the order time difference and a preset incentive value function, wherein the incentive value is negatively related to the order time difference.
6. The method according to claim 5, wherein determining the reward value corresponding to the user re-purchase condition according to the ordering time difference and a preset reward value function comprises:
determining the reward value corresponding to the user re-purchase condition according to the following formula:
Figure FDA0002191553630000021
wherein r represents an award value corresponding to the user re-purchase condition, C represents a weight adjusting coefficient, and T represents the ordering time difference.
7. The method of any of claims 1-6, wherein the user characteristics include at least one of: a consumption preference of the user, a geographic location of the user, a user representation, and a behavioral characteristic.
8. The method of any of claims 1-6, wherein the merchant characteristics include at least one of: merchant ID, item class ID, delivery duration, sales volume, full reduction, and customer unit price.
9. An apparatus for recommending merchants, the apparatus comprising:
the characteristic obtaining module is used for obtaining the user characteristic of the target user and the merchant characteristics of the candidate merchants;
the purchase-resuming rate obtaining module is used for inputting the user characteristics and the merchant characteristics into a purchase-resuming rate prediction model to obtain the purchase-resuming rate of the target user for each candidate merchant in the candidate merchants;
and the target merchant recommending module is used for recommending a target merchant to the target user according to the repurchase rate of the target user for each candidate merchant in the candidate merchants, wherein the target merchant is at least one of the candidate merchants.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the method according to any one of claims 1 to 8.
11. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor when executed implements the steps of the method according to any of claims 1 to 8.
CN201910833752.9A 2019-09-04 2019-09-04 Method, apparatus, electronic device, and readable storage medium for recommending merchants Pending CN110689402A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910833752.9A CN110689402A (en) 2019-09-04 2019-09-04 Method, apparatus, electronic device, and readable storage medium for recommending merchants

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910833752.9A CN110689402A (en) 2019-09-04 2019-09-04 Method, apparatus, electronic device, and readable storage medium for recommending merchants

Publications (1)

Publication Number Publication Date
CN110689402A true CN110689402A (en) 2020-01-14

Family

ID=69107833

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910833752.9A Pending CN110689402A (en) 2019-09-04 2019-09-04 Method, apparatus, electronic device, and readable storage medium for recommending merchants

Country Status (1)

Country Link
CN (1) CN110689402A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111275493A (en) * 2020-02-10 2020-06-12 拉扎斯网络科技(上海)有限公司 Method, device, server and non-volatile storage medium for processing list data
CN111311332A (en) * 2020-02-28 2020-06-19 北京互金新融科技有限公司 User data processing method and device
CN111523922A (en) * 2020-04-01 2020-08-11 北京三快在线科技有限公司 Information pushing method and system, computer device and cloud server system
CN111582991A (en) * 2020-05-13 2020-08-25 中国银行股份有限公司 Product information recommendation method and device
CN112948693A (en) * 2021-03-30 2021-06-11 北京三快在线科技有限公司 Information pushing method and device, storage medium and electronic equipment
CN115170242A (en) * 2022-07-18 2022-10-11 拉扎斯网络科技(上海)有限公司 Information processing method and system
CN115204518A (en) * 2022-08-09 2022-10-18 阿里巴巴(中国)有限公司 Supply chain information processing method, device, equipment and storage medium
CN115311008A (en) * 2022-07-29 2022-11-08 北京深演智能科技股份有限公司 Method and device for determining valid sample and electronic equipment

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120296701A1 (en) * 2008-07-14 2012-11-22 Wahrheit, Llc System and method for generating recommendations
CN106157650A (en) * 2016-07-11 2016-11-23 东南大学 A kind of through street traffic efficiency ameliorative way controlled based on intensified learning variable speed-limit
CN106802553A (en) * 2017-01-13 2017-06-06 清华大学 A kind of railway locomotive operation control system hybrid tasks scheduling method based on intensified learning
CN107194492A (en) * 2017-04-13 2017-09-22 南京邮电大学 The optimization method that a kind of businessman based on position social networks is recommended
CN107451840A (en) * 2016-05-31 2017-12-08 百度在线网络技术(北京)有限公司 A kind of Transaction Information method for pushing and device
CN107742245A (en) * 2017-10-31 2018-02-27 北京小度信息科技有限公司 A merchant information recommendation method, device and equipment
CN107944913A (en) * 2017-11-21 2018-04-20 重庆邮电大学 High potential user's purchase intention Forecasting Methodology based on big data user behavior analysis
US20180374138A1 (en) * 2017-06-23 2018-12-27 Vufind Inc. Leveraging delayed and partial reward in deep reinforcement learning artificial intelligence systems to provide purchase recommendations
CN109118337A (en) * 2018-08-28 2019-01-01 深圳市烽焌信息科技有限公司 Consume the recommended method and Related product of businessman
CN110135871A (en) * 2018-02-02 2019-08-16 北京京东尚科信息技术有限公司 Calculate the method and apparatus that user purchases the phase again
CN110197415A (en) * 2019-04-23 2019-09-03 北京三快在线科技有限公司 A kind of recommended method, device, electronic equipment and readable storage medium storing program for executing

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120296701A1 (en) * 2008-07-14 2012-11-22 Wahrheit, Llc System and method for generating recommendations
CN107451840A (en) * 2016-05-31 2017-12-08 百度在线网络技术(北京)有限公司 A kind of Transaction Information method for pushing and device
CN106157650A (en) * 2016-07-11 2016-11-23 东南大学 A kind of through street traffic efficiency ameliorative way controlled based on intensified learning variable speed-limit
CN106802553A (en) * 2017-01-13 2017-06-06 清华大学 A kind of railway locomotive operation control system hybrid tasks scheduling method based on intensified learning
CN107194492A (en) * 2017-04-13 2017-09-22 南京邮电大学 The optimization method that a kind of businessman based on position social networks is recommended
US20180374138A1 (en) * 2017-06-23 2018-12-27 Vufind Inc. Leveraging delayed and partial reward in deep reinforcement learning artificial intelligence systems to provide purchase recommendations
CN107742245A (en) * 2017-10-31 2018-02-27 北京小度信息科技有限公司 A merchant information recommendation method, device and equipment
CN107944913A (en) * 2017-11-21 2018-04-20 重庆邮电大学 High potential user's purchase intention Forecasting Methodology based on big data user behavior analysis
CN110135871A (en) * 2018-02-02 2019-08-16 北京京东尚科信息技术有限公司 Calculate the method and apparatus that user purchases the phase again
CN109118337A (en) * 2018-08-28 2019-01-01 深圳市烽焌信息科技有限公司 Consume the recommended method and Related product of businessman
CN110197415A (en) * 2019-04-23 2019-09-03 北京三快在线科技有限公司 A kind of recommended method, device, electronic equipment and readable storage medium storing program for executing

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
YYMWATER: "强化学习系列一——基于深度强化学习的新闻推荐模型DRN", pages 1 - 5, Retrieved from the Internet <URL:https://zhuanlan.zhihu.com/p/58280384> *
慕课网: "慕课网-个性化推荐系统实战入门", pages 1 - 7, Retrieved from the Internet <URL:https://www.kukuxiaai.com/blog/2019-07/%E6%85%95%E8%AF%BE%E7%BD%91-%E4%B8%AA%E6%80%A7%E5%8C%96%E6%8E%A8%E8%8D%90%E7%B3%BB%E7%BB%9F%E5%AE%9E%E6%88%98%E5%85%A5%E9%97%A8/> *
段瑾: "强化学习在美团"猜你喜欢"的实践", pages 1 - 12, Retrieved from the Internet <URL:https://tech.meituan.com/2018/11/15/reinforcement-learning-in-mt-recommend-system.html> *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111275493A (en) * 2020-02-10 2020-06-12 拉扎斯网络科技(上海)有限公司 Method, device, server and non-volatile storage medium for processing list data
CN111275493B (en) * 2020-02-10 2023-08-22 拉扎斯网络科技(上海)有限公司 List data processing method, device, server and non-volatile storage medium
CN111311332A (en) * 2020-02-28 2020-06-19 北京互金新融科技有限公司 User data processing method and device
CN111523922A (en) * 2020-04-01 2020-08-11 北京三快在线科技有限公司 Information pushing method and system, computer device and cloud server system
CN111523922B (en) * 2020-04-01 2023-08-29 北京三快在线科技有限公司 Information pushing method, system, computer device and cloud server system
CN111582991A (en) * 2020-05-13 2020-08-25 中国银行股份有限公司 Product information recommendation method and device
CN111582991B (en) * 2020-05-13 2023-09-01 中国银行股份有限公司 Product information recommendation method and device
CN112948693A (en) * 2021-03-30 2021-06-11 北京三快在线科技有限公司 Information pushing method and device, storage medium and electronic equipment
CN115170242A (en) * 2022-07-18 2022-10-11 拉扎斯网络科技(上海)有限公司 Information processing method and system
CN115311008A (en) * 2022-07-29 2022-11-08 北京深演智能科技股份有限公司 Method and device for determining valid sample and electronic equipment
CN115204518A (en) * 2022-08-09 2022-10-18 阿里巴巴(中国)有限公司 Supply chain information processing method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN110689402A (en) Method, apparatus, electronic device, and readable storage medium for recommending merchants
US10762288B2 (en) Adaptive modification of content presented in electronic forms
US9589025B2 (en) Correlated information recommendation
US10409821B2 (en) Search result ranking using machine learning
US20140289239A1 (en) Recommendation tuning using interest correlation
EP2704080A1 (en) Recommendation systems and methods
US20160140627A1 (en) Generating high quality leads for marketing campaigns
WO2019072128A1 (en) Object recognition method and system thereof
CA3070612A1 (en) Click rate estimation
CN102541972A (en) Matching advertisers and users based on their respective intents
US10135933B2 (en) Apparatus and method for generating dynamic similarity audiences
CN106062743A (en) Systems and methods for keyword suggestion
CN110111167A (en) A kind of method and apparatus of determining recommended
TW201528181A (en) Systems and methods for search results targeting
CN107103028A (en) A kind of information processing method and device
CN114549125B (en) Item recommendation method and device, electronic device and computer-readable storage medium
US10387934B1 (en) Method medium and system for category prediction for a changed shopping mission
US8700625B1 (en) Identifying alternative products
JP2016062358A (en) Extractor, and method and program for extraction
CN114331581A (en) Intelligent recommendation method, system and device for article information during user browsing
KR102557299B1 (en) A method of creating the right marketing strategy for your users
CN102799593B (en) Personalized search ranking method and system
JP2017016523A (en) Providing device, providing method, and providing program
JP6327950B2 (en) Predicted value calculation device, predicted value calculation method, and predicted value calculation program
US12354135B2 (en) Personalized ranking of promotional items

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200114