CN106777067A - Information recommendation method and system - Google Patents
Information recommendation method and system Download PDFInfo
- Publication number
- CN106777067A CN106777067A CN201611140338.2A CN201611140338A CN106777067A CN 106777067 A CN106777067 A CN 106777067A CN 201611140338 A CN201611140338 A CN 201611140338A CN 106777067 A CN106777067 A CN 106777067A
- Authority
- CN
- China
- Prior art keywords
- scene
- user behavior
- recommendation
- data
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本发明涉及信息推荐领域,特别是涉及信息推荐方法及系统。The invention relates to the field of information recommendation, in particular to an information recommendation method and system.
背景技术Background technique
社会覆盖面极广、在线活跃用户数量庞大的流式信息服务称为公众性流式信息服务,公众性流式信息服务的典型代表为各类媒体直播服务。公众性流式信息服务的推荐面临以下问题:1)必须在内容的在线周期内推荐,从而避免将下线内容推荐给用户;2)必须将内容推荐给合适的用户个体,从而避免对其他用户群体造成干扰。Streaming information services with a wide social coverage and a large number of active online users are called public streaming information services, and the typical representatives of public streaming information services are various media live broadcast services. The recommendation of public streaming information services faces the following problems: 1) It must be recommended within the online cycle of the content, so as to avoid recommending offline content to users; 2) It must be recommended to appropriate individual users, so as to avoid other users Groups interfere.
可见,对于公众性流式信息服务的推荐,需要能够同时保证时效性和兼顾不同用户个体的个性化需求的新技术。It can be seen that for the recommendation of public streaming information services, a new technology that can ensure timeliness and take into account the personalized needs of different individual users is needed.
发明内容Contents of the invention
鉴于以上所述现有技术的缺点,本发明的目的在于提供信息推荐方法及系统,用于解决现有技术中公众性流式信息服务的推荐无法保证时效性,以及无法兼顾不同用户个体的个性化需求等问题。In view of the above-mentioned shortcomings of the prior art, the purpose of the present invention is to provide an information recommendation method and system, which is used to solve the problem that the recommendation of public streaming information services in the prior art cannot guarantee timeliness and cannot take into account the personalities of different individual users needs and other issues.
为实现上述目的及其他相关目的,本发明提供一种信息推荐方法,包括:按照预设时间间隔切割读入的多个采样数据流以得到多个采样数据切片;定义一个或多个用户行为场景,所述每个用户行为场景包括至少一个和/或至少一类用户行为事件;分别从每个所述采样数据切片的数据中甄选出符合所述用户行为场景的部分以组成场景快照;将所述场景快照作为场景模拟模型的输入,从而得到所述场景模拟模型输出的场景画像;将所述场景画像与预设推荐结果进行匹配,若匹配成功,则将所述预设推荐结果确定为推荐信息。In order to achieve the above purpose and other related purposes, the present invention provides an information recommendation method, including: cutting multiple sampled data streams read in according to preset time intervals to obtain multiple sampled data slices; defining one or more user behavior scenarios , each user behavior scenario includes at least one and/or at least one type of user behavior event; respectively select a part that conforms to the user behavior scenario from the data of each sampled data slice to form a scene snapshot; The scene snapshot is used as the input of the scene simulation model, thereby obtaining the scene portrait output by the scene simulation model; the scene portrait is matched with the preset recommendation result, and if the matching is successful, the preset recommendation result is determined as a recommendation information.
于本发明一实施例中,当定义了一个或多个用户行为场景时,所述一个或多个用户行为场景构成行为模式,所述行为模式中的某个用户行为场景的场景画像是由所述某个用户行为场景的场景快照和前一个用户行为场景的场景画像共同作为所述场景模拟模型的输入,并经所述场景模拟模型的输出得到的,所述方法还包括:将最终得到的场景画像与所述预设推荐结果进行匹配,若匹配成功,则将所述预设推荐结果确定为推荐信息。In an embodiment of the present invention, when one or more user behavior scenarios are defined, the one or more user behavior scenarios constitute a behavior pattern, and the scene portrait of a certain user behavior scenario in the behavior pattern is determined by the The scene snapshot of a certain user behavior scene and the scene portrait of the previous user behavior scene are jointly used as the input of the scene simulation model, and are obtained through the output of the scene simulation model, and the method also includes: the final obtained The scene portrait is matched with the preset recommendation result, and if the matching is successful, the preset recommendation result is determined as recommendation information.
于本发明一实施例中,所述场景模拟模型包括多个,每个用于针对一种特征信息进行场景模拟。In an embodiment of the present invention, the scene simulation model includes a plurality, each of which is used to perform scene simulation for a kind of feature information.
于本发明一实施例中,所述场景模拟模型包括:有监督学习模型、和/或无监督学习模型。In an embodiment of the present invention, the scene simulation model includes: a supervised learning model, and/or an unsupervised learning model.
于本发明一实施例中,所述无监督学习模型包括:深度学习模型。In an embodiment of the present invention, the unsupervised learning model includes: a deep learning model.
于本发明一实施例中,所述深度学习模型是根据反向传播算法建立的。In an embodiment of the present invention, the deep learning model is established according to the backpropagation algorithm.
于本发明一实施例中,每个所述采样数据流包括一类用户行为事件数据,所述多个采样数据切片包括与所述多个采样数据流分别一一对应的多个所述用户行为事件的部分数据。In an embodiment of the present invention, each of the sampled data streams includes a type of user behavior event data, and the plurality of sampled data slices include a plurality of user behavior events corresponding to the plurality of sampled data streams respectively. Partial data of the event.
于本发明一实施例中,每类所述用户行为事件数据包括多个所述用户行为事件数据,每个所述用户行为事件数据包括:时间戳、用于标识产生相应的行为事件的对象的用户标识、及相应的行为事件的采样数据。In an embodiment of the present invention, each type of user behavior event data includes a plurality of user behavior event data, and each user behavior event data includes: a time stamp, an ID for identifying the object that generated the corresponding behavior event Sampling data of user identification and corresponding behavior events.
于本发明一实施例中,在将所述场景快照输入所述场景模拟模型之前,所述方法还包括:将所述场景快照包含的各个用户行为事件数据按照时间顺序排序,并依序输入所述场景模拟模型。In an embodiment of the present invention, before inputting the scene snapshot into the scene simulation model, the method further includes: sorting the user behavior event data included in the scene snapshot in chronological order, and inputting the scene snapshot in sequence The scenario simulation model described above.
于本发明一实施例中,所述预设推荐结果包括多个,所述匹配包括:分别计算所述场景画像与每个所述预设推荐结果的相关度,将所述相关度最大的预设推荐结果确定为推荐信息。In an embodiment of the present invention, the preset recommendation results include multiple ones, and the matching includes: respectively calculating the correlation between the scene portrait and each of the preset recommendation results, and selecting the preset recommendation result with the greatest correlation Let the recommendation result be determined as the recommendation information.
为实现上述目的及其他相关目的,本发明提供一种信息推荐系统,包括:数据切片模块,用于按照预设时间间隔切割读入的多个采样数据流以得到多个采样数据切片;场景定义模块,用于定义一个或多个用户行为场景,每个用户行为场景包括至少一个和/或至少一类用户行为事件;快照甄选模块,用于分别从每个所述采样数据切片的数据中甄选出符合所述用户行为场景的部分以组成场景快照;场景模拟模块,用于将所述场景快照作为场景模拟模型的输入,从而得到所述场景模拟模型输出的场景画像;匹配模块,用于将所述场景画像与预设推荐结果进行匹配,若匹配成功,则将所述预设推荐结果确定为推荐信息。In order to achieve the above purpose and other related purposes, the present invention provides an information recommendation system, including: a data slicing module, which is used to cut and read multiple sampled data streams according to preset time intervals to obtain multiple sampled data slices; scene definition A module for defining one or more user behavior scenarios, each user behavior scenario including at least one and/or at least one type of user behavior event; a snapshot selection module for selecting from the data of each sampled data slice respectively Parts that meet the user behavior scene to form a scene snapshot; the scene simulation module is used to use the scene snapshot as the input of the scene simulation model, so as to obtain the scene portrait output by the scene simulation model; the matching module is used to The scene portrait is matched with a preset recommendation result, and if the match is successful, the preset recommendation result is determined as recommendation information.
于本发明一实施例中,当定义了一个或多个用户行为场景时,所述一个或多个用户行为场景构成行为模式,所述行为模式中的某个用户行为场景的场景画像是由所述某个用户行为场景的场景快照和前一个用户行为场景的场景画像共同作为所述场景模拟模型的输入,并经所述场景模拟模型的输出得到的,所述匹配模块还用于:将最终得到的场景画像与所述预设推荐结果进行匹配,若匹配成功,则将所述预设推荐结果确定为推荐信息。In an embodiment of the present invention, when one or more user behavior scenarios are defined, the one or more user behavior scenarios constitute a behavior pattern, and the scene portrait of a certain user behavior scenario in the behavior pattern is determined by the The scene snapshot of a certain user behavior scene and the scene portrait of the previous user behavior scene are used as the input of the scene simulation model, and are obtained through the output of the scene simulation model. The matching module is also used for: the final The obtained scene portrait is matched with the preset recommendation result, and if the matching is successful, the preset recommendation result is determined as recommendation information.
于本发明一实施例中,所述场景模拟模型包括多个,每个用于针对一种特征信息进行场景模拟。In an embodiment of the present invention, the scene simulation model includes a plurality, each of which is used to perform scene simulation for a kind of feature information.
于本发明一实施例中,所述场景模拟模型包括:有监督学习模型、和/或无监督学习模型。In an embodiment of the present invention, the scene simulation model includes: a supervised learning model, and/or an unsupervised learning model.
于本发明一实施例中,所述无监督学习模型包括:深度学习模型。In an embodiment of the present invention, the unsupervised learning model includes: a deep learning model.
于本发明一实施例中,所述深度学习模型是根据反向传播算法建立的。In an embodiment of the present invention, the deep learning model is established according to the backpropagation algorithm.
于本发明一实施例中,每个所述采样数据流包括一类用户行为事件数据,所述多个采样数据切片包括与所述多个采样数据流分别一一对应的多个所述用户行为事件的部分数据。In an embodiment of the present invention, each of the sampled data streams includes a type of user behavior event data, and the plurality of sampled data slices include a plurality of user behavior events corresponding to the plurality of sampled data streams respectively. Partial data of the event.
于本发明一实施例中,每类所述用户行为事件数据包括多个所述用户行为事件数据,每个所述用户行为事件数据包括:时间戳、用于标识产生相应的行为事件的对象的用户标识、及相应的行为事件的采样数据。In an embodiment of the present invention, each type of user behavior event data includes a plurality of user behavior event data, and each user behavior event data includes: a time stamp, an ID for identifying the object that generated the corresponding behavior event Sampling data of user identification and corresponding behavior events.
于本发明一实施例中,所述系统还包括:排序模块,用于在将所述场景快照输入所述场景模拟模型之前,将所述场景快照包含的各个用户行为事件数据按照时间顺序排序,并依序输入所述场景模拟模型。In an embodiment of the present invention, the system further includes: a sorting module, configured to sort the user behavior event data contained in the scene snapshot in chronological order before inputting the scene snapshot into the scene simulation model, And sequentially input the scene simulation model.
于本发明一实施例中,所述预设推荐结果包括多个,所述匹配包括:分别计算所述场景画像与每个所述预设推荐结果的相关度,将所述相关度最大的预设推荐结果确定为推荐信息。In an embodiment of the present invention, the preset recommendation results include multiple ones, and the matching includes: respectively calculating the correlation between the scene portrait and each of the preset recommendation results, and selecting the preset recommendation result with the greatest correlation Let the recommendation result be determined as the recommendation information.
如上所述,本发明的信息推荐方法及系统,实现了基于用户行为数据即时读入采样数据、即时个性化推荐的技术方案,具有以下有益效果:As mentioned above, the information recommendation method and system of the present invention realizes the technical solution of real-time reading of sampled data and real-time personalized recommendation based on user behavior data, and has the following beneficial effects:
1)推荐结果更新的时间周期短,以分钟或秒或更小的时间单位计算;1) The time period for updating the recommendation results is short, calculated in minutes or seconds or smaller time units;
2)读入采样数据后,采样数据对推荐结果的影响在几分钟/几秒钟之后显现出来,能够形成一种“行为采样—推荐反馈”的交互式推荐;2) After the sampling data is read in, the impact of the sampling data on the recommendation results will appear in a few minutes/seconds, and an interactive recommendation of "behavior sampling-recommendation feedback" can be formed;
3)推荐结果基于用户行为数据采样,是用户个体行为历史演进的概率结果,具有个性化特质。3) The recommendation result is based on user behavior data sampling, which is the probability result of the historical evolution of individual user behavior and has personalized characteristics.
附图说明Description of drawings
图1显示为本发明一实施例的信息推荐方法流程图。FIG. 1 is a flowchart of an information recommendation method according to an embodiment of the present invention.
图2显示为本发明一实施例的切割采样数据流以生成采样数据切片的示意图。FIG. 2 is a schematic diagram of cutting a sampled data stream to generate sampled data slices according to an embodiment of the present invention.
图3显示为本发明一实施例的A类行为采样数据流的内部组成示意图。FIG. 3 is a schematic diagram showing the internal composition of a type A behavior sampling data stream according to an embodiment of the present invention.
图4显示为本发明一实施例的定义的用户行为场景的示意图。Fig. 4 is a schematic diagram of a defined user behavior scenario according to an embodiment of the present invention.
图5A~5B显示为本发明一实施例的快照甄选过程示意图。5A-5B are schematic diagrams showing a snapshot selection process according to an embodiment of the present invention.
图6A~6B显示为本发明一实施例的场景模拟过程示意图。6A-6B are schematic diagrams of a scene simulation process according to an embodiment of the present invention.
图7显示为本发明一实施例的多个用户行为场景串联形成行为模式的模拟过程示意图。FIG. 7 is a schematic diagram of a simulation process in which multiple user behavior scenarios are connected in series to form a behavior pattern according to an embodiment of the present invention.
图8显示为本发明一实施例的选择最终推荐结果的过程示意图。FIG. 8 is a schematic diagram of a process of selecting a final recommendation result according to an embodiment of the present invention.
图9显示为本发明一实施例的信息推荐系统模块图。FIG. 9 is a block diagram of an information recommendation system according to an embodiment of the present invention.
具体实施方式detailed description
以下通过特定的具体实例说明本发明的实施方式,本领域技术人员可由本说明书所揭露的内容轻易地了解本发明的其他优点与功效。本发明还可以通过另外不同的具体实施方式加以实施或应用,本说明书中的各项细节也可以基于不同观点与应用,在没有背离本发明的精神下进行各种修饰或改变。需说明的是,在不冲突的情况下,以下实施例及实施例中的特征可以相互组合。Embodiments of the present invention are described below through specific examples, and those skilled in the art can easily understand other advantages and effects of the present invention from the content disclosed in this specification. The present invention can also be implemented or applied through other different specific implementation modes, and various modifications or changes can be made to the details in this specification based on different viewpoints and applications without departing from the spirit of the present invention. It should be noted that, in the case of no conflict, the following embodiments and features in the embodiments can be combined with each other.
需要说明的是,以下实施例中所提供的图示仅以示意方式说明本发明的基本构想,遂图式中仅显示与本发明中有关的组件而非按照实际实施时的组件数目、形状及尺寸绘制,其实际实施时各组件的型态、数量及比例可为一种随意的改变,且其组件布局型态也可能更为复杂。It should be noted that the diagrams provided in the following embodiments are only schematically illustrating the basic ideas of the present invention, and only the components related to the present invention are shown in the diagrams rather than the number, shape and shape of the components in actual implementation. Dimensional drawing, the type, quantity and proportion of each component can be changed arbitrarily during actual implementation, and the component layout type may also be more complicated.
随着大数据时代的到来,通过挖掘用户的历史行为来预测用户未来的行为,建立起用户和内容的关系,才能提高信息推荐的准确性。请参阅图1,本发明提供一种信息推荐方法,在保证时效性的基础上,兼顾不同用户的个性化需求,实现了流式数据读入的动态交互式个性化推荐,具体包括以下步骤:With the advent of the era of big data, the accuracy of information recommendation can only be improved by mining the historical behavior of users to predict the future behavior of users and establishing the relationship between users and content. Please refer to Fig. 1, the present invention provides an information recommendation method, on the basis of ensuring timeliness, taking into account the personalized needs of different users, and realizing the dynamic interactive personalized recommendation of streaming data reading, specifically including the following steps:
步骤S101:按照预设时间间隔切割读入的多个采样数据流以得到多个采样数据切片,图2显示了一次切割的过程,切割的时间点记录为时间点1,每个所述采样数据流包括一类用户行为事件数据,每个切割出的采样数据切片包括对应类别的用户行为事件数据中与预设时间间隔对应的部分。如图3所示,每个采样数据流是由多个用户行为事件数据组成的,其中,每个用户行为事件数据包括:记录该数据产生时间的时间戳、标识该数据产生的来源对象的用户标识,以及该数据主体部分。Step S101: Cut the read-in multiple sampling data streams according to preset time intervals to obtain multiple sampling data slices. Figure 2 shows a cutting process, and the cutting time point is recorded as time point 1, and each sampling data The stream includes a type of user behavior event data, and each sliced sampled data slice includes a part corresponding to a preset time interval in the corresponding type of user behavior event data. As shown in Figure 3, each sampled data stream is composed of multiple user behavior event data, where each user behavior event data includes: a timestamp recording the time when the data was generated, and the user identifying the source object of the data generation identification, and the data subject section.
需要说明的是,在每个采样数据流内,各个用户行为事件数据可以是无序的,在以预设时间间隔切割时,在预设时间间隔范围内的时间戳所对应的用户行为事件数据被切分出来,从而生成该类用户行为事件的采样数据切片。在多次切割后,按照切割时间的先后顺序将各批采样数据切片进行排序。It should be noted that in each sampled data stream, each user behavior event data may be out of order. When cutting at a preset time interval, the user behavior event data corresponding to the time stamp within the preset time interval range are segmented to generate sample data slices of this type of user behavior events. After multiple cuts, the batches of sampling data slices are sorted in the order of cutting time.
例如,用户行为事件为:观众观看某频道,对应的采样数据流为:所有在线观众观看该频道的数据,这些数据将实时地以数据流的形式获取,对应的采样数据切片为:在时间轴上对这些数据进行多次切片处理后,获得的一组彼此之间依切割时间排序的切片。For example, the user behavior event is: viewers watch a certain channel, and the corresponding sampling data stream is: the data of all online viewers watching this channel, these data will be obtained in the form of data stream in real time, and the corresponding sampling data slice is: in the time axis After performing multiple slice processing on these data, a set of slices sorted by cutting time are obtained.
步骤S102:定义一个或多个用户行为场景,如图4所示,每个定义的用户行为场景可以包括一个、多个、一类、多类用户行为事件。Step S102: Define one or more user behavior scenarios. As shown in FIG. 4, each defined user behavior scenario may include one, multiple, one type, or multiple types of user behavior events.
例如:A类用户行为事件为:观众观看某频道,B类用户行为事件为:观众切换到某频道,C类用户行为事件为:频道播放某节目,当定义的场景a包括A、C两类,则场景a为:观众观看某频道某节目,当定义的场景b包括B、C两类,则场景b为:观众切换到某频道某节目。For example: Type A user behavior event is: the audience watches a certain channel, type B user behavior event is: the audience switches to a certain channel, and type C user behavior event is: the channel plays a certain program, when the defined scene a includes two types of A and C , then scene a is: the audience watches a certain program on a certain channel, and when the defined scene b includes two types of B and C, then scene b is: the audience switches to a certain program on a certain channel.
步骤S103:对于每次切割,分别从每个所述采样数据切片的数据中甄选出符合所述用户行为场景的部分以组成与每个切割时间点对应的场景快照。Step S103: For each cut, select a part that conforms to the user behavior scene from the data of each sampled data slice to form a scene snapshot corresponding to each cut time point.
例如,对于图5A,A类用户行为事件采样数据切片为:所有在线观众在不同时间段观看某频道的数据,在时间轴上对这些数据进行多次切片处理后,获得的一组彼此之间依时间排序的切片。C类用户行为事件采样数据切片为:所有直播频道在某段时间内播放节目的节目单数据,在时间轴上对这些数据进行3次切片处理后,获得的一组彼此之间依切割时间排序的切片。甄选出的场景a快照1为:时间点1,观众观看某频道某节目的采样数据组成的快照,甄选出的场景a快照2为:时间点2,观众观看某频道某节目的采样数据组成的快照,甄选出的场景a快照3为:时间点3,观众观看某频道某节目的采样数据组成的快照。对于图5B,B类用户行为事件采样数据切片为:所有在线观众在不同时间段切换到某频道的数据,在时间轴上对这些数据进行3次切片处理后,获得的一组彼此之间依时间排序的切片。C类用户行为事件采样数据切片为:所有直播频道在某段时间内播放节目的节目单数据,在时间轴上对这些数据进行多次切片处理后,获得的一组彼此之间依时间排序的切片。甄选出的场景b快照1为:时间点1,某频道某节目观众切换进来的采样数据组成的快照,甄选出的场景b快照2为:时间点2,某频道某节目观众切换进来的采样数据组成的快照,甄选出的场景b快照3为:时间点3,某频道某节目观众切换进来的采样数据组成的快照。For example, for Figure 5A, the sampled data slice of type A user behavior events is: the data of all online viewers watching a certain channel in different time periods, and after performing multiple slice processing on these data on the time axis, a group of Slices sorted by time. Type C user behavior event sampling data slices are: program list data of all live channels broadcasting programs within a certain period of time, after slicing these data three times on the time axis, the obtained groups are sorted according to the cutting time slices. The selected scene a snapshot 1 is: at time point 1, the audience watches a snapshot composed of sampled data of a program on a certain channel, and the selected scene a snapshot 2 is: at time point 2, the audience watches a snapshot composed of sampled data of a certain program on a certain channel Snapshot, the selected scene a snapshot 3 is: at time point 3, a snapshot composed of sampling data of a program watched by a certain channel. For Figure 5B, the sampled data slice of B-type user behavior events is: the data of all online viewers switching to a certain channel in different time periods, and after processing these data slices three times on the time axis, a set of data obtained according to each other is obtained. Time-sorted slices. Type C user behavior event sampling data slices are: the program list data of all live channels broadcasting programs within a certain period of time. After performing multiple slice processing on these data on the time axis, a set of time-ordered data is obtained. slice. The selected scene b snapshot 1 is: time point 1, a snapshot composed of sampled data switched in by viewers of a certain channel and a certain program, and the selected scene b snapshot 2 is: time point 2, the sampled data of a certain channel and certain program viewers switched in The selected snapshot 3 of scene b is: at time point 3, a snapshot composed of sampled data switched in by viewers of a certain program on a certain channel.
步骤S104:按照切割时间顺序,分别将各所述场景快照作为场景模拟模型的输入,从而得到所述场景模拟模型输出的各场景画像。需要说明的是,场景模拟模型是定制模型,可以是一个,也可以是多个,每个模型用于针对一种特征信息进行场景模拟。定制模型的生成根据画像的需要,由选择的各类深度学习算法(如:反向传播算法)和/或非监督学习算法等组合而成。Step S104: According to the order of cutting time, each of the scene snapshots is used as the input of the scene simulation model, so as to obtain the scene portraits output by the scene simulation model. It should be noted that the scene simulation model is a customized model, and there may be one or more than one model, and each model is used to perform scene simulation for a kind of feature information. The generation of the custom model is based on the needs of the portrait, and is composed of various selected deep learning algorithms (such as: backpropagation algorithm) and/or unsupervised learning algorithms.
例如,对于图6A,模拟模型针对场景a的快照中蕴含的特定特征,模拟该特征的概率画像。具体的,场景a画像1为:针对场景a快照1中蕴含的特定特征模拟出的该特征的概率画像,即时间点1及其邻近时域,观众们可能观看某频道某节目的概率预测,场景a画像1与场景a快照1进行比对并调整参数,使得二者的差异缩小;场景a画像2为:针对场景a快照2中蕴含的特定特征模拟出的该特征的概率画像,即时间点2及其邻近时域,观众们可能观看某频道某节目的概率预测,场景a画像2与场景a快照2进行比对并调整参数,使得二者的差异缩小;场景a画像3为:针对场景a快照3中蕴含的特定特征模拟出的该特征的概率画像,即时间点3及其邻近时域,观众们可能观看某频道某节目的概率预测,场景a画像3与场景a快照3进行比对并调整参数,使得二者的差异缩小。以此类推,场景a画像n为:针对场景a快照n中蕴含的特定特征模拟出的该特征的概率画像,即时间点n及其邻近时域,观众们可能观看某频道某节目的概率预测,场景a画像n与场景a快照n进行比对并调整参数,使得二者的差异缩小。For example, for FIG. 6A , the simulation model simulates the probability profile of a specific feature contained in the snapshot of scene a. Specifically, the scene a portrait 1 is: the probability portrait of the specific feature contained in the scene a snapshot 1 simulated, that is, the probability prediction that the audience may watch a certain program on a certain channel at time point 1 and its adjacent time domain, The scene a portrait 1 is compared with the scene a snapshot 1 and the parameters are adjusted to reduce the difference between the two; the scene a portrait 2 is: the probability portrait of the feature simulated for the specific feature contained in the scene a snapshot 2, that is, the time Point 2 and its adjacent time domain, the probability prediction that the audience may watch a certain program on a certain channel, compare the scene a portrait 2 with the scene a snapshot 2 and adjust the parameters, so that the difference between the two is reduced; the scene a portrait 3 is: for The probability profile of the feature simulated by the specific features contained in scene a snapshot 3, that is, time point 3 and its adjacent time domain, the probability prediction that the audience may watch a certain program on a certain channel, scene a portrait 3 and scene a snapshot 3 Compare and adjust the parameters to reduce the difference between the two. By analogy, the scene a portrait n is: the probability portrait of the feature simulated for the specific feature contained in the scene a snapshot n, that is, the probability prediction that the audience may watch a certain program on a certain channel at time point n and its adjacent time domain , the scene a portrait n is compared with the scene a snapshot n and the parameters are adjusted so that the difference between the two is reduced.
又例如,对于图6B,模拟模型对场景b的快照中蕴含的特定特征,模拟该特征的概率画像。具体的,场景b画像1为:针对场景b快照1中蕴含的特定特征模拟出的该特征的概率画像,即时间点1及其邻近时域,观众们可能切换到某频道某节目的概率预测,场景b画像1与场景b快照1进行比对并调整参数,使得二者的差异缩小;场景b画像2为:针对场景b快照2中蕴含的特定特征模拟出的该特征的概率画像,即时间点2及其邻近时域,观众们可能切换到某频道某节目的概率预测,场景b画像1与场景b快照1进行比对并调整参数,使得二者的差异缩小;场景b画像3为:针对场景b快照3中蕴含的特定特征模拟出的该特征的概率画像,即时间点3及其邻近时域,观众们可能切换到某频道某节目的概率预测,场景b画像1与场景b快照1进行比对并调整参数,使得二者的差异缩小。以此类推,场景b画像n为:针对场景b快照n中蕴含的特定特征模拟出的该特征的概率画像,即时间点n及其邻近时域,观众们可能观看某频道某节目的概率预测,场景b画像n与场景b快照n进行比对并调整参数,使得二者的差异缩小。For another example, for FIG. 6B , the simulation model simulates the probability profile of a specific feature contained in the snapshot of scene b. Specifically, the scene b portrait 1 is: the probability portrait of the specific feature contained in the scene b snapshot 1 simulated, that is, the probability prediction that the audience may switch to a certain channel and a certain program at time point 1 and its adjacent time domain , the scene b portrait 1 is compared with the scene b snapshot 1 and the parameters are adjusted to reduce the difference between the two; the scene b portrait 2 is: the probability portrait of the feature simulated for the specific feature contained in the scene b snapshot 2, that is At time point 2 and its adjacent time domain, the probability prediction that the audience may switch to a certain program on a certain channel, the scene b portrait 1 is compared with the scene b snapshot 1 and the parameters are adjusted to reduce the difference between the two; the scene b portrait 3 is : The probability portrait of the feature simulated for the specific feature contained in the snapshot 3 of scene b, that is, the probability prediction that the audience may switch to a certain channel and a certain program at time point 3 and its adjacent time domain, scene b portrait 1 and scene b Snapshot 1 is compared and parameters are adjusted to reduce the difference between the two. By analogy, the scene b profile n is: the probability profile of the feature simulated for the specific feature contained in the scene b snapshot n, that is, the probability prediction that the audience may watch a certain program on a certain channel at time point n and its adjacent time domain , the scene b portrait n is compared with the scene b snapshot n and the parameters are adjusted so that the difference between the two is reduced.
需要说明的是,在另一实施例中,将每个场景快照输入相应的场景模拟进行模型之前,根据用户行为事件的时间戳,先将场景快照中包含的各个用户行为事件数据按照时间先后顺序排序,再依序输入相应的场景模拟模型中。It should be noted that, in another embodiment, before each scene snapshot is input into the corresponding scene simulation model, according to the timestamp of the user behavior event, the user behavior event data contained in the scene snapshot are firstly sorted in chronological order sorted, and then input into the corresponding scene simulation model in sequence.
特别的,当定义了一个或多个用户行为场景时,所述一个或多个用户行为场景构成行为模式,具体来说,单独一个用户行为场景可以构成行为模式,多个用户行为场景组合起来(例如以串联的形式)也可以构成行为模式。图7显示了多个用户行为场景串联构成行为模式的情况,其中,所述行为模式中的某个用户行为场景的场景画像是由所述某个用户行为场景的场景快照和前一个用户行为场景的场景画像共同作为所述场景模拟模型的输入,并经所述场景模拟模型的输出得到的。In particular, when one or more user behavior scenarios are defined, the one or more user behavior scenarios constitute a behavior pattern, specifically, a single user behavior scenario may constitute a behavior pattern, and multiple user behavior scenarios are combined ( For example in series) can also constitute behavioral patterns. Figure 7 shows a situation in which multiple user behavior scenarios are connected in series to form a behavior pattern, wherein the scene portrait of a certain user behavior scenario in the behavior pattern is composed of the scene snapshot of the certain user behavior scenario and the previous user behavior scenario The scene portraits are used as the input of the scene simulation model and obtained through the output of the scene simulation model.
例如,同时考虑2个因素:1)在时间点1及其邻近时域,观众们可能观看某频道某节目的概率,来自场景a画像1;2)在时间点1,观众切换进某频道某节目的实际采样数据,来自场景b快照1。模拟:时间点1及其邻近时域,观众们可能切换到某频道某节目的概率预测。For example, two factors are considered at the same time: 1) at time point 1 and its adjacent time domain, the probability that the audience may watch a certain program on a certain channel comes from scene a portrait 1; 2) at time point 1, the audience switches to a certain channel and certain program Actual sample data for the show, from scene b snapshot 1. Simulation: time point 1 and its adjacent time domain, the probability prediction that viewers may switch to a certain program on a certain channel.
步骤S105:将所述场景画像与预设推荐结果进行匹配,若匹配成功,则将所述预设推荐结果确定为推荐信息。可选的,所述预设推荐结果可以包括多个,所述匹配具体包括:分别计算每个场景画像与每个所述预设推荐结果的相关度,将相关度最大的预设推荐结果确定为最终的推荐信息。Step S105: Match the scene portrait with a preset recommendation result, and if the matching is successful, determine the preset recommendation result as recommendation information. Optionally, the preset recommendation results may include multiple ones, and the matching specifically includes: separately calculating the correlation between each scene portrait and each of the preset recommendation results, and determining the preset recommendation result with the greatest correlation For the final recommendation information.
例如图8所示,由所有的场景模拟所构成的行为模式模拟输出的场景b画像作为了最终的推荐结果,那么,就可以根据某时间点及其邻近时域中观众们可能切换到某频道某节目的概率,来向观众做频道推荐。For example, as shown in Figure 8, the scene b portrait output by the behavior pattern simulation composed of all the scene simulations is used as the final recommendation result, then, the audience can switch to a certain channel according to a certain time point and its adjacent time domain The probability of a program is used to recommend channels to viewers.
请参阅图9,与上述方法实施例原理相似的是,本发明提供一种信息推荐系统900,作为一种软件实现,可以搭载于具有输入、输出、数据处理功能的电子设备上予以执行。由于前述实施例中的各个技术特征可以应用于本系统实施例,因而不再重复赘述。Please refer to FIG. 9 , similar to the principles of the above-mentioned method embodiments, the present invention provides an information recommendation system 900, which is implemented as a software and can be carried on an electronic device with input, output, and data processing functions for execution. Since each technical feature in the preceding embodiments can be applied to this system embodiment, it is not repeated here.
系统900包括:数据切片模块901、场景定义模块902、快照甄选模块903、场景模拟模块904、及匹配模块905。具体的:数据切片模块901按照预设时间间隔切割读入的多个采样数据流以得到多个采样数据切片;场景定义模块902定义一个或多个用户行为场景,每个用户行为场景包括至少一个和/或至少一类用户行为事件;快照甄选模块903分别从每个所述采样数据切片的数据中甄选出符合所述用户行为场景的部分以组成场景快照;场景模拟模块904将所述场景快照作为场景模拟模型的输入,从而得到所述场景模拟模型输出的场景画像;匹配模块905将所述场景画像与预设推荐结果进行匹配,若匹配成功,则将所述预设推荐结果确定为推荐信息。The system 900 includes: a data slicing module 901 , a scene definition module 902 , a snapshot selection module 903 , a scene simulation module 904 , and a matching module 905 . Specifically: the data slicing module 901 cuts the read-in multiple sampling data streams according to preset time intervals to obtain multiple sampling data slices; the scene definition module 902 defines one or more user behavior scenarios, each user behavior scenario includes at least one And/or at least one type of user behavior event; the snapshot selection module 903 selects the part that meets the user behavior scene from the data of each sampled data slice to form a scene snapshot; the scene simulation module 904 takes the scene snapshot As the input of the scene simulation model, the scene portrait output by the scene simulation model is obtained; the matching module 905 matches the scene portrait with the preset recommendation result, and if the matching is successful, the preset recommendation result is determined as the recommendation information.
在一实施例中,所述系统还包括:排序模块,用于在将所述场景快照输入所述场景模拟模型之前,将所述场景快照包含的各个用户行为事件数据按照时间顺序排序,并依序输入所述场景模拟模型。In an embodiment, the system further includes: a sorting module, configured to sort the user behavior event data included in the scene snapshot in chronological order before inputting the scene snapshot into the scene simulation model, and sort input the scenario simulation model in sequence.
综上所述,本发明有效克服了现有技术中的种种缺点而具高度产业利用价值。To sum up, the present invention effectively overcomes various shortcomings in the prior art and has high industrial application value.
上述实施例仅例示性说明本发明的原理及其功效,而非用于限制本发明。任何熟悉此技术的人士皆可在不违背本发明的精神及范畴下,对上述实施例进行修饰或改变。因此,举凡所属技术领域中具有通常知识者在未脱离本发明所揭示的精神与技术思想下所完成的一切等效修饰或改变,仍应由本发明的权利要求所涵盖。The above-mentioned embodiments only illustrate the principles and effects of the present invention, but are not intended to limit the present invention. Anyone skilled in the art can modify or change the above-mentioned embodiments without departing from the spirit and scope of the present invention. Therefore, all equivalent modifications or changes made by those skilled in the art without departing from the spirit and technical ideas disclosed in the present invention should still be covered by the claims of the present invention.
Claims (20)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN2016110308323 | 2016-11-16 | ||
| CN201611030832 | 2016-11-16 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN106777067A true CN106777067A (en) | 2017-05-31 |
Family
ID=58876062
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201611140338.2A Pending CN106777067A (en) | 2016-11-16 | 2016-12-12 | Information recommendation method and system |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN106777067A (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109165347A (en) * | 2018-08-20 | 2019-01-08 | 腾讯科技(深圳)有限公司 | Data push method and device, storage medium and electronic device |
| CN109670106A (en) * | 2018-12-06 | 2019-04-23 | 百度在线网络技术(北京)有限公司 | Things recommended method and device based on scene |
| CN111797309A (en) * | 2020-06-19 | 2020-10-20 | 一汽奔腾轿车有限公司 | Vehicle-mounted intelligent recommendation device and method based on travel mode |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102841922A (en) * | 2012-07-04 | 2012-12-26 | 北京国双科技有限公司 | Data collection method and data collection device |
| US20130325858A1 (en) * | 2012-03-07 | 2013-12-05 | Ut-Battelle, Llc | Personalized professional content recommendation |
| CN104036002A (en) * | 2014-06-16 | 2014-09-10 | 深圳市英威诺科技有限公司 | Technical method for intelligently recommending data |
| CN104731861A (en) * | 2015-02-05 | 2015-06-24 | 腾讯科技(深圳)有限公司 | Method and device for pushing multimedia data |
-
2016
- 2016-12-12 CN CN201611140338.2A patent/CN106777067A/en active Pending
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130325858A1 (en) * | 2012-03-07 | 2013-12-05 | Ut-Battelle, Llc | Personalized professional content recommendation |
| CN102841922A (en) * | 2012-07-04 | 2012-12-26 | 北京国双科技有限公司 | Data collection method and data collection device |
| CN104036002A (en) * | 2014-06-16 | 2014-09-10 | 深圳市英威诺科技有限公司 | Technical method for intelligently recommending data |
| CN104731861A (en) * | 2015-02-05 | 2015-06-24 | 腾讯科技(深圳)有限公司 | Method and device for pushing multimedia data |
Cited By (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN109165347A (en) * | 2018-08-20 | 2019-01-08 | 腾讯科技(深圳)有限公司 | Data push method and device, storage medium and electronic device |
| CN109165347B (en) * | 2018-08-20 | 2021-03-26 | 腾讯科技(深圳)有限公司 | Data pushing method and device, storage medium and electronic device |
| CN109670106A (en) * | 2018-12-06 | 2019-04-23 | 百度在线网络技术(北京)有限公司 | Things recommended method and device based on scene |
| CN111797309A (en) * | 2020-06-19 | 2020-10-20 | 一汽奔腾轿车有限公司 | Vehicle-mounted intelligent recommendation device and method based on travel mode |
| CN111797309B (en) * | 2020-06-19 | 2024-04-16 | 一汽奔腾轿车有限公司 | Vehicle-mounted intelligent recommendation device and method based on travel mode |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9912974B2 (en) | Shot structure of online video as a predictor of success | |
| CN103377242B (en) | User behavior analysis method, analyzing and predicting method and television program push system | |
| CN110012302A (en) | A kind of network direct broadcasting monitoring method and device, data processing method | |
| US10911821B2 (en) | Utilizing combined outputs of a plurality of recurrent neural networks to generate media consumption predictions | |
| US11064233B2 (en) | Providing service recommendation information on the basis of a device use history | |
| CN106339507B (en) | Streaming Media information push method and device | |
| CN109493888B (en) | Cartoon dubbing method and device, computer-readable storage medium and electronic equipment | |
| EP4297030B1 (en) | Polling questions for a conference call discussion | |
| CN112511854A (en) | Live video highlight generation method, device, medium and equipment | |
| Anwar et al. | Impact of the impairment in 360-degree videos on users VR involvement and machine learning-based QoE predictions | |
| CN109151615B (en) | Video processing method, computer device, and computer storage medium | |
| CN106777067A (en) | Information recommendation method and system | |
| Yang et al. | Personalized video recommendations for shared accounts | |
| CN104935967B (en) | The interest recognition methods of video terminal user a kind of and device | |
| CN110874609B (en) | User clustering method, storage medium, device and system based on user behaviors | |
| CN105763947A (en) | Method for extracting features and interests of smart television users | |
| CN115203577A (en) | Object recommendation method, object recommendation model training method and device | |
| CN115146107A (en) | Video information recommendation method and device, electronic equipment and storage medium | |
| CN113411644B (en) | Sample data processing method, device, server and storage medium | |
| CN118378130B (en) | Video matrix platform data management method and system based on cloud computing | |
| CN119364042B (en) | Video playback method and device | |
| US20250193490A1 (en) | Asynchronous updates for media item access history embeddings | |
| US20240236155A9 (en) | Systems, methods, and computer program products for providing simulator augmented content selection | |
| Li et al. | A study of synchronization deviation between vision and haptic in multi-sensorial extended reality | |
| Chen | The Dynamics of Public Attention to Climate Change in a Hybrid Media System |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170531 |
|
| RJ01 | Rejection of invention patent application after publication |