CN111159559A - Method for constructing recommendation engine according to user requirements and user behaviors - Google Patents
Method for constructing recommendation engine according to user requirements and user behaviors Download PDFInfo
- Publication number
- CN111159559A CN111159559A CN201911409520.7A CN201911409520A CN111159559A CN 111159559 A CN111159559 A CN 111159559A CN 201911409520 A CN201911409520 A CN 201911409520A CN 111159559 A CN111159559 A CN 111159559A
- Authority
- CN
- China
- Prior art keywords
- user
- information
- source
- house
- recommended
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/901—Indexing; Data structures therefor; Storage structures
- G06F16/9027—Trees
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9538—Presentation of query results
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The application discloses a method for constructing a recommendation engine according to user requirements and user behaviors, which relates to the technical field of data statistics, and a specific implementation mode of the method comprises the following steps: collecting user information and house source information; the user information comprises user demand information and user behavior information, the user demand information comprises the demand for the house price, the distance to a working place or a subway station and a business circle, and the user behavior information comprises the information of webpage browsing of the user and the renting time of the user; a feedback interface is preset to determine user demand information; obtaining a first recall data source and a second recall data source according to the house source information to generate a first recommended data source, performing feature engineering through a Spark calculation frame to obtain a second recommended data source, and training to obtain a ranking model to generate the first recommended house source; generating a second recommended house source according to the house owner demand; and constructing a recommendation engine. The feedback interface is preset in the implementation mode, so that the accuracy of the user requirement is improved, and the accuracy of the sequencing model is improved.
Description
Technical Field
The application relates to the technical field of data statistics, in particular to a method for constructing a recommendation engine according to user requirements and user behaviors.
Background
With the development of the internet, the interests of users are more and more extensive, and with the change of the environment and living standard of users, the demands of users are also changing. At present, the LBS (Location Based Services) recommendation engine recall and sorting method has the problems in the applicability of the long-rented apartment field:
1. the current long-rent apartment collection user information is generally only according to the online record of the user, and the channel is single, so that the user information is omitted, and the sample information base is lost;
2. the recall mode industry is integrally recall by inverted indexes and user behaviors, and does not consider the properties of bulk objects such as high-quality plot prices and the like of high-quality public houses;
3. the general recommendation rough arrangement stage in the rough arrangement layer is not optimized for a high-value sparse target scene, a long-rented apartment scene needs multilayer rough arrangement, LBS and value matching strategies need to be carried out for matching of strong intentions of clients and key attributes of target objects, and matching strategies need to be carried out for secondary requirements of users and secondary house states and district characteristics of houses;
4. the ranking layer industry generally bases on the user online behavior to do the learning ToRank model ranking of pair-wise or list-wise. However, in a long-rented apartment, the conditions of on-line feedback of users and the like need to be considered, the sequencing model is more complex, multi-level sample weight depiction needs to be performed on the basis of the sequencing model, and meanwhile, the scenes such as areas, house states and the like need to be deeply subdivided on the basis of the high sparsity of the samples and the large-volume attribute of transactions, so that the recommendation quality is improved;
5. in the interference adjustment layer industry, accurate advertisement insertion is generally performed based on a bidding ranking mode, modeling is not performed based on a specific LBS mode of a long rental apartment, and the recommendation effect is not ideal.
Disclosure of Invention
In view of this, an embodiment of the present invention provides a method for constructing a recommendation engine according to user requirements and user behaviors, including the steps of:
collecting user information and house source information;
the user information comprises user demand information and user behavior information, the user demand information comprises the requirements of a user on room price, a distance workplace or a subway station and a business circle, and the user behavior information comprises user browsing webpage information and user renting time; presetting a feedback interface of the user demand information, wherein the feedback interface displays at least one keyword serving as a node of a corresponding tree structure; all levels of nodes of the tree structure are keywords of the user demand information; when a search instruction aiming at the keyword currently displayed by the feedback interface by the user is acquired, determining a target keyword corresponding to the search instruction as user demand information;
the house source information is recalled according to the user demand information, and the first recall data source is obtained through filtering and formatting of a data collector;
the house source information is divided into real-time user behavior information and historical user behavior information according to the user behavior information; the real-time user behavior information is examined and checked, and the real-time user behavior information enters an online database in real time according to a distributed stream processing open source framework; generating a log from the historical user behavior information, and inputting the log into an offline data warehouse in batches through a Spark frame calculation engine; generating a second recall data source for the online database and the offline data repository;
carrying out price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source, wherein the first recommended data source comprises an online data source and an offline data source;
performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
training the second recommended data source to obtain a ranking model;
inputting the house source information into the sequencing model to obtain the grading sequencing condition of the house source information;
setting a scoring threshold value, wherein the house source with the score of the house source information larger than the scoring threshold value is a first recommended house source;
meanwhile, generating a second recommended house source according to the house owner demand;
and constructing a recommendation engine according to the first recommended house source and the second recommended house source.
Preferably, the price of the first recommended source matches the price of the second source.
Preferably, the user information is collected by means of online, offline and electrical pinning.
Preferably, the first recalled data source is recalled by way of a search.
Preferably, the feature engineering process includes sequentially performing aggregation, refining, screening, and completion on the first recommended data source to obtain the second recommended data source.
Preferably, the inputting the room source information into the ranking model to obtain the ranking condition of the scores of the room source information includes:
and carrying out weight depiction on the house source information, and carrying out deep division on the scenes such as regions, house states and the like according to the sparsity and the transaction attribute of the house source information to obtain the grading and sequencing conditions of the house source information.
Preferably, the recommendation engine further comprises a hotspot source.
Compared with the prior art, the method for constructing the recommendation engine according to the user requirements and the user behaviors, provided by the invention, has the following beneficial effects that:
1. according to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the first recommendation data source is subjected to feature engineering processing through the Spark distributed framework to obtain the second recommendation data source, and the response time of the obtained result is favorably shortened.
2. According to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the feedback interface of the preset user requirement information reduces the probability of wrong selection and selection omission of the user due to numerous keywords corresponding to the user requirements.
3. The method for constructing the recommendation engine according to the user requirements and the user behaviors, provided by the invention, has rich information acquisition channels, including user online APP records, offline watching and communication and electric marketing modes; a large amount of user information is collected, an integral sample model is generated, and the bulk object attributes such as high-quality parcel prices of high-quality long-rent apartments are covered.
4. According to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the personalized recommendation mode is optimized, and apartments with high recommendation matching degree are extracted from the sample model library through comprehensive measurement and calculation of the user requirements and behaviors.
5. According to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the recommendation sequencing model is optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, the sequencing model is subjected to multilevel sample weight portrayal, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
6. The method for constructing the recommendation engine according to the user requirements and the user behaviors optimizes an intervention adjustment layer, models based on a specific LBS (location based service) mode of the long-rented apartment, and designs the special recommendation engine of the long-rented apartment with bulk commodities with sparsely distributed landmass.
The technical solution of the present invention is further described in detail by the accompanying drawings and embodiments.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:
FIG. 1 is a schematic flow chart of a method of constructing a recommendation engine based on user needs and user behavior in accordance with the present invention;
FIG. 2 is a schematic flow chart of another method for constructing a recommendation engine according to user requirements and user behaviors.
Detailed Description
The technical solution in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. It should be noted that the described embodiments are merely some embodiments, rather than all embodiments, of the invention and are merely illustrative in nature and in no way intended to limit the invention, its application, or uses. The protection scope of the present application shall be subject to the definitions of the appended claims.
Example 1:
fig. 1 is a schematic flowchart of a method for building a recommendation engine according to user requirements and user behaviors according to the present invention, and as shown in fig. 1, the method for building a recommendation engine according to user requirements and user behaviors provided by this embodiment includes the steps of:
step 101, collecting user information and house source information;
the user information comprises user demand information and user behavior information, the user demand information comprises the demand of a user on the room price, the distance work place or the subway station and the business circle, and the user behavior information comprises the information of browsing web pages of the user and the renting time of the user; presetting a feedback interface of user demand information, wherein the feedback interface displays at least one keyword serving as a node of a corresponding tree structure; all levels of nodes of the tree structure are keywords of user demand information; when a search instruction aiming at the keyword currently displayed by the feedback interface by the user is acquired, determining a target keyword corresponding to the search instruction as user demand information;
it can be understood that the keywords of the feedback interface are equivalent to the first keywords, and the keywords matched with the user search belong to the second keywords; when the second keywords are not collected, the first keywords can be adopted for continuing, and the feedback interface of the preset user requirement information reduces the probability of wrong selection and selection omission of the user due to numerous keywords of the keywords corresponding to the user requirement.
In step 101, user information may be collected by online, offline, and electrical pinning. It can be understood that, in the first online acquisition mode, according to browsing and searching records of a user on a webpage and an APP, user intention information is acquired; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; the method is characterized by enriching an information acquisition channel, acquiring a large amount of user information, generating an integral sample model, and covering the attributes of a large number of targets such as the price of a high-quality land parcel owned by a high-quality long-rent apartment.
Step 102, recalling house source information according to user demand information, and filtering and formatting through a data collector Logstash to obtain a first recall data source;
the house source information is divided into real-time user behavior information and historical user behavior information according to the user behavior information; the real-time user behavior information is examined and checked, and the information enters an online database in real time according to a distributed stream processing open source framework (Flink); generating a log from the historical user behavior information, and inputting the log into an offline data warehouse HIVE in batches through a Spark frame calculation engine; generating a second recall data source by the online database and the offline data warehouse HIVE;
in step 102, optionally, user requirement feedback is collected through user filling or inquiry, and the search mode is recalled, so that the technology can adopt an ELK framework to ensure the real-time performance of the index establishment and query function executing MySQL query and the expandability of the recommendation engine. The ELK framework is Elasticisearch, Logstash, and Kibana is a set of solution for real-time data collection, storage, indexing, retrieval, statistical analysis and visualization. MySQL is a relational database management system.
In step 102, optionally, the real-time behavior can capture the current intention of the user through the recall of the historical behavior and the real-time behavior respectively. Wherein, the Flink executes any stream data program in a data parallel and pipeline mode, and the pipeline runtime system of the Flink can execute batch processing and stream processing programs. Spark is a Hadoop MapReduce-like universal parallel framework sourced by UC Berkeley AMP lab (AMP laboratories, burkeley, university, ca). HIVE is a data warehouse tool based on Hadoop, is used for data extraction, transformation and loading, and is a mechanism capable of storing, inquiring and analyzing large-scale data stored in Hadoop.
103, performing price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source, wherein the first recommended data source comprises an online data source and an offline data source;
in step 103, the first recall data source and the second recall data source are sorted according to the LBS service price matching strategy and the segment characteristics matching strategy to generate a first recommended data source.
And further, an LBS and value matching strategy is carried out for matching the strong intention of the client and the key attributes of the target objects, and a matching strategy is carried out for the secondary requirements of the user and the characteristics of the secondary house state and the section of the house, so that a rough sequencing result is obtained. Optimizing an individualized recommendation mode, comprehensively measuring and calculating the requirements and behaviors of the users, and extracting high-quality apartments with high recommendation matching degree from a sample model library. Meanwhile, a recommended sequencing model can be optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, multi-level sample weight depiction is carried out on the sequencing model, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
In the above steps, it can be understood that the user information correspondingly generates a first recall data source, a second recall data source and a third recall data source through the user demand information, the user behavior information and the user portrait information house source information, and processes the first recall data source, the second recall data source and the third recall data source to generate a first recommended data source; some user information with relevancy can be recalled from massive user information and roughly sorted, the subsequent fine sorting can be carried out, the workload of work is reduced, and the response time of the obtained result is favorably prolonged.
104, performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
it is to be understood that since the first recommended data source includes an online data source and an offline data source; the Spark distributed framework comprises an offline calculation module and an online calculation module, the online data source of the first recommended data source is processed by the Spark distributed framework and comprises the online calculation module, and the offline data source of the first recommended data source is processed by the Spark distributed framework and comprises the offline calculation module, so that the response time of the obtained result can be shortened.
105, training a second recommended data source to obtain a ranking model;
step 106, inputting the house source information into a ranking model to obtain the scoring ranking condition of the house source information;
in step 106, the house source information is further subjected to weight portrayal, and scenes such as areas, house states and the like are deeply divided according to the sparsity and the transaction attributes of the house source information to obtain the ranking score of the house source information.
Step 107, setting a score threshold value, wherein the house source with the score of the house source information larger than the score threshold value is a first recommended house source;
in steps 104-107, it is understood that the feature engineering process includes aggregating, refining, screening, and complementing the first recommended data source to obtain a second recommended data source. And further training a corresponding sorting model after the first recommended data source is collected and processed through characteristic engineering measurement and calculation of a Spark frame, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-amount attribute of transaction, giving accurate sorting scores of house source information, and finally recommending high-quality long-rented apartment resources to users.
Step 108, generating a second recommended house source according to the demands of the homeowners;
and step 109, constructing a recommendation engine according to the first recommended house source and the second recommended house source. Optionally, the recommendation engine further includes a hotspot source. The hot house source can be a house source with a rental rate of more than 90%, and preferably, the hot house source can be a house source with a rental rate of more than 90% near a subway station, a commercial district or a city center.
In steps 108-109, it can be understood that after modeling based on LBS specific to the long rental apartment, the system inserts some specific house sources into specific positions in the recommendation list according to customized requirements of the company such as the out-of-house tilt policy, the operation activity policy, the advertisement promotion, etc., and recommends more selection spaces for the user; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
The price of the first recommended house source is matched with the price of the second house source, and it can be understood that the first recommended house source belongs to a user demand house source, the second recommended house source belongs to a house source which is expected to be rented and recommended by a long-renting apartment, the price of the first recommended house source is matched with the price of the second house source, and the price matching can improve the probability that the user selects the second recommended house source, so that the income of the long-renting apartment is facilitated.
Example 2:
fig. 2 is a schematic flow chart of another method for building a recommendation engine according to user requirements and user behaviors according to the present invention, and as shown in fig. 2, the present embodiment provides another method for building a recommendation engine according to user requirements and user behaviors, which includes the steps of:
step 201, user information acquisition:
the information collection method mainly includes three methods: the method comprises the steps that a first online acquisition mode is adopted, and user intention information is obtained according to browsing search records of a user on a webpage and an APP; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; constructing a sample model of the information acquired by all channels, storing the sample model in a database and facilitating retrieval; enriching collected information channels, including on-line APP recording, off-line communication with watching and selling modes of a user; a large amount of user information is collected, an integral sample model is generated, and the attributes of large objects such as high-quality parcel prices of high-quality long-rent apartments are covered.
Step 202, recall:
after a user searches for information of a long rental apartment near 'XX prefecture', firstly, user requirements are acquired through a sample database: presetting a feedback interface of the user demand information, wherein the feedback interface displays at least one keyword serving as a node of a corresponding tree structure; all levels of nodes of the tree structure are keywords of the user demand information; when a search instruction aiming at the keyword currently displayed by the feedback interface by the user is acquired, determining a target keyword corresponding to the search instruction as user demand information; recalling according to the requirements of users: the system selects data information of a long rental apartment near the XX first house from a MySQL house source library, transmits the data to a data collector Logstash for filtering and formatting, and then stores the data in an ES database;
further, collected data information is collected and carefully selected, then multi-layer rough sequencing is carried out on the long-rent apartment scenes, LBS and value matching strategies are carried out aiming at the matching of the strong intentions of the clients and the key attributes of the target objects, and matching strategies are carried out on the secondary requirements of the users and the secondary house state and the district characteristics of the house, so that a rough sequencing result is obtained; a new matching strategy is formulated for the conditions of on-line and off-line watching, signing and the like, and a multi-level sample weight depiction is carried out on the sequencing model, so that the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
It can be understood that the keywords of the feedback interface are equivalent to the first keywords, and the keywords matched with the user search belong to the second keywords; when the second keywords are not collected, the first keywords can be adopted for continuing, and the feedback interface of the preset user requirement information reduces the probability of wrong selection and selection omission of the user due to numerous keywords of the keywords corresponding to the user requirement.
Step 203, sorting:
for recalled house source data rough ordering information, after comprehensive collection of historical behaviors, demand preference information and house state plot and circle images of a user are processed through characteristic engineering measurement and calculation of a Spark distributed framework, a corresponding training and constructed ordering model is respectively carried out on online data and offline data, weight depiction is carried out on sample data, scenes such as regions, house states and the like are deeply divided according to the high sparsity of the data information and the large-volume attribute of transaction, accurate ordering scoring of the house source information is given, and high-quality long-rental apartment resources are finally recommended to the user;
step 204, intervention and adjustment:
after modeling based on the specific LBS of the long rental apartment, the system inserts some specific house sources into specific positions in a recommendation list according to customized requirements of a company such as a house-leaving inclined policy, an operation activity policy, advertisement promotion and the like, and recommends more selection spaces for users; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
It can be understood that the recall method of the present invention may be based on any one of the user requirement information or the behavior information, or may be based on both of the above two methods. The embodiment only shows the recall according to the requirements of the user, and the embodiment does not make specific requirements on the recall mode and can be set according to actual conditions.
Example 3:
with continued reference to fig. 2, the present embodiment provides a method for building a recommendation engine according to user requirements and user behaviors, which includes the steps of:
step 301, user information acquisition:
the information collection method mainly includes three methods: the method comprises the steps that a first online acquisition mode is adopted, and user intention information is obtained according to browsing search records of a user on a webpage and an APP; the second is an offline acquisition mode, which acquires the intention information of the user through the carrying, signing and communication of a specially-assigned person; the third is a mode of electric marketing, which is used for determining the intention of the user through communication with the user by telephone; constructing a sample model of the information acquired by all channels, storing the sample model in a database and facilitating retrieval; enriching collected information channels, including on-line APP recording, off-line communication with watching and selling modes of a user; a large amount of user information is collected, an integral sample model is generated, and the attributes of large objects such as high-quality parcel prices of high-quality long-rent apartments are covered.
Step 302, recall:
after a user searches for information of a long rental apartment near 'XX prefecture', user behavior information is acquired through a sample database: recalling according to the user line information: the real-time behavior of the user is subjected to data cleaning and enters a database in real time through a Flink processing framework; historical behavior logs are generated for each behavior of the user and collected, and the collected historical behavior logs are input into a HIVE offline data warehouse in batches through a Spark calculation engine; synchronously storing the warehousing data into a Redis real-time database;
collecting and carefully selecting the acquired data information, then carrying out multilayer rough sequencing on the long-rent apartment scenes, carrying out LBS and value matching strategies aiming at the matching of the strong intentions of the clients and the key attributes of the target objects, and carrying out matching strategies on the secondary requirements of the users, the secondary house state and the district characteristics of the house to obtain a rough sequencing result; a new matching strategy is formulated for the conditions of on-line and off-line watching, signing and the like, and a multi-level sample weight depiction is carried out on the sequencing model, so that the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
Step 303, sorting:
for recalled house source data rough sorting information, after comprehensively acquiring historical behaviors, demand preference information and house state plot and business circle images of a user and performing characteristic engineering measurement and calculation processing on a Spark distributed framework, respectively performing corresponding training on online data and offline data to construct a sorting model, performing weight portrayal on sample data, performing deep division on scenes such as regions, house states and the like according to the high sparsity of data information and the large-volume attribute of transaction, giving accurate sorting scoring of house source information, and finally recommending high-quality long-rental apartment resources to the user;
step 304, intervention and adjustment:
after modeling based on the specific LBS of the long rental apartment, the system inserts some specific house sources into specific positions in a recommendation list according to customized requirements of a company such as a house-leaving inclined policy, an operation activity policy, advertisement promotion and the like, and recommends more selection spaces for users; and if the recommendation result is insufficient, the system automatically recommends the completion of the related hot spot competitive house resources. And optimizing an intervention adjustment layer, modeling based on a specific LBS mode of the long-renting apartment, and designing a characteristic recommendation engine of the long-renting apartment with large commodities with sparsely distributed plots.
It can be understood that the recall method of the present invention may be based on any one of the user requirement information or the behavior information, or may be based on both of the above two methods. The embodiment only shows that the recall is performed according to the user behavior, and the embodiment does not make specific requirements on the recall mode and can be set according to actual conditions.
According to the embodiments, the application has the following beneficial effects:
1. according to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the first recommendation data source is subjected to feature engineering processing through the Spark distributed framework to obtain the second recommendation data source, and the response time of the obtained result is favorably shortened.
2. According to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the feedback interface of the preset user requirement information reduces the probability of wrong selection and selection omission of the user due to numerous keywords corresponding to the user requirements.
3. The method for constructing the recommendation engine according to the user requirements and the user behaviors, provided by the invention, has rich information acquisition channels, including user online APP records, offline watching and communication and electric marketing modes; a large amount of user information is collected, an integral sample model is generated, and the bulk object attributes such as high-quality parcel prices of high-quality long-rent apartments are covered.
4. According to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the personalized recommendation mode is optimized, and apartments with high recommendation matching degree are extracted from the sample model library through comprehensive measurement and calculation of the user requirements and behaviors.
5. According to the method for constructing the recommendation engine according to the user requirements and the user behaviors, the recommendation sequencing model is optimized, a new matching strategy is formulated for the conditions of online and offline watching, signing and the like, the sequencing model is subjected to multilevel sample weight portrayal, and the complexity of the existing sequencing model is reduced; meanwhile, scenes such as regions, house states and the like are deeply subdivided according to the high sparsity of samples and the large-amount attribute of transaction, and the quality of the recommended apartment is improved.
6. The method for constructing the recommendation engine according to the user requirements and the user behaviors optimizes an intervention adjustment layer, models based on a specific LBS (location based service) mode of the long-rented apartment, and designs the special recommendation engine of the long-rented apartment with bulk commodities with sparsely distributed landmass.
While the present invention has been described in detail with reference to the drawings and examples, it is to be understood that the foregoing examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention. The scope of the invention is defined by the appended claims.
Claims (7)
1. A method for constructing a recommendation engine according to user requirements and user behaviors, comprising the steps of:
collecting user information and house source information;
the user information comprises user demand information and user behavior information, the user demand information comprises the requirements of a user on room price, a distance workplace or a subway station and a business circle, and the user behavior information comprises user browsing webpage information and user renting time; presetting a feedback interface of the user demand information, wherein the feedback interface displays at least one keyword serving as a node of a corresponding tree structure; all levels of nodes of the tree structure are keywords of the user demand information; when a search instruction aiming at the keyword currently displayed by the feedback interface by the user is acquired, determining a target keyword corresponding to the search instruction as user demand information;
the house source information is recalled according to the user demand information, and the first recall data source is obtained through filtering and formatting of a data collector;
the house source information is divided into real-time user behavior information and historical user behavior information according to the user behavior information; the real-time user behavior information is examined and checked, and the real-time user behavior information enters an online database in real time according to a distributed stream processing open source framework; generating a log from the historical user behavior information, and inputting the log into an offline data warehouse in batches through a Spark frame calculation engine; generating a second recall data source for the online database and the offline data repository;
carrying out price matching and section feature matching on the first recall data source and the second recall data source according to LBS service to obtain corresponding matching degrees, and sequentially sequencing from high to low according to the matching degrees to generate a first recommended data source, wherein the first recommended data source comprises an online data source and an offline data source;
performing feature engineering processing on the first recommended data source through a Spark distributed framework, and grading to obtain a second recommended data source;
training the second recommended data source to obtain a ranking model;
inputting the house source information into the sequencing model to obtain the grading sequencing condition of the house source information;
setting a scoring threshold value, wherein the house source with the score of the house source information larger than the scoring threshold value is a first recommended house source;
meanwhile, generating a second recommended house source according to the house owner demand;
and constructing a recommendation engine according to the first recommended house source and the second recommended house source.
2. The method of claim 1, wherein the price of the first recommended source matches the price of the second source.
3. The method of claim 1, wherein the user information is collected online, offline, and telemarketing.
4. The method of claim 1, wherein the first recalled data source is recalled by searching.
5. The method of claim 1, wherein the feature engineering process comprises subjecting the first recommended data source to aggregation, refinement, screening, and completion in sequence to obtain the second recommended data source.
6. The method of claim 1, wherein the step of inputting the house source information into the ranking model to obtain the ranking score of the house source information comprises:
and carrying out weight depiction on the house source information, and carrying out deep division on the scenes such as regions, house states and the like according to the sparsity and the transaction attribute of the house source information to obtain the grading and sequencing conditions of the house source information.
7. The method of claim 1, wherein the recommendation engine further comprises a hotspot source.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201911409520.7A CN111159559A (en) | 2019-12-31 | 2019-12-31 | Method for constructing recommendation engine according to user requirements and user behaviors |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201911409520.7A CN111159559A (en) | 2019-12-31 | 2019-12-31 | Method for constructing recommendation engine according to user requirements and user behaviors |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN111159559A true CN111159559A (en) | 2020-05-15 |
Family
ID=70559900
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201911409520.7A Withdrawn CN111159559A (en) | 2019-12-31 | 2019-12-31 | Method for constructing recommendation engine according to user requirements and user behaviors |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111159559A (en) |
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111915329A (en) * | 2020-07-30 | 2020-11-10 | 上海数策软件股份有限公司 | Personalized recommendation method and system based on after-sale scenes in automobile industry |
| CN112487310A (en) * | 2020-10-23 | 2021-03-12 | 贝壳技术有限公司 | Cell recommendation method and system |
| CN116719911A (en) * | 2023-08-10 | 2023-09-08 | 成都不烦智能科技有限责任公司 | Automatic flow generation method, device, equipment and storage medium |
| CN118606558A (en) * | 2024-06-19 | 2024-09-06 | 北京寓享科技有限公司 | Apartment rental matching recommendation method and system based on big data |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102279851A (en) * | 2010-06-12 | 2011-12-14 | 阿里巴巴集团控股有限公司 | Intelligent navigation method, device and system |
| US20130124234A1 (en) * | 2011-11-10 | 2013-05-16 | Stubhub, Inc. | Intelligent seat recommendation |
| CN105512118A (en) * | 2014-09-22 | 2016-04-20 | 珠海金山办公软件有限公司 | User demand feedback method and device |
| CN108376341A (en) * | 2018-01-10 | 2018-08-07 | 链家网(北京)科技有限公司 | A kind of localization method and system of source of houses client |
| CN109377329A (en) * | 2018-12-25 | 2019-02-22 | 北京时光荏苒科技有限公司 | A kind of source of houses recommended method, device, storage medium and electronic equipment |
-
2019
- 2019-12-31 CN CN201911409520.7A patent/CN111159559A/en not_active Withdrawn
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN102279851A (en) * | 2010-06-12 | 2011-12-14 | 阿里巴巴集团控股有限公司 | Intelligent navigation method, device and system |
| US20130124234A1 (en) * | 2011-11-10 | 2013-05-16 | Stubhub, Inc. | Intelligent seat recommendation |
| CN105512118A (en) * | 2014-09-22 | 2016-04-20 | 珠海金山办公软件有限公司 | User demand feedback method and device |
| CN108376341A (en) * | 2018-01-10 | 2018-08-07 | 链家网(北京)科技有限公司 | A kind of localization method and system of source of houses client |
| CN109377329A (en) * | 2018-12-25 | 2019-02-22 | 北京时光荏苒科技有限公司 | A kind of source of houses recommended method, device, storage medium and electronic equipment |
Non-Patent Citations (2)
| Title |
|---|
| 张海华 * |
| 段志强;蔡为;余晓敏;秦昆;黄长青;郭涛;徐源泉;: "面向规划管理的众包数据接入与存储处理方法" * |
Cited By (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN111915329A (en) * | 2020-07-30 | 2020-11-10 | 上海数策软件股份有限公司 | Personalized recommendation method and system based on after-sale scenes in automobile industry |
| CN112487310A (en) * | 2020-10-23 | 2021-03-12 | 贝壳技术有限公司 | Cell recommendation method and system |
| CN112487310B (en) * | 2020-10-23 | 2023-02-17 | 贝壳技术有限公司 | Cell recommendation method and system |
| CN116719911A (en) * | 2023-08-10 | 2023-09-08 | 成都不烦智能科技有限责任公司 | Automatic flow generation method, device, equipment and storage medium |
| CN116719911B (en) * | 2023-08-10 | 2023-10-31 | 成都不烦智能科技有限责任公司 | Automatic flow generation method, device, equipment and storage medium |
| CN118606558A (en) * | 2024-06-19 | 2024-09-06 | 北京寓享科技有限公司 | Apartment rental matching recommendation method and system based on big data |
| CN118606558B (en) * | 2024-06-19 | 2024-12-24 | 北京寓享科技有限公司 | Apartment rental matching recommendation method and system based on big data |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN110442790B (en) | Method, device, server and storage medium for recommending multimedia data | |
| CN112632405B (en) | Recommendation method, recommendation device, recommendation equipment and storage medium | |
| US20170140038A1 (en) | Method and system for hybrid information query | |
| CN112232909A (en) | Business opportunity mining method based on enterprise portrait | |
| CN111159561A (en) | Method for constructing recommendation engine according to user behaviors and user portrait | |
| CN111143689A (en) | Method for constructing recommendation engine according to user requirements and user portrait | |
| CN111159559A (en) | Method for constructing recommendation engine according to user requirements and user behaviors | |
| CN114764479A (en) | Personalized news recommendation method based on user behaviors in news scene | |
| CN106557558A (en) | A kind of data analysing method and device | |
| CN111161034A (en) | Method for constructing recommendation engine based on LBS (location based service) renting scene | |
| CN110968801A (en) | Real estate product searching method, storage medium and electronic device | |
| CN118747235B (en) | A library book push method and system based on behavior analysis | |
| CN115115407A (en) | Intelligent recommendation method and system for urban integrated intelligent service platform | |
| CN114547453A (en) | A general TOP-N intelligent recommendation system and method | |
| CN112669113A (en) | Product recommendation method and device, storage medium and electronic device | |
| CN104679827A (en) | Big data-based public information association method and mining engine | |
| CN110795613B (en) | Commodity searching method, device and system and electronic equipment | |
| CN108647729B (en) | User portrait acquisition method | |
| CN116049543A (en) | Hybrid recommendation method, system and storage medium for comprehensive energy efficiency service business | |
| CN118861207A (en) | User behavior analysis method, device and electronic device based on multi-channel data | |
| CN119003840A (en) | Government service recommendation method and system based on user behavior analysis | |
| CN119130586A (en) | An e-commerce customer service method and system based on AI | |
| CN114880588A (en) | News popularity prediction method based on knowledge graph | |
| CN118260484A (en) | Recommendation problem acquisition method and device, electronic equipment and medium | |
| Abdallah et al. | A data collection quality model for big data systems |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| WW01 | Invention patent application withdrawn after publication | ||
| WW01 | Invention patent application withdrawn after publication |
Application publication date: 20200515 |