[go: up one dir, main page]

WO2018165960A1 - Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage - Google Patents

Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage Download PDF

Info

Publication number
WO2018165960A1
WO2018165960A1 PCT/CN2017/077007 CN2017077007W WO2018165960A1 WO 2018165960 A1 WO2018165960 A1 WO 2018165960A1 CN 2017077007 W CN2017077007 W CN 2017077007W WO 2018165960 A1 WO2018165960 A1 WO 2018165960A1
Authority
WO
WIPO (PCT)
Prior art keywords
hash
commerce
commerce data
hash value
association list
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2017/077007
Other languages
English (en)
Chinese (zh)
Inventor
李楚斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Xiustyle Brand Communication Co Ltd
Original Assignee
Shenzhen Xiustyle Brand Communication Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Xiustyle Brand Communication Co Ltd filed Critical Shenzhen Xiustyle Brand Communication Co Ltd
Priority to PCT/CN2017/077007 priority Critical patent/WO2018165960A1/fr
Publication of WO2018165960A1 publication Critical patent/WO2018165960A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the field of data processing technologies, and in particular, to a hash value based e-commerce data processing system and method.
  • Data mining is the process of mining effective knowledge from large amounts of data stored in databases, data warehouses, or other information repositories. Data mining extracts implicit, valuable, and understandable information from large amounts of data to guide people's activities. Data mining techniques mainly include association rules, classification rules, cluster analysis and sequence patterns.
  • mobile terminals for example, mobile terminals such as mobile phones, tablet computers, etc.
  • mobile terminals for example, mobile terminals such as mobile phones, tablet computers, etc.
  • the e-commerce data thus resulting in redundancy of e-commerce data on the e-commerce platform e-commerce platform and reducing the access speed of the e-commerce platform website.
  • the main object of the present invention is to provide a hash value based e-commerce data processing system and method, aiming at solving the technical problem of cleaning redundant e-commerce data on an e-commerce platform.
  • the present invention provides a hash value-based e-commerce data processing system, which runs in an e-commerce platform, and the e-commerce platform is communicably connected to a terminal through a network, and the system includes:
  • a calculation module configured to calculate a hash value of each group of e-commerce data on the e-commerce platform
  • a search module configured to find the same hash value from the calculated hash value, and group the same hash value into a group, and the same hash value in each group and the same hash in the group Storage path of e-commerce data corresponding to the Greek value The path is saved in a hash association list;
  • a deleting module configured to select a storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage path in the hash association list;
  • a modification module configured to modify, in each hash association list, an identifier corresponding to the deleted e-commerce data
  • a receiving module configured to receive a file query instruction sent by the terminal
  • a determining module configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list
  • a sending module configured to: when the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list, the e-commerce corresponding to the selected storage path in the corresponding hash association list The data is sent to the terminal.
  • the hash association list includes four fields, and each of the four fields is a data name, a hash value, a storage path, and an identifier.
  • the calculation module calculates a hash value of each group of e-commerce data on the e-commerce platform as follows: traverses e-commerce data on the e-commerce platform, and calculates each group of e-commerce by using a hash algorithm The hash value of the data.
  • the calculation module calculates a hash value of each set of e-commerce data on the e-commerce platform by using a hash algorithm, where the hash algorithm is a SHA256 algorithm.
  • the determining module determines whether the storage path in the file query instruction has a corresponding hash association list as follows: parsing the file query instruction and obtaining a storage path of the e-commerce data to be queried, The storage path of the e-commerce data to be queried is compared with each hash association list in the e-commerce platform. If a hash association list in the e-commerce platform includes a storage path of the e-commerce data to be queried, The hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.
  • the present invention further provides a method for processing an e-commerce data based on a hash value, which is applied to an e-commerce platform, wherein the e-commerce platform is communicatively connected to the terminal through a network, and the method includes the following steps:
  • selecting a storage path in each hash association list, and deleting the hash association list is not selected E-commerce data corresponding to the storage path;
  • the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list
  • the e-commerce data corresponding to the selected storage path in the corresponding hash association list is sent to the terminal.
  • the hash association list includes four fields, and each of the four fields is a data name, a hash value, a storage path, and an identifier.
  • the step of calculating a hash value of each group of e-commerce data on the e-commerce platform comprises the steps of: traversing e-commerce data on an e-commerce platform, and calculating each group of e-commerce by using a hash algorithm The hash value of the data.
  • the hash value of each group of e-commerce data on the e-commerce platform is calculated by a hash algorithm, where the hash algorithm is a SHA256 algorithm.
  • the step of determining whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list comprises the following steps: parsing the file query instruction and obtaining the electricity to be queried a storage path of the quotient data, comparing the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform, if there is a hash association list in the e-commerce platform, the s
  • the storage association path of the quotient data, the hash association list is a hash association list corresponding to the storage path of the quotient data to be queried.
  • the present invention adopts the above technical solution, and brings the technical effects as follows:
  • the present invention uses the hash value comparison method to find the same e-commerce data, and deletes repeated e-commerce data, thereby accelerating the access speed of the e-commerce platform. , reducing the redundancy of the e-commerce platform.
  • FIG. 1 is a schematic diagram of an application environment of a hash value-based e-commerce data processing system according to the present invention
  • FIG. 2 is a block diagram showing a preferred embodiment of a hash value based e-commerce data processing system of the present invention
  • 3 is a flow chart of a preferred embodiment of a hash value based e-commerce data processing method of the present invention
  • FIG. 4 is a schematic diagram of a hash association list of the present invention.
  • FIG. 1 is a schematic diagram of an application environment of a hash value-based e-commerce data processing system according to the present invention.
  • the hash value based e-commerce data processing system 30 of the present invention operates on the e-commerce platform 3.
  • the e-commerce platform 3 is communicatively coupled to a plurality of terminals 1 via a network 2.
  • the e-commerce platform 3 stores various types of e-commerce data (e.g., user name, product text information, merchandise type information, courier information, warehousing information, merchandise pictures, shopping records, etc.).
  • the e-commerce data stored by the e-commerce platform 3 is classified into multiple sets of data according to the time division and location, for example, the data generated every second is classified into a group of data.
  • the e-commerce platform 3 is a cluster composed of a plurality of servers.
  • the network 2 may be a wired communication network or a wireless communication network.
  • the network is preferably a wireless communication network, including but not limited to a GSM network, a GPRS network, a CDMA network, a TD-S CDMA network, a WiMAX network, a TD-LTE network, an FDD-LTE network, and the like. .
  • the terminal 1 may be, but not limited to, any other suitable portable touch screen electronic device such as a smart phone, a tablet computer, a personal digital assistant (PDA), a personal computer, an electronic signboard, and the like.
  • the terminal 1 is configured to access the e-commerce platform 3 through the network 2, and obtain e-commerce data on the e-commerce platform 3.
  • the terminal 1 is further configured to generate e-commerce data, and send the generated e-commerce data to the e-commerce platform 3 for storage.
  • FIG. 2 there is shown a block diagram of a preferred embodiment of a hash value based e-commerce data processing system of the present invention.
  • the hash value based e-commerce data processing system 30 is applied to the e-commerce platform 3.
  • the e-commerce platform 3 includes, but is not limited to, a hash value based e-commerce data processing system 30, a storage unit 31, a processing unit 32, and a communication unit 33.
  • the storage unit 31 may be a read only storage unit ROM, an electrically erasable storage unit EEPRO M, flash memory unit FLASH or solid hard disk.
  • the processing unit 32 may be a central processing unit (CPU), a microcontroller (MCU), a data processing chip, or an information processing unit having a data processing function.
  • CPU central processing unit
  • MCU microcontroller
  • data processing chip or an information processing unit having a data processing function.
  • the communication unit 33 is a communication interface with a remote communication function, for example, supports GSM, GPR.
  • the communication interface of wireless communication technologies such as S, WCDMA, CDMA, TD-SCDMA, WiMAX, TD-LTE, FDD-LTE, and the wired communication interface supporting USB and network cable.
  • the data processing system of the electricity supplier based on a hash value 30 include, but are not limited to, the calculation module 312, a searching module 313, a deleting module 314, modification module 315, a receiving module 316, a determining module 317 and a sending module 318.
  • module refers to a series of computer program instruction segments that can be executed by the processing unit 32 of the e-commerce platform 3 and that can perform fixed functions, which are stored in the storage unit of the e-commerce platform 3. 31.
  • the calculation module 312 is configured to calculate a hash value of each group of e-commerce data on the e-commerce platform 3. Specifically, the calculation module 312 traverses the e-commerce data on the e-commerce platform 3 and calculates a hash value of each set of e-commerce data through a hash algorithm.
  • the hash algorithm may be, but is not limited to, the SHA256 algorithm.
  • the searching module 313 is configured to search for the same hash value from the calculated hash values, and group the same hash values into a group, and the same hash value in each group is the same as the same in the group.
  • the storage path of the e-commerce data corresponding to the hash value is stored in a hash association list.
  • the hash association list includes a plurality of identical hash values and a storage path of each hash value corresponding to the e-commerce data. It should be noted that the hash values in the hash association list are the same.
  • the search module 313 obtains multiple sets of identical hash values and generates a hash association list for each set of identical hash values.
  • the hash association list includes at least four fields, and each of the four fields is a name of each group of data (representing the name of the e-commerce data), and a hash value (a representation of the e-commerce data).
  • the value of the storage path (representing the storage path of the e-commerce data on the e-commerce platform 3) and the identification (representing whether the e-commerce data is retained in the e-commerce platform 3, where Y is reserved and N is deleted).
  • the deleting module 314 is configured to select one storage path in each hash association list, and delete the e-commerce data corresponding to the unselected storage paths in the hash association list. In this embodiment, the deleting module 3 14 selects the storage path T1 and deletes the e-commerce data corresponding to the remaining storage paths T2 to T9 in the e-commerce platform 3.
  • the modification module 315 is configured to modify the identifier corresponding to the deleted e-commerce data in each hash association list. Specifically, as shown in FIG. 4, the modification module 315 modifies the identifier corresponding to T2 to T9 in the hash association list from Y to N.
  • the receiving module 316 is configured to receive a file query instruction sent by the terminal 1. Generally, the terminal 1 queries the e-commerce data on the e-commerce platform 3, and the file query instruction includes a storage path of the e-commerce data to be queried.
  • the determining module 317 is configured to determine whether a storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list.
  • the determining module 317 parses the file query instruction and obtains a storage path of the e-commerce data to be queried, and compares the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform 3. If there is a hash association list in the e-commerce platform 3 that includes the storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.
  • the sending module 318 is configured to send the e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal 1. Specifically, as shown in FIG. 4, if the storage path of the e-commerce data to be queried is T2, the sending module 318 sends the e-commerce data A corresponding to the storage path T1 to the terminal 1.
  • FIG. 3 there is shown a flow chart of a preferred embodiment of the hash value based e-commerce data processing method of the present invention.
  • the hash value-based e-commerce data processing method is applied to the e-commerce platform 3, and the method includes the following steps:
  • Step S12 The calculation module 312 calculates a hash value of each group of e-commerce data on the e-commerce platform 3. Specifically, the calculation module 312 traverses the e-commerce data on the e-commerce platform 3, and calculates a hash value of each set of e-commerce data through a hash algorithm.
  • the hash algorithm may be, but is not limited to, the SHA256 algorithm.
  • Step S13 The searching module 313 searches for the same hash value from the calculated hash values, and groups the same hash values into a group, and sets the same hash value in each group and the group.
  • the storage path of the e-commerce data corresponding to the same hash value is stored in a hash association list.
  • the hash association list includes a plurality of identical hash values and a storage path of each hash value corresponding to the e-commerce data. It should be noted that the hash values in the hash association list are the same.
  • the search module 313 obtains multiple sets of identical hash values and generates a hash association list for each set of identical hash values.
  • the hash association list includes at least four fields, and each of the four fields is a name of each group (representing the name of the e-commerce data).
  • a hash value (representing a hash value of the e-commerce data), a storage path (representing a storage path of the e-commerce data on the e-commerce platform 3), and a logo (indicating whether the e-commerce data is retained in the e-commerce platform 3, wherein Y means reserved, N means delete).
  • Y means reserved, N means delete.
  • the letter B in FIG. 4 merely refers to the hash value, and does not mean that the hash value is B
  • the letters T1 to T9 in FIG. 4 are merely referring to the storage path, and do not indicate T1 to T9 are storage paths.
  • Step S14 The deleting module 314 selects one storage path in each hash association list, and deletes the e-commerce data corresponding to the unselected storage paths in the hash association list.
  • the deletion module 314 selects the storage path T1, and deletes the e-commerce data corresponding to the remaining storage paths ⁇ 2 to ⁇ 9 in the e-commerce platform 3.
  • Step S15 The modification module 315 modifies the identifier corresponding to the deleted e-commerce data in each hash association list. Specifically, as shown in FIG. 4, the modification module 315 modifies the identifier corresponding to ⁇ 2 to ⁇ 9 in the hash association list from ⁇ to ⁇ .
  • Step S16 The receiving module 316 receives the file query instruction sent by the terminal 1. Generally, the terminal 1 queries the e-commerce data on the e-commerce platform 3, and the query instruction includes a storage path of the e-commerce data to be queried.
  • Step S17 The determining module 317 determines whether the storage path of the e-commerce data to be queried in the file query instruction has a corresponding hash association list.
  • the determining module 317 parses the file query instruction and obtains a storage path of the e-commerce data to be queried, and compares the storage path of the e-commerce data to be queried with each hash association list in the e-commerce platform 3. If there is a hash association list in the e-commerce platform 3 that includes the storage path of the e-commerce data to be queried, the hash association list is a hash association list corresponding to the storage path of the e-commerce data to be queried.
  • step S18 If the storage path in the file query instruction has a corresponding hash association list, the process proceeds to step S18. Otherwise, if the storage path in the file query instruction does not have a corresponding hash association list, The e-commerce data in the e-commerce platform 3 is traversed directly through the storage path in the file query command (omitted in FIG. 3), and then the flow is ended.
  • Step S18 The sending module 318 sends the e-commerce data corresponding to the selected storage path in the corresponding hash association list to the terminal 1. Specifically, as shown in FIG. 4, if the storage path of the e-commerce data to be queried is T2, the sending module 318 sends the e-commerce data ⁇ corresponding to the storage path T1 to the terminal 1. Industrial applicability
  • the present invention uses the hash value comparison method to find the same e-commerce data, and deletes the repeated e-commerce data, accelerates the access speed of the e-commerce platform, and reduces the redundancy of the e-commerce platform.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention concerne un système et un procédé de traitement de données de commerce électronique basé sur une valeur de hachage, le procédé consistant : à calculer la valeur de hachage de chaque groupe de données de commerce électronique au moyen d'un algorithme de hachage; à rechercher la même valeur de hachage, et à la mémoriser avec le chemin de mémorisation de données de commerce électronique qui lui correspondent dans une liste d'associations de hachage; à modifier l'identifiant correspondant à des données de commerce électronique supprimées dans chaque liste d'associations de hachage; à recevoir une instruction d'interrogation de fichier envoyée par un terminal; et, lorsque le chemin de mémorisation de données de commerce électronique à interroger dans l'instruction d'interrogation de fichier a une liste d'associations de hachage correspondante, à envoyer des données de commerce électronique correspondant au chemin de mémorisation sélectionné dans la liste d'associations de hachage correspondante au terminal. La mise en œuvre de la solution ci-décrite peut augmenter la vitesse d'accès à une plateforme de commerce électronique.
PCT/CN2017/077007 2017-03-17 2017-03-17 Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage Ceased WO2018165960A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/077007 WO2018165960A1 (fr) 2017-03-17 2017-03-17 Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2017/077007 WO2018165960A1 (fr) 2017-03-17 2017-03-17 Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage

Publications (1)

Publication Number Publication Date
WO2018165960A1 true WO2018165960A1 (fr) 2018-09-20

Family

ID=63521739

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/077007 Ceased WO2018165960A1 (fr) 2017-03-17 2017-03-17 Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage

Country Status (1)

Country Link
WO (1) WO2018165960A1 (fr)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101702179A (zh) * 2009-12-01 2010-05-05 百度在线网络技术(北京)有限公司 在数据挖掘中去除重复的方法和设备
CN104199863A (zh) * 2014-08-15 2014-12-10 小米科技有限责任公司 存储设备上的文件的查找方法、装置及路由器
CN106599260A (zh) * 2016-12-21 2017-04-26 深圳市易特科信息技术有限公司 医疗信息化重复医疗文件删除系统及方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101702179A (zh) * 2009-12-01 2010-05-05 百度在线网络技术(北京)有限公司 在数据挖掘中去除重复的方法和设备
CN104199863A (zh) * 2014-08-15 2014-12-10 小米科技有限责任公司 存储设备上的文件的查找方法、装置及路由器
CN106599260A (zh) * 2016-12-21 2017-04-26 深圳市易特科信息技术有限公司 医疗信息化重复医疗文件删除系统及方法

Similar Documents

Publication Publication Date Title
JP5721627B2 (ja) データベースストレージスペースを節約するための方法及びシステム
TWI611305B (zh) 識別特徵群體的方法及裝置和搜索方法及裝置
TWI522942B (zh) 用戶收藏夾資料的處理方法與裝置、用戶收藏夾資料的搜尋方法與裝置、及用戶收藏夾系統
US8150813B2 (en) Using relationships in candidate discovery
CN103177075B (zh) 基于知识的实体检测和消歧
CN108804516B (zh) 相似用户查找装置、方法及计算机可读存储介质
CN111242709B (zh) 一种消息推送方法及其装置、设备、存储介质
US8352496B2 (en) Entity name matching
US9720946B2 (en) Efficient storage of related sparse data in a search index
US9405840B2 (en) Using social signals to rank search results
CN114328632A (zh) 基于位图的用户数据分析方法、装置及计算机设备
KR20140139623A (ko) 제품 피드 유사성을 사용하는 스팸 상인 발견
CN104536984A (zh) 一种外包数据库中的空间文本Top-k查询的验证方法及系统
WO2018165968A1 (fr) Système et procédé d'évaluation et de recommandation de commerçant basées sur une recherche de pages web
CN111651749A (zh) 基于密码找回账号的方法、装置、计算机设备及存储介质
CN105187439A (zh) 钓鱼网站检测方法及装置
TWI639093B (zh) Object set and processing method and device thereof
CN109241360B (zh) 组合字符串的匹配方法及装置和电子设备
CN103455637A (zh) 一种车牌检索的方法和装置
CN104750692B (zh) 一种信息处理方法、信息检索方法及其对应的装置
WO2020056976A1 (fr) Procédé, dispositif et programme de classement optimisé de résultats de recherche, et support de stockage lisible par ordinateur
TWI547888B (zh) A method of recording user information and a search method and a server
CN109101630B (zh) 一种应用程序搜索结果的生成方法、装置及设备
WO2018165960A1 (fr) Système et procédé de traitement de données de commerce électronique basé sur une valeur de hachage
JP2020525949A (ja) メディア検索方法及び装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17900618

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17900618

Country of ref document: EP

Kind code of ref document: A1