[go: up one dir, main page]

WO2017000761A1 - Procédé et appareil d'extraction d'informations caractéristiques de dispositif terminal - Google Patents

Procédé et appareil d'extraction d'informations caractéristiques de dispositif terminal Download PDF

Info

Publication number
WO2017000761A1
WO2017000761A1 PCT/CN2016/085592 CN2016085592W WO2017000761A1 WO 2017000761 A1 WO2017000761 A1 WO 2017000761A1 CN 2016085592 W CN2016085592 W CN 2016085592W WO 2017000761 A1 WO2017000761 A1 WO 2017000761A1
Authority
WO
WIPO (PCT)
Prior art keywords
feature information
terminal device
string
feature
library file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2016/085592
Other languages
English (en)
Chinese (zh)
Inventor
李燕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
ZTE Corp
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Publication of WO2017000761A1 publication Critical patent/WO2017000761A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/08Configuration management of networks or network elements
    • H04L41/085Retrieval of network configuration; Tracking network configuration history

Definitions

  • This document relates to, but is not limited to, the field of communication technologies, and in particular, to a method and an apparatus for extracting feature information of a terminal device.
  • the characteristic information of the terminal devices in the communication network including: the information of the terminal device and the operating system, on the one hand, the customer information can be better understood, and the network service can be optimized; on the other hand, the collection can be The market share of each type of terminal equipment and operating system is better for product promotion.
  • the technology for extracting information of the terminal device and the operating system in the related art is not perfect, and the information of the terminal device and the operating system cannot be accurately extracted.
  • the embodiment of the invention provides a method and a device for extracting feature information of a terminal device, which can accurately extract terminal device information and better optimize network services.
  • An embodiment of the present invention provides a method for extracting feature information of a terminal device, including:
  • the feature information of the terminal device includes feature information respectively obtained from multiple dimensions
  • the plurality of dimensions include at least a terminal device system, a terminal device manufacturer, and a terminal device model One of them.
  • the request message includes: a hypertext transfer protocol HTTP message.
  • the searching, by the UA signature file, the first feature information corresponding to the first UA string includes:
  • the extracting method further includes:
  • the first feature information having the longest length is selected as the feature information of the first terminal device.
  • An embodiment of the present invention further provides an apparatus for extracting feature information of a terminal device, including:
  • a building module configured to: according to the feature information of the terminal device and the user agent UA string corresponding to each feature information, construct a UA feature library file that identifies a mapping relationship between the feature information and the UA string;
  • the acquiring module is configured to collect the request packet sent by the first terminal device, and obtain the first UA string of the first terminal device from the request packet;
  • a determining module configured to search, according to the first UA string, the first feature information corresponding to the first UA string from the UA feature library file, and to find the first UA string corresponding to the first UA string
  • the first feature information is used as feature information of the first terminal device.
  • the feature information of the terminal device includes feature information respectively obtained from multiple dimensions
  • the plurality of dimensions includes at least one of a terminal device system, a terminal device manufacturer, and a terminal device model.
  • the collection module is configured to:
  • HTTP Hypertext Transfer Protocol
  • the determining module is configured to:
  • the extracting device further includes:
  • the information determining module is configured to select the first feature information having the longest length as the feature information of the first terminal device.
  • the technical solution provided by the embodiment of the present invention includes: constructing a UA feature library file that defines a mapping relationship between the feature information of the terminal device and the UA string; and the UA character to be analyzed according to the first terminal device.
  • the string searches for the matching first feature information from the UA signature file to obtain the feature information of the first terminal device.
  • the method of the embodiment of the present invention obtains the feature information of the terminal device by parsing the UA string, and implements the feature of the terminal device. Accurate extraction of information.
  • FIG. 1 is a flowchart showing a method for extracting feature information of a terminal device according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram showing the structure of a goto table of an AC algorithm
  • FIG. 3 is a schematic structural diagram of another goto table of an AC algorithm according to an embodiment of the present invention.
  • FIG. 4 is a flowchart of constructing an AC feature library file in a method for extracting feature information of a terminal device according to an embodiment of the present invention
  • FIG. 5 is a flowchart of extracting feature information in a method for extracting feature information of a terminal device according to an embodiment of the present invention
  • FIG. 6 is a block diagram showing the structure of an apparatus for extracting feature information of a terminal device according to an embodiment of the present invention.
  • the embodiment of the present invention provides a method and a device for extracting feature information of a terminal device, in which the operator cannot accurately obtain the feature information of the terminal device, and firstly constructs the feature information of the terminal device and the user agent (UA,
  • the UA signature library file of the mapping relationship between the strings of the user agent, and the first feature information of the first terminal device is obtained by searching the first feature information from the UA signature database file according to the UA string to be analyzed of the first terminal device.
  • Feature information; the extraction method obtains the feature information of the terminal device by parsing the UA string, and effectively solves the requirement of the operator to acquire the feature information of the terminal device.
  • an embodiment of the present invention provides a method for extracting feature information of a terminal device, including:
  • Step 11 Construct, according to the feature information of the terminal device and the user agent (UA, User Agent) string corresponding to each feature information, a UA feature library file that has a mapping relationship between the feature information and the UA string;
  • UA User Agent
  • the UA feature library file that constructs the mapping relationship between the feature information and the UA string can be constructed by using the feature library in the related art; the method for constructing the feature library file is a common technical means for those skilled in the art. I will not repeat them here.
  • Step 12 Acquire a request packet sent by the first terminal device, and obtain a first UA string of the first terminal device from the request packet.
  • Step 13 Search for the first feature information corresponding to the first UA string from the UA signature file according to the first UA string, and use the first feature information corresponding to the first UA string as the first terminal device. Characteristic information.
  • User Agent is a part of the HTTP protocol and belongs to one of the components of the header domain. Its information includes hardware platform, system software, application software, and user personal preferences.
  • User-Agent includes: AppStore. /2.0iOS/7.1.2model/iPhone3, 1build/11D257(4;dt:27), through UA, can accurately extract the information of the terminal device.
  • the parameter types included in the UA include: terminal system platform, terminal product model, user application, and so on. There may be only one or more types; for example, AppStore/2.0 is user application information, and iOS/7.1.2 is a terminal system platform. And version information, iPhone3 is user terminal information, and the like.
  • the feature information of the terminal device is information that can identify the terminal device, such as a serial number, a name, and the like of the feature information.
  • each feature information has a corresponding UA string, which is a feature string indicating its feature information.
  • the UA feature library file stores the UA string and the corresponding feature information in a one-to-one correspondence, which facilitates the call recognition of the subsequent steps.
  • the request message (which may be a data domain message) of the collected first terminal device is obtained in step 12, and the first UA string included in the request message is obtained, so that, according to the first UA string,
  • the first feature information is obtained from the UA signature file, and the first feature information is the feature information of the first terminal device.
  • the feature information of the terminal device includes feature information respectively obtained from multiple dimensions
  • the plurality of dimensions includes at least one of a terminal device system, a terminal device manufacturer, and a terminal device model.
  • acquiring the feature information from one dimension is also an optional embodiment of the present invention.
  • the terminal device system generally includes the version number of the terminal system
  • the terminal device manufacturer generally includes the name of the terminal device manufacturer
  • the terminal device model generally includes the model number of the terminal device.
  • step 12 includes:
  • Step 121 Acquire a Hypertext Transfer Protocol (HTTP) packet sent by the first terminal device, and obtain a first UA string of the first terminal device from the HTTP packet.
  • HTTP Hypertext Transfer Protocol
  • the UA is a part of the HTTP protocol, and the embodiment of the present invention only collects the HTTP packet sent by the first terminal device, and the HTTP packet (for example, the request for the HTTP message (REQ) message)
  • the UA string is obtained by the first UA string, and the UA string includes system software, hardware platform, system software, application software, and/or user personal preference of the first terminal device.
  • the embodiment of the present invention obtains feature information according to the UA string in HTTP, which is more accurate than the related technology.
  • step 13 includes:
  • Step 131 Apply a multi-pattern matching (AC) algorithm from the first UA string to the UA feature.
  • the first feature information corresponding to the first UA string is searched in the library file.
  • AC multi-pattern matching
  • the lightweight multi-mode matching engine based on the AC algorithm identifies the UA string and obtains the recognition result of the related device information (ie, the first feature information).
  • the AC algorithm is a classic multi-pattern matching algorithm consisting of three parts, a goto table, a fail table, and an output table; a goto table, a fail table, and an output table are conventional tables included in the AC algorithm itself. It can be guaranteed that for a given text and pattern set P ⁇ p1, p2,...pm ⁇ of length n, within the O(n) time complexity, all target patterns in the text are found, and the scale of the pattern set m has nothing to do.
  • the mode string is the first feature information corresponding to the first UA string in the embodiment of the present invention:
  • the method of the embodiment of the present invention further includes:
  • Step 14 If the first UA string corresponds to two or more first feature information and each of the first feature information belongs to the same dimension, obtain the length of each first feature information;
  • Step 15 Select the first feature information with the longest length as the feature information of the first terminal device.
  • the step 14 and the step 15 are to prevent the same UA string from being corresponding to the plurality of first feature information.
  • the feature information in which the information length is the longest is selected as the feature information of the first terminal.
  • the matching feature information obtained according to the first UA string is iphone3 and iphone3, 1build respectively; then iphone3, 1build is selected as the feature information of the first terminal device.
  • selecting the first feature information with the longest length as the feature information of the first terminal device is considered based on the principle that the longer the length, the higher the accuracy, and only an optional embodiment of the present invention;
  • the shortest first feature information is also applicable to the feature information of the first terminal device in a certain application scenario, that is, it should also belong to the protection scope of the embodiment of the present invention, and other pre-set methods are also applicable.
  • the process of constructing a UA signature file in the embodiment of the present invention is as follows:
  • the file covers the feature information of the terminal device from three dimensions, the system, the terminal vendor, and the terminal device model.
  • the steps are as follows:
  • the feature information of the terminal device such as system, vendor, and brand is separately recorded in the form of a table;
  • the table mainly includes information such as TOKEN_ID (serial number), NAME (recognition result), and PATTERN data (UA string);
  • Step 402 Construct a system table, and obtain and fill data of TOKEN_ID, NAME, and PATTERN of each record in the table;
  • Step 403 Construct a vendor table, and obtain and fill data of TOKEN_ID, NAME, and PATTERN of each record in the table;
  • Step 404 Construct a brand table, and obtain and fill data of TOKEN_ID, NAME, and PATTERN of each record in the table;
  • Step 405 Generate a UA feature library file according to the system table, the vendor table, and the brand table, and the process ends.
  • the process of extracting feature information of a terminal device in the embodiment of the present invention is as follows:
  • the file covers the feature information of the terminal device from three dimensions, the system, the terminal device, and the terminal device model.
  • the device feature library covers the terminal device in the UA string.
  • Step 501 Load the UA feature library file generated by the build.
  • step 502 the HTTP packet is collected, and the first UA string is obtained from the HTTP message (for example, the request message (REQ) of the HTTP packet.
  • the HTTP packet is one of the request packets.
  • Step 503 Search for the first feature information corresponding to the first UA string from the UA feature library file by using an AC algorithm according to the first UA string.
  • step 504 the length of the first feature information corresponding to the first UA string is calculated by traversing; in the method of the embodiment of the present invention, the ID and the length of the feature information having the longest length can be recorded for each of the first feature information.
  • Step 505 The first feature information having the longest length is selected as the feature information of the first terminal device.
  • Step 506 Return the version number of the device system of the terminal device according to the feature information of the first terminal device.
  • Step 507 Return the terminal device model of the terminal device according to the feature information of the first terminal device, For example, brand.
  • Step 508 Return to the terminal device manufacturer of the terminal device according to the feature information of the first terminal device. End the process.
  • the foregoing extraction method may be implemented by a server or an collection and analysis device in an operator.
  • the embodiment of the invention further provides a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the method for extracting feature information of the terminal device.
  • the embodiment of the present invention further provides an apparatus for extracting feature information of a terminal device, including:
  • the structuring module 61 is configured to construct a UA feature library file that is mapped to the UA string according to the feature information of the terminal device and the user agent (UA) string corresponding to each feature information.
  • the acquiring module 62 is configured to collect the request packet sent by the first terminal device, and obtain the first UA string of the first terminal device from the request packet.
  • the determining module 63 is configured to search for the first feature information corresponding to the first UA string from the UA signature file according to the first UA string, and use the first feature information corresponding to the first UA string as the first Characteristic information of a terminal device.
  • the feature information of the terminal device includes feature information respectively obtained from multiple dimensions
  • the plurality of dimensions includes at least one of a terminal device system, a terminal device manufacturer, and a terminal device model.
  • the collecting module 62 is configured to:
  • HTTP Hypertext Transfer Protocol
  • the determining module 63 is configured to:
  • the first feature information corresponding to the first UA string is searched from the UA feature library file by using a multi-pattern matching (AC) algorithm according to the first UA string.
  • AC multi-pattern matching
  • the extracting device further includes:
  • Obtaining a module configured to acquire a length of the first feature information if the first UA string corresponds to two or more first feature information and each of the first feature information belongs to the same dimension;
  • the information determining module is configured to select the first feature information having the longest length as the feature information of the first terminal device.
  • the device for extracting feature information of the terminal device provided by the foregoing embodiment of the present invention is a device for applying the method for extracting feature information of the terminal device, and all embodiments of the foregoing extraction method are applicable to the extracting device. Both can achieve the same or similar benefits.
  • each module/unit in the foregoing embodiment may be implemented in the form of hardware, for example, by implementing an integrated circuit to implement its corresponding function, or may be implemented in the form of a software function module, for example, being executed by a processor and stored in a memory. Programs/instructions to implement their respective functions.
  • the invention is not limited to any specific form of combination of hardware and software.
  • the above technical solution realizes accurate extraction of feature information of the terminal device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

L'invention concerne un procédé et un appareil d'extraction d'informations caractéristiques d'un dispositif terminal. Le procédé comprend les étapes consistant à : d'après des informations caractéristiques d'un dispositif terminal et une chaîne d'agents utilisateur (UA) correspondant à chaque élément des informations caractéristiques, créer un fichier bibliothèque de caractéristiques d'UA identifiant une relation de mappage entre les informations caractéristiques et la chaîne d'UA ; collecter un paquet de demande transmis par un premier dispositif terminal, et acquérir une première chaîne d'UA du premier dispositif terminal à partir du paquet de demande ; et, d'après la première chaîne d'UA, rechercher de premières informations caractéristiques correspondant à la première chaîne d'UA dans le fichier bibliothèque de caractéristiques d'UA, et définir les premières informations caractéristiques trouvées, correspondant à la première chaîne d'UA, en tant que les informations caractéristiques du premier dispositif terminal. Le procédé décrit dans un mode de réalisation de la présente invention exécute une extraction précise d'informations caractéristiques à partir du dispositif terminal.
PCT/CN2016/085592 2015-07-02 2016-06-13 Procédé et appareil d'extraction d'informations caractéristiques de dispositif terminal Ceased WO2017000761A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510379950.4 2015-07-02
CN201510379950.4A CN106330520A (zh) 2015-07-02 2015-07-02 一种终端设备的特征信息的提取方法及装置

Publications (1)

Publication Number Publication Date
WO2017000761A1 true WO2017000761A1 (fr) 2017-01-05

Family

ID=57607871

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2016/085592 Ceased WO2017000761A1 (fr) 2015-07-02 2016-06-13 Procédé et appareil d'extraction d'informations caractéristiques de dispositif terminal

Country Status (2)

Country Link
CN (1) CN106330520A (fr)
WO (1) WO2017000761A1 (fr)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109660507A (zh) * 2018-10-16 2019-04-19 深圳壹账通智能科技有限公司 与用户端通信的方法、装置、设备及可读存储介质
CN113507471A (zh) * 2021-07-12 2021-10-15 深圳市共进电子股份有限公司 获取终端系统类型的方法、装置、路由器及存储介质
CN114143385A (zh) * 2021-11-24 2022-03-04 广东电网有限责任公司 一种网络流量数据的识别方法、装置、设备和介质

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2018214094A1 (fr) * 2017-05-25 2018-11-29 Microsoft Technology Licensing, Llc Analyseur
CN107368532A (zh) * 2017-06-14 2017-11-21 上海斐讯数据通信技术有限公司 一种用户代理字段信息处理方法和系统
CN107562835B (zh) * 2017-08-23 2020-03-27 Oppo广东移动通信有限公司 文件查找方法、装置、移动终端及计算机可读存储介质
CN109905293B (zh) * 2019-03-12 2021-06-08 北京奇虎科技有限公司 一种终端设备识别方法、系统及存储介质
CN109905292B (zh) * 2019-03-12 2021-08-10 北京奇虎科技有限公司 一种终端设备识别方法、系统及存储介质
CN109951354B (zh) * 2019-03-12 2021-08-10 北京奇虎科技有限公司 一种终端设备识别方法、系统及存储介质

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110124319A1 (en) * 2009-11-25 2011-05-26 Nokia Corporation Method and apparatus for ensuring transport of user agent information
CN102722585A (zh) * 2012-06-08 2012-10-10 亿赞普(北京)科技有限公司 浏览器类型识别方法、装置及系统
CN102932775A (zh) * 2012-11-16 2013-02-13 广州市通联技术发展有限公司 一种利用imei与ua结合进行终端识别的方法及装置
CN103974232A (zh) * 2013-01-24 2014-08-06 中国电信股份有限公司 WiFi用户终端识别方法及系统

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110124319A1 (en) * 2009-11-25 2011-05-26 Nokia Corporation Method and apparatus for ensuring transport of user agent information
CN102722585A (zh) * 2012-06-08 2012-10-10 亿赞普(北京)科技有限公司 浏览器类型识别方法、装置及系统
CN102932775A (zh) * 2012-11-16 2013-02-13 广州市通联技术发展有限公司 一种利用imei与ua结合进行终端识别的方法及装置
CN103974232A (zh) * 2013-01-24 2014-08-06 中国电信股份有限公司 WiFi用户终端识别方法及系统

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109660507A (zh) * 2018-10-16 2019-04-19 深圳壹账通智能科技有限公司 与用户端通信的方法、装置、设备及可读存储介质
CN109660507B (zh) * 2018-10-16 2022-05-17 深圳壹账通智能科技有限公司 与用户端通信的方法、装置、设备及可读存储介质
CN113507471A (zh) * 2021-07-12 2021-10-15 深圳市共进电子股份有限公司 获取终端系统类型的方法、装置、路由器及存储介质
CN114143385A (zh) * 2021-11-24 2022-03-04 广东电网有限责任公司 一种网络流量数据的识别方法、装置、设备和介质
CN114143385B (zh) * 2021-11-24 2024-01-05 广东电网有限责任公司 一种网络流量数据的识别方法、装置、设备和介质

Also Published As

Publication number Publication date
CN106330520A (zh) 2017-01-11

Similar Documents

Publication Publication Date Title
WO2017000761A1 (fr) Procédé et appareil d'extraction d'informations caractéristiques de dispositif terminal
CN109951435B (zh) 一种设备标识提供方法及装置和风险控制方法及装置
JP6626211B2 (ja) ショートリンクを処理する方法及び装置並びにショートリンクサーバ
CN102880501B (zh) 应用推荐的实现方法、装置和系统
CN107169094B (zh) 信息聚合方法及装置
CN114328632A (zh) 基于位图的用户数据分析方法、装置及计算机设备
CN105589782B (zh) 基于浏览器的用户行为采集方法
WO2016107523A1 (fr) Procédé et appareil d'analyse de chemin d'accès pour site web
US9491223B2 (en) Techniques for determining a mobile application download attribution
CN102968454B (zh) 一种用于获取推广对象搜索结果的方法和设备
US10540325B2 (en) Method and device for identifying junk picture files
CN104199863A (zh) 存储设备上的文件的查找方法、装置及路由器
WO2016184163A1 (fr) Procédé et dispositif de génération de règles dpi
US10666720B2 (en) Maintenance of persistent master identifier for clusters of user identifiers across a plurality of devices
US20160019266A1 (en) Query generating method and query generating device
CN111209325A (zh) 业务系统接口识别方法、装置及存储介质
CN108093026A (zh) 多租户请求的处理方法及装置
CN106202440B (zh) 数据处理方法、装置及设备
CN110851334A (zh) 流量统计方法、电子设备、系统及介质
CN111159131A (zh) 性能优化方法、装置、设备及计算机可读存储介质
US20160277477A1 (en) Information processing apparatus, terminal device, information processing method, and non-transitory computer readable recording medium
CN104660581A (zh) 一种为业务策略确定目标用户的方法、装置及系统
CN103617214B (zh) 一种cim模型路径查询方法及装置
US10445213B2 (en) Non-transitory computer-readable storage medium, evaluation method, and evaluation device
CN106651408B (zh) 一种数据分析方法及装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16817125

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16817125

Country of ref document: EP

Kind code of ref document: A1