CN115357729A - Method and device for constructing securities relation map and electronic equipment - Google Patents
Method and device for constructing securities relation map and electronic equipment Download PDFInfo
- Publication number
- CN115357729A CN115357729A CN202211045889.6A CN202211045889A CN115357729A CN 115357729 A CN115357729 A CN 115357729A CN 202211045889 A CN202211045889 A CN 202211045889A CN 115357729 A CN115357729 A CN 115357729A
- Authority
- CN
- China
- Prior art keywords
- securities
- security
- specified
- data
- graph
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/022—Knowledge engineering; Knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/06—Asset management; Financial planning or analysis
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- Finance (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Development Economics (AREA)
- Accounting & Taxation (AREA)
- Computing Systems (AREA)
- Economics (AREA)
- Mathematical Physics (AREA)
- Evolutionary Computation (AREA)
- Entrepreneurship & Innovation (AREA)
- Artificial Intelligence (AREA)
- Game Theory and Decision Science (AREA)
- Human Resources & Organizations (AREA)
- Operations Research (AREA)
- Software Systems (AREA)
- Marketing (AREA)
- Strategic Management (AREA)
- Technology Law (AREA)
- General Business, Economics & Management (AREA)
- Life Sciences & Earth Sciences (AREA)
- Animal Behavior & Ethology (AREA)
- Databases & Information Systems (AREA)
- Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)
Abstract
Description
技术领域technical field
本发明涉及知识图谱技术领域,特别是涉及一种证券关系图谱的构建方法、装置及电子设备。The present invention relates to the technical field of knowledge graphs, in particular to a method, device and electronic equipment for constructing a securities relationship graph.
背景技术Background technique
目前,通过构造证券关系图谱,可以辅助分析证券与证券之间的价格的影响方式、风险的传导过程和投资中持仓组合的约束条件的设置。At present, by constructing a securities relationship map, it is possible to assist in the analysis of the price influence mode between securities, the risk transmission process, and the setting of constraints on position portfolios in investment.
而相关技术中,基于证券所属行业构建的证券关系图谱无法表征不同行业的股票的关联关系,通过将新闻或舆论同时提及的证券相连所构建的证券关系图谱所涉及的证券数目太少,以及基于证券所在公司上下游供应链关系的方式构建的证券关系图谱,仅能表征处于一条供应链内的公司的证券的关联关系,等等。In related technologies, the securities relationship graph constructed based on the industry to which the securities belong cannot represent the correlation of stocks in different industries, and the number of securities involved in the securities relationship graph constructed by connecting securities mentioned in news or public opinion at the same time is too small, and The securities relationship graph constructed based on the relationship between the upstream and downstream supply chains of the companies where the securities are located can only represent the relationship between the securities of the companies in a supply chain, and so on.
可见,相关技术所构建的证券关系图谱的丰富度和连通性不理想,导致所构建的证券关系图谱不利于为投资者的投资决策和风险防范提供参考。因此,如何提高证券关系图谱的丰富度和连通性,从而更好地构建证券关系图谱是亟待解决的问题。It can be seen that the richness and connectivity of the securities relationship graph constructed by related technologies are not ideal, resulting in the construction of the securities relationship graph is not conducive to providing reference for investors' investment decisions and risk prevention. Therefore, how to improve the richness and connectivity of the securities relationship graph, so as to better construct the securities relationship graph is an urgent problem to be solved.
发明内容Contents of the invention
本发明实施例的目的在于提供一种证券关系图谱的构建方法、装置及电子设备,以实现提高证券关系图谱的丰富度和连通性,从而更好地构建证券关系图谱。具体技术方案如下:The purpose of the embodiments of the present invention is to provide a method, device, and electronic device for constructing a securities relationship graph, so as to improve the richness and connectivity of the securities relationship graph, so as to better construct the securities relationship graph. The specific technical scheme is as follows:
第一方面,本发明实施例提供了一种证券关系图谱的构建方法,所述方法包括:In the first aspect, an embodiment of the present invention provides a method for constructing a securities relationship graph, the method comprising:
针对待构建证券关系图谱的多只证券,获取每一所述证券的证券数据;其中,所述证券数据包括,所述证券的发行方的财务报表、所述证券的技术指标数据和/或所述证券的日内交易数据;Obtaining the securities data of each of the securities for which the securities relationship graph is to be constructed; wherein the securities data includes the financial statement of the issuer of the securities, the technical index data of the securities and/or all Intraday transaction data for the securities mentioned;
针对每一所述证券,从该证券的证券数据中,确定该证券的多个指定类别下的因子数据,并基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量;其中,每一指定类别为用于进行证券描述的一个类别,每一指定类别下的因子数据为该指定类别下的证券描述数据;for each of said securities, determining factor data for a plurality of specified categories of the security from the security data for the security, and determining each specified category of the security based on the factor data for each specified category of the security The securities vector under ; wherein, each specified category is a category used for securities description, and the factor data under each specified category is the security description data under the specified category;
针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据;其中,所述指定关系数据为用于表征关联关系的数据;For each specified category, based on the securities vectors of each security under the specified category, determine the specified relationship data between the securities under the specified category; wherein, the specified relationship data is used to represent the association relationship data;
以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱。Taking each security as a graph vertex, and based on the specified relationship data between securities under each specified category, construct a security relationship graph with each specified category as a graph dimension.
可选地,所述因子数据包含至少一个因子值;Optionally, said factor data comprises at least one factor value;
所述基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量,包括:The determining of the security vectors under each specified category of the security based on the factor data under each specified category of the security includes:
针对每一指定类别,以该证券在该指定类别下的因子数据所包含的每一因子值分别作为一个向量维度,生成该证券在该指定类别下的证券向量。For each specified category, each factor value contained in the factor data of the security under the specified category is used as a vector dimension to generate a security vector of the security under the specified category.
可选地,所述针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据,包括:Optionally, for each specified category, based on the securities vectors of each security under the specified category, determining the specified relationship data between the securities under the specified category includes:
针对每一指定类别,计算每两只证券的该指定类别下的证券向量的向量相似度,得到该指定类别下每两只证券之间的指定关系数据。For each specified category, calculate the vector similarity of the securities vectors under the specified category for every two securities, and obtain the specified relationship data between every two securities under the specified category.
可选地,所述以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱,包括:Optionally, using each security as a graph vertex, and based on the specified relationship data between securities under each specified category, constructing a securities relationship graph with each specified category as a graph dimension, including:
针对每一指定类别,基于每两只证券的该指定类别下的证券向量的向量相似度,确定在该指定类别下每一证券对应的关联证券;其中,每一证券对应的关联证券为:与该证券的向量相似度大于预定阈值的证券,或者,按照向量相似度排序靠后排序靠前的、与该证券对应的指定数量个证券;For each specified category, based on the vector similarity of the securities vectors under the specified category for each two securities, determine the related securities corresponding to each security under the specified category; wherein, the related securities corresponding to each security are: Securities whose vector similarity of the security is greater than a predetermined threshold, or a specified number of securities corresponding to the security that are sorted according to the vector similarity;
以各只证券作为图谱顶点,且以每一指定类别对应的连接线作为该指定类别的图谱边,生成以每一指定类别分别作为一个图谱维度的证券关系图谱;Using each security as a graph vertex, and using the connecting line corresponding to each specified category as a graph edge of the specified category, generate a securities relationship graph with each specified category as a graph dimension;
其中,每一指定类别对应的连接线为该指定类别下每一证券的图谱顶点与所对应的关联证券的图谱顶点的连接线。Wherein, the connection line corresponding to each specified category is the connection line between the graph vertex of each security under the specified category and the graph vertex of the corresponding related securities.
可选地,每一证券对应的指定数量的确定方式包括:Optionally, the method for determining the specified quantity corresponding to each security includes:
针对每一指定类别,计算在该指定类别下每一证券对应的关联证券的数量分别为1到M时,该指定类别作为一个图谱维度的证券关系图谱的M个信息熵;其中,M为各只证券的总只数;For each specified category, when the number of related securities corresponding to each security under the specified category is 1 to M, the specified category is used as the M information entropy of the securities relationship graph of a graph dimension; where M is the the total number of securities;
从所得到的各信息熵中选取数值最小的极小值点,作为目标信息熵;Select the minimum value point with the smallest value from the obtained information entropy as the target information entropy;
确定所述目标信息熵所对应的关联证券的数量,作为在该指定类别下每一证券对应的指定数量。Determine the quantity of associated securities corresponding to the target information entropy as the specified quantity corresponding to each security under the specified category.
可选地,所述针对待构建证券关系图谱的多只证券,获取每一所述证券的证券数据,包括:Optionally, the acquisition of securities data of each of the securities for the plurality of securities to be constructed in the securities relationship graph includes:
每当进入预定的构建周期时,针对待构建证券关系图谱的多只证券,获取当前的构建周期内每一所述证券的证券数据;其中,所述构建周期的周期时长基于所述证券数据的更新周期的时长所确定。Whenever a predetermined construction cycle is entered, the securities data of each of the securities in the current construction cycle is obtained for multiple securities of the securities relationship graph to be constructed; wherein, the duration of the construction cycle is based on the securities data Determined by the length of the update cycle.
可选地,所述方法还包括:Optionally, the method also includes:
利用社区挖掘算法,以及所述证券关系图谱中的每一图谱维度下的各只证券的指定关系数据,从不同的图谱维度,将各只证券划分至不同的簇当中。Using the community mining algorithm and the specified relationship data of each security under each graph dimension in the securities relationship graph, the securities are divided into different clusters from different graph dimensions.
第二方面,本发明实施例提供了一种证券关系图谱的构建装置,所述装置包括:In a second aspect, an embodiment of the present invention provides a device for constructing a securities relationship graph, the device comprising:
获取模块,用于针对待构建证券关系图谱的多只证券,获取每一所述证券的证券数据;其中,所述证券数据包括,所述证券的发行方的财务报表、所述证券的技术指标数据以及所述证券的日内交易数据;The obtaining module is used to obtain the securities data of each of the securities for the plurality of securities to be constructed in the securities relationship graph; wherein, the securities data includes the financial statements of the issuer of the securities and the technical indicators of the securities data and intraday transaction data for said securities;
向量确定模块,用于针对每一所述证券,从该证券的证券数据中,确定该证券的多个指定类别下的因子数据,并基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量;其中,每一指定类别为用于进行证券描述的一个类别,每一指定类别下的因子数据为该指定类别下的证券描述数据;The vector determination module is configured to, for each of the securities, determine the factor data under multiple specified categories of the security from the security data of the security, and determine the factor data under each specified category of the security based on the factor data of the security. Securities vectors under each specified category of securities; wherein, each specified category is a category used for securities description, and the factor data under each specified category is the securities description data under the specified category;
关系确定模块,用于针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据;其中,所述指定关系数据为用于表征关联关系的数据;The relationship determination module is used to determine, for each specified category, specified relationship data between securities of the specified category based on the securities vectors of each security under the specified category; wherein, the specified relationship data is Data used to characterize the relationship;
构建模块,用于以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱。The construction module is used to use each security as a graph vertex, and based on the specified relationship data between securities under each specified category, construct a securities relationship graph with each specified category as a graph dimension.
第三方面,本发明实施例提供了一种电子设备,包括处理器、通信接口、存储器和通信总线,其中,处理器,通信接口,存储器通过通信总线完成相互间的通信;In a third aspect, an embodiment of the present invention provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory complete communication with each other through the communication bus;
存储器,用于存放计算机程序;memory for storing computer programs;
处理器,用于执行存储器上所存放的程序时,实现上述证券关系图谱的构建方法的步骤。The processor is used to implement the steps of the above method for constructing the securities relationship graph when executing the program stored in the memory.
第四方面,本发明实施例提供了一种计算机可读存储介质,所述计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述证券关系图谱的构建方法的步骤。In a fourth aspect, an embodiment of the present invention provides a computer-readable storage medium, wherein a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the steps of the above-mentioned method for constructing a securities relationship graph are implemented .
本发明实施例有益效果:Beneficial effects of the embodiments of the present invention:
本发明实施例提供的证券关系图谱的构建方法,针对待构建证券关系图谱的多只证券,获取每一证券的证券数据;针对每一所述证券,从该证券的证券数据中,确定该证券的多个指定类别下的因子数据,并基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量;其中,每一指定类别为用于进行证券描述的一个类别,每一指定类别下的因子数据为该指定类别下的证券描述数据;针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据;其中,指定关系数据为用于表征关联关系的数据;以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱。本方案中,针对多只证券,根据所获取的证券数据确定每一只证券多个指定类别下的因子数据,从多个维度构建证券关系图谱,提高了证券关系图谱的丰富度,同时利用因子数据将各只证券进行向量化处理,得到证券向量,并基于所得到的证券向量确定每一类别下各只证券之间的指定关系数据,从而利用所确定的指定关系数据构建证券关系图谱,提高了各只证券之间的连通性。因此,通过本方案可以更好地构建证券关系图谱。In the method for constructing a securities relationship graph provided in an embodiment of the present invention, the securities data of each security is obtained for multiple securities to be constructed in the securities relationship graph; The factor data under multiple specified categories of the security, and based on the factor data under each specified category of the security, determine the security vector under each specified category of the security; wherein, each specified category is used for securities description A category, the factor data under each specified category is the securities description data under the specified category; for each specified category, based on the securities vectors under the specified category of each security, determine each security under the specified category Among them, the specified relationship data is the data used to represent the relationship; each security is used as the graph vertex, and based on the specified relationship data between each security under each specified category, each A securities relationship graph that specifies categories as a graph dimension. In this scheme, for multiple securities, the factor data under multiple specified categories of each security is determined according to the acquired securities data, and the securities relationship map is constructed from multiple dimensions, which improves the richness of the securities relationship map. At the same time, the factor data is used to The data is vectorized for each security to obtain the security vector, and based on the obtained security vector, the specified relationship data between the securities in each category is determined, so that the specified relationship data can be used to construct the security relationship map, and improve Connectivity between securities. Therefore, through this scheme, the securities relationship graph can be better constructed.
当然,实施本发明的任一产品或方法并不一定需要同时达到以上所述的所有优点。Of course, implementing any product or method of the present invention does not necessarily need to achieve all the above-mentioned advantages at the same time.
附图说明Description of drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的实施例。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention, and those skilled in the art can also obtain other embodiments according to these drawings.
图1为本发明实施例所提供的一种证券关系图谱的构建方法的流程图;Fig. 1 is a flowchart of a method for constructing a securities relationship graph provided by an embodiment of the present invention;
图2为本发明实施例所提供的一种证券关系图谱的构建方法的另一流程图;FIG. 2 is another flowchart of a method for constructing a securities relationship graph provided by an embodiment of the present invention;
图3为本发明实施例所提供的一种证券关系图谱的构建方法的另一流程图;FIG. 3 is another flowchart of a method for constructing a securities relationship graph provided by an embodiment of the present invention;
图4为实现本发明实施例所提供的证券关系图谱方法的程序模块的结构示意图;FIG. 4 is a schematic structural diagram of a program module for realizing the securities relationship graph method provided by the embodiment of the present invention;
图5为本发明实施例所提供的一种证券关系图谱的构建装置的结构示意图;FIG. 5 is a schematic structural diagram of a device for constructing a securities relationship graph provided by an embodiment of the present invention;
图6为本发明实施例所提供的电子设备的结构示意图。FIG. 6 is a schematic structural diagram of an electronic device provided by an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员基于本申请所获得的所有其他实施例,都属于本发明保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art based on the present application belong to the protection scope of the present invention.
图谱可以从本质上揭示顶点与顶点之间的关联关系,例如蛋白质交互关系图谱,化学分子式图谱等,但目前能反映不同证券,例如股票、基金、债券等,之间关联关系的图谱鲜有存在,若能构建反应不同证券之间关联关系,并通过可视化技术进行展现,可以用于辅助投资者投资决策和风险防范。Graphs can essentially reveal the relationship between vertices, such as protein interaction graphs, chemical molecular formula graphs, etc., but currently there are few graphs that can reflect the relationship between different securities, such as stocks, funds, bonds, etc. , if it can be constructed to reflect the relationship between different securities and displayed through visualization technology, it can be used to assist investors in investment decision-making and risk prevention.
证券关系图谱目前主要有三种构建方式,第一,基于证券所属行业构建的证券关系图谱,该方法将同一个行业内的证券相连,不同行业的证券没有边相连;第二,基于新闻或舆论数据构建的证券关系图谱,该方法将新闻或舆论中同时提到的证券连边;第三,基于证券所在公司上下游供应链关系构建的证券关系图谱,该方法将有供应链关系的证券连边。但是上述三种方式均存在明显的缺陷。There are currently three main construction methods for securities relationship graphs. First, based on the securities relationship graph constructed by the industry to which the securities belong, this method connects securities in the same industry, and securities in different industries are not connected by edges; second, based on news or public opinion data Constructed securities relationship graph, this method connects securities mentioned in news or public opinion at the same time; third, based on the securities relationship graph constructed based on the upstream and downstream supply chain relationship of the company where the securities are located, this method connects securities with supply chain relationships . However, there are obvious defects in the above three methods.
第一,根据证券所属行业构建的证券关系图谱无法反映不同行业间股票的关联关系,而且证券所处行业更新频率极低,一旦确定行业后,除非公司主营业务变化,否则基本不会变更行业,因此所构建的证券关系图谱相对固定,无法动态反映不同证券间的关系;第二,基于新闻或舆论数据构建的证券关系图谱,其中的证券只出现在新闻或舆论中,涉及的证券均为热点证券,数目较少,此外,还需要爬虫技术爬取不同来源的新闻数据,存在潜在的法律合规问题,而且需要自然语言理解技术提取证券名称或者证券代码,技术门槛高;第三,基于证券所在公司上下游供应链关系构建的证券关系图谱,由于获取每家上市公司的供应链关系比较困难,且每条供应链关系为一个树形图,最优情况下,所构建的证券关系图谱也仅是一个树形图,边的数目和顶点的数目为同一数量级,并且供应链中的公司并不都是上市公司,并不都发行证券,不同供应链下的证券都也不能形成一个树形图,例如,医疗业证券和畜牧业证券,因此导致所构建的证券关系图谱连通度低,此外公司的供应链相对稳定,变化频率低,因此基于证券所在公司上下游供应链关系构建的证券关系图谱无法动态地反应证券间的关联关系。First, the securities relationship graph constructed according to the industry to which the securities belong cannot reflect the relationship between stocks in different industries, and the update frequency of the industry in which the securities are located is extremely low. Once the industry is determined, unless the company's main business changes, the industry will basically not change , so the constructed securities relationship graph is relatively fixed and cannot dynamically reflect the relationship between different securities; secondly, the securities relationship graph constructed based on news or public opinion data, the securities in it only appear in the news or public opinion, and the securities involved are all The number of hot securities is small. In addition, crawler technology is required to crawl news data from different sources, which has potential legal compliance issues, and natural language understanding technology is required to extract securities names or securities codes, which has a high technical threshold; third, based on The securities relationship map constructed by the upstream and downstream supply chain relationship of the company where the securities are located. Since it is difficult to obtain the supply chain relationship of each listed company, and each supply chain relationship is a tree diagram, in the optimal case, the constructed securities relationship map It is only a tree diagram, the number of edges and the number of vertices are of the same order of magnitude, and not all companies in the supply chain are listed companies, not all issue securities, and securities under different supply chains cannot form a tree For example, medical industry securities and animal husbandry securities, so the connectivity of the constructed securities relationship graph is low. In addition, the supply chain of the company is relatively stable and the frequency of change is low. Therefore, the securities constructed based on the upstream and downstream supply chain relationship of the company where the securities are The relationship graph cannot dynamically reflect the relationship between securities.
可见,相关技术所构建的证券关系图谱连通度均较低,不利于对证券的社区结构进行分析,且相关技术所构建的证券关系图谱的维度为1,相关技术无法构建含义更丰富的多维度的证券关系图谱。因此,利用相关技术所构建的证券关系图谱不利于为投资者的投资决策和风险防范提供参考。It can be seen that the connectivity of the securities relationship graph constructed by related technologies is low, which is not conducive to the analysis of the community structure of securities, and the dimension of the securities relationship graph constructed by related technologies is 1, and related technologies cannot construct multi-dimensional more meaningful Graph of securities relationships. Therefore, the securities relationship graph constructed by using related technologies is not conducive to providing reference for investors' investment decisions and risk prevention.
为了解决上述问题,提高证券关系图谱的丰富度和连通性,从而更好地构建证券关系图谱,本发明实施例提供了一种证券关系图谱的构建方法、装置及电子设备,该方法可以应用于电子设备中,该电子设备可以为计算机、服务器等。该方法包括:In order to solve the above problems, improve the richness and connectivity of the securities relationship graph, so as to better construct the securities relationship graph, the embodiment of the present invention provides a construction method, device and electronic equipment of the securities relationship graph, which can be applied to In an electronic device, the electronic device may be a computer, a server, or the like. The method includes:
针对待构建证券关系图谱的多只证券,获取每一证券的证券数据;其中,证券数据包括,证券的发行方的财务报表、证券的技术指标数据以及证券的日内交易数据;Obtain the securities data of each of the securities for which the securities relationship graph is to be constructed; wherein, the securities data includes the financial statements of the issuer of the securities, the technical index data of the securities, and the intraday transaction data of the securities;
针对每一证券,从该证券的证券数据中,确定该证券的多个指定类别下的因子数据,并基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量;其中,每一指定类别为用于进行证券描述的一个类别,每一指定类别下的因子数据为该指定类别下的证券描述数据;For each security, from the security data of the security, determine the factor data under a plurality of specified categories of the security, and determine the factor data under each specified category of the security based on the factor data under each specified category of the security Securities vector; wherein, each specified category is a category used for securities description, and the factor data under each specified category is the security description data under the specified category;
针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据;其中,指定关系数据为用于表征关联关系的数据;For each specified category, based on the securities vectors of each security under the specified category, determine the specified relationship data between the securities under the specified category; wherein the specified relationship data is data used to represent the relationship;
以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱。Taking each security as a graph vertex, and based on the specified relationship data between securities under each specified category, construct a security relationship graph with each specified category as a graph dimension.
本方案中,针对多只证券,根据所获取的证券数据确定每一只证券多个指定类别下的因子数据,从多个维度构建证券关系图谱,提高了证券关系图谱的丰富度,同时利用因子数据将各只证券进行向量化处理,得到证券向量,并基于所得到的证券向量确定每一类别下各只证券之间的指定关系数据,从而利用所确定的指定关系数据构建证券关系图谱,提高了各只证券之间的连通性。因此,通过本方案可以更好地构建证券关系图谱。In this scheme, for multiple securities, the factor data under multiple specified categories of each security is determined according to the acquired securities data, and the securities relationship map is constructed from multiple dimensions, which improves the richness of the securities relationship map. At the same time, the factor data is used to The data is vectorized for each security to obtain the security vector, and based on the obtained security vector, the specified relationship data between the securities in each category is determined, so that the specified relationship data can be used to construct the security relationship map, and improve Connectivity between securities. Therefore, through this scheme, the securities relationship graph can be better constructed.
下面结合附图,对本发明实施例所提供的一种证券关系图谱的构建方法进行介绍,如图1所示,该方法可以包括以下步骤:A method for constructing a securities relationship map provided by an embodiment of the present invention is introduced below in conjunction with the accompanying drawings. As shown in FIG. 1, the method may include the following steps:
S101,针对待构建证券关系图谱的多只证券,获取每一证券的证券数据;其中,证券数据包括,证券的发行方的财务报表、证券的技术指标数据和/或证券的日内交易数据;S101. Obtain the securities data of each of the securities for which the securities relationship graph is to be constructed; wherein, the securities data includes the financial statement of the issuer of the securities, the technical index data of the securities and/or the intraday transaction data of the securities;
其中,证券可以为股票、基金、债券等。待构建证券关系图谱的多只证券可以为市场上任意的多只证券。证券的发行方一般为上市公司,上市公司的财务报表是公开发布的,技术指标数据是指一切通过数学公式计算得出的证券价格的数据集合,日内交易数据是表征日内交易的数据,日内交易指持仓时间短,不留过夜持仓的交易方式,技术指标数据和日内交易数据均是互联网中公开的数据;因此,证券的发行方的财务报表、证券的技术指标数据以及证券的日内交易数据都可以在互联网中获取,也就是说,可以针对待构建证券关系图谱的多只证券,从互联网中直接获取每一证券的证券数据,所获取的证券数据可以为其中的至少一种。Wherein, the securities may be stocks, funds, bonds, and the like. The multiple securities to be constructed in the securities relationship graph can be any multiple securities in the market. The issuer of the securities is generally a listed company. The financial statements of the listed company are publicly released. The technical index data refers to the data collection of all securities prices calculated by mathematical formulas. The intraday transaction data is the data that represents the intraday transaction. Refers to the trading method of holding positions for a short time without leaving overnight positions. The technical index data and intraday transaction data are all public data on the Internet; therefore, the financial statements of the securities issuer, the technical index data of securities, and the intraday transaction data of securities are all It can be obtained from the Internet, that is to say, the securities data of each security can be directly obtained from the Internet for multiple securities whose securities relationship graph is to be constructed, and the obtained securities data can be at least one of them.
S102,针对每一证券,从该证券的证券数据中,确定该证券的多个指定类别下的因子数据,并基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量;其中,每一指定类别为用于进行证券描述的一个类别,每一指定类别下的因子数据为该指定类别下的证券描述数据;S102, for each security, from the security data of the security, determine the factor data under multiple specified categories of the security, and determine each specified category of the security based on the factor data under each specified category of the security The securities vector under ; wherein, each specified category is a category used for securities description, and the factor data under each specified category is the security description data under the specified category;
上述指定类别可以是根据经验及需求预先设定的类别,同时设定针对每一指定类别下所要获取的因子数据,因子数据可以是证券数据中指定的数据,或者对证券数据中指定的数据进行特定计算后得到的数据。当需要建立证券关系图谱时,可以针对每一特定类别,从证券数据获取指定的数据作为该指定类别下的因子数据,或者,先从证券数据获取指定的数据,并将对该指定的数据进行特定计算后得到数据作为该定类别下的因子数据。The above specified categories can be preset categories based on experience and needs, and at the same time set the factor data to be obtained under each specified category. The factor data can be the data specified in the securities data, or the data specified in the securities data. Data obtained after certain calculations. When it is necessary to establish a securities relationship graph, for each specific category, the specified data can be obtained from the securities data as the factor data under the specified category, or the specified data can be obtained from the securities data first, and the specified data will be processed The data obtained after specific calculations are used as factor data under the specified category.
以证券数据为证券的技术指标数据为例,可以将指定类别设定为以下类别:Taking the security data as the technical indicator data of the security as an example, the specified category can be set to the following categories:
动量类、反转类、技术复合类,动量类下的因子数据可以包括:长期动量、6月动量、威廉指标等;反转类下的因子数据可以包括:1个月股价反转、3个月股价反转、6个月股价反转等;技术复合类的因子数据可以包括:价量背离、价格线性回归系数、成交量方差、最高价长度、最高点距离、股价相对强度等。Momentum category, reversal category, technology composite category, factor data under momentum category can include: long-term momentum, June momentum, Williams index, etc.; factor data under reversal category can include: 1-month stock price reversal, 3 Monthly stock price reversal, 6-month stock price reversal, etc.; technical composite factor data can include: price-volume divergence, price linear regression coefficient, trading volume variance, length of highest price, distance from highest point, relative strength of stock price, etc.
针对每一证券,可以对该证券的每一指定类别下的因子数据进行向量化处理,得到该证券的每一指定类别下的证券向量。For each security, the factor data under each specified category of the security may be vectorized to obtain a security vector under each specified category of the security.
在一种实现方式中,上述因子数据包含至少一个因子值;In one implementation, the aforementioned factor data includes at least one factor value;
例如,当证券数据为证券的发行方的财务报表时,则可以将指定类别设定为以下类别:For example, when the securities data is the financial statement of the issuer of the securities, the specified category can be set to the following categories:
a.规模类,该类别下的因子数据包含的因子值可以为:总资产、营运资本、财务费用、股权自由现金流、折旧与摊销、营业总收入、净营运资本和营业成本等中的至少一个;a. Scale category, the factor values included in the factor data under this category can be: total assets, working capital, financial expenses, free cash flow of equity, depreciation and amortization, total operating income, net working capital and operating costs, etc. at least one;
b.盈利类,该类别下的因子数据包含的因子值可以为:权益税前回报率、净资产收益率、总资产收益率、净利率、毛利率、总利润率、营业利润率和营业利润占比等中的至少一个;b. Profit category, the factor values included in the factor data under this category can be: return on equity before tax, return on net assets, return on total assets, net profit rate, gross profit rate, total profit rate, operating profit rate and operating profit at least one of proportion etc.;
c.资本结构类,该类别下的因子数据包含的因子值可以为:资产负债率、固定资产比例、权益负债比率、流动资产比率、总负债比、股东权益比、运营资本占比和留存收益占比等中的至少一个;c. Capital structure category, the factor data included in this category can include: asset-liability ratio, fixed asset ratio, equity-liability ratio, current asset ratio, total debt ratio, shareholder equity ratio, operating capital ratio, and retained earnings at least one of proportion etc.;
d.估值类,该类别下的因子数据包含的因子值可以为:市盈率、市净率、市现率、市销率、账面市值比、每股净资产、扣非市盈率和销售率等中的至少一个;d. Valuation category, the factor values included in the factor data under this category can be: price-earnings ratio, price-to-book ratio, price-to-current ratio, price-to-sales ratio, book-to-market value ratio, net assets per share, non-price-earnings ratio and sales ratio, etc. at least one of
e.质量类,该类别下的因子数据包含的因子值可以为:总资产现金回收率、应收应付比、净利润波动、利息保障倍数、销售期间费用率、留存盈余比率、净利润波动率和销售现金比等中的至少一个;e. Quality category. The factor values included in the factor data under this category can be: total asset cash recovery rate, receivable-payable ratio, net profit fluctuation, interest coverage ratio, sales period expense ratio, retained earnings ratio, and net profit volatility and at least one of the cash-to-sales ratio;
f.成长类,该类别下的因子数据包含的因子值可以为:净资产收益率季度同比增速、净利润季度同比增速、营业利润增长率、主营收入增长率、总资产增长率、总利润增长率、股东权益增长率和每股净资产增长率等中的至少一个;f. Growth category, the factor values included in the factor data under this category can be: quarterly year-on-year growth rate of return on net assets, quarterly year-on-year growth rate of net profit, operating profit growth rate, main operating income growth rate, total asset growth rate, At least one of the growth rate of total profit, growth rate of shareholders' equity and growth rate of net assets per share;
g.经营类,该类别下的因子数据包含的因子值可以为:应收账款周转率、固定资产周转率、流动资产周转率、存货周转率、应付账款周转率、股东权益周转率和总资产周转率等中的至少一个;g. Operation category, the factor values included in the factor data under this category can be: accounts receivable turnover ratio, fixed asset turnover ratio, current asset turnover ratio, inventory turnover ratio, accounts payable turnover ratio, shareholder equity turnover ratio and At least one of the total asset turnover ratio, etc.;
h.股东相关类,该类别下的因子数据包含的因子值可以为:高管持股占比、员工持股计划占比、机构持股变动、股东数量变化率、流通股比例、机构持股占总股本比例、第一大股东持股比例和管理层持股数量等中的至少一个;h. Shareholder-related category, the factor data included in this category can include factor values: proportion of executives’ shareholding, proportion of employee stock ownership plan, change in institutional shareholding, change rate of number of shareholders, proportion of tradable shares, institutional shareholding At least one of the proportion of the total share capital, the proportion of the largest shareholder and the number of shares held by the management;
i.偿债能力类,该类别下的因子数据包含的因子值可以为:流动比率、速动比率、现金比率、现金流量比率、长期资本负债率、现金流量与负债比率、超速动比率和长期债务与营业收入比等中的至少一个。i. Debt solvency category, the factor data included in this category can include: current ratio, quick ratio, cash ratio, cash flow ratio, long-term debt ratio, cash flow and debt ratio, super quick ratio and long-term At least one of debt to operating income ratio etc.
当证券数据为证券的技术指标数据时,则可以将指定类别设定为以下类别:When the security data is the technical indicator data of the security, the specified category can be set to the following categories:
a.动量类,该类别下的因子数据包含的因子值可以为:长期动量、6月动量、威廉指标、季节性、随机指标、顺势指标(CCI,Commodity Channel Index)、动量线(MOM,Momentum)和移动平均线(MACD,Moving Average Convergence/Divergence)中的至少一个;a. Momentum category, the factor values contained in the factor data under this category can be: long-term momentum, June momentum, William index, seasonality, stochastic index, homeopathic index (CCI, Commodity Channel Index), momentum line (MOM, Momentum ) and at least one of moving average (MACD, Moving Average Convergence/Divergence);
b.反转类,该类别下的因子数据包含的因子值可以为:1个月股价反转、3个月股价反转、6个月股价反转、30日涨跌幅、90日涨跌幅和180天涨跌幅中的至少一个;b. Inversion category, the factor data contained in this category can contain factor values: 1-month stock price reversal, 3-month stock price reversal, 6-month stock price reversal, 30-day rise and fall, 90-day rise and fall At least one of the price increase and the 180-day increase and decrease;
c.技术复合类,该类别下的因子数据包含的因子值可以为:价量背离、容量比(一种衡量买卖力量的技术指标)、价格线性回归系数、成交量方差、最高价长度、最高点距离和股价相对强度中的至少一个。c. Technical composite category, the factor data contained in this category can include factor values: price-volume divergence, capacity ratio (a technical indicator to measure buying and selling power), price linear regression coefficient, volume variance, maximum price length, maximum At least one of point distance and stock price relative strength.
当证券数据为证券的日内交易数据时,则可以将指定类别设定为高频类,该类别下的因子数据包含的因子值可以为:高频波动、高频特质波动、高频特异度、高频系统波动、高频收益方差、高频收益偏度和高频收益峰值。When the securities data is the intraday trading data of securities, the specified category can be set as high-frequency category, and the factor values contained in the factor data under this category can be: high-frequency fluctuation, high-frequency idiosyncratic fluctuation, high-frequency specificity, High Frequency Systematic Volatility, High Frequency Return Variance, High Frequency Return Skewness, and High Frequency Return Peak.
以上所提及的因子值均可以从证券数据中获取,或者根据证券数据中获取的数据进行特定运算得到,该特定运算可以为现有技术中存在的运算,不属于本发明的发明点,在此不做赘述。上述各指定类别以及因子数据仅是示例性介绍,本发明实施例不对指定类别以及因子数据进行限定。The factor values mentioned above can be obtained from the securities data, or obtained by performing specific operations on the data obtained from the securities data. The specific operations can be operations in the prior art, which do not belong to the invention point of the present invention. I won't go into details here. The above specified categories and factor data are only exemplary introductions, and the embodiment of the present invention does not limit the specified categories and factor data.
上述基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量,可以包括:The aforementioned determination of the security vectors under each specified category of the security based on the factor data under each specified category of the security may include:
针对每一指定类别,以该证券在该指定类别下的因子数据所包含的每一因子值分别作为一个向量维度,生成该证券在该指定类别下的证券向量。For each specified category, each factor value contained in the factor data of the security under the specified category is used as a vector dimension to generate a security vector of the security under the specified category.
以指定类别为规模类为例,假设根据一证券的发行方的财务报表得到,规模类下的因子数据所包含的因子值分别为:总资产=A、运营资本=B、财务费用=C……,则可以生成该证券在规模类下的证券向量为【A,B,C……】。或者,还可以将该指定类别下的因子数据所包含的因子值进行归一化处理后,再根据归一化后的各因子值生成证券的证券向量,也是可以的。Taking the specified category as the scale category as an example, assuming that according to the financial statements of the issuer of a security, the factor values contained in the factor data under the scale category are: total assets = A, operating capital = B, financial expenses = C... ..., then the security vector of the security under the scale category can be generated as [A, B, C...]. Alternatively, it is also possible to normalize the factor values contained in the factor data under the specified category, and then generate the stock vector of the stock according to the normalized value of each factor.
该实现方式中,针对每一指定类别,以该证券在该指定类别下的因子数据所包含的每一因子值分别作为一个向量维度,生成该证券在该指定类别下的证券向量,通过证券向量更容易反映出各只证券之间的关联关系。In this implementation, for each specified category, each factor value contained in the factor data of the security under the specified category is used as a vector dimension to generate the security vector of the security under the specified category. It is easier to reflect the correlation between various securities.
S103,针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据;其中,指定关系数据为用于表征关联关系的数据;S103, for each specified category, based on the securities vectors of each security under the specified category, determine the specified relationship data between the securities under the specified category; wherein the specified relationship data is used to represent the relationship data;
上述指定关系数据可以为每两只证券的证券向量的向量相似度,此时,上述针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据,可以包括:The above-mentioned specified relationship data may be the vector similarity of the securities vectors of each two securities. Specified relationship data between securities, which may include:
针对每一指定类别,计算每两只证券的该指定类别下的证券向量的向量相似度,得到该指定类别下每两只证券之间的指定关系数据。For each specified category, calculate the vector similarity of the securities vectors under the specified category for every two securities, and obtain the specified relationship data between every two securities under the specified category.
其中,计算向量相似度的方式可以为计算归一化余弦相似度或者皮尔逊相关系数等方式。Wherein, the manner of calculating the vector similarity may be calculating the normalized cosine similarity or the Pearson correlation coefficient.
示例性的,当指定类别为规模类,在该指定类别下,证券1的证券向量为【A1,B1,C1……】,证券2的证券向量为【A2,B2,C2……】,则可以计算证券1和证券2的证券向量的向量相似度,作为证券1和证券2之间的指定关系数据。Exemplarily, when the specified category is the scale category, under the specified category, the security vector of security 1 is [A1, B1, C1...], and the security vector of security 2 is [A2, B2, C2...], then The vector similarity of the security vectors of the security 1 and the security 2 can be calculated as specified relationship data between the security 1 and the security 2.
本实现方式中,针对每一指定类别,通过计算每两只证券的该指定类别下的证券向量的向量相似度,得到该指定类别下每两只证券之间的指定关系数据,进一步地,可以根据指定关系数据构建证券关系图谱。In this implementation, for each specified category, by calculating the vector similarity of the securities vectors of each two securities under the specified category, the specified relationship data between each two securities under the specified category can be obtained. Further, it can be Build a securities relationship graph based on specified relationship data.
S104,以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱。S104, taking each security as a map vertex, and constructing a securities relationship graph with each specified category as a graph dimension, based on specified relationship data between securities under each specified category.
可以理解的,图谱是由图谱顶点,以及两两相连的图谱顶点所组成的图谱边构成,本实施例中,可以以各只证券作为图谱顶点,并针对每两只证券,根据该两只证券之间的指定关系数据,确定该两只证券的图谱顶点是否相连。It can be understood that the graph is composed of graph vertices and graph edges composed of two connected graph vertices. In this embodiment, each security can be used as the graph vertex, and for each two securities, according to the The specified relationship data between, determine whether the graph vertices of the two securities are connected.
在一种实现方式中,上述以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱,可以包括步骤A1-A2:In one implementation, the above-mentioned securities are used as graph vertices, and based on the specified relationship data between securities under each specified category, a securities relationship graph with each specified category as a graph dimension is constructed, which can be Include steps A1-A2:
步骤A1,针对每一指定类别,基于每两只证券的该指定类别下的证券向量的向量相似度,确定在该指定类别下每一证券对应的关联证券;其中,每一证券对应的关联证券为:与该证券的向量相似度大于预定阈值的证券,或者,按照向量相似度排序靠后排序靠前的、与该证券对应的指定数量个证券;Step A1, for each specified category, based on the vector similarity of the securities vectors under the specified category of every two securities, determine the related securities corresponding to each security under the specified category; wherein, the related securities corresponding to each security It is: the securities whose vector similarity with the security is greater than a predetermined threshold, or a specified number of securities corresponding to the security that are sorted according to the vector similarity;
本实现方式中,可以先设定预定阈值,该预定阈值可以根据经验和需求设定,再针对每一指定类别下的每一证券,确定与该证券的证券向量的向量相似度大于预定阈值的证券,作为关联证券;In this implementation, a predetermined threshold can be set first, and the predetermined threshold can be set according to experience and needs, and then for each security under each specified category, it is determined that the vector similarity with the security vector of the security is greater than the predetermined threshold Securities, as related securities;
或者,还可以预先设定指定数量,该指定数量也可以根据经验和需求设定,再针对每一指定类别下的每一证券,确定与该证券的证券向量的向量相似度最大的、或最小的指定数量个证券,作为关联证券。在该实现方式中,确定与该证券的证券向量的向量相似度最大的指定数量个证券,作为关联证券可以相对来说可以更好地反映各只证券之间的关联关系。Alternatively, the designated quantity can also be preset, and the designated quantity can also be set according to experience and needs, and then, for each security under each specified category, determine the vector similarity with the security vector of the security with the largest or minimum vector similarity. A specified number of securities, as affiliated securities. In this implementation, the specified number of securities with the highest vector similarity with the security vector of the security is determined, as related securities, which can relatively better reflect the relationship between the various securities.
在具体实现过程中,还可以针对每一指定类别,先生成该指定类别作为一个图谱维度的原始证券关系图谱,原始证券关系图谱中的各个顶点是全连接的,即每只证券都两两相连,同时将每两只证券的证券向量之间的向量相似度作为该两只证券的图谱边的权重,之后再针对每一图谱顶点,保留权重最大、或最小的指定数量个图谱边,得到该指定类别作为一个图谱维度的证券关系图谱。In the specific implementation process, for each specified category, it is also possible to first generate the specified category as an original securities relationship graph of a graph dimension, and each vertex in the original securities relationship graph is fully connected, that is, each security is connected in pairs , at the same time, the vector similarity between the securities vectors of each two securities is used as the weight of the graph edges of the two securities, and then for each graph vertex, the specified number of graph edges with the largest or smallest weight is reserved to obtain the A securities relationship graph that specifies categories as a graph dimension.
步骤A2,以各只证券作为图谱顶点,且以每一指定类别对应的连接线作为该指定类别的图谱边,生成以每一指定类别分别作为一个图谱维度的证券关系图谱;其中,每一指定类别对应的连接线为该指定类别下每一证券的图谱顶点与所对应的关联证券的图谱顶点的连接线。Step A2, using each security as a graph vertex, and using the connecting line corresponding to each specified category as a graph edge of the specified category, to generate a securities relationship graph with each specified category as a graph dimension; wherein, each specified The connection line corresponding to the category is the connection line between the graph vertex of each security under the specified category and the graph vertex of the corresponding related securities.
本实现方式中,针对每一指定类别下的每一证券,都在证券关系图谱中将该证券的图谱顶点与所对应的关联证券的图谱顶点相连,从而得到该只证券的图谱顶点与所对应的关联证券的图谱顶点的连接线;针对该指定类别,将该指定类别的各个连接线作为图谱边,从而得到该指定类别作为一个图谱维度的证券关系图谱。最终可以得到各个指定类别分别作为一个图谱维度的多维度的证券关系图谱。In this implementation, for each security under each specified category, the graph vertex of the security is connected with the graph vertex of the corresponding related securities in the securities relationship graph, so as to obtain the graph vertex of the security and the corresponding For the specified category, each connecting line of the specified category is used as a graph edge, so as to obtain a securities relationship graph with the specified category as a graph dimension. Finally, a multi-dimensional securities relationship graph can be obtained in which each specified category is used as a graph dimension.
多维度的证券关系图谱可以为每一图谱维度的证券关系图谱的集合,也就是每一图谱维度的证券关系图谱都可以作为一个独立的图谱,或者还可以为一张包含多种类型图谱边的图谱,也就是说,在多维度的证券关系图谱中,每两只证券之间的可能存在多条图谱边,每一图谱边用于表征不同的指定类别。A multi-dimensional securities relationship graph can be a collection of securities relationship graphs in each graph dimension, that is, a securities relationship graph in each graph dimension can be used as an independent graph, or it can also be a graph containing multiple types of graph edges. Graph, that is to say, in a multi-dimensional securities relationship graph, there may be multiple graph edges between every two securities, and each graph edge is used to represent a different specified category.
当生成证券关系图谱后,可以将该证券关系图谱存入图数据库中,实现对证券关系图谱的存储,客户端的终端设备也可以从图数据库中获取该证券关系图谱,并在终端设备中对该证券关系图谱进行可视化展示。After the securities relationship graph is generated, the securities relationship graph can be stored in the graph database to realize the storage of the securities graph. The terminal device of the client can also obtain the securities graph from the graph database and store the The securities relationship graph is visualized.
本实施例中,针对多只证券,根据所获取的证券数据确定每一只证券多个指定类别下的因子数据,从多个维度构建证券关系图谱,提高了证券关系图谱的丰富度,同时利用因子数据将各只证券进行向量化处理,得到证券向量,并基于所得到的证券向量确定每一类别下各只证券之间的指定关系数据,从而利用所确定的指定关系数据构建证券关系图谱,提高了各只证券之间的连通性。因此,通过本方案可以更好地构建证券关系图谱。In this embodiment, for multiple securities, according to the acquired securities data, the factor data under multiple specified categories of each security is determined, and the securities relationship map is constructed from multiple dimensions, which improves the richness of the securities relationship map. The factor data vectorizes each security to obtain a security vector, and based on the obtained security vector, determines the specified relationship data between each security under each category, so as to use the determined specified relationship data to construct a securities relationship graph. Improved connectivity between securities. Therefore, through this scheme, the securities relationship graph can be better constructed.
可选地,在本发明的另一实施例中,如图2所示,上述每一证券对应的指定数量的确定方式,可以包括步骤S201-步骤S203:Optionally, in another embodiment of the present invention, as shown in FIG. 2 , the method of determining the designated quantity corresponding to each of the above-mentioned securities may include steps S201-step S203:
S201,针对每一指定类别,计算在该指定类别下每一证券对应的关联证券的数量分别为1到M时,该指定类别作为一个图谱维度的证券关系图谱的M个信息熵;其中,M为各只证券的总只数;S201, for each specified category, calculate the M information entropies of the specified category as a securities relationship map of a map dimension when the number of related securities corresponding to each security under the specified category is 1 to M; wherein, M is the total number of each security;
也就是假定该指定类别下每一证券对应的关联证券的数量为1,2,3……M,计算每一种情况下该指定类别作为一个图谱维度的证券关系图谱的信息熵,其中,信息熵可以用于度量该指定类别作为一个图谱维度的证券关系图谱的不确定性,即该指定类别作为一个图谱维度的证券关系图谱的可靠性,例如,该指定类别下每一证券对应的关联证券的数量为n,则可以计算该指定类别下每一证券对应的关联证券的数量为n时,该指定类别作为一个图谱维度的证券关系图谱的信息熵。That is, assuming that the number of related securities corresponding to each security under the specified category is 1, 2, 3...M, calculate the information entropy of the specified category as a graph dimension of the securities relationship graph in each case, where information Entropy can be used to measure the uncertainty of the securities relationship graph of the specified category as a graph dimension, that is, the reliability of the securities relationship graph of the specified category as a graph dimension, for example, the associated securities corresponding to each security under the specified category The number of securities is n, and when the number of related securities corresponding to each security under the specified category is n, the information entropy of the specified category as a graph dimension of the securities relationship map can be calculated.
假设证券分别为证券1、证券2、证券3……证券M,则信息熵可以按照如下公式计算:Assuming that the securities are respectively securities 1, securities 2, securities 3 ... securities M, the information entropy can be calculated according to the following formula:
其中,H(n)为该指定类别下每一证券对应的关联证券的数量为n时,以该指定类别作为一个图谱维度的证券关系图谱的信息熵,n的取值范围为1到M;M为各只证券的总只数;为证券m的关联证券数量为n时,该n只关联证券的证券向量的向量相似度之和,该n只关联证券可以为与证券m的证券向量的向量相似度最大的n只证券,m的取值范围为1到M。Among them, H(n) is the information entropy of the securities relationship map with the specified category as a graph dimension when the number of related securities corresponding to each security under the specified category is n, and the value range of n is 1 to M; M is the total number of securities; When the number of related securities of security m is n, the sum of the vector similarities of the securities vectors of the n related securities, the n related securities can be the n securities with the largest vector similarity with the securities vector of security m, m The value range is from 1 to M.
S202,从所得到的各信息熵中选取数值最小的极小值点,作为目标信息熵;S202. Select the minimum value point with the smallest value from the obtained information entropy as the target information entropy;
可以理解的,选取最小的极小值点可以保证在排除无意义的最小值点的情况下,使得该指定类别作为一个图谱维度的证券关系图谱的不确定性尽可能小。It can be understood that selecting the smallest minimum value point can ensure that the uncertainty of the securities relationship graph of the specified category as a graph dimension is as small as possible under the condition of excluding meaningless minimum value points.
S203,确定目标信息熵所对应的关联证券的数量,作为在该指定类别下每一证券对应的指定数量。S203. Determine the quantity of associated securities corresponding to the target information entropy as the specified quantity corresponding to each security under the specified category.
可以理解的,针对每一指定类别,都可以通过计算信息熵的方式,确定该指定类别下每一证券对应的指定数量,使得最终生成的多个维度的证券关系图谱的不确定性更低。It can be understood that, for each specified category, the specified quantity corresponding to each security under the specified category can be determined by calculating information entropy, so that the uncertainty of the finally generated multi-dimensional security relationship map is lower.
本实施例中,针对每一指定类别,计算在该指定类别下每一证券对应的关联证券的数量分别为1到M时,该指定类别作为一个图谱维度的证券关系图谱的M个信息熵;从所得到的各信息熵中选取数值最小的极小值点,作为目标信息熵;确定目标信息熵所对应的关联证券的数量,作为在该指定类别下每一证券对应的指定数量。本方案通计算信息熵的方式确定指定数量,能够使得所构建的证券关系图谱的可靠性更高。In this embodiment, for each specified category, when the number of related securities corresponding to each security under the specified category is 1 to M respectively, the specified category is used as M information entropies of a securities relationship graph of a map dimension; Select the minimum value point with the smallest value from the obtained information entropy as the target information entropy; determine the number of related securities corresponding to the target information entropy as the specified quantity corresponding to each security under the specified category. This scheme determines the specified quantity by calculating the information entropy, which can make the constructed securities relationship map more reliable.
可选地,在本发明的另一实施例中,上述针对待构建证券关系图谱的多只证券,获取每一证券的证券数据,可以包括:Optionally, in another embodiment of the present invention, the acquisition of the securities data of each of the securities for the multiple securities to be constructed of the securities relationship graph may include:
每当进入预定的构建周期时,针对待构建证券关系图谱的多只证券,获取当前的构建周期内每一证券的证券数据;其中,构建周期的周期时长基于证券数据的更新周期的时长所确定。Whenever a predetermined construction cycle is entered, the securities data of each security in the current construction cycle is obtained for multiple securities of the securities relationship graph to be constructed; wherein, the duration of the construction cycle is determined based on the duration of the update cycle of the securities data .
本实施例中,可以周期性地更新证券关系图谱,也就是周期性地针对待构建证券关系图谱的多只证券,获取当前的构建周期内每一证券的证券数据,并执行后续的构建证券关系图谱的过程。上述构建周期可以可证券数据的更新周期保持一致,例如一个季度、一个月、一天、一小时等,这都是可以的。不同的指定类别也可以有不同的构建周期,例如,指定类别下的因子数据来源于证券的发行方的财务报表,则构建周期可以为一个季度;指定类别下的因子数据来源于证券的技术指标数据,则构建周期可以为一天;指定类别下的因子数据来源于证券的证券的日内交易数据,则构建周期可以为一小时。In this embodiment, the securities relationship graph can be periodically updated, that is, periodically for multiple securities to be constructed in the securities relationship graph, obtain the securities data of each security in the current construction cycle, and execute the subsequent construction of the securities relationship Mapping process. The above-mentioned construction cycle can be consistent with the update cycle of securities data, such as one quarter, one month, one day, one hour, etc., which are all possible. Different designated categories can also have different construction periods. For example, the factor data under the designated category comes from the financial statements of the issuer of the securities, and the construction period can be one quarter; the factor data under the designated category comes from the technical indicators of the securities data, the construction period can be one day; the factor data under the specified category comes from the intraday transaction data of securities, and the construction period can be one hour.
本实施例中,针对多只证券,根据所获取的证券数据确定每一只证券多个指定类别下的因子数据,从多个维度构建证券关系图谱,提高了证券关系图谱的丰富度,同时利用因子数据将各只证券进行向量化处理,得到证券向量,并基于所得到的证券向量构建证券关系图谱,提高了各只证券之间的连通性。因此,通过本方案可以更好地构建证券关系图谱。进一步的,通过每当进入预定的构建周期时,针对待构建证券关系图谱的多只证券,获取当前的构建周期内每一证券的证券数据,可以动态地更新证券关系图谱,从而可以动态的反映不同证券之间的关联关系。In this embodiment, for multiple securities, according to the acquired securities data, the factor data under multiple specified categories of each security is determined, and the securities relationship map is constructed from multiple dimensions, which improves the richness of the securities relationship map. The factor data vectorizes each security to obtain a security vector, and builds a security relationship graph based on the obtained security vector, which improves the connectivity between various securities. Therefore, through this scheme, the securities relationship graph can be better constructed. Further, by obtaining the securities data of each security in the current construction cycle for multiple securities to be constructed whenever a predetermined construction period is entered, the securities relationship diagram can be dynamically updated, thereby dynamically reflecting Relationship between different securities.
可选地,在本发明的另一实施例中,该方法还包括:Optionally, in another embodiment of the present invention, the method further includes:
利用社区挖掘算法,以及证券关系图谱中的每一图谱维度下的各只证券的指定关系数据,从不同的图谱维度,将各只证券划分至不同的簇当中。Using the community mining algorithm and the specified relationship data of each security under each graph dimension in the securities relationship graph, each security is divided into different clusters from different graph dimensions.
其中,社区挖掘是指将实体划分为不同的类型,即不同的簇,使得同一簇内的图谱边尽量地多,不同簇之间的图谱边尽可能地少。社区挖掘算法可以采用UEOC(基于集成网络重叠社区挖掘算法)算法、FNCA(一种快速的社区挖掘算法)算法等,本发明实施例不做具体限定。Among them, community mining refers to dividing entities into different types, that is, different clusters, so that there are as many graph edges in the same cluster as possible, and as few graph edges as possible between different clusters. The community mining algorithm may adopt UEOC (community mining algorithm based on integrated network overlap) algorithm, FNCA (a fast community mining algorithm) algorithm, etc., which are not specifically limited in the embodiment of the present invention.
在实际应用过程中,可以利用社区挖掘算法处理每一图谱维度下的证券关系图谱,由于社区挖掘算法一般要使用到各个图谱边的权重,因此,当指定关系数据为每一图谱维度下每两只证券的证券向量的向量相似度时,则可以将每两只证券的证券向量的向量相似度作为该两只证券作为图谱顶点的图谱边的权重,进一步的,将每一图谱维度下的各个图谱边的权重带入上述社区挖掘算法的计算过程中,则可以得到将该图谱维度下的各只证券划分至不同的簇的计算结果。In the actual application process, the community mining algorithm can be used to process the securities relationship graph under each graph dimension. Since the community mining algorithm generally uses the weight of each graph edge, when the specified relationship data is every two graphs in each graph dimension When the vector similarity of the securities vectors of only two securities is used, the vector similarity of the securities vectors of each two securities can be used as the weight of the two securities as the graph edge of the graph vertex, and further, each graph under each graph dimension The weight of the graph edge is brought into the calculation process of the above-mentioned community mining algorithm, and then the calculation results of dividing each security under the graph dimension into different clusters can be obtained.
本实施例中,通过利用社区挖掘算法,以及证券关系图谱中的每一图谱维度下的各只证券的指定关系数据,从不同的图谱维度,将各只证券划分至不同的簇当中,可以更加直观地反映出不同证券之间的关联关系,从而更好地为投资者的投资决策和风险防范提供参考。In this embodiment, by using the community mining algorithm and the specified relational data of each stock under each map dimension in the stock relation map, each stock is divided into different clusters from different map dimensions, which can be more Intuitively reflect the relationship between different securities, so as to better provide reference for investors' investment decisions and risk prevention.
为了方便理解,下面结合附图,对本发明实施例所提供的证券关系图谱的构建方法进行进一步介绍。For the convenience of understanding, the method for constructing the securities relationship graph provided by the embodiment of the present invention will be further introduced below in conjunction with the accompanying drawings.
如图3所示,指定类别共有N种,指定类别1、指定类别2……指定类别N,针对指定类别1,可先构建指定类别1的多因子向量数据,即指定类别1下的各证券的证券向量;根据各证券的证券向量计算每两只证券的证券向量之间的向量相似度;将每两只证券的证券向量之间的向量相似度作为该两只证券的图谱边的权重,从而构建出全连接的原始证券关系图谱;根据信息熵极小值确定图谱边的数目n,针对每一证券保留权重最大的n条图谱边,得到指定类别1的证券关系图谱,即指定类别1作为一个图谱维度的证券关系图谱。As shown in Figure 3, there are N types of specified categories, specified category 1, specified category 2 ... specified category N, for specified category 1, the multi-factor vector data of specified category 1 can be constructed first, that is, each security under specified category 1 The stock vectors of each stock; calculate the vector similarity between the stock vectors of each two stocks according to the stock vectors of each stock; use the vector similarity between the stock vectors of each two stocks as the weight of the graph edge of the two stocks, In this way, a fully connected original securities relationship graph is constructed; the number n of graph edges is determined according to the minimum value of information entropy, and the n graph edges with the largest weight are reserved for each security to obtain a securities relationship graph of the specified category 1, that is, the specified category 1 A securities relationship graph as a graph dimension.
其他各指定类别的证券关系图谱的构建方式相同。当各指定类别的证券关系图谱均构建后,叠加所有指定类别的证券关系图谱生成多维度的证券关系图谱。之后,可以进行图谱存储及可视化展示,也就是,将该证券关系图谱存入图数据库中,并将该证券关系图谱在客户端中展示;同时,进行证券间社区挖掘及分析,即利用社区挖掘算法对每一图谱维度下的各只证券的指定关系数据,从不同的图谱维度,将各只证券划分至不同的簇当中。The construction method of the securities relationship graph of other specified categories is the same. After the securities relationship graphs of each specified category are constructed, the securities relationship graphs of all specified categories are superimposed to generate a multi-dimensional securities relationship graph. After that, graph storage and visual display can be carried out, that is, the securities relationship graph is stored in the graph database, and the securities relationship graph is displayed on the client; at the same time, community mining and analysis among securities can be carried out, that is, using community mining For the specified relationship data of each security under each map dimension, the algorithm divides each stock into different clusters from different map dimensions.
在一种实现方式中,实现本发明实施例所提供的方法的计算机程序可以被划分为图4所示的多个功能模块:In an implementation manner, the computer program implementing the method provided by the embodiment of the present invention can be divided into multiple functional modules as shown in FIG. 4:
证券多因子数据计算模块,用于针对待构建证券关系图谱的多只证券,获取每一证券的证券数据,并从证券数据中获取及计算得到各只证券的多个指定类别下的因子数据,生成每一指定类别下各只证券的证券向量;The securities multi-factor data calculation module is used to obtain the securities data of each security for multiple securities to be constructed in the securities relationship graph, and obtain and calculate the factor data under multiple specified categories of each security from the securities data, Generate security vectors for each security under each specified category;
证券相似度计算模块,用于计算每一指定类别下,每两只证券的证券向量的向量相似度;The securities similarity calculation module is used to calculate the vector similarity of the securities vectors of each two securities under each specified category;
证券全连接图谱建立模块,用于构建并存储每一个指定类别下所有证券之间的全连接的证券关系图谱,其中,图谱边的权重为该图谱边的两只证券的证券向量的向量相似度;The securities full connection graph building module is used to construct and store a fully connected securities relationship graph between all securities under each specified category, wherein the weight of the graph edge is the vector similarity of the securities vectors of the two securities on the graph edge ;
证券边数目确立模块,用于计算每一个指定类别下各只证券的图谱边的指定数量,并针对每一个指定类别下每一证券保留指定数量个图谱边;A module for establishing the number of securities edges is used to calculate the specified number of graph edges of each security under each specified category, and reserve a specified number of graph edges for each security under each specified category;
证券多维度关系图谱叠加和生成模块,用于组合每一个指定类别的证券关系图谱至一个多维度的证券关系图谱中;The securities multi-dimensional relationship graph overlay and generation module is used to combine the securities relationship graph of each specified category into a multi-dimensional securities relationship graph;
证券关系图谱存储及可视化模块,用于将所构建的证券关系图谱存储到图数据库中,并利用图数据库展示方面的优势,可视化不同证券之间多维度的关联关系;The securities relationship map storage and visualization module is used to store the constructed securities relationship map in the graph database, and use the advantages of the graph database display to visualize the multi-dimensional relationship between different securities;
证券关系图社区挖掘和分析模块,用于利用社区挖掘算法,以及证券关系图谱中的每一图谱维度下的各只证券的指定关系数据,从不同的图谱维度,将各只证券划分至不同的簇当中,分析每个簇内部及不同簇之间的证券关系。The securities relationship map community mining and analysis module is used to use the community mining algorithm and the specified relationship data of each stock under each map dimension in the stock relation map to divide each stock into different map dimensions from different map dimensions Among the clusters, the securities relationship within each cluster and between different clusters is analyzed.
本实施例中,根据所获取的证券数据确定每一只证券多个指定类别下的因子数据,从多个维度构建证券关系图谱,提高了证券关系图谱的丰富度,同时利用因子数据将各只证券进行向量化处理,得到证券向量,并基于所得到的证券向量构建证券关系图谱,提高了各只证券之间的连通性。因此,通过本方案可以更好地构建证券关系图谱。In this embodiment, the factor data under multiple specified categories of each security is determined according to the acquired securities data, and the securities relationship map is constructed from multiple dimensions, which improves the richness of the securities relationship map. Stocks are vectorized to obtain stock vectors, and a stock relationship graph is constructed based on the obtained stock vectors, which improves the connectivity between each stock. Therefore, through this scheme, the securities relationship graph can be better constructed.
本发明实施例还提供了一种证券关系图谱的构建装置,如图5所示,该装置包括:The embodiment of the present invention also provides a device for constructing a securities relationship graph, as shown in Figure 5, the device includes:
获取模块510,用于针对待构建证券关系图谱的多只证券,获取每一所述证券的证券数据;其中,所述证券数据包括,所述证券的发行方的财务报表、所述证券的技术指标数据和/或所述证券的日内交易数据;An
向量确定模块520,用于针对每一所述证券,从该证券的证券数据中,确定该证券的多个指定类别下的因子数据,并基于该证券的每一指定类别下的因子数据,确定该证券的每一指定类别下的证券向量;其中,每一指定类别为用于进行证券描述的一个类别,每一指定类别下的因子数据为该指定类别下的证券描述数据;The
关系确定模块530,用于针对每一指定类别,基于各只证券的该指定类别下的证券向量,确定在该指定类别下的各只证券之间的指定关系数据;其中,所述指定关系数据为用于表征关联关系的数据;The
构建模块540,用于以各只证券作为图谱顶点,基于在每一指定类别下各只证券之间的指定关系数据,构建以每一指定类别分别作为一个图谱维度的证券关系图谱。The
可选地,所述因子数据包含至少一个因子值;Optionally, said factor data comprises at least one factor value;
所述向量确定模块,具体用于:The vector determination module is specifically used for:
针对每一指定类别,以该证券在该指定类别下的因子数据所包含的每一因子值分别作为一个向量维度,生成该证券在该指定类别下的证券向量。For each specified category, each factor value contained in the factor data of the security under the specified category is used as a vector dimension to generate a security vector of the security under the specified category.
可选地,所述关系确定模块,具体用于:Optionally, the relationship determination module is specifically configured to:
针对每一指定类别,计算每两只证券的该指定类别下的证券向量的向量相似度,得到该指定类别下每两只证券之间的指定关系数据。For each specified category, calculate the vector similarity of the securities vectors under the specified category for every two securities, and obtain the specified relationship data between every two securities under the specified category.
可选地,所述构建模块,包括:Optionally, the building blocks include:
关联证券确定子模块,用于针对每一指定类别,基于每两只证券的该指定类别下的证券向量的向量相似度,确定在该指定类别下每一证券对应的关联证券;其中,每一证券对应的关联证券为:与该证券的向量相似度大于预定阈值的证券,或者,按照向量相似度排序靠后排序靠前的、与该证券对应的指定数量个证券;The related securities determination sub-module is used to determine the related securities corresponding to each security under the specified category based on the vector similarity of the securities vectors under the specified category of every two securities for each specified category; wherein, each The associated securities corresponding to the securities are: securities whose vector similarity with the securities is greater than a predetermined threshold, or a specified number of securities corresponding to the securities that are sorted according to the similarity of the vectors;
生成子模块,用于以各只证券作为图谱顶点,且以每一指定类别对应的连接线作为该指定类别的图谱边,生成以每一指定类别分别作为一个图谱维度的证券关系图谱;其中,每一指定类别对应的连接线为该指定类别下每一证券的图谱顶点与所对应的关联证券的图谱顶点的连接线。Generate a sub-module, which is used to use each security as a graph vertex, and use the connecting line corresponding to each specified category as a graph edge of the specified category to generate a securities relationship graph with each specified category as a graph dimension; wherein, The connection line corresponding to each specified category is the connection line between the graph vertex of each security under the specified category and the graph vertex of the corresponding related securities.
可选地,所述装置还包括:Optionally, the device also includes:
计算模块,用于针对每一指定类别,计算在该指定类别下每一证券对应的关联证券的数量分别为1到M时,该指定类别作为一个图谱维度的证券关系图谱的M个信息熵;其中,M为各只证券的总只数;A calculation module, for each specified category, when the number of related securities corresponding to each security under the specified category is 1 to M respectively, the specified category is used as M information entropies of a securities relationship map of a map dimension; Among them, M is the total number of securities;
选取模块,用于从所得到的各信息熵中选取数值最小的极小值点,作为目标信息熵;The selection module is used to select the minimum value point with the smallest value from the obtained information entropy as the target information entropy;
数量确定模块,用于确定所述目标信息熵所对应的关联证券的数量,作为在该指定类别下每一证券对应的指定数量。A quantity determining module, configured to determine the quantity of associated securities corresponding to the target information entropy as the specified quantity corresponding to each security under the specified category.
可选地,所述获取模块,具体用于:Optionally, the acquisition module is specifically used for:
每当进入预定的构建周期时,针对待构建证券关系图谱的多只证券,获取当前的构建周期内每一所述证券的证券数据;其中,所述构建周期的周期时长基于所述证券数据的更新周期的时长所确定。Whenever a predetermined construction cycle is entered, the securities data of each of the securities in the current construction cycle is obtained for multiple securities of the securities relationship graph to be constructed; wherein, the duration of the construction cycle is based on the securities data Determined by the length of the update cycle.
可选地,所述装置还包括:Optionally, the device also includes:
划分模块,用于利用社区挖掘算法,以及所述证券关系图谱中的每一图谱维度下的各只证券的指定关系数据,从不同的图谱维度,将各只证券划分至不同的簇当中。The division module is used to divide each security into different clusters from different graph dimensions by using the community mining algorithm and the specified relationship data of each security under each graph dimension in the securities relationship graph.
本发明实施例还提供了一种电子设备,如图6所示,包括处理器601、通信接口602、存储器603和通信总线604,其中,处理器601,通信接口602,存储器603通过通信总线604完成相互间的通信,The embodiment of the present invention also provides an electronic device, as shown in FIG. complete the mutual communication,
存储器603,用于存放计算机程序;
处理器601,用于执行存储器603上所存放的程序时,实现上述证券关系图谱的构建方法。The
上述电子设备提到的通信总线可以是外设部件互连标准(Peripheral ComponentInterconnect,PCI)总线或扩展工业标准结构(Extended Industry StandardArchitecture,EISA)总线等。该通信总线可以分为地址总线、数据总线、控制总线等。为便于表示,图中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。The communication bus mentioned in the above electronic device may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus or the like. The communication bus can be divided into an address bus, a data bus, a control bus, and the like. For ease of representation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus.
通信接口用于上述电子设备与其他设备之间的通信。The communication interface is used for communication between the electronic device and other devices.
存储器可以包括随机存取存储器(Random Access Memory,RAM),也可以包括非易失性存储器(Non-Volatile Memory,NVM),例如至少一个磁盘存储器。可选的,存储器还可以是至少一个位于远离前述处理器的存储装置。The memory may include a random access memory (Random Access Memory, RAM), and may also include a non-volatile memory (Non-Volatile Memory, NVM), such as at least one disk memory. Optionally, the memory may also be at least one storage device located far away from the aforementioned processor.
上述的处理器可以是通用处理器,包括中央处理器(Central Processing Unit,CPU)、网络处理器(Network Processor,NP)等;还可以是数字信号处理器(Digital SignalProcessor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。Above-mentioned processor can be general-purpose processor, comprises central processing unit (Central Processing Unit, CPU), network processor (Network Processor, NP) etc.; It can also be Digital Signal Processor (Digital Signal Processor, DSP), ASIC (Application Specific Integrated Circuit, ASIC), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
在本发明提供的又一实施例中,还提供了一种计算机可读存储介质,该计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述证券关系图谱的构建方法的步骤。In yet another embodiment provided by the present invention, a computer-readable storage medium is also provided, and a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the construction of the above-mentioned securities relationship graph is realized. method steps.
在本发明提供的又一实施例中,还提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述实施例中证券关系图谱的构建方法。In yet another embodiment provided by the present invention, there is also provided a computer program product containing instructions, which, when run on a computer, causes the computer to execute the method for constructing a securities relationship graph in the above embodiment.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘Solid State Disk(SSD))等。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, all or part of the processes or functions according to the embodiments of the present invention will be generated. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transmitted from a website, computer, server or data center Transmission to another website site, computer, server, or data center by wired (eg, coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (eg, infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media. The available medium may be a magnetic medium (for example, a floppy disk, a hard disk, or a magnetic tape), an optical medium (for example, DVD), or a semiconductor medium (for example, a Solid State Disk (SSD)).
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that there is a relationship between these entities or operations. any such actual relationship or order exists between them. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or apparatus. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.
本说明书中的各个实施例均采用相关的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。Each embodiment in this specification is described in a related manner, the same and similar parts of each embodiment can be referred to each other, and each embodiment focuses on the differences from other embodiments.
以上所述仅为本发明的较佳实施例,并非用于限定本发明的保护范围。凡在本发明的精神和原则之内所作的任何修改、等同替换、改进等,均包含在本发明的保护范围内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the protection scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principles of the present invention are included in the protection scope of the present invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202211045889.6A CN115357729B (en) | 2022-08-30 | 2022-08-30 | Method, device and electronic device for constructing securities relationship map |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202211045889.6A CN115357729B (en) | 2022-08-30 | 2022-08-30 | Method, device and electronic device for constructing securities relationship map |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN115357729A true CN115357729A (en) | 2022-11-18 |
| CN115357729B CN115357729B (en) | 2025-08-22 |
Family
ID=84004199
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202211045889.6A Active CN115357729B (en) | 2022-08-30 | 2022-08-30 | Method, device and electronic device for constructing securities relationship map |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN115357729B (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115982229A (en) * | 2022-12-08 | 2023-04-18 | 中信建投证券股份有限公司 | Security processing method and device, electronic equipment and storage medium |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103236013A (en) * | 2013-05-08 | 2013-08-07 | 南京大学 | Stock market data analysis method based on key stock set identification |
| US20180082183A1 (en) * | 2011-02-22 | 2018-03-22 | Thomson Reuters Global Resources | Machine learning-based relationship association and related discovery and search engines |
| CN111782701A (en) * | 2020-05-29 | 2020-10-16 | 上海羽时互联网金融信息服务有限公司 | Method and device for selecting cattle stocks and method and device for screening cross-industry stock data |
| CN112634039A (en) * | 2020-12-25 | 2021-04-09 | 中国工商银行股份有限公司 | Bond associated transaction identification method and device |
| CN112862617A (en) * | 2019-11-27 | 2021-05-28 | 泰康保险集团股份有限公司 | Data processing method, system, storage medium and electronic device |
| CN114065069A (en) * | 2021-09-27 | 2022-02-18 | 杭州摸象大数据科技有限公司 | Financial event map construction method and device, computer equipment and storage medium |
| CN114677228A (en) * | 2022-03-21 | 2022-06-28 | 灯塔财经信息有限公司 | A security data mining method based on knowledge graph |
-
2022
- 2022-08-30 CN CN202211045889.6A patent/CN115357729B/en active Active
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20180082183A1 (en) * | 2011-02-22 | 2018-03-22 | Thomson Reuters Global Resources | Machine learning-based relationship association and related discovery and search engines |
| CN103236013A (en) * | 2013-05-08 | 2013-08-07 | 南京大学 | Stock market data analysis method based on key stock set identification |
| CN112862617A (en) * | 2019-11-27 | 2021-05-28 | 泰康保险集团股份有限公司 | Data processing method, system, storage medium and electronic device |
| CN111782701A (en) * | 2020-05-29 | 2020-10-16 | 上海羽时互联网金融信息服务有限公司 | Method and device for selecting cattle stocks and method and device for screening cross-industry stock data |
| CN112634039A (en) * | 2020-12-25 | 2021-04-09 | 中国工商银行股份有限公司 | Bond associated transaction identification method and device |
| CN114065069A (en) * | 2021-09-27 | 2022-02-18 | 杭州摸象大数据科技有限公司 | Financial event map construction method and device, computer equipment and storage medium |
| CN114677228A (en) * | 2022-03-21 | 2022-06-28 | 灯塔财经信息有限公司 | A security data mining method based on knowledge graph |
Non-Patent Citations (1)
| Title |
|---|
| 刘政昊等: "知识关联视角下金融证券知识图谱构建与相关股票发现", 数据分析与知识发现, no. 2, 11 December 2021 (2021-12-11), pages 3 - 5 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115982229A (en) * | 2022-12-08 | 2023-04-18 | 中信建投证券股份有限公司 | Security processing method and device, electronic equipment and storage medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN115357729B (en) | 2025-08-22 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US8788394B1 (en) | Electronic information and analysis system | |
| US8374937B2 (en) | Non-capitalization weighted indexing system, method and computer program product | |
| US12235798B2 (en) | Data conversion and distribution systems | |
| US20120095892A1 (en) | Price indexing | |
| AU2017221899A1 (en) | Using accounting data based indexing to create a portfolio of financial objects | |
| CN101501720A (en) | Using accounting data based indexing to create a portfolio of assets | |
| Turcas et al. | Practical aspects of portfolio selection and optimisation on the capital market | |
| Hearn | Size and liquidity effects in African frontier equity markets | |
| Suhonen | Direct lending returns | |
| CN115357729A (en) | Method and device for constructing securities relation map and electronic equipment | |
| CA3081254C (en) | Data conversion and distribution systems | |
| Doan | Volatility and Risk Assessment of Blockchain Cryptocurrencies Using GARCH Modeling: An Analytical Study on Dogecoin, Polygon, and Solana | |
| KR102692812B1 (en) | Provision of stock investment strategies for Real Estate Investment Trusts (REITs) based on fluctuations in real estate prices | |
| Ang | Options | |
| Chen et al. | Mixed copula model with stochastic correlation for CDO pricing | |
| García et al. | Risk-cost frontier and collateral valuation in securities settlement systems for extreme market events | |
| Ramaube | The Profitability of Technical Analysis During Volatile Periods in the South African Financial Markets | |
| Jarrow et al. | Valuing Commercial Real Estate and its Real Options |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |