[go: up one dir, main page]

CN111754199A - A business ontology-driven enterprise credit relationship graph coarsening method - Google Patents

A business ontology-driven enterprise credit relationship graph coarsening method Download PDF

Info

Publication number
CN111754199A
CN111754199A CN202010604556.7A CN202010604556A CN111754199A CN 111754199 A CN111754199 A CN 111754199A CN 202010604556 A CN202010604556 A CN 202010604556A CN 111754199 A CN111754199 A CN 111754199A
Authority
CN
China
Prior art keywords
enterprise credit
enterprise
relationship graph
business
graph
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010604556.7A
Other languages
Chinese (zh)
Other versions
CN111754199B (en
Inventor
曹鸿强
赵鹏
冷巍
王俊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
3golden Beijing Information Technology Co ltd
Original Assignee
3golden Beijing Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 3golden Beijing Information Technology Co ltd filed Critical 3golden Beijing Information Technology Co ltd
Priority to CN202010604556.7A priority Critical patent/CN111754199B/en
Publication of CN111754199A publication Critical patent/CN111754199A/en
Application granted granted Critical
Publication of CN111754199B publication Critical patent/CN111754199B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Databases & Information Systems (AREA)
  • Human Resources & Organizations (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • Entrepreneurship & Innovation (AREA)
  • General Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Marketing (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Primary Health Care (AREA)
  • Development Economics (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention belongs to the technical field of graph data processing, and particularly relates to a business ontology driven enterprise credit relationship graph coarsening method. The invention provides a novel business ontology driven enterprise credit relational graph coarsening method, which provides a multi-field business ontology model for public credit evaluation in the process of coarsening an enterprise credit relational graph, is based on the multi-field business ontology model, is assisted by enterprise data logical relationship information, adopts an Odgca algorithm, has a relatively great advantage in the aspect of the efficiency of coarsening the enterprise credit relational graph, avoids the blindness of the current majority of graph coarsening methods to a certain extent, and can generate a coarsening graph meeting the requirements of users more quickly.

Description

一种业务本体驱动的企业信用关系图粗化方法A business ontology-driven enterprise credit relationship graph coarsening method

技术领域technical field

本发明属于图数据处理技术领域,特别涉及一种业务本体驱动的企业信用关系图粗化方法。The invention belongs to the technical field of graph data processing, and particularly relates to a business ontology-driven enterprise credit relationship graph coarsening method.

背景技术Background technique

公共信用本质上是由公共生活领域中特定主体(如政府、企业、民间组织、有影响力的个人等)以履行不完全契约或隐形契约方式而提供的一种特殊的公共产品,产品的形式表现为上述特定主体在履行职责过程中产生或掌握的、可用于识别商事主体、事业单位、社会组织和公民个人(信息主体)基本信用状况的数据和资料。其中,企业是公共生活领域中最活跃的主要主体。以社会公共信用数据为基础,使用企业信用图谱模型,为构建以信用为基础的新型监管机制提供技术工具,为政府保障民生和发展经济政务决策提供数据支撑,具有重要意义。Public credit is essentially a special public product provided by specific subjects in the field of public life (such as the government, enterprises, civil organizations, influential individuals, etc.) in the form of fulfilling incomplete contracts or invisible contracts. Data and materials that can be used to identify the basic credit status of commercial entities, public institutions, social organizations and individual citizens (information subjects), which are generated or mastered by the above-mentioned specific subjects in the course of performing their duties. Among them, enterprises are the most active main subjects in the field of public life. Based on social public credit data, using the enterprise credit map model, it is of great significance to provide technical tools for the construction of a new credit-based supervision mechanism, and to provide data support for the government to ensure people's livelihood and develop economic and government affairs decisions.

企业信用图谱是对专家经验的一种固化,能够支持知识与工具的积累与复用,具有强大的关系表达能力,能够从企业与工商、税务、社保、法院、消防等实体的关系,投资关系、抵押关系、关联企业、招投标关系中挖掘信用信息,能够支持复杂风险模式挖掘,具有更好的可解释性和展示效果。然后,企业信用图谱规模较大,仅企业实体节点就超过2亿个,关系规模更大。因此,需要使用图粗化算法,在不损失或者尽量小的损失信息的前提下,实现大规模图向较小规模图转换,是实现大规模企业信用图谱处理和使用的重要问题。The corporate credit map is a solidification of expert experience, which can support the accumulation and reuse of knowledge and tools, and has a strong ability to express relationships. , mortgage relationship, affiliated enterprises, and bidding relationship mining credit information, which can support complex risk pattern mining, with better interpretability and display effect. Then, the scale of the enterprise credit graph is large, with more than 200 million enterprise entity nodes alone, and the scale of the relationship is even larger. Therefore, it is necessary to use a graph coarsening algorithm to realize the conversion of large-scale graphs to smaller-scale graphs without losing or as little loss of information as possible, which is an important issue to realize the processing and use of large-scale enterprise credit graphs.

目前的图粗化方法多是针对通过互联网数据爬取并自动构建的图,通过图节点和关系的分析,辅以算法来识别可以裁剪或者合并的节点和关系,图粗化过程没有考虑到用户业务需求,导致图粗化方法可能与用户业务需求存在一定偏差;也没有充分利用具体应用场景下的业务本体知识来减少图粗化算法的复杂性,在一定程度上影响了图粗化算法的效率。The current graph coarsening methods are mostly aimed at graphs crawled and automatically constructed through Internet data. Through the analysis of graph nodes and relationships, supplemented by algorithms to identify nodes and relationships that can be cut or merged, the graph coarsening process does not take users into account. Due to business requirements, the graph coarsening method may have a certain deviation from the user's business needs; the knowledge of business ontology in specific application scenarios is not fully utilized to reduce the complexity of the graph coarsening algorithm, which affects the graph coarsening algorithm to a certain extent. efficiency.

发明内容SUMMARY OF THE INVENTION

为了解决现有技术中存在的问题,本发明提供一种新的业务本体驱动的企业信用关系图粗化方法。In order to solve the problems existing in the prior art, the present invention provides a new business ontology-driven enterprise credit relationship graph coarsening method.

本发明具体技术方案如下:The specific technical scheme of the present invention is as follows:

本发明提供一种业务本体驱动的企业信用关系图粗化方法,包括如下步骤:The present invention provides a method for coarsening an enterprise credit relationship graph driven by a business ontology, comprising the following steps:

S1:接收输入的业务本体集合OS:{os1,os2,...,osn}、企业信用关系图g以及关注条件condition,并将业务本体集合OS、企业信用关系图g以及关注条件condition输入业务本体模型中;S1: Receive the input business ontology set OS: {os1, os2, ..., osn}, the enterprise credit relationship graph g and the concern condition condition, and input the business ontology set OS, the enterprise credit relation graph g and the concern condition condition into the business in the ontology model;

S2:解析关注条件,并根据解析后关注条件在企业信用关系图中生成虚拟节点,并采用Odgca算法将企业信用关系图粗化,得到粗化后的企业信用关系图g';S2: Analyze the attention conditions, and generate virtual nodes in the enterprise credit relationship graph according to the parsed attention conditions, and use the Odgca algorithm to coarsen the enterprise credit relationship graph to obtain the coarsened enterprise credit relationship graph g';

S3:基于粗化后的企业信用关系图进行业务服务。S3: Provide business services based on the roughened enterprise credit relationship graph.

本发明的有益效果如下:The beneficial effects of the present invention are as follows:

本发明提供一种新的业务本体驱动的企业信用关系图粗化方法,该业务本体驱动的企业信用关系图粗化方法在企业信用关系图粗化过程中,提出了公共信用评价的多领域业务本体模型,并基于多领域业务本体模型,辅以企业数据逻辑关系信息,采用Odgca算法,在企业信用关系图粗化的效率方面具有比较大的优势,一定程度上避免了目前大多数图粗化方法的盲目性,能够更快的生成满足用户需求的粗化图。The present invention provides a new business ontology-driven enterprise credit relationship graph coarsening method, and the business ontology-driven enterprise credit relationship graph coarsening method proposes a multi-domain business of public credit evaluation in the process of enterprise credit relationship graph coarsening Ontology model, based on multi-domain business ontology model, supplemented by enterprise data logic relationship information, using Odgca algorithm, has a relatively large advantage in the efficiency of enterprise credit relationship graph coarsening, and avoids most of the current graph coarsening to a certain extent. The blindness of the method can generate a coarse map that meets the user's needs more quickly.

附图说明Description of drawings

图1为本发明业务本体驱动的企业信用关系图粗化方法的流程图;Fig. 1 is the flow chart of the coarsening method of enterprise credit relation graph driven by business ontology of the present invention;

图2为步骤S2中Odgca算法的流程图;Fig. 2 is the flow chart of Odgca algorithm in step S2;

图3为步骤S24的流程图;Fig. 3 is the flow chart of step S24;

图4为本发明企业信用评价业务本体模型的结构示意图;4 is a schematic structural diagram of an enterprise credit evaluation business ontology model of the present invention;

图5为本发明企业信用评价区域本体模型的结构示意图;5 is a schematic structural diagram of an enterprise credit evaluation area ontology model of the present invention;

图6为本发明企业信用评价行业本体模型的结构示意图;6 is a schematic structural diagram of the enterprise credit evaluation industry ontology model of the present invention;

图7为本发明企业信用评价领域本体模型的结构示意图。FIG. 7 is a schematic structural diagram of an ontology model in the field of enterprise credit evaluation according to the present invention.

具体实施方式Detailed ways

下面结合附图和以下实施例对本发明作进一步详细说明。The present invention will be described in further detail below in conjunction with the accompanying drawings and the following examples.

本发明提供一种业务本体驱动的企业信用关系图粗化方法,如图1所示,包括如下步骤:The present invention provides a business ontology-driven enterprise credit relationship graph coarsening method, as shown in FIG. 1 , comprising the following steps:

S1:接收输入的业务本体集合OS:{os1,os2,...,osn}、企业信用关系图g以及关注条件condition,并将业务本体集合OS、企业信用关系图g以及关注条件condition输入业务本体模型中;S1: Receive the input business ontology set OS: {os1, os2, ..., osn}, the enterprise credit relationship graph g and the concern condition condition, and input the business ontology set OS, the enterprise credit relation graph g and the concern condition condition into the business in the ontology model;

企业信用评价领域中,用户需要输入业务关注点。业务关注点是图粗化的输入条件之一,作为图粗化算法的指导信息。In the field of enterprise credit evaluation, users need to input business concerns. The business concern is one of the input conditions for graph coarsening, and serves as the guiding information for the graph coarsening algorithm.

S2:解析关注条件,并根据解析后关注条件在企业信用关系图中生成虚拟节点,并采用Odgca算法将企业信用关系图粗化,得到粗化后的企业信用关系图g';S2: Analyze the attention conditions, and generate virtual nodes in the enterprise credit relationship graph according to the parsed attention conditions, and use the Odgca algorithm to coarsen the enterprise credit relationship graph to obtain the coarsened enterprise credit relationship graph g';

解析关注条件,根据关注条件生成粗化图中的虚拟节点,使用Odgca算法进行图粗化,对企业信用图中的节点进行遍历和聚集操作,实现企业信用关系图粗化。Analyze the attention conditions, generate virtual nodes in the coarsening graph according to the attention conditions, use the Odgca algorithm to coarsen the graph, perform traversal and aggregation operations on the nodes in the enterprise credit graph, and realize the coarsening of the enterprise credit relationship graph.

S3:基于粗化后的企业信用关系图进行业务服务。S3: Provide business services based on the roughened enterprise credit relationship graph.

得到了粗化图之后,即可基于粗化图进行企业信用评价业务的各种操作,包括各种查询和统计,由于粗化图粒度较大、规模较小,可以大幅度的缩短图查询和统计的时间开销。After obtaining the coarsened graph, various operations of the enterprise credit evaluation business can be performed based on the coarsened graph, including various queries and statistics. Due to the large granularity and small scale of the coarsened graph, graph queries and graphs can be greatly shortened. Statistical time overhead.

本发明提供一种新的业务本体驱动的企业信用关系图粗化方法,该业务本体驱动的企业信用关系图粗化方法在企业信用关系图粗化过程中,提出了公共信用评价的多领域业务本体模型,并基于多领域业务本体模型,辅以企业数据逻辑关系信息,采用Odgca算法,在企业信用关系图粗化的效率方面具有比较大的优势,一定程度上避免了目前大多数图粗化方法的盲目性,能够更快的生成满足用户需求的粗化图。The present invention provides a new business ontology-driven enterprise credit relationship graph coarsening method, and the business ontology-driven enterprise credit relationship graph coarsening method proposes a multi-domain business of public credit evaluation in the process of enterprise credit relationship graph coarsening Ontology model, based on multi-domain business ontology model, supplemented by enterprise data logic relationship information, using Odgca algorithm, has a relatively large advantage in the efficiency of enterprise credit relationship graph coarsening, and avoids most of the current graph coarsening to a certain extent. The blindness of the method can generate a coarse map that meets the user's needs more quickly.

不同的应用领域具有不同的业务本体模型。本发明面向的是企业信用评价领域,首先应针对该领域构建企业信用评价的业务本体模型;业务本体模型描述了企业信用评价领域的客观存在实体以及实体之间的逻辑关系,反映了企业信用评价业务需求。Different application domains have different business ontology models. The present invention is aimed at the field of enterprise credit evaluation, and firstly, a business ontology model of enterprise credit evaluation should be constructed for this field; the business ontology model describes the objective existence entities in the field of enterprise credit evaluation and the logical relationship between entities, and reflects the enterprise credit evaluation. Business needs.

如图2所示,步骤S2的Odgca算法(业务本体驱动的图粗化算法)包括如下步骤:As shown in Figure 2, the Odgca algorithm (a business ontology-driven graph coarsening algorithm) in step S2 includes the following steps:

S21:根据关注条件遍历业务本体集合OS:{os1,os2,...,osn};此步骤中关注条件可能为多个业务本体的交集S21: Traverse the business ontology set OS:{os1,os2,...,osn} according to the concern condition; the concern condition in this step may be the intersection of multiple business ontology

S22:遍历企业信用关系图中的节点,并在关注条件为多个业务本体的交集的情况下,生成虚拟的业务本体,同时关注多个业务本体;S22: Traverse the nodes in the enterprise credit relationship graph, and generate a virtual business ontology when the concern condition is the intersection of multiple business ontology, and pay attention to multiple business ontology at the same time;

S23:初始化粗化后的企业信用关系图g'及其节点,在初始化阶段,粗化后的企业信用关系图g'仅包括根节点和节点,属性及边均为空;S23: Initialize the roughened enterprise credit relationship graph g' and its nodes. In the initialization stage, the roughened enterprise credit relationship graph g' only includes root nodes and nodes, and both attributes and edges are empty;

S24:对企业信用关系图中的节点进行聚集操作,得到粗化后的企业信用关系图g';S24: Perform an aggregation operation on the nodes in the enterprise credit relationship graph to obtain a coarsened enterprise credit relationship graph g';

如图3所示,优选的,步骤S24具体包括如下部分:As shown in Figure 3, preferably, step S24 specifically includes the following parts:

S241:判断企业信用关系图中每个节点的类型;S241: Determine the type of each node in the enterprise credit relationship graph;

S242:当节点的类型为公司时,将公司节点的连接的所有属性边收缩至公司节点,实现公司属性节点及边的粗化;S242: When the type of the node is a company, shrink all the attribute edges of the connection of the company node to the company node, so as to realize the coarsening of the company attribute node and edge;

S243:判断企业信用关系图的节点与虚拟的业务本体实例节点是否有边连接,如果是,则在粗化图中增加该企业信用关系图的节点;S243: Determine whether the node of the enterprise credit relationship graph is connected with the virtual business ontology instance node, and if so, add the node of the enterprise credit relationship graph in the coarsening graph;

优选的,步骤S243中在粗化图中增加该企业信用关系图的节点的步骤包括将该节点的边从企业信用关系图g拷贝到粗化图g'以及将该节点从企业信用关系图g拷贝到粗化图g'。Preferably, the step of adding a node of the enterprise credit relationship graph in the roughened graph in step S243 includes copying the edge of the node from the enterprise credit relationship graph g to the roughened graph g' and adding the node from the enterprise credit relationship graph g Copy to coarsened image g'.

本实施例对Odgca算法进行了进一步的限定,采用该Odgca算法算法进行图粗化,对企业信用图中的节点进行遍历和聚集操作,实现企业信用关系图粗化。This embodiment further defines the Odgca algorithm, and the Odgca algorithm is used to coarsen the graph, traverse and aggregate the nodes in the enterprise credit graph, and realize the coarsening of the enterprise credit graph.

本实施例中步骤S1中的业务本体模型包括企业信用评价业务本体模型以及与企业信用评价业务本体模型相关联的多个子业务本体模型,所述企业信用评价业务本体模型由若干个类别及与各类别之间的关系构建。In this embodiment, the business ontology model in step S1 includes an enterprise credit evaluation business ontology model and a plurality of sub-business ontology models associated with the enterprise credit evaluation business ontology model. The enterprise credit evaluation business ontology model consists of several categories and various Relationship building between categories.

所述类别包括行业类别、地区类别、区域类别、企业类别、个人类别和事件类别,其中区域类别包括全国子类、省子类、市子类、区县子类、园区子类、商圈子类和楼宇子类,行业按照国家统计局标准分为20个子类,领域按照政府企业监管职能分为14个子类,企业类别分为大规模企业子类、中小规模企业子类和无业务企业子类。The categories include industry categories, regional categories, regional categories, enterprise categories, personal categories, and event categories, where regional categories include national subcategories, provincial subcategories, city subcategories, district/county subcategories, park subcategories, and business circles Class and building sub-category, the industry is divided into 20 sub-categories according to the standard of the National Bureau of Statistics, the field is divided into 14 sub-categories according to the government enterprise supervision function, the enterprise category is divided into large-scale enterprise sub-category, small and medium-sized enterprise sub-category and non-business enterprise sub-category kind.

企业信用评价业务本体模型与各类别之间的关系包括:企业位于某个地区、企业属于某个领域、个人在企业中担任某种角色以及个人/企业/领域发生了某事件。The relationship between the enterprise credit evaluation business ontology model and various categories includes: the enterprise is located in a certain area, the enterprise belongs to a certain field, the individual plays a certain role in the enterprise, and a certain event occurs in the individual/enterprise/field.

如图4所示,本实施例中企业信用评价业务本体模型优选6个类别,分别为行业、地区、区域、企业、个人和事件;As shown in Figure 4, in this embodiment, the enterprise credit evaluation business ontology model preferably has six categories, namely industry, region, region, enterprise, individual and event;

再如图5-图7所示,每个类别构建的对应的子业务本体模型,包括但不限于企业信用评价区域本体模型、企业信用评价行业本体模型以及企业信用评价领域本体模型。As shown in Figures 5-7, the corresponding sub-business ontology models constructed for each category include but are not limited to the corporate credit evaluation regional ontology model, the corporate credit evaluation industry ontology model, and the corporate credit evaluation domain ontology model.

本实施例中步骤1中企业信用关系图是一个全量数据,包括企业本身数据和企业外部关系图。企业信用关系图是图粗化算法要处理的目标。In this embodiment, the enterprise credit relationship diagram in step 1 is a full amount of data, including the enterprise's own data and the enterprise external relationship diagram. The enterprise credit relationship graph is the target of the graph coarsening algorithm.

本实施例中步骤S2中企业信用关系图中包括有多个节点,在这些节点中搜寻与关注条件无关的节点,并将与关注条件无关的节点生成为虚拟节点。生产虚拟节点后使用Odgca算法进行图粗化,对企业信用图中的节点进行遍历和聚集操作,实现企业信用关系图粗化。In this embodiment, the enterprise credit relationship graph in step S2 includes a plurality of nodes, and the nodes irrelevant to the condition of interest are searched for in these nodes, and the nodes irrelevant to the condition of interest are generated as virtual nodes. After the virtual nodes are produced, the Odgca algorithm is used to coarsen the graph, and the nodes in the enterprise credit graph are traversed and aggregated to realize the coarsening of the enterprise credit graph.

本实施例中步骤S2中在关注条件针对的是至少两个不同的本体时,不同本体的关注条件进行交叉集合,根据交叉集合后的关注条件在企业信用关系图中生成虚拟节点。In step S2 of this embodiment, when the attention conditions target at least two different ontologies, the attention conditions of different ontologies are cross-aggregated, and virtual nodes are generated in the enterprise credit relationship graph according to the cross-aggregated attention conditions.

本发明的图粗化方法,引入了企业信用评价领域的业务本体模型。根据企业信用评价业务模型中的不同本体的关注点,以及不同业务本体关注点的交叉集合,实现业务本体牵引的快速图节点及关系的粗化。由于原始的企业信用关系图中包括所有的公司节点,然而在不同的企业信用评价场景中,用户的关注点是不同的,比如有的用户关注不同行政区域的企业情况,有的用户关注不同行业的企业情况。在特定关注点条件下,原始图中的节点存在大量的冗余,因此本发明根据不同业务场景的关注条件驱动图粗化算法,最终提高企业信用图谱的查询效率。The graph coarsening method of the present invention introduces a business ontology model in the field of enterprise credit evaluation. According to the concerns of different ontologies in the business model of enterprise credit evaluation and the intersection of different business ontology concerns, the rapid graph nodes and relationships that are drawn by business ontology can be coarsened. Since the original enterprise credit relationship graph includes all company nodes, in different enterprise credit evaluation scenarios, users' concerns are different. For example, some users pay attention to the situation of enterprises in different administrative regions, and some users pay attention to different industries. business situation. Under the condition of a specific concern point, the nodes in the original graph have a lot of redundancy, so the present invention drives the graph coarsening algorithm according to the concern condition of different business scenarios, and finally improves the query efficiency of the enterprise credit graph.

本实施例中步骤S3中得到了粗化图后,可基于粗化图进行企业信用评价业务的各种操作,包括各种查询和统计。In this embodiment, after the coarsened map is obtained in step S3, various operations of the enterprise credit evaluation service can be performed based on the coarsened map, including various queries and statistics.

本说明书中描述的主题的实施方式和功能性操作可以在以下中实施:有形实施的计算机软件,计算机硬件,包括本说明书中公开的结构及其结构等同体,或者上述中的一者以上的组合。本说明书中描述的主题的实施方式可以被实施为一个或多个计算机程序,即,一个或多个有形非暂时性程序载体上编码的计算机程序指令的一个或多个模块,用以被数据处理设备执行或者控制数据处理设备的操作。The implementations and functional operations of the subject matter described in this specification can be implemented in tangible embodied computer software, computer hardware, including the structures disclosed in this specification and their structural equivalents, or in combinations of one or more of the foregoing . Embodiments of the subject matter described in this specification can be implemented as one or more computer programs, ie, one or more modules of computer program instructions encoded on one or more tangible non-transitory program carriers, for processing by data The device performs or controls the operation of the data processing device.

术语“数据处理设备”包含所有种类的用于处理数据的设备、装置以及机器,作为实例,包括计算机或者服务器。The term "data processing apparatus" includes all kinds of apparatus, apparatus and machines for processing data, including, by way of example, computers or servers.

计算机程序(还可以被称为或者描述为程序、软件、软件应用、模块、软件模块、脚本或者代码)可以以任意形式的编程语言而被写出,包括编译语言或者解释语言或者声明性语言或过程式语言,并且计算机程序可以以任意形式展开,包括作为独立程序或者作为模块、组件、子程序或者适于在计算环境中使用的其他单元。计算机程序可以但不必须对应于文件系统中的文件。程序可以被存储在保存其他程序或者数据的文件的一部分中,例如,存储在如下中的一个或多个脚本:在标记语言文档中;在专用于相关程序的单个文件中;或者在多个协同文件中,例如,存储一个或多个模块、子程序或者代码部分的文件。计算机程序可以被展开为执行在一个计算机或者多个计算机上,所述计算机位于一处,或者分布至多个场所并且通过通信网络而互相连接。A computer program (which may also be called or described as a program, software, software application, module, software module, script, or code) may be written in any form of programming language, including compiled or interpreted or declarative or A procedural language, and a computer program may be developed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program may, but need not, correspond to a file in a file system. A program may be stored in a portion of a file that holds other programs or data, for example, one or more scripts stored in a markup language document; in a single file dedicated to the associated program; or in multiple collaborative In a file, for example, a file that stores one or more modules, subroutines, or portions of code. A computer program can be developed to be executed on one computer or on multiple computers, which are located at one site, or distributed over multiple sites and interconnected by a communication network.

在本说明书中描述的处理和逻辑流程可以由一个或多个计算机执行,该计算机通过运算输入数据并且生成输出而执行一个或多个的计算机程序,以运行函数。处理和逻辑流程还可以由专用逻辑电路。The processes and logic flows described in this specification can be performed by one or more computers executing one or more computer programs to perform functions by operating on input data and generating output. The processes and logic flows can also be performed by special purpose logic circuits.

适于实行计算机程序的计算机包括个人计算机或者服务器。Computers suitable for the execution of a computer program include personal computers or servers.

适于存储计算机程序指令和数据的计算机可读介质包括所有形式的非易失存储器、介质和存储器装置,作为实例,包括:半导体存储器装置,例如,内置硬盘或者可移动磁盘。Computer readable media suitable for storing computer program instructions and data include all forms of nonvolatile memory, media, and memory devices, including, by way of example, semiconductor memory devices, such as internal hard disks or removable disks.

为了发送与用户的交互,本说明书中描述的主题的实施方式可以被实施在计算机上,该计算机具有:显示装置,例如,CRT(阴极射线管)或者LCD(液晶显示器)监控器,用于向用户显示信息;以及键盘和例如鼠标或者追踪球这样的定位装置,用户利用它们可以将输入发送到计算机。其他种类的装置也可以用于发送与用户的交互;例如,提供给用户的反馈可以是任意形式的传感反馈,例如,视觉反馈等;以及来自用户的输入可以以任意形式接收到,包括键盘输入等。另外,计算机可以通过将文档发送至由用户使用的装置并且接收来自该装置的文档而与用户交互;例如,通过响应于接收到的来自网络浏览器的请求,而将网页发送到用户的客户端装置上的网络浏览器。To transmit interactions with a user, implementations of the subject matter described in this specification can be implemented on a computer having a display device, such as a CRT (Cathode Ray Tube) or LCD (Liquid Crystal Display) monitor, for displaying The user displays information; and a keyboard and pointing device, such as a mouse or trackball, with which the user can send input to the computer. Other kinds of devices may also be used to transmit interactions with the user; for example, feedback provided to the user may be any form of sensory feedback, such as visual feedback, etc.; and input from the user may be received in any form, including keyboards input etc. Additionally, a computer can interact with a user by sending documents to and receiving documents from a device used by the user; for example, by sending web pages to the user's client in response to a request received from a web browser web browser on the device.

本说明书中描述的主题的实施方式可以在计算系统中实施,该计算系统包括例如数据服务器这样的后端组件,或者包括例如应用服务器这样的中间组件,或者包括例如客户端计算机这样的前端组件,该客户端计算机具有图形用户界面或者网络浏览器,用户可以通过图形用户界面或者网络浏览器而与本说明书中描述的主题的实施进行交互,或者该Embodiments of the subject matter described in this specification may be implemented in a computing system including back-end components such as data servers, or intermediate components such as application servers, or front-end components such as client computers, The client computer has a graphical user interface or web browser through which a user can interact with implementations of the subject matter described in this specification, or the

计算机系统包括一个或多个这种后端组件、中间组件或者前端组件的任意组合。系统中的组件可以通过例如通信网络的任意形式或介质的数字数据通信而互相连接。通信网络的实例包括局域网络(“LAN”)和广域网络(“WAN”),例如,因特网。__计算系统可以包括客户端和服务器。客户端和服务器通常彼此远离,并且通常通过通信网络而交互。客户端与服务器之间的关系利用在各自的计算机上运行并且具有彼此之间的客户端-服务器关系的计算机程序而产生。A computer system includes any combination of one or more of such back-end components, intermediate components, or front-end components. The components in the system may be interconnected by any form or medium of digital data communication, such as a communication network. Examples of communication networks include local area networks ("LAN") and wide area networks ("WAN"), eg, the Internet. __ A computing system may include clients and servers. Clients and servers are usually remote from each other and usually interact through a communication network. The relationship between client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.

虽然本说明书包含很多具体的实施细节,但是这些不应当被解释为对任何发明的范围或者对可以要求保护的内容的范围的限制,而是作为可以使特定发明的特定实施方式具体化的特征的说明。在独立的实施方式的语境中的本说明书中描述的特定特征还可以与单个实施方式组合地实施。相反地,在单个实施方式的语境中描述的各种特征还可以独立地在多个实施方式中实施,或者在任何合适的子组合中实施。此外,虽然以上可以将特征描述为组合作用并且甚至最初这样要求,但是来自要求的组合的一个或多个特征在一些情况下可以从该组合去掉,并且要求的组合可以转向子组合或者子组合的变形。While this specification contains many specific implementation details, these should not be construed as limitations on the scope of any invention or on what may be claimed, but as features that may embody particular embodiments of particular inventions illustrate. Certain features that are described in this specification in the context of separate implementations can also be implemented in combination in a single implementation. Conversely, various features that are described in the context of a single implementation can also be implemented in multiple implementations independently or in any suitable subcombination. Furthermore, although features may be described above as acting in combination and even initially claimed as such, one or more features from a claimed combination may in some cases be removed from the combination and the claimed combination may turn into a sub-combination or sub-combination deformed.

相似地,虽然以特定顺序在附图中描述了操作,但是不应当理解为:为了实现期望的结果,要求这样的操作以示出的特定顺序或者以顺序次序而执行,或者所有图示的操作都被执行。在特定情况下,多任务处理和并行处理可以是有利的。此外,上述实施方式中的各种系统模块和组件的分离不应当理解为在所有实施方式中要求这样的分离,并且应当理解程序组件和系统可以通常被一体化在单个软件产品中或者打包至多个软件产品中。Similarly, although operations are depicted in the figures in a particular order, this should not be understood as requiring that such operations be performed in the particular order shown or in a sequential order, or that all illustrated operations are required to achieve desirable results are executed. In certain situations, multitasking and parallel processing may be advantageous. Furthermore, the separation of various system modules and components in the above-described embodiments should not be construed as requiring such separation in all embodiments, and it should be understood that program components and systems may generally be integrated in a single software product or packaged into multiple in software products.

已经描述了主题的特定实施方式。其他实施方式在以下权利要求的范围内。例如,在权利要求中记载的活动可以以不同的顺序执行并且仍旧实现期望的结果。作为一个实例,为了实现期望的结果,附图中描述的处理不必须要求示出的特定顺序或者顺序次序。在特定实现中,多任务处理和并行处理可以是有优势的。Specific implementations of the subject matter have been described. Other implementations are within the scope of the following claims. For example, the activities recited in the claims can be performed in a different order and still achieve desirable results. As one example, the processes depicted in the figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In certain implementations, multitasking and parallel processing may be advantageous.

Claims (10)

1.一种业务本体驱动的企业信用关系图粗化方法,其特征在于,包括如下步骤:1. an enterprise credit relationship graph coarsening method driven by business ontology, is characterized in that, comprises the steps: S1:接收输入的业务本体集合OS:{os1,os2,...,osn}、企业信用关系图g以及关注条件condition,并将业务本体集合OS、企业信用关系图g以及关注条件condition输入业务本体模型中;S1: Receive the input business ontology set OS: {os1, os2, ..., osn}, the enterprise credit relationship graph g and the concern condition condition, and input the business ontology set OS, the enterprise credit relation graph g and the concern condition condition into the business in the ontology model; S2:解析关注条件,并根据解析后关注条件在企业信用关系图中生成虚拟节点,并采用Odgca算法将企业信用关系图粗化,得到粗化后的企业信用关系图g';S2: Analyze the attention conditions, and generate virtual nodes in the enterprise credit relationship graph according to the parsed attention conditions, and use the Odgca algorithm to coarsen the enterprise credit relationship graph to obtain the coarsened enterprise credit relationship graph g'; S3:基于粗化后的企业信用关系图进行业务服务。S3: Provide business services based on the roughened enterprise credit relationship graph. 2.根据权利要求1所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,步骤S2的Odgca算法包括如下步骤:2. the enterprise credit relationship graph coarsening method driven by business ontology according to claim 1, is characterized in that, the Odgca algorithm of step S2 comprises the following steps: S21:根据关注条件遍历业务本体集合OS:{os1,os2,...,osn};S21: Traverse the business ontology set OS:{os1,os2,...,osn} according to the concern condition; S22:遍历企业信用关系图中的节点,并在关注条件为多个业务本体的交集的情况下,生成虚拟的业务本体,同时关注多个业务本体;S22: Traverse the nodes in the enterprise credit relationship graph, and generate a virtual business ontology when the concern condition is the intersection of multiple business ontology, and pay attention to multiple business ontology at the same time; S23:初始化粗化后的企业信用关系图g'及其节点,在初始化阶段,粗化后的企业信用关系图g'仅包括根节点和节点,属性及边均为空;S23: Initialize the roughened enterprise credit relationship graph g' and its nodes. In the initialization stage, the roughened enterprise credit relationship graph g' only includes root nodes and nodes, and both attributes and edges are empty; S24:对企业信用关系图中的节点进行聚集操作,得到粗化后的企业信用关系图g';S24: Perform an aggregation operation on the nodes in the enterprise credit relationship graph to obtain a coarsened enterprise credit relationship graph g'; 优选的,步骤S24具体包括如下部分:Preferably, step S24 specifically includes the following parts: S241:判断企业信用关系图中每个节点的类型;S241: Determine the type of each node in the enterprise credit relationship graph; S242:当节点的类型为公司时,将公司节点的连接的所有属性边收缩至公司节点,实现公司属性节点及边的粗化;S242: When the type of the node is a company, shrink all the attribute edges of the connection of the company node to the company node, so as to realize the coarsening of the company attribute node and edge; S243:判断企业信用关系图的节点与虚拟的业务本体实例节点是否有边连接,如果是,则在粗化图中增加该企业信用关系图的节点;S243: Determine whether the node of the enterprise credit relationship graph is connected with the virtual business ontology instance node, and if so, add the node of the enterprise credit relationship graph in the coarsening graph; 优选的,步骤S243中在粗化图中增加该企业信用关系图的节点的步骤包括将该节点的边从企业信用关系图g拷贝到粗化图g'以及将该节点从企业信用关系图g拷贝到粗化图g'。Preferably, the step of adding a node of the enterprise credit relationship graph in the roughened graph in step S243 includes copying the edge of the node from the enterprise credit relationship graph g to the roughened graph g' and adding the node from the enterprise credit relationship graph g Copy to coarsened image g'. 3.根据权利要求1所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,步骤S1中的业务本体模型包括企业信用评价业务本体模型以及与企业信用评价业务本体模型相关联的多个子业务本体模型,所述企业信用评价业务本体模型由若干个类别及与各类别之间的关系构建。3. The business ontology-driven enterprise credit relationship graph coarsening method according to claim 1, wherein the business ontology model in step S1 comprises an enterprise credit evaluation business ontology model and a business ontology model associated with the enterprise credit evaluation business ontology model. A plurality of sub-business ontology models, the enterprise credit evaluation business ontology model is constructed by several categories and the relationship with each category. 4.根据权利要求3所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,所述类别包括行业类别、地区类别、区域类别、企业类别、个人类别和事件类别,其中区域类别包括全国子类、省子类、市子类、区县子类、园区子类、商圈子类和楼宇子类,行业按照国家统计局标准分为20个子类,领域按照政府企业监管职能分为14个子类,企业类别分为大规模企业子类、中小规模企业子类和无业务企业子类。4. The business ontology-driven enterprise credit relationship graph coarsening method according to claim 3, wherein the categories include industry categories, regional categories, regional categories, enterprise categories, personal categories and event categories, wherein the regional categories Including national sub-category, provincial sub-category, city sub-category, district/county sub-category, park sub-category, business circle sub-category and building sub-category, the industry is divided into 20 sub-categories according to the standards of the National Bureau of Statistics, and the fields are divided according to the government and enterprise supervision functions There are 14 sub-categories, and the enterprise category is divided into large-scale enterprise sub-category, small and medium-sized enterprise sub-category and non-business enterprise sub-category. 5.根据权利要求3所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,企业信用评价业务本体模型与各类别之间的关系包括:企业位于某个地区、企业属于某个领域、个人在企业中担任某种角色以及个人/企业/领域发生了某事件。5. The business ontology-driven enterprise credit relationship graph coarsening method according to claim 3, wherein the relationship between the enterprise credit evaluation business ontology model and each category comprises: the enterprise is located in a certain area, the enterprise belongs to a certain The domain, the individual's role in the business, and the occurrence of an event by the individual/business/domain. 6.根据权利要求1所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,步骤1中企业信用关系图是一个全量数据,包括企业本身数据和企业外部关系图。6 . The business ontology-driven enterprise credit relationship graph coarsening method according to claim 1 , wherein the enterprise credit relationship graph in step 1 is a full amount of data, including enterprise data and an enterprise external relationship graph. 7 . 7.根据权利要求1所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,步骤S2中企业信用关系图中包括有多个节点,在这些节点中搜寻与关注条件无关的节点,并将与关注条件无关的节点生成为虚拟节点。7. The enterprise credit relationship graph coarsening method driven by business ontology according to claim 1, is characterized in that, in step S2, the enterprise credit relationship graph includes a plurality of nodes, and searches for nodes irrelevant to the conditions of interest in these nodes , and generate nodes irrelevant to the attention condition as virtual nodes. 8.根据权利要求6所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,步骤S2中在关注条件针对的是至少两个不同的本体时,不同本体的关注条件进行交叉集合,根据交叉集合后的关注条件在企业信用关系图中生成虚拟节点。8. The business ontology-driven enterprise credit relationship graph coarsening method according to claim 6, characterized in that, in step S2, when the attention conditions are aimed at at least two different ontologies, the attention conditions of different ontologies are cross-collected , and generate virtual nodes in the enterprise credit relationship graph according to the attention conditions after the cross collection. 9.根据权利要求1所述的业务本体驱动的企业信用关系图粗化方法,其特征在于,步骤S3中得到了粗化图后,可基于粗化图进行企业信用评价业务的各种操作,包括各种查询和统计。9. The enterprise credit relationship graph upscaling method driven by business ontology according to claim 1, is characterized in that, after the upscaling graph is obtained in step S3, various operations of enterprise credit evaluation business can be performed based on the upscaling graph, Including various queries and statistics. 10.一种业务本体驱动的企业信用关系图粗化系统,所述系统包括至少一个处理器;10. A business ontology-driven enterprise credit relationship graph coarsening system, the system comprising at least one processor; 以及存储器,其存储有指令,当通过至少一个处理器来执行该指令时,实施按照权利要求1-9任一项所述的方法。and a memory storing instructions which, when executed by at least one processor, implement the method according to any of claims 1-9.
CN202010604556.7A 2020-06-29 2020-06-29 Business ontology driven enterprise credit relationship graph coarsening method Active CN111754199B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010604556.7A CN111754199B (en) 2020-06-29 2020-06-29 Business ontology driven enterprise credit relationship graph coarsening method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010604556.7A CN111754199B (en) 2020-06-29 2020-06-29 Business ontology driven enterprise credit relationship graph coarsening method

Publications (2)

Publication Number Publication Date
CN111754199A true CN111754199A (en) 2020-10-09
CN111754199B CN111754199B (en) 2022-02-22

Family

ID=72677955

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010604556.7A Active CN111754199B (en) 2020-06-29 2020-06-29 Business ontology driven enterprise credit relationship graph coarsening method

Country Status (1)

Country Link
CN (1) CN111754199B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101324937A (en) * 2007-06-15 2008-12-17 国际商业机器公司 System and method for roughening picture
CN103942308A (en) * 2014-04-18 2014-07-23 中国科学院信息工程研究所 Method and device for detecting large-scale social network communities
US20140351289A1 (en) * 2013-05-22 2014-11-27 International Business Machines Corporation Simplification of large networks and graphs
US9355166B2 (en) * 2013-01-31 2016-05-31 Hewlett Packard Enterprise Development Lp Clustering signifiers in a semantics graph
CN105786943A (en) * 2015-12-08 2016-07-20 北京师范大学 Method and device for generating digital object
CN110489599A (en) * 2019-07-08 2019-11-22 深圳壹账通智能科技有限公司 Business connection map construction method, apparatus, computer equipment and storage medium
CN111178507A (en) * 2019-12-26 2020-05-19 集奥聚合(北京)人工智能科技有限公司 Atlas convolution neural network data processing method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101324937A (en) * 2007-06-15 2008-12-17 国际商业机器公司 System and method for roughening picture
US9355166B2 (en) * 2013-01-31 2016-05-31 Hewlett Packard Enterprise Development Lp Clustering signifiers in a semantics graph
US20140351289A1 (en) * 2013-05-22 2014-11-27 International Business Machines Corporation Simplification of large networks and graphs
CN103942308A (en) * 2014-04-18 2014-07-23 中国科学院信息工程研究所 Method and device for detecting large-scale social network communities
CN105786943A (en) * 2015-12-08 2016-07-20 北京师范大学 Method and device for generating digital object
CN110489599A (en) * 2019-07-08 2019-11-22 深圳壹账通智能科技有限公司 Business connection map construction method, apparatus, computer equipment and storage medium
CN111178507A (en) * 2019-12-26 2020-05-19 集奥聚合(北京)人工智能科技有限公司 Atlas convolution neural network data processing method and device

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
KARYPIS, G等: "A Fast And High Quality Multilevel Scheme For Partitioning Irregular Graphs", 《SIAM JOURNAL ON SCIENTIFIC COMPUTING》 *
张少华: "OWL 本体知识库面向对象表示及应用", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
贾建伟,陈崚: "基于影响力的大规模社会网络快速粗化方法", 《计算机应用研究》 *

Also Published As

Publication number Publication date
CN111754199B (en) 2022-02-22

Similar Documents

Publication Publication Date Title
US20220156615A1 (en) System, apparatus, and method to identify intelligence using a data processing platform
US10313466B2 (en) System, apparatus, and method to identify intelligence using a data processing platform
CN110766289A (en) A method, device and electronic device for dynamic adjustment of risk control rules
CN104346371A (en) Business integration system report driven by in-memory database
CN113989018A (en) Risk management method, risk management device, electronic equipment and medium
US11741379B2 (en) Automated resolution of over and under-specification in a knowledge graph
Duan et al. Rough K-modes Clustering Algorithm Based on Entropy.
CN118377936A (en) Event association analysis method and device based on graph rules integrated into computing model
Ting et al. Data sketching for real time analytics: Theory and practice
Kumar et al. Knowledge discovery from data mining techniques
Anderson et al. Architectural Implications of Social Media Analytics in Support of Crisis Informatics Research.
Zhu et al. Modeling Domains as Distributions with Uncertainty for Cross-Domain Recommendation
van der Goes Scaling enterprise recommender systems for decentralization
Bai Variable incremental adaptive learning model based on knowledge graph and its application in online learning system
CN111754199A (en) A business ontology-driven enterprise credit relationship graph coarsening method
Li et al. Alarm reduction and root cause inference based on association mining in communication network
EP4595357A1 (en) Integration of event and aggregated summary reports from the privacy sandbox attribution reporting
CN117668128A (en) Directed relation graph management method, device, equipment and storage medium
US20230177368A1 (en) Integrated ai planners and rl agents through ai planning annotation in rl
Bozdogan et al. Data mining for supporting it management
CN115017275A (en) A Conversation Recommendation Method and System Based on Graph Neural Network and Knowledge Graph
Zhu et al. Research on system of data mining technology based on computer
Zhang et al. An approach for exploring spatial associations in multi-layer networks based on convergent and divergent flow structures
Xu et al. Research on performance optimization and visualization tool of Hadoop
US20220156323A1 (en) Field of technology

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PP01 Preservation of patent right

Effective date of registration: 20251128

Granted publication date: 20220222