[go: up one dir, main page]

CN103699542A - Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology - Google Patents

Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology Download PDF

Info

Publication number
CN103699542A
CN103699542A CN201210366895.1A CN201210366895A CN103699542A CN 103699542 A CN103699542 A CN 103699542A CN 201210366895 A CN201210366895 A CN 201210366895A CN 103699542 A CN103699542 A CN 103699542A
Authority
CN
China
Prior art keywords
concept
ontology
ontology library
standard
pipeline
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210366895.1A
Other languages
Chinese (zh)
Inventor
刘冰
姚学军
李云杰
张欣
税碧垣
刘艳双
郑娟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Petrochina Co Ltd
Original Assignee
Petrochina Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Petrochina Co Ltd filed Critical Petrochina Co Ltd
Priority to CN201210366895.1A priority Critical patent/CN103699542A/en
Publication of CN103699542A publication Critical patent/CN103699542A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a natural gas and pipeline technical standard ontology library construction method, and relates to the technical field of digital data processing devices and pipeline systems. The method comprises (1) determining the domain and range of the body; (2) collecting and analyzing domain information; (3) determining a concept; in the determination of concepts, synonyms are supplemented; (4) establishing a body frame; (5) ontology custom integration, including the reference of existing ontologies, and the integration of new ontologies; (6) determining a concept logic relationship; when the concept logic relationship is determined, the existing ontology is combined; (7) establishing a complete body surface; (8) confirming and evaluating; (9) evolution; participating in (3) determination of concepts and (6) determination of concept logical relations after evolution; (10) and finishing the establishment of the ontology. The ontology base established by the invention can realize efficient standard information retrieval from 'basic field information' to 'important technical indexes'.

Description

天然气与管道技术标准本体库构建方法Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology

技术领域 technical field

本发明是一种天然气与管道技术标准本体库构建方法,涉及数字数据处理装置和管道系统技术领域。The invention relates to a method for constructing a natural gas and pipeline technology standard ontology library, and relates to the technical fields of digital data processing devices and pipeline systems.

背景技术 Background technique

随着信息技术与网络技术的发展,信息共享系统已经在各个行业广泛应用,大大提高了各行业工作劳动效率与便捷性,天然气与管道行业在信息化技术应用方面一直走在行业前列,随着业务、技术的进一步发展,仅仅提供文献级别的检索、浏览等功能已经不能满足用户的需求,必须进行更深一步的挖掘与服务,以满足用户需求。目前常用的标准检索方式为“基本字段信息”检索,一般仅能通过对标准名称、主题词进行检索进而实现全文检索,不能实现对技术指标的精确定位与检索、不同标准中同一技术指标的对比。概括起来,传统检索方式对技术标准的使用效果有以下几方面的局限性。With the development of information technology and network technology, information sharing systems have been widely used in various industries, greatly improving the labor efficiency and convenience of various industries. The natural gas and pipeline industries have been at the forefront of the industry in the application of information technology. With the With the further development of business and technology, it is no longer enough to provide functions such as document-level retrieval and browsing to meet the needs of users, and further excavation and services must be carried out to meet user needs. At present, the commonly used standard retrieval method is "basic field information" retrieval, which generally can only realize full-text retrieval by searching standard names and subject terms, and cannot realize precise positioning and retrieval of technical indicators, and comparison of the same technical indicators in different standards . To sum up, traditional retrieval methods have limitations in the following aspects in terms of the effect of using technical standards.

(1)不能对技术标准内容进行精确检索(1) It is impossible to accurately retrieve the content of technical standards

传统数据库检索方式是通过分类、标题、摘要及叙词等手段对标准文献进行题录数据加工,来实现对技术标准与技术法规的检索。但是技术指标一般会分散在不同技术标准与技术法规中,传统的检索方式只能通过题录数据库检索到相关标准,逐一阅读原文技术指标的内容。但是这样的方法很浪费时间,并且难以保障查全率。The traditional database retrieval method is to process the bibliographic data of standard documents by means of classification, title, abstract and descriptors to realize the retrieval of technical standards and technical regulations. However, technical indicators are generally scattered in different technical standards and technical regulations. The traditional search method can only retrieve relevant standards through bibliographic databases, and read the content of the original technical indicators one by one. But such a method is time-consuming, and it is difficult to guarantee the recall rate.

(2)不能同时检索到不同标准的技术指标,并实现不同标准的同一技术指标的对比。(2) The technical indicators of different standards cannot be retrieved at the same time, and the comparison of the same technical indicators of different standards can be realized.

在检索过程中,经常会出现同一产品的技术指标同时存在国际标准、国家标准、行业标准、地方标准和企业标准等不同的标准中,用户经常需要对不同标准中的相同产品的技术指标进行对比研究,这是传统检索方式所不能满足的。During the retrieval process, it often happens that the technical indicators of the same product exist in different standards such as international standards, national standards, industry standards, local standards and enterprise standards, and users often need to compare the technical indicators of the same product in different standards research, which cannot be satisfied by traditional retrieval methods.

标准内容提取与展示系统是标准信息检索的最新发展方向。这种新型的检索方式通过对标准技术指标的系统提取和有效组织,能够实现从“基本字段信息”到“重要技术指标”的高效的标准信息检索。对于负责油气管道工程建设的工程项目管理人员、实施人员,可以实现利用关键指标控制管道设计和施工建设;对于油气管道运行操作人员、管理人员,可以实现查询、对比操作参数、方法;对于科研人员,可以实现国内外标准关键指标差异分析、判断技术差异,分析体系内各标准间的协调性。Standard content extraction and display system is the latest development direction of standard information retrieval. This new type of retrieval method can realize efficient standard information retrieval from "basic field information" to "important technical indicators" through the systematic extraction and effective organization of standard technical indicators. For project managers and implementers in charge of oil and gas pipeline construction, key indicators can be used to control pipeline design and construction; for oil and gas pipeline operators and managers, query and comparison of operating parameters and methods can be realized; for scientific research personnel , which can realize the difference analysis of key indicators of domestic and foreign standards, judge the technical differences, and analyze the coordination among the standards in the system.

CN102591878A公开了一种技术标准内容提取与展示系统的建立方法,《石油规划设计》2011年第22卷第6期“天然气与管道标准信息管理系统开发方案研究”公开了一种天然气与管道标准信息管理系统的开发方案,但该系统仅能进行标准全文检索,无法实现标准内容的提取与展示,也并未公开建立天然气与管道标准本体库的构建方法,其技术并不完善。CN102591878A discloses a method for establishing a technical standard content extraction and display system, and "Petroleum Planning and Design", Volume 22, No. 6, 2011, "Research on the Development Scheme of Natural Gas and Pipeline Standard Information Management System" discloses a natural gas and pipeline standard information The development plan of the management system, but the system can only search the full text of the standard, and cannot extract and display the content of the standard. It has not disclosed the construction method of the natural gas and pipeline standard ontology library, and its technology is not perfect.

标准内容提取与展示技术是一种新的标准检索技术,目前国外未见以此技术开发的商业数据库。在国内,只有中国标准化研究院将标准内容提取与展示技术初步应用在食品、农产品的国家标准、行业标准中,并建设了相应的提取与展示系统平台,实现对标准内容指标的提取与展示。目前国内外未见到天然气与管道技术标准内容提取与展示系统的建立方法。Standard content extraction and display technology is a new standard retrieval technology, and there is no commercial database developed by this technology in foreign countries. In China, only the China National Institute of Standardization has initially applied standard content extraction and display technology to national standards and industry standards for food and agricultural products, and built a corresponding extraction and display system platform to realize the extraction and display of standard content indicators. At present, there is no method of establishing a content extraction and display system for natural gas and pipeline technical standards at home and abroad.

油气管道标准本体库就是标准化对象,能够涵盖天然气与管道标准中出现的所有有效检索对象,并可通过本体库界定不同本体对象的位置以及他们之间的所属关系,可以对标准内容指标的检索起到支撑作用。The oil and gas pipeline standard ontology library is a standardized object, which can cover all effective retrieval objects in natural gas and pipeline standards, and can define the positions of different ontology objects and their affiliation relationships through the ontology library, which can play an important role in the retrieval of standard content indicators. to support.

要实现天然气与管道标准技术内容提取与展示就必须对技术标准中的标准对象进行归纳并明确各对象间的关系,进而形成统一的检索规则以及能够实现精确定位的有效检索点集合。此外随着天然气与管道技术标准对业务的支撑作用越来越强以及标准是适用范围越来越广,建立统一、规范、完整的标准化对象即本体库的成为用户的迫切需求,并且将在天然气与管道领域信息共享和集成过程中起到重要的作用。然而目前尚没有现成的本体库可用,且经检索也没有提取标准对象从而建立本体库的有效的方法。To realize the extraction and display of the technical content of natural gas and pipeline standards, it is necessary to summarize the standard objects in the technical standards and clarify the relationship between each object, and then form a unified retrieval rule and an effective collection of retrieval points that can achieve precise positioning. In addition, as natural gas and pipeline technical standards become more and more powerful in supporting business and the scope of application of standards becomes wider and wider, it is an urgent need for users to establish a unified, standardized, and complete standardized object, that is, an ontology library, and will be used in natural gas It plays an important role in the process of information sharing and integration with the pipeline field. However, there is no ready-made ontology database available at present, and there is no effective method to extract standard objects to build ontology database after retrieval.

因此对天然气与管道标准进行分解和核心标准对象的提取进而构建本体库是实现标准内容提取与展示的基础。Therefore, decomposing the natural gas and pipeline standards and extracting the core standard objects to build ontology database is the basis for standard content extraction and display.

发明内容 Contents of the invention

本发明的目的是发明一种实现从“基本字段信息”到“重要技术指标”的高效的标准信息检索、简单易懂可行、能有效的分解、提取天然气与管道标准对象、构建统一、完整的天然气与管道技术标准本体库构建方法。The purpose of the present invention is to invent an efficient standard information retrieval from "basic field information" to "important technical indicators", which is easy to understand and feasible, can effectively decompose and extract natural gas and pipeline standard objects, and build a unified and complete Construction method of natural gas and pipeline technology standard ontology library.

本发明是天然气与管道技术标准内容提取与展示系统中本体库的构建方法,天然气与管道技术标准内容提取与展示系统的建立方法如图1所示,主要包括以下步骤:The present invention is a construction method of an ontology library in a natural gas and pipeline technical standard content extraction and display system. The establishment method of the natural gas and pipeline technical standard content extraction and display system is shown in Figure 1, and mainly includes the following steps:

(1)使用光学字符识别(OCR)工具对天然气与管道标准文献全文进行数字化加工,使标准文献数字化;(1) Use optical character recognition (OCR) tools to digitize the full text of natural gas and pipeline standard documents to digitize the standard documents;

(2)建立天然气与管道标准本体库、体例库、题录数据库;(2) Establish natural gas and pipeline standard ontology database, style database, bibliographic database;

(3)建立天然气与管道标准内容数据库;(3) Establish a content database of natural gas and pipeline standards;

(4)开发天然气与管道标准内容提取与展示系统平台,用于标准内容提取、展示与对比。(4) Develop a natural gas and pipeline standard content extraction and display system platform for standard content extraction, display and comparison.

具体建立步骤为:The specific establishment steps are:

(1)按照标准文献数字化规范,对确定的天然气与管道标准进行全文数字化后,同时包括对文献内容识别与质量审校,使标准可编辑,满足提取与展示需求;(1) According to the digitization specification of standard documents, after digitizing the full text of the determined natural gas and pipeline standards, it also includes identification of document content and quality review, so that the standards can be edited to meet the needs of extraction and display;

(2)建立天然气与管道标准本体库、天然气与管道标准题录数据库、天然气与管道标准体例库,这三个数据库的建立为并列过程,互不干扰;(2) Establish the natural gas and pipeline standard ontology database, the natural gas and pipeline standard bibliographic database, and the natural gas and pipeline standard style database. The establishment of these three databases is a parallel process without interfering with each other;

①建立天然气与管道标准本体库:对天然气与管道标准主题概念进行分析归纳,针对主体类别的概念内涵,根据对专业知识的查询结果和标准文献中枚举的标准化对象之间的从属关系进行本体概念分析,明确标准中发生的概念体系及其层次关系,建立天然气与管道标准本体库;①Establish the natural gas and pipeline standard ontology library: analyze and summarize the subject concepts of natural gas and pipeline standards, aiming at the concept connotation of the main category, according to the query results of professional knowledge and the affiliation relationship between the standardized objects enumerated in the standard documents, the ontology is carried out Conceptual analysis, clarifying the concept system and its hierarchical relationship in the standard, and establishing the natural gas and pipeline standard ontology library;

②建立天然气与管道体例库:对天然气与管道标准文献进行分类,归纳标准的结构化信息;按照相同结构的标准文献开展标准文献体例分析,抽象出其中的核心概念及其特征描述术语,建立体例库;②Establishment of natural gas and pipeline style library: classify natural gas and pipeline standard documents, summarize the structured information of standards; carry out standard document style analysis according to standard documents with the same structure, abstract the core concepts and their characteristic description terms, and establish style library;

③天然气与管道标准题录数据库:针对天然气与管道技术标准进行标准技术指标分析归纳、对技术指标体系进行术语学与概念关系研究,进行技术指标概念的规范化及体系构建与标引,进行标准文献技术指标标引,进行体例元素的分类与标示,建立标准技术指标数据库,建立量与单位等辅助数据库;以标准文本中的中文标准名称、英文标准名称、标准号、标准类型、技术领域技术方向、采用关系、代替关系、被代替关系、引用文献、标准状态、立项日期、发布日期、实施日期、确认日期、重要程度分级、归口单位、起草单位、摘要、中文主题词、英文主题词、译文、备注、正文等信息为基础,建立标准题录数据库;③Natural gas and pipeline standard bibliography database: analyze and summarize standard technical indicators for natural gas and pipeline technical standards, conduct terminology and conceptual relationship research on technical indicator systems, standardize technical indicator concepts, system construction and indexing, and standard literature Indexing of technical indicators, classification and labeling of style elements, establishment of a database of standard technical indicators, establishment of auxiliary databases such as quantities and units; use the Chinese standard name, English standard name, standard number, standard type, and technical direction of the technical field in the standard text , Adoption relationship, substitution relationship, superseded relationship, cited documents, standard status, project approval date, release date, implementation date, confirmation date, importance level, focal unit, drafting unit, abstract, Chinese subject terms, English subject terms, translation , Remarks, text and other information as the basis to establish a standard bibliographic database;

(3)建立天然气与管道标准内容数据库(3) Establish a natural gas and pipeline standard content database

原有的标准体系表同步骤(2)所建立本体库、题录数据库、体例库三个数据库,形成天然气与管道标准内容数据库;The original standard system table is the same as the three databases of ontology database, bibliography database and style database established in step (2), forming a natural gas and pipeline standard content database;

(4)开发天然气与管道标准内容提取与展示系统平台,该系统除一般检索系统功能外,如基本检索功能、管理功能、在线反馈功能、帮助功能等功能,而且还应具有标准内容指标检索功能、标准指标加工功能。(4) Develop a natural gas and pipeline standard content extraction and display system platform. In addition to general retrieval system functions, the system should also have standard content index retrieval functions such as basic retrieval functions, management functions, online feedback functions, and help functions. , Standard index processing function.

所述天然气与管道标准本体库构建流程如图2所示,为:The construction process of the natural gas and pipeline standard ontology library is shown in Figure 2, which is:

(1)确定本体的领域与范围;(1) Determine the domain and scope of the ontology;

(2)领域信息的收集和分析;(2) Collection and analysis of field information;

(3)概念的确定;概念的确定中,要补充同义词;(3) Determination of concepts; in the determination of concepts, synonyms should be added;

(4)建立本体框架;(4) Establish an ontology framework;

(5)本体自定义集成,包括现有本体的引用,以及新本体的集成;(5) Ontology custom integration, including references to existing ontologies, and integration of new ontologies;

(6)确定概念逻辑关系;确定概念逻辑关系时,要结合现有本体;(6) Determine the conceptual logical relationship; when determining the conceptual logical relationship, it is necessary to combine the existing ontology;

(7)建立完整的本体表;(7) Establish a complete ontology table;

(8)确认与评价;(8) Confirmation and evaluation;

(9)进化;进化后参与(3)概念的确定和(6)确定概念逻辑关系;(9) Evolution; Participate in the determination of (3) concepts and (6) determine the logical relationship of concepts after evolution;

(10)完成本体建立。(10) Complete ontology establishment.

所述天然气与管道标准本体库构建流程具体为:The construction process of the natural gas and pipeline standard ontology library is as follows:

(1)确定本体库的领域与范围:要明确构建的本体库将覆盖的专业领域、本体的目的、作用以及应用对象;(1) Determining the field and scope of the ontology library: it is necessary to clarify the professional fields to be covered by the ontology library to be built, the purpose, function and application objects of the ontology;

(2)领域信息的收集和分析:通过收集石油天然气管道领域信息充分了解该领域知识;信息来源包括专家、书籍、标准、网络以及其它的本体;(2) Collection and analysis of field information: fully understand the field knowledge by collecting information in the field of oil and gas pipelines; information sources include experts, books, standards, networks and other ontologies;

(3)概念的确定:在充分了解天然气与管道领域知识之后,确定该领域中概念和概念之间的关系,用精确的术语表达出来,经领域专家的确认,作为领域本体的核心概念集。基本应该满足的要求有:(3) Determination of concepts: After fully understanding the domain knowledge of natural gas and pipelines, determine the concepts and the relationship between concepts in this domain, express them in precise terms, and confirm them with domain experts, and use them as the core concept set of domain ontology. The basic requirements that should be met are:

①确定的概念及关系一定是领域相关的;领域的边界往往是模糊的,需根据实际需求确定边界包含的概念;① The defined concepts and relationships must be domain-related; the boundaries of domains are often fuzzy, and the concepts included in the boundaries need to be determined according to actual needs;

②采用的术语要精确,含义应具有唯一性;② The terms used should be precise and have unique meanings;

③对每个术语有相应的自然语言描述和同义词补充;③ There are corresponding natural language descriptions and synonyms for each term;

(4)建立本体库框架;对于步骤(3)中整理的领域中大量的概念,要按照一定的逻辑规则把它们进行分组,形成不同的小专业领域,在同一小工作领域的概念,其相关性应该比较强;另外,对其中的每一个概念的重要性要进行评估,选出关键性术语,摒弃那些不必要或者超出领域范围的概念,尽可能准确而精简的表达出领域的知识;(4) Establish an ontology library framework; for a large number of concepts in the field organized in step (3), they should be grouped according to certain logical rules to form different small professional fields. In addition, the importance of each concept should be evaluated, key terms should be selected, unnecessary or beyond the scope of the field should be discarded, and the knowledge of the field should be expressed as accurately and concisely as possible;

(5)本体库自定义集成;在创建本体库可以自定义,也可以是领域中现存的本体库的重用;重用本体库时,需要注意查看元本体库,选择和自己概念模型中的语义和实现一致的术语定义;其中涉及的关键技术是本体的映射;针对每个集成的本体库,应确定其元本体库、术语集、形式化的本体库描述、以及集成在自己本体库中的位置等属性;(5) Custom integration of ontology library; when creating an ontology library, it can be customized or reuse an existing ontology library in the field; when reusing an ontology library, you need to pay attention to viewing the meta-ontology library, selecting and semantics in your own conceptual model To achieve a consistent definition of terms; the key technology involved is ontology mapping; for each integrated ontology library, its meta-ontology library, term set, formal ontology library description, and the location integrated in its own ontology library should be determined and other attributes;

(6)确定概念逻辑关系;主要以专业知识的与科学分类为基础,根据分类学中的主题法和分类法,确定概念的逻辑关系;(6) Determine the logical relationship of concepts; mainly based on professional knowledge and scientific classification, according to the subject method and taxonomy in taxonomy, determine the logical relationship of concepts;

(7)建立完整的本体库;将天然气与管道标准本体库与标准文献有效检索点结合,从而形成一个领域知识的框架体系,得到领域本体库的框架结构;(7) Establish a complete ontology library; combine the natural gas and pipeline standard ontology library with the effective retrieval points of standard documents to form a framework system of domain knowledge and obtain the framework structure of the domain ontology library;

建立天然气与管道本体库时,本体划分应遵循以下基本规则:a)各子项的外延之和应等于母项的外延;b)划分的各子项,其外延宜相互排斥;c)每次划分应按同一原则进行;d)划分应按层次逐级、由高到低、由简到繁进行,宜结合天然气与管道主营业务粗细结合;e)应持续更新补充;When establishing the natural gas and pipeline ontology database, the ontology division should follow the following basic rules: a) The sum of the extensions of each subitem should be equal to the extension of the parent item; b) The extensions of the divided subitems should be mutually exclusive; c) Each time The division should be carried out according to the same principle; d) The division should be carried out step by step, from high to low, from simple to complex, and should be combined with the main business of natural gas and pipeline; e) It should be continuously updated and supplemented;

类目的划分与设置应突出主营业务,将内容相关性较大的类目,应尽量临近设置;对于一些无专属的类,且具有普遍指导意义的综合性基础标准可根据内容分别单独设置类;上一层次类目的技术要求下层类目都要满足;The division and setting of categories should highlight the main business, and the categories with relatively high content should be set as close as possible; for some non-exclusive categories, comprehensive basic standards with general guiding significance can be set separately according to the content category; the technical requirements of the category at the upper level must be met at the category at the lower level;

(8)确认与评价:本体库应具有正确性、一致性、可扩展性和有效性;(8) Confirmation and evaluation: Ontology database should have correctness, consistency, scalability and validity;

(9)进化:在使用过程中需要对本体库不断更新,本体库进化的方式可以是集成新的本体库或定义新的概念和关系;(9) Evolution: The ontology library needs to be continuously updated during use, and the evolution of the ontology library can be by integrating new ontology libraries or defining new concepts and relationships;

(10)完成本体库建立。(10) Complete the establishment of ontology database.

本发明的有益效果:Beneficial effects of the present invention:

本发明为天然气与管道标准本体库的构建方法取得了以下有益效果:The invention achieves the following beneficial effects for the construction method of the natural gas and pipeline standard ontology library:

(1)本发明简单易懂可行,可以有效的分解、提取天然气与管道标准对象,构建统一、完整的本体库;(1) The invention is simple, easy to understand and feasible, and can effectively decompose and extract natural gas and pipeline standard objects, and build a unified and complete ontology library;

(2)本发明构建的本体库应用于天然气与管道标准内容提取与展示系统可以作为有效检索点的集合实现技术标准内容的精确定位和检索,实现从“基本字段信息”到“重要技术指标”的高效的标准信息检索;(2) The ontology library constructed by the present invention is applied to the natural gas and pipeline standard content extraction and display system, which can be used as a collection of effective retrieval points to realize accurate positioning and retrieval of technical standard content, and realize the transformation from "basic field information" to "important technical indicators" efficient standard information retrieval;

(3)本发明提取的本体精确、唯一、科学,可作为术语数据库一部分,对天然气与管道领域的信息共享与交流有重要作用。(3) The ontology extracted by the present invention is accurate, unique and scientific, and can be used as a part of the terminology database, which plays an important role in information sharing and communication in the fields of natural gas and pipelines.

附图说明 Description of drawings

图1天然气与管道标准内容提取与展示系统建立流程图Figure 1 Flowchart of establishing the content extraction and display system for natural gas and pipeline standards

图2本体库构建流程图Figure 2 Ontology library construction flow chart

具体实施方式Detailed ways

实施例.本例是一实验方法,其流程如图2所示。Embodiment. This example is an experimental method, and its flow chart is shown in Figure 2.

本例主要包括以下步骤:This example mainly includes the following steps:

(1)确定本体的领域与范围;(1) Determine the domain and scope of the ontology;

(2)领域信息的收集和分析;(2) Collection and analysis of domain information;

(3)概念的确定;概念的确定中,要补充同义词;(3) Determination of concepts; in the determination of concepts, synonyms should be added;

(4)建立本体框架;(4) Establish an ontology framework;

(5)本体自定义集成,包括现有本体的引用,以及新本体的集成;(5) Ontology custom integration, including references to existing ontologies, and integration of new ontologies;

(6)确定概念逻辑关系;确定概念逻辑关系时,要结合现有本体;(6) Determine the conceptual logical relationship; when determining the conceptual logical relationship, it is necessary to combine the existing ontology;

(7)建立完整的本体表;(7) Establish a complete ontology table;

(8)确认与评价;(8) Confirmation and evaluation;

(9)进化;进化后参与(3)概念的确定和(6)确定概念逻辑关系;(9) Evolution; Participate in the determination of (3) concepts and (6) determine the logical relationship of concepts after evolution;

(10)完成本体建立。(10) Complete ontology establishment.

本例的体系表如下表:The system table of this example is as follows:

表1本体表样例Table 1 Sample Ontology Table

Figure BDA00002202375200061
Figure BDA00002202375200061

Figure BDA00002202375200071
Figure BDA00002202375200071

本例经试用,建立的天然气与管道本体提取技术经试用可以有效的提取天然气与管道领域的本体,通过系统的提取工作,形成天然气与管道技术领域知识本体体系,应用于天然气与管道标准内容提取与展示系统,能够实现对标准内容中技术指标的精确定位与检索;在检索结果中直接显示所需要的标准检索内容或技术指标,而不需要用户对文献通篇阅读,查找需要信息,从而提高了检索效率;能够实现技术指标相关的标准体检索;在检索标准时,可以通过上位登录,在检索到特定标准技术指标时,也可以检索到其他相关标准。In this example, the natural gas and pipeline ontology extraction technology established can effectively extract the ontology of natural gas and pipeline field after trial. Through the systematic extraction work, a knowledge ontology system in the field of natural gas and pipeline technology is formed, which is applied to the content extraction of natural gas and pipeline standards The system and display system can realize precise positioning and retrieval of technical indicators in standard content; directly display the required standard retrieval content or technical indicators in the search results, without requiring users to read the entire document to find the required information, thereby improving It improves the retrieval efficiency; it can realize the retrieval of standards related to technical indicators; when searching for standards, you can log in through the host, and when you retrieve specific standard technical indicators, you can also retrieve other related standards.

Claims (2)

1. rock gas and a pipe technology standard body base construction method, is characterized in that flow process is:
(1) determine field and the scope of body;
(2) Collection and analysis of realm information;
(3) concept determines; In the determining of concept, supplement synonym;
(4) set up body frame;
(5) body is self-defined integrated, comprises quoting of existing body, and new body is integrated;
(6) determine concept logic relation; Determine when concept logic is related to, be in conjunction with existing body;
(7) set up this complete body surface;
(8) confirm and evaluate;
(9) evolve; After evolving, participate in determining with (6) of (3) concept and determine concept logic relation;
(10) completing body sets up.
2. rock gas according to claim 1 and pipe technology standard body base construction method, is characterized in that building flow process and be specially:
(1) determine field and the scope of ontology library: the ontology library that clearly build is by the professional domain covering, object, effect and the application of body;
(2) Collection and analysis of realm information: fully understand this domain knowledge by collecting oil and gas pipeline realm information; Information source comprises expert, books, standard, network and other body;
(3) determining of concept: after fully understanding rock gas and pipeline domain knowledge, determine the relation between concept and concept in this field, express with accurate term, through domain expert's confirmation, as the Core Set of Concepts of domain body.The requirement that substantially should meet has:
1. definite concept and relation must be domain-specifics; The border in field is fuzzy often, need determine according to the actual requirements the concept that border comprises;
2. the term adopting is wanted accurately, and implication should have uniqueness;
3. to each term, there are corresponding natural language description and synonym to supplement;
(4) set up ontology library framework; For a large amount of concept in the field arranging in step (3), they are divided into groups according to certain logic rules, form different particular specialty fields, in the concept of same little career field, its correlativity should be more intense; In addition, to the importance of each concept wherein, to assess, select key term, abandon that those are unnecessary or exceed the concept of territory, the knowledge in the field that gives expression to of as far as possible accurately simplifying;
(5) ontology library is self-defined integrated; Create ontology library can be self-defined, can be also reusing of ontology library existing in field; While reusing ontology library, should be noted that and check meta-ontology storehouse, select with semanteme in own conceptual model with realize consistent term definition; The gordian technique wherein relating to is the mapping of body; For each integrated ontology library, should determine its meta-ontology storehouse, terminology, the description of formal ontology library and be integrated in the attributes such as position in own ontology library;
(6) determine concept logic relation; Main take professional knowledge with scientific classification as basis, according to the subject indexing method in taxonomy and classification, determine the logical relation of concept;
(7) set up complete ontology library; Rock gas and piping standards ontology library and normative document are effectively retrieved to a combination, thereby form the frame system of a domain knowledge, obtain the framed structure of field ontology library;
While setting up rock gas and pipeline body storehouse, body is divided should follow following primitive rule: a) the extension sum of each subitem should equal the extension of female; B) each subitem of dividing, its extension should be repelled mutually; C) each division should be undertaken by identity principle; D) divide should by level step by step, from high to low, go from the simple to the complex and carry out, should be combined with pipeline main business thickness in conjunction with rock gas; E) answer continuous updating to supplement;
The division of classification should be given prominence to main business with setting, by the larger classification of content relevance, should close on setting as far as possible; For some, without exclusive class, and the comprehensive basic standard with general directive significance can arrange separately respectively class according to content; The technical requirement lower floor classification of last layer time classification all will meet;
(8) confirm and evaluate: ontology library should have correctness, consistance, extensibility and validity;
(9) evolve: in use need ontology library to constantly update, the mode that ontology library is evolved can be integrated new ontology library or define new concept and relation;
(10) completing ontology library sets up.
CN201210366895.1A 2012-09-28 2012-09-28 Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology Pending CN103699542A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210366895.1A CN103699542A (en) 2012-09-28 2012-09-28 Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210366895.1A CN103699542A (en) 2012-09-28 2012-09-28 Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology

Publications (1)

Publication Number Publication Date
CN103699542A true CN103699542A (en) 2014-04-02

Family

ID=50361073

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210366895.1A Pending CN103699542A (en) 2012-09-28 2012-09-28 Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology

Country Status (1)

Country Link
CN (1) CN103699542A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850664A (en) * 2015-06-09 2015-08-19 北京理工大学 PDM based mechanical processing domain ontology constructing method
CN111709237A (en) * 2020-06-04 2020-09-25 中国地质大学(北京) A Logical Structure Tree Construction Method Based on Expert Knowledge of Geoscience Branches
CN112560471A (en) * 2019-09-26 2021-03-26 北京国双科技有限公司 Method and system for acquiring related words of professional words
CN113377926A (en) * 2021-06-28 2021-09-10 中国标准化研究院 Construction method of registration meta-model of quality information ontology evolution

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080140684A1 (en) * 2006-06-09 2008-06-12 O'reilly Daniel F Xavier Systems and methods for information categorization
CN102096868A (en) * 2011-02-25 2011-06-15 上海建科建设监理咨询有限公司 Ontology-based building domain knowledge query method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080140684A1 (en) * 2006-06-09 2008-06-12 O'reilly Daniel F Xavier Systems and methods for information categorization
CN102096868A (en) * 2011-02-25 2011-06-15 上海建科建设监理咨询有限公司 Ontology-based building domain knowledge query method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
彭勃: "领域本体构建方法研究", 《电脑知识与技术》 *
赵明华等: "天然气与管道标准信息管理系统开发方案研究", 《石油规划设计》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104850664A (en) * 2015-06-09 2015-08-19 北京理工大学 PDM based mechanical processing domain ontology constructing method
CN112560471A (en) * 2019-09-26 2021-03-26 北京国双科技有限公司 Method and system for acquiring related words of professional words
CN111709237A (en) * 2020-06-04 2020-09-25 中国地质大学(北京) A Logical Structure Tree Construction Method Based on Expert Knowledge of Geoscience Branches
CN113377926A (en) * 2021-06-28 2021-09-10 中国标准化研究院 Construction method of registration meta-model of quality information ontology evolution
CN113377926B (en) * 2021-06-28 2022-11-25 中国标准化研究院 Construction method of registration meta-model of quality information ontology evolution

Similar Documents

Publication Publication Date Title
CN110874414B (en) Policy interpretation method based on data joint service
US10176261B2 (en) Keyword presenting system and method based on semantic depth structure
Aria et al. openalexR: An R-Tool for Collecting Bibliometric Data from OpenAlex.
CN115470339A (en) Intelligent Matching Algorithm for Technical Diagnosis Experts Based on Science and Technology Big Data Knowledge Graph
CN110297872A (en) A kind of building, querying method and the system of sciemtifec and technical sphere knowledge mapping
CN108446368A (en) A kind of construction method and equipment of Packaging Industry big data knowledge mapping
CN112258061B (en) Intelligent risk analysis early warning system and early warning method for whole process of project
Feng et al. Patent text mining and informetric-based patent technology morphological analysis: an empirical study
Van Hooland et al. Evaluating the success of vocabulary reconciliation for cultural heritage collections
CN103049532A (en) Knowledge base engine construction and query method based on emergency management of emergencies
CN104636424A (en) Method for building literature review framework based on atlas analysis
Tang et al. Software architecture documentation: The road ahead
CN115757810A (en) A method for constructing knowledge graph standard ontology
CN114817573A (en) Knowledge management platform of knowledge graph
Qu et al. Patent research in the field of library and information science: Less useful or difficult to explore?
Silvennoinen et al. A semantic web approach to land use regulations in urban planning: The OntoZoning ontology of zones, land uses and programmes for Singapore
Peterlin et al. Automated content analysis: The review of the big data systemic discourse in tourism and hospitality
CN103699542A (en) Construction Method of Standard Ontology Library of Natural Gas and Pipeline Technology
CN115937881A (en) A method for automatic identification of content in standard tables for knowledge graph construction
CN101937433A (en) Real-time searching method of product
Boella et al. Eunomos, a legal document and knowledge management system to build legal services
Gu Integration and optimization of ancient literature information resources based on big data technology
Wang et al. A survey on services provision and distribution of official and commercial intellectual property platforms
CN108205564B (en) Knowledge system construction method and system
Pal et al. Fetching automatic authority data in ILS from Wikidata via OpenRefine

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140402

RJ01 Rejection of invention patent application after publication