[go: up one dir, main page]

WO2018150211A1 - Procédé de manipulation de documents sur une base d'ontologie - Google Patents

Procédé de manipulation de documents sur une base d'ontologie Download PDF

Info

Publication number
WO2018150211A1
WO2018150211A1 PCT/IB2017/000144 IB2017000144W WO2018150211A1 WO 2018150211 A1 WO2018150211 A1 WO 2018150211A1 IB 2017000144 W IB2017000144 W IB 2017000144W WO 2018150211 A1 WO2018150211 A1 WO 2018150211A1
Authority
WO
WIPO (PCT)
Prior art keywords
document
data
documents
sdsys
ontology
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/IB2017/000144
Other languages
English (en)
Inventor
András CSIBA
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to PCT/IB2017/000144 priority Critical patent/WO2018150211A1/fr
Publication of WO2018150211A1 publication Critical patent/WO2018150211A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services

Definitions

  • the subject of the invention is method for handling documents on ontology base, which allows the implementation of safe document handling, as a result of which the unchanged condition of the data content of a given document can be guaranteed.
  • an important feature of the IT systems is that the document and the data are separated, and in certain cases the terms are misinterpreted, and data, i.e. the a compilation of electronic data, is regarded as document. This is inconvenient, because the data tells what is in the document, and the document tells what should be done with the data.
  • a known solution for handing documents containing data or information is the so called data store or data warehouse.
  • the Espacenet database containing patent data is a good example of such database, where the various patent documents are identified according to various aspects, e.g. by key words, such as applicant, inventor, priority, publication number, time and identifiers of content.
  • the search is made according to documents, input data and key words.
  • the document is stored usually in pdf format, it is not necessary and it is not possible to verify its authenticity.
  • Another widely used method for managing data and documents is the so called data fishing. Google search is an example of this, and this is how the various search engines produce database from the date found in the Internet.
  • Goggle For instance, Goggle generates key words for itself, then sorts these key words, and uses them for searching in the various web sites. These key words are regularly updated and refreshed. This is how the various hit lists are provided. Its principle is based on the so called “big data" handling, when an unorganised set of data exists, such as the Internet, and there is a search engine, which starts to index the changes in the unorganized set of data. By this action it produces a kind of order from the unorganised set of data. Then decision has to be made about what is important, for which already a higher intelligence is necessary, software or further filtering, e.g. business intelligence.
  • the Hungarian patent application P0501053 makes known an internet based intelligent information collection, processing and display procedure, which is supported by Hungarian linguistic tools, and which is controlled by an ontology that can be tailored to the needs.
  • the invention described here is an intelligent information collection, processing and display procedure, which can be tailored to the needs, and can be used in the areas of Hungarian language with the help of Hungarian linguistic tools, which monitors the contents of the internet pages written in the Hungarian language, and collects the relevant documents for the user, and with the help of its special text mining algorithms it prepares sequences according to relevance, makes classification according to content (clustering), highlights the key words and the relevant texts (extracting), stores them in own database (archiving) and displays them for the user.
  • the invention is built partly on freely available Hungarian linguistic tools, e.g. morphological analyser, and partly on special individual text mining tools. It sorts the collected documents for the user according to relevance. It is usable for storing the knowledge of companies or individuals, as well as for using in searching and in text mining.
  • the invention stores the knowledge in a simple ontology, and links the terms to a synonym storage. It builds up the queries with the help of the ontology, using the terms and synonyms found in the ontology, and it forwards them to the known large search engines. It collects the results, processes them with text mining algorithms, stores them in database and makes them available for the user.
  • the invention described in this is useful for companies as well as for individuals. Many companies make efforts to make sure that the articles published about them or about their products are positive in attitude, and that they are able to react in time if such articles are negative. The kind of information published about the competitors, including their products and actions is also important for these companies. This tool monitors continouosly the Internet, and informs the user immediately about the relevant pieces of information, thus releasing the company from the task of browsing the web for published articles.
  • the invention is useful for individuals also, because only the ontology is to be uploaded with the interesting, important terms and items about which the user intends to obtain information regularly, and the user will get all relevant pieces of information with the help of intelligent tools.
  • the Chinese patent document CN 102314519 (A) makes known a procedure for searching information, which is based on an analysis of ontological model of the contents in the public domains.
  • the solution described here prepares a search procedure for words and word groups interpreted in natural language environment, and it primary serves for the stock search.
  • the patent document EP 1300778 makes known a data warehouse, in which multi dimensional data cubes are formed during the described procedure, which allows the assignment of multi dimensional identification system to a given document. As a result of this process a document grouping or clustering is made, which establishes the possibility of searching later on.
  • the patent document AU2013370424 makes known a patent document system and procedure, which is applicable for creating, editing, storage of knowledge located in the specifications of the documents, as well as for retrieval of such knowledge.
  • ontological features are assigned to the documents, and the documents are sorted accordingly.
  • the ontological properties are extracted from the document.
  • the patent document TW591519 makes known a system and procedure built on automatic ontology base.
  • the ontological properties are extracted from the document, and a mathematical model, the linear dependency model is used for the sorting, and clusters are created in this way.
  • the distances among the various properties are determined in the multi dimensional space.
  • a multi dimensional space is created, which is rotated according to the given search aspects, and views and cross sections are produced from this spatial model according to the given search aspects.
  • Periodic data polling takes place for instance at the "big data” handling, but it is not know what happens in the interim periods. It could happen, that the data content changes, and the document is forwarded with changed or erroneous data content. This cannot be considered as authentic in the point of view of the content or legal aspect either. In case of an electronic document the change of content cannot be tracked without authentication.
  • the aim and requirement was to realize the safe document handling, as a result of which the unchanged status of the data content of a given document can be guaranteed.
  • the recognition and milestone constituting the basis of the procedure according to the invention was the result of an organic problem finding effort done during the creation of the solution according to the invention.
  • process control principles e.g. sampling, measurement, feedback
  • the method is convenient in many respects, but there are lots of higher level data movements, where the compliance with these principles is problematic. Examples that can be mentioned include the economy in general, company management, particularly the subsystems of accounting and logistics.
  • the state administration and all the subsystems of the legal technique systems belong here. The problem was recognised by the professional circles in the subsystems of logistics and then in the subsystems of accounting.
  • Legislations were adapted about the accounting subsystems by introducing an accounting principle, that bookkeeping, i.e. the recording of data, may be made only from original documents.
  • the logistics subsystem e.g. ordering, order acknowledgement, delivery, stocking, as part of the accounting system, inherits this legal obligation, but the IT systems could not ensure that.
  • the EDI system emerged for solving this problem, which created a safe data channel at least at this area, providing not a full solution, but a situation close to the solution.
  • the EDI solution is appropriate in a narrow environment, it is able to establish document type data connection between two IT systems, which is still used today.
  • the invention is a method for handling documents on ontology base, which allows the implementation of a safe document handling, as a result of which the unchanged condition of the data content of a given document can be guaranteed. It is characterised by that, the SDSys Gateway server belonging to the SDSys system receives all the documents during the procedure, which come from the document issuers, constituting the collector in terms of the utility, then the server forwards the documents to the appropriate subcontractor according to the ordered services, and if all invited subcontractors meet their obligations, then the SDSys Gateway server forwards the documents to the system of the addressee, during the process the subcontractor service providers perform the tasks that require special business logic, which can be the conversion of document, creating new documents, interpreting a document, initiating control events, archive, etc., and more and more new subcontractor logic can be implemented in the procedure only as required by the subcontractors, and if anybody finds a data and document handling task that can be resolved and interpreted (in business term), which can be accomplished on the basis of the given
  • an ontological server is used for the application of the method, which ontological server is an outstanding subsystem, that provides basic service for interpreting any document, and in the given case it is connected to a service, to an optical character recognition system, which optical character recognition system is capable of determining the relevant data in the document according to its business logic, which can be tax code, name, address, etc.
  • document types of the ontological procedure can be extended, but the reception of the ontological description of the new document belongs to the responsibilities of the operator of the document ontological database, and only such document can be included in the ontological database, which is the descendant of a document type recorded earlier in the document ontology, and it does not contradict its inherent logic.
  • a new language version of a document type described earlier in the ontology can be placed in the ontology database without creating a new document type.
  • the invoice ontology determines the place of tax code, name, address, etc. in the invoice type document, as well as the mode of interpreting the same data, e.g. identification, name, address, etc. of buyer and supplier in case of invoice.
  • the document type is contract
  • the document type determines the mode of interpreting the relevant data recognised in the document, which shall be or may be placed as document meta data into the xml data belonging to the SDSys system.
  • the document evaluating systems can determine the mode of automatically creating xml data containing meta data based on the ontology data according to the document type, which can be read from the ontology data base interpreted by the document processing system in an automatically processable mode according to the method of the invention.
  • meta data structure processed according to a document ontology can be interpreted in any language different from the original language, related to which language supplementary information can be found in the ontology database created according to the invention.
  • the mailing service is the emitter side of the utility for the use of the method, which is capable of finding the recipient software based on the individual identifier of the addressee, and forwards the SDSys document there, and opens a gate to the mailbox maintained for the recipient, as a result of which, two document releasing technologies are available, and the recipient IT system can chose freely from these two technologies.
  • the document recipient systems are used for the application of the method, among which the SDSys emitter has an address, which is a recipient IT system, and has a sub-address, which assigns further routes within the own system of the recipient, which in the given case is visible at the recipient having "Bookkeeper” identifier, while the "Buyer” receives electronic invoice, and the "Web shop operator” receives orders.
  • the emitters and recipients are selected examples only, but in reality they can be numerous and many different kinds, furthermore, the "right” and “left” sides are interchangeable, and then the automatic management system is established.
  • a system is designed and built up, which flows the documents through the SDSys system, meaning that the document logistics changes entirely, because the paper based equivalent of the document is processed only once within the SDSys system, as a result of which, the document hereinafter can be flown as an automatically processable electronic document, while it retains a property which existed in case of paper based document, namely that it can be displayed, in other words, not only the data flows during the method, but it is also possible to assign the authenticated electronic copies of the document in an inseparable manner.
  • the IT subsystems perform the tasks that can be made automatically on the basis of the data, while in the case of tasks requiring human decision, the systems are capable of showing the authenticated electronic copy of the original document on the screen at the appropriate subsystem, and it can be seen with the application of the method, that the processing subsystem disappear from the company environment, and the document can be transferred to the utility after executing the method once.
  • Fig. 1 shows a possible embodiment of a complete SDSys system implementing the procedure according to the invention.
  • Fig. 2 shows the general architecture of automatic control systems used so far.
  • Fig. 3 shows a system, which flows the documents through the SDSys system with the procedure according to the invention.
  • Fig. 1 shows a possible embodiment of a complete SDSys system which implements the procedure according to the invention.
  • the documents issuers are shown at the left of the figure, which constitute the collector in terms of the utility of IT aspect.
  • the SDSys GS Gateway server receives all documents, then it forwards the documents to the appropriate subcontractor according to the ordered services. If all addressed subcontractors complete their own tasks, then the SDSys GS Gateway server forwards the document to the system of the addressee.
  • Fig. 1 shows a possible embodiment of a complete SDSys system which implements the procedure according to the invention.
  • the documents issuers are shown at the left of the figure, which constitute the collector in terms of the utility of IT aspect.
  • the SDSys GS Gateway server receives all documents, then it forwards the documents to the appropriate subcontractor according to the ordered services. If all addressed subcontractors complete their own tasks, then the SDSys GS Gateway server forwards the document to the system of the addressee.
  • FIG. 1 shows the parts of the system which can be a possible implementation of the procedure according to the invention, including the issuer of electronic invoice of EVSZ buyer, AVA administration of "A" company, WMR WEB shop orders, as well as the documents of the total electronic archive of BVTAchevB" company, the services ordered from the document issuers, and in the give case the creation of ESZLE E-invoice, the SZAD accounting documents, the MAD order data, and the MAJD document orders assigned for archiving are sent in electronic form to a SDSys GS Gateway server.
  • the SDSys GS Gateway server receives all the documents, and it contains an ADSZR authentication, which performs the encryption of document and the service, and the tasks of system administrations, then it forwards the documents to the ASZ subcontractor service provider according to the ordered services.
  • ADSZR authentication which performs the encryption of document and the service, and the tasks of system administrations
  • the SDSys GS Gateway server sends a portion of the SZD1 accounting documents to the OCR service provider, which produces raw data and sends them to the OCR optical characters recognition OSZE ontology server through the HOCR network route with conversion according to the ontology characterising the raw data document produced by the OCR.
  • the OSZE ontology server contains or it can reach the ODA ontology database, which is the business logic that interprets the document and the meta data based on the ontology.
  • the SDSys GS Gateway server sends a portion of the SZD2 accounting documents, as well as the documents handed over for AD archiving directly, and the ESZLA e-invoices through the ESZ electronic invoice preparing service to the ARC archiving subcontractor service provider for the purpose of ESZA e-invoice archiving.
  • the ESZ electronic invoice preparing service sends the data to the EA electronic signing subcontractor service provider for the purpose of EDA e-invoice digital signing.
  • the data are forwarded directly from the SDSys GS Gateway server to the EA electronic signing subcontractor service provider. If all service providers completed the given task, then they can send it to the SDSys GS Gateway server, which forwards the POD documents ready for mailing electronically to the addressees of the PSA mailing services, through the service provider performing SDSys basic service, i.e.
  • the document recipient systems can be found at the right side of the figure.
  • the SDSys emitter has a VEV, KONY, WIT address, which is the identifier of the recipient IT system, as well as an AVP, BVP, CVP sub-address, so that the recipient would be able to determine further routes within its own system. This can be seen at the recipient having KONY accounting identifier.
  • the VEV buyer receives electronic invoice, while the operator of WIT WEB shop receives orders.
  • the issuers and the recipients are mentioned only as examples, in reality they can be numerous and of many different kinds. Right side and the left side are interchangeable, thus an automatic control system is created.
  • the ASZ subcontractor service providers perform the tasks that require special business logic, which could be conversion of the documents, creation of new documents, interpretation of document, initiating control events, archiving, etc.
  • Inclusion of new subcontractor logic depends on how the subcontractors can realize their ideas. If anybody finds a task that can be resolved and interpreted in business terms, which can be accomplished on the basis of the document, then the user can built a connector to the SDSys system for taking out and putting back documents. This activity can be regarded as the essential idea behind the SDSys utility service.
  • the document issuer is connected to the SDSys system at a single point with its document, then the service deployed there (basis) will convert the document or create document.
  • the OSZE ontology server is an outstanding subsystem, which provides basic service for the interpretation of any document.
  • OSZE ontology server corresponding to the present interpretation, is connected to a service, i.e. to the OCR optical character recognition server.
  • the role of OSZE is an independent entity, so it could from a part of any other subsystem.
  • the OCR optical character recognition system is capable of determining interpretable relevant data in the document according to its own business logic. This could be tax identification code, name, address, etc. If the document is an invoice, then the invoice ontology will tell the location of tax code, address, etc., in the invoice type document, and will also tell how to interpret the same data, if the document type is a contract.
  • the obligator and the beneficiary are the buyer and the supplier in case of an invoice, while the contracting parties are the obligator and beneficiary in case of a contract.
  • the OSZR is capable not only of subsedquent analysis of the doucument, but is can produce live templates for creating new document, or it may generate automatic documents according to the needs of further service providers.
  • the mailing service is an emitter, i.e. the releasing side of KOS utility. It is capable of finding the recipient software based on the individual identifier of the addressee, and of forwarding the SDSys document there, and of opening a gate to the mailbox kept for the recipient.
  • Fig. 2 shows the general architecture of the automatic control systems used previously. For the case of the previously used procedures
  • Fig. 2 shows the company environment VAL and the associated IT subsystem according to Fig. 2 S as well as the financial department PE, sales ERT, accounting SZV, controlling KON, and the procurement subsystems BESZ, and the processing FED as part of the given subsystems.
  • the figure shows the schematic diagram of the company document flow system used so far, which indicate that where document flow, i.e. document sending and reception takes place, then the FED processing always appears there as part of the given subsystems.
  • the UT remittance documents are sent to BA bank through the FED processing, or KIV extract documents are received, from the VEV buyer the SZR invoice complaint document, the MR order document, and through the SZF contract processing the TER production documents are received, and to the VEV buyer the KIN receivables document, the IGSZ acknowledgement, disposition, verification of performance, bill of lading and invoice documents are sent, while from the SZAL supplier subsystem, the KIN receivables document, the MR order documentation are sent, and SZR invoice complaint document, the IGSZ acknowledgement, disposition, verification of performance, bill of lading and invoice documents are received, and furthermore, from the KON controlling subsystem documents are sent to ESZ cooperating organisations, as well as to the MT management and owners, from the SZV accounting subsystem statement document is sent to the NGM ministry and tax return document is sent to NAV bureau, while certificates and list of errors are received from the NAV bureau.
  • the FED processing of document appear within the company system, even when
  • Fig. 3 shows a system, which flows the document through the SDSys system by the procedure according to the invention.
  • the mode of designing and implementing a system is introduced in Fig. 3, which flows the documents through the SDSys system. It means that the document logistics is changed entirely, because the paper based equivalent of the document is processed only once, and it is made within the SDSys system.
  • the document logistics is changed entirely, because the paper based equivalent of the document is processed only once, and it is made within the SDSys system.
  • the document can be flown as an automatically processable electronic document, while it keeps the property that existed in case of paper based document, so its display is also possible.
  • the authenticated electronic copies of the document can be attached in an inseparable manner.
  • Fig. 3 shows a preferred application of the procedure according to the invention, where the IT subsystems accomplish the tasks that can be done automatically based on the data, wherein the systems using proper subsystems are able to show the authenticated electronic copy of the original document on the display in case of tasks requiring human decision, therefore, it is possible to see by the application of the procedure, that the processing subsystems disappear from the company environment (FED), and after running them once, the document can be transferred to the utility (KOS).
  • FED company environment
  • KOS utility
  • the task is performed by bookkeeping bureaus.
  • the forwarding of paper based documents to the bookkeeper is made in a campaign like manner, primarily according to the reporting schedules.
  • the work management of the bookkeeping bureaus is influenced by the large amount of documents coming in simultaneously, which is expected to processed within a short period of time. The work cannot be scheduled properly, and it could lead to unnecessary capacities sometimes.
  • the bookkeeper may store the documents in his own office, thus the company has no immediate access to the documents that could be important to the business, and alternatively, the bookkeeper may return the documents right after the bookkeeping operations, but in this case the bookkeeper has no access to the original documents in the reporting period.
  • the task regarding the recording of data is minimum, as ensured by OCR and the ontology.
  • the electronic copy of the document is available immediately any time on the screen, so the tasks related to the search and data sorting in the files are practically eliminated.
  • the list of problems is the same as the list shown in Point 1 , except that the data processing in this industry has an entirely different role.
  • the SDSys is capable of ensuring to the relevant IT system, that the processed document would be capable of controlling the loading machines directly, or in case of manual processing, the system would be capable of preparing the takeout compiling lists without preliminary processing.
  • the subsystems of larger companies process the paper based documents as subsystems.
  • a further task is the operation of the company logistics system of documents (internal mailing traffic).
  • the logistical feature of the document is that a document is part of a number of subsystems, which run linearly or in parallel.
  • the document pre-processing can be centralized with the help of the ontology, and it can be outsourced to external contractors, which significantly increases the effectiveness.
  • the SDSys document flow system is capable of performing the document logistical tasks with the help of the addressing and sub addressing system of the document, and it forwards the document to the division, where it has to be processed, or where it performs its control function.
  • SDSys system and the document processing according to the ontology can ensure that the different data structures are interconnected efficiently without interfaces.
  • the state administration subsystems require data considerably from the companies, which match the own internal analysing and controlling systems. It means, that it is required to prepare a data releasing subsystem for each program, which establishes a link to the state administration subsystems. As soon as the SDSys link to the state administration system exists, every program has the possibility of maintaining the expected relationship with the subsystems. There is not any additional development task, because the necessary procedure is the same as the system, which is capable of maintaining contact with the database of the bookkeeper or any other partner.
  • the legal subsystems of law offices, district attorneys, courts and state administration are obliged to keep huge quantities of paper based documents.
  • the document flow system of SDSys is a secure and closed system, which is capable of verifying the authenticity of electronic document copies, which are regarded as private or public documents that can be used as evidence according to the civil lawsuit legislation.
  • the system is ready in technical terms, it is only the required change of attitude which presents a limitation.
  • the "Document" utility” is a general and universal service, which is suitable for processing or servicing any type of document.
  • the essence of the invention that can be described with a single sentence is as follows, that secure forwarding of any document from the sender to the recipient, along the route of which supplementary services can be assigned to the document, while the document handling and its route can be evaluated within the expiry period. Because of its latter property, and subject to a further development of the legal background, it may correspond to the category of private document that can be used as an evidence without electronic signature or time stamp.
  • the document utility provides a procedure, that can ensure the safe and traceable flow of documents, while the document could go through processing, changes and other procedures according to the ordered services.
  • the task of the utility is to analyse the document flow, and to ensure that the document is forwarded to the assigned service provider, to receive the modified or new document from the service providers, stores the path of the document, then to forward it to the addressees. If no service has been ordered by the sender, nor by the recipient, then the basic service of the utility is enabled, which ensures the reception of the document.
  • data could be a date, a value, a description, while the document is able to determine whether the data refers to delivery, payment deadline or issue of invoice, or the value is the quantity or price, the description is the name of the author or article or the description of the service.
  • the document utility is a compilation of services.
  • the respective services (IT services in practice) perform their activities independently.
  • the utility has three layers.
  • the documents will be processed by the service assigned to the service provider, if the sender or the recipient has valid contract for this.
  • Each processing service with the functionality expected by the external service provider shall be created for each external service provider in the utility development.
  • the task of the service is to forward the documents in a condition prevailing before the processing to the external service provider, and to place the results in the utility database after completing the service. It is expected from the external service provider to provide the necessary data to the utility.
  • the designing of the services assigned to the service providers can be done only jointly with the service providers.
  • post service The own basic service by the utility (post service), which makes sure that the documents are delivered to the recipient in a verified manner.
  • the ontology based electronic invoicing is a mode of document interpreting that has not been explored before, which could lead to the programmable automatic document processing. It is indispensable for the solution, that all documents in the ICT system assume a data structure that can be handled by the IT technology.
  • the digitalization of written documents, the digital conversion of sound and image recordings are not enough for extracting the data, because the result provides only an opportunity for the processing systems to handle the data of documents in a unified structure in orderly fashion.
  • the image information has to be recognised first (text interpretation, shape recognition), then the data-like content has to be found in it (dates, partner data, values, etc.).
  • Data shall be assigned with their interpretations (date of delivery or payment deadline, location of delivery or premises of company, value in sum or value in the given currency, etc.).
  • the novelty of the invention is that no subsequent search is made to find the placing location of the document according to the ontological structure, but a document is created according to a determined element of the ontology data structure existing earlier.
  • a tree structure exists (not created yet), in which all documents could be found on the leaves.
  • the documents listed at the nodes of the tree structure has a common property of having identical data and their interpretation, which are inherited by all descendant documents as proceeding in the structure.
  • a document prototype is a element positioned at the terminal point of the structure. All elements of the structure are characterised only by the data description and interpretation corresponding to their identity, the rest of the descriptive data are inherited from the predecessors.
  • the created document description system defines the e-ontological map of the documents.
  • the ontology database is a separate database, which can be used by more utilities also.
  • the ontology database is realized from elemental level through an organic development.
  • the task of the utilities is to regularly update the descriptions and publish them on the internet, which is important, because the document processing systems do not need to store the entire ontology map (which is probably very large), it is enough to poll the description of the given document prototype for the interpretation, on which the document in the system can be automatically interpretated, independently of the language created of the original document.
  • the document is identified in the entire database individually.
  • One of the services provided by the utility is the creation of a document description interface characterising the given utility from the ontology database.
  • An example could be when a document utility, that has chosen XML data structure, prepares the xml for the processing system, which has to be filled with data only, and it allows for the development environment to automatically produce an interface, which ensures easy access to the XML uploading or reading the assigned to the document.
  • the tasks of the utility include the creation of the empty xml considering of all predecessors of the meta data description determined with individual identifier, or an interface that can be interpreted by the translating program. It is not necessary in the service to generate the tools on the basis of the last element of the tree structure. Any intermediate element can be regarded as the last element of the descriptive database. The result in such case it that the data fields describing the developed meta data built on the given document ontology cannot be seen in the processing.
  • a language module is to be created in the ontological database, which makes sure that the Field Text included in the description can be displayed in any language.
  • a new document prototype is created (which is actually the emergence of a new document individual identifier with the associated elements), then it should be possible immediately or at any time later to assign an arbitrary series of characters to any Field Text in the ontology database.
  • the prototype can be prepared in any language, and the language version is to be prepared when it is necessary for the processing.
  • the solution makes sure, that the form of a document can be produced with the same program in any language, and that the meta data of the document is accessible in any language existing in the ontology.
  • a document may be produced with a form in the Hungarian language, and if it has assigned English Field Text data, then it can be read with form in the English language.
  • the ontology database and the utility constitute a unit in a way, that the ontology database is an operable unit without the utility, but an IT utility can be created without ontology database. Acting as a single unit, the two systems realize the most efficient document utility. With the use of the ontology database it is possible to prepare an application, which is capable of creating input form for any document prototype. By filling the prepared application the user of the form creates an electronic document, which can be directly sent to the document utility.
  • the ontology database can be used by any service, if the meta data needs to be interpreted.
  • the externals service provider receives the ontology identifier assigned to the document. Based on this, it will not search randomly among the data obtained with OCR, but tries to find specific data content assigned to the given fields in the ontology. The processing is considered successful, when all the data described in the ontology are found. The success may be determined in proportion based on the found and not found data.
  • the meta data are returned by the service in the order corresponding to its own business logic.
  • the service assigned to the external service provider places the returned data into the meta data fields listed by the ontology.
  • a significant advantage of the method is that the external service provider is not expected to prepare independent procedure for each document type, because the search for meta data is not built on the type of document, but on the rules given by the ontology. The efficacy of the search and the safety of the found data fields increases significantly.
  • the document evaluation searches for meta data built on the document type given in the provided service.
  • Document having ontological identifier can also be sent to external service provider, who has been established the possibility of such type of processing.
  • the service has to consider the ontology identifier, or other document property if such identifier does not exist, for determining the mode of processing by which the given document is requested from the external service provider (invoice, bank report, contract, etc.). If the document has ontological identifier, then the service converts the processed meta data do data fields according to the ontology. If the document has no ontological identifier, then the service performs a data conversion according to the public business logic of the external service provider and the document utility.
  • the method according to the invention allows the implementation of a safe documenting handing, as a result of which the unchanged condition of the content of a given document can be guaranteed.
  • ODA - Ontology database (document and meta data interpreting business logic based on ontology)
  • KOS - SDSys (SDSys utility, and the services ordered in it)

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • General Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Marketing (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Technology Law (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • General Engineering & Computer Science (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention a pour objet un procédé de manipulation de documents sur une base d'ontologie, qui permet la mise en œuvre d'une manipulation de documents sûre, en conséquence de quoi la condition inchangée du contenu de données d'un document donné peut être garantie. Au cours du procédé, le serveur passerelle (SDSys GS) appartenant au système SDSys reçoit tous les documents pendant la procédure, qui proviennent des émetteurs de documents, constituant le collecteur en termes d'utilité, puis le serveur transfère les documents au sous-traitant approprié selon les services commandés. Si tous les sous-traitants invités remplissent leurs obligations, alors le serveur passerelle SDSys (SDSys GS) transfère les documents au système du destinataire, pendant le processus, les fournisseurs de services de sous-traitant effectuent les tâches qui nécessitent une logique commerciale spéciale.
PCT/IB2017/000144 2017-02-20 2017-02-20 Procédé de manipulation de documents sur une base d'ontologie Ceased WO2018150211A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/IB2017/000144 WO2018150211A1 (fr) 2017-02-20 2017-02-20 Procédé de manipulation de documents sur une base d'ontologie

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/IB2017/000144 WO2018150211A1 (fr) 2017-02-20 2017-02-20 Procédé de manipulation de documents sur une base d'ontologie

Publications (1)

Publication Number Publication Date
WO2018150211A1 true WO2018150211A1 (fr) 2018-08-23

Family

ID=60421816

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2017/000144 Ceased WO2018150211A1 (fr) 2017-02-20 2017-02-20 Procédé de manipulation de documents sur une base d'ontologie

Country Status (1)

Country Link
WO (1) WO2018150211A1 (fr)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1577730A1 (fr) * 2004-03-17 2005-09-21 Sap Ag Procédé, système et logiciel pour la vérification de certaines conditions dans des documents électroniques
WO2009061917A1 (fr) * 2007-11-06 2009-05-14 Copanion, Inc. Systèmes et procédés pour organiser automatiquement des travaux électroniques en classant automatiquement des documents électroniques utilisant des images extraites et des caractéristiques de texte et utilisant un sous-système de reconnaissance d'apprentissage machine
US20160042124A1 (en) * 2014-08-08 2016-02-11 Practice Fusion, Inc. Electronic health records data management systems and methods

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1577730A1 (fr) * 2004-03-17 2005-09-21 Sap Ag Procédé, système et logiciel pour la vérification de certaines conditions dans des documents électroniques
WO2009061917A1 (fr) * 2007-11-06 2009-05-14 Copanion, Inc. Systèmes et procédés pour organiser automatiquement des travaux électroniques en classant automatiquement des documents électroniques utilisant des images extraites et des caractéristiques de texte et utilisant un sous-système de reconnaissance d'apprentissage machine
US20160042124A1 (en) * 2014-08-08 2016-02-11 Practice Fusion, Inc. Electronic health records data management systems and methods

Similar Documents

Publication Publication Date Title
US6088700A (en) Automated forms completion for global information network applications
CN114202755A (zh) 基于ocr和nlp技术的交易背景真实性审核方法和系统
CN100456290C (zh) 用于自动和动态地构建文件管理应用程序的方法和系统
CN114117171A (zh) 一种基于赋能思维的工程档案智能收整方法及系统
CN110069623A (zh) 摘要文本生成方法、装置、存储介质和计算机设备
US20150032645A1 (en) Computer-implemented systems and methods of performing contract review
US20100023422A1 (en) System and Method for Processing Import/Export Transactions
CN114187082A (zh) 一种财务记账及报销方法及系统
CN107844960B (zh) 一种自动化智能分析商业计划书的投资分析工具
TW202018616A (zh) 智能會計帳務系統與會計憑證的辨識入帳方法
Chen et al. Exploring technology opportunities and evolution of IoT-related logistics services with text mining
CN110597796A (zh) 基于全生命周期的大数据实时建模方法及系统
CN113283984B (zh) 一种个人贷款信息的录入方法及装置
CN113485987A (zh) 企业信息标签生成方法及装置
Zimkus et al. The need for permit management within biodiversity collection management systems to digitally track legal compliance documentation and increase transparency about origins and uses
Ali et al. Closing the information gaps: a systematic review of research on delay and disruption claims
EP4348917A1 (fr) Chaîne de blocs, procédé de transmission d'informations entre des noeuds de la chaîne de blocs, et procédés de configuration et d'interrogation de la chaîne de blocs
WO2018150211A1 (fr) Procédé de manipulation de documents sur une base d'ontologie
KR102349164B1 (ko) 보험설계사 통합 위촉 처리 시스템
KR101178998B1 (ko) 데이터 인증 방법 및 시스템
Siapera et al. Closing the gap: Leveraging data for seamless integration between pre-award and post-award in public procurement
Tjebane et al. Unravelling the State of the Art of Blockchain Development for Improved Infrastructure Delivery in the Built Environment: A Bibliometric Review
TW201413628A (zh) 謄本解析系統
Amaya et al. Technological development of functionalities with convolutional neural networks for intelligent document management
US20200118122A1 (en) Techniques for completing missing and obscured transaction data items

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17801771

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17801771

Country of ref document: EP

Kind code of ref document: A1