[go: up one dir, main page]

WO2013100415A1 - Système de gestion de la qualité de données distribuées et procédé associé - Google Patents

Système de gestion de la qualité de données distribuées et procédé associé Download PDF

Info

Publication number
WO2013100415A1
WO2013100415A1 PCT/KR2012/010247 KR2012010247W WO2013100415A1 WO 2013100415 A1 WO2013100415 A1 WO 2013100415A1 KR 2012010247 W KR2012010247 W KR 2012010247W WO 2013100415 A1 WO2013100415 A1 WO 2013100415A1
Authority
WO
WIPO (PCT)
Prior art keywords
user
data
function
target data
heterogeneity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/KR2012/010247
Other languages
English (en)
Korean (ko)
Inventor
국윤규
이준
박민우
최기석
김재수
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Korea Institute of Science and Technology Information KISTI
Original Assignee
Korea Institute of Science and Technology Information KISTI
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Korea Institute of Science and Technology Information KISTI filed Critical Korea Institute of Science and Technology Information KISTI
Publication of WO2013100415A1 publication Critical patent/WO2013100415A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F15/00Digital computers in general; Data processing equipment in general
    • G06F15/16Combinations of two or more digital computers each having at least an arithmetic unit, a program unit and a register, e.g. for a simultaneous processing of several programs
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/565Conversion or adaptation of application format or content

Definitions

  • the present invention relates to a distributed data interoperability method, and more particularly, by performing data transformation and purification by dynamically generating and utilizing a business rule necessary for data transformation and purification based on user definition for interoperability of distributed data.
  • the present invention relates to a distributed data quality management system and method for improving the quality of data.
  • the data management in the information system is very important because it supports business operation and decision-making through data analysis and processing, and especially, it must be managed so that invalid data is not utilized for work.
  • the present invention has been made in view of the above circumstances, and an object of the present invention is to generate and store a custom rule including at least one of a user-defined function and a built-in function in which the master operating apparatus is variably set.
  • the master operating device provides a distributed data quality management system and method for performing data value adjustment on the collected target data for one or more of the slave operating devices through the application of the stored user-defined rule.
  • Business rules for conversion and refining It is to perform data transformation and purification by dynamically generating and utilizing based on user definition.
  • Still another object of the present invention is to generate and store a custom rule including at least one of a variable and a user-defined function that is variablely set, and reference from one or more other operating devices that operate each legacy system. Collecting the target data for linkage with the data, and providing an operating device and an operation method for adjusting the data value for the collected target data through the application of the stored user-defined rules, necessary for data conversion and purification It is to perform data transformation and purification by dynamically creating and utilizing business rules based on user definition.
  • the slave operating device having a database for operating its own legacy system, and provides the target data stored in the database in accordance with the data connection request ; And collecting the target data for linkage with reference data by transmitting the data linkage request to one or more slave operating devices according to a specific process, and selecting at least one of a user-defined function and a built-in function that are variably set. It characterized in that it comprises a master operating device for performing a data value adjustment for the collected target data through the application of a user-defined rule including.
  • the master operating device sets the user defined function according to a user request through a user interface (UI), and performs error verification through grammar verification and execution verification of the set user defined function.
  • UI user interface
  • the user-defined function of which error verification is completed is set as the user-defined rule and stored.
  • the master operating device characterized in that for setting the user-defined function in conjunction with the built-in function, in response to a user request through the user interface.
  • the master operating device by comparing the target data and the reference data through a predetermined heterogeneity determination criterion to determine the validity of the target data, if the determination result is not valid, the user It is characterized by adjusting a data value for the target data by applying a definition rule.
  • the validity of the object data is determined through the heterogeneity determination criteria including at least one of structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity.
  • the master operating apparatus performs at least one of data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment through the application of the user-defined rule to the collected target data. And adjusting the data value for the data.
  • a conversion rule setting unit for generating and storing a user-defined rule including at least one of a variable and a user-defined function is set variable;
  • a data collector for collecting target data for linkage with reference data from one or more other operating apparatuses for operating each legacy system;
  • a conversion processor configured to apply the stored user definition rule to adjust data values of the collected target data.
  • the conversion rule setting unit sets the user defined function according to a user request through a user interface (UI), and performs error verification through grammar verification and execution verification of the set user defined function.
  • UI user interface
  • the user-defined function of which error verification is completed is set as the user-defined rule and stored.
  • the conversion rule setting unit according to the user request through the user interface, characterized in that for setting the user-defined function linked to the built-in function.
  • the conversion processor determines the validity of the target data by performing comparison between the target data and the reference data through a predetermined heterogeneity determination criterion, and when the determination result is not valid, the user definition.
  • the data value for the target data is adjusted by applying a rule.
  • the conversion processing unit is characterized in that it determines the validity of the target data through the heterogeneity determination criteria including at least one of structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity.
  • the conversion processing unit performs at least one of data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment through the application of the user-defined rule to the collected target data.
  • the data value adjustment is performed.
  • a distributed data quality management method for achieving the above object is provided:
  • the method includes a custom rule including at least one of a user-defined function and a built-in function in which the master operating apparatus is variably set.
  • a conversion rule setting step of generating and storing a message;
  • a data request step of requesting, by the master operating device, target data for linkage with reference data, to a slave operating device operating each legacy system according to a specific process process;
  • a target data providing step of providing, by the slave operating apparatus, the target data stored in its own database according to the request;
  • a data conversion processing step of performing, by the master operating device, data value adjustment on the collected target data to one or more of the slave operating devices through the application of the stored user defined rule.
  • the conversion rule setting step includes: a function setting step of setting the user defined function according to a user request through a user interface (UI); An error verification step of performing error verification through grammar verification and execution verification of the set user defined function; And a function storage step of setting and storing the user-defined function whose error verification is completed as the user-defined rule.
  • UI user interface
  • An error verification step of performing error verification through grammar verification and execution verification of the set user defined function
  • a function storage step of setting and storing the user-defined function whose error verification is completed as the user-defined rule.
  • the function setting step characterized in that for setting the user-defined function in conjunction with the built-in function, in response to a user request through the user interface.
  • the data conversion processing step the validity determination step of determining the validity of the target data by performing a comparison between the target data and the reference data through a predefined heterogeneity determination criteria; And a data adjustment step of adjusting a data value for the target data by applying the user definition rule when the determination result is not valid.
  • the validity determining step is characterized by determining the validity of the target data through the heterogeneity determination criteria including at least one of structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity.
  • the data adjustment step includes at least one of data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment through the application of the user-defined rule to the collected target data. And adjusting the data value for the data.
  • the conversion rule setting step includes: a function setting step of setting the user defined function according to a user request through a user interface (UI); An error verification step of performing error verification through grammar verification and execution verification of the set user defined function; And a function storage step of setting and storing the user-defined function whose error verification is completed as the user-defined rule.
  • UI user interface
  • An error verification step of performing error verification through grammar verification and execution verification of the set user defined function
  • a function storage step of setting and storing the user-defined function whose error verification is completed as the user-defined rule.
  • the function setting step characterized in that for setting the user-defined function in conjunction with the built-in function, in response to a user request through the user interface.
  • the conversion processing step the validity determination step of determining the validity of the target data by performing a comparison between the target data and the reference data through a predetermined heterogeneity determination criteria; And a data adjustment step of adjusting a data value for the target data by applying the user definition rule when the determination result is not valid.
  • the validity determining step is characterized by determining the validity of the target data through the heterogeneity determination criteria including at least one of structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity.
  • the data adjustment step includes at least one of data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment through the application of the user-defined rule to the collected target data. And adjusting the data value for the data.
  • a computer-readable recording medium comprising: a conversion rule setting step of generating and storing a user-defined rule including at least one of a variable and a user-defined function; A data collection step of collecting target data for linkage with reference data from one or more other operating apparatuses that operate each legacy system; And instructions for executing a conversion processing step of performing data value adjustment on the collected target data by applying the stored user defined rule.
  • the conversion rule setting step includes: a function setting step of setting the user defined function according to a user request through a user interface (UI); An error verification step of performing error verification through grammar verification and execution verification of the set user defined function; And a command for executing a function storing step of setting and storing the user-defined function whose error verification is completed as the user-defined rule.
  • UI user interface
  • An error verification step of performing error verification through grammar verification and execution verification of the set user defined function
  • the function setting step characterized in that for setting the user-defined function in conjunction with the built-in function, in response to a user request through the user interface.
  • the conversion processing step the validity determination step of determining the validity of the target data by performing a comparison between the target data and the reference data through a predetermined heterogeneity determination criteria; And a command for executing a data adjusting step of adjusting a data value for the target data by applying the user-defined rule when the determination result is not valid.
  • the validity determining step is characterized by determining the validity of the target data through the heterogeneity determination criteria including at least one of structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity.
  • the data adjustment step includes at least one of data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment through the application of the user-defined rule to the collected target data. And adjusting the data value for the data.
  • the distributed data quality management system and method according to the present invention by dynamically generating and utilizing a business rule necessary for data conversion and purification for user-based basis for the interoperability of distributed data, it is widely used for data conversion and purification Use it to improve the quality of data and further minimize changes in legacy information systems.
  • FIG. 1 is a schematic configuration diagram of a distributed data quality management system according to an embodiment of the present invention.
  • FIG. 2 is a schematic configuration diagram of an operation apparatus according to an embodiment of the present invention.
  • FIG 3 is a view for explaining a user interface according to an embodiment of the present invention.
  • FIG. 4 is a view for explaining a user-defined function according to an embodiment of the present invention.
  • FIG. 5 is a diagram for explaining an example of data value adjustment according to an embodiment of the present invention.
  • FIG. 6 is a schematic flowchart illustrating an operating method of a distributed data quality management system according to an embodiment of the present invention.
  • FIG. 7 is a schematic flowchart illustrating a method of operating an operating apparatus according to an embodiment of the present invention.
  • FIG. 1 is a schematic structural diagram of a distributed data quality management system according to an embodiment of the present invention.
  • the system collects target data for linkage with reference data to one or more slave operating apparatuses 200 operating each legacy system and performs data value adjustment.
  • Device 100 and a slave device 200 for providing target data according to a data linkage request from the master operating device 100.
  • the master operating device 100 and the slave operating device 200 refers to a server operating each legacy system on the same domain or different domains, a process that requires distributed data collection It has the same configuration that the agent (Agent) for processing, but may be divided into a master operating device 100 or a slave operating device 200 according to the process processing subject.
  • the operating device for collecting distributed data in accordance with the process processing through the agent as a master operating device 100, and at the same time provides the target data itself stored in accordance with the request of the master operating device 200 It is assumed that the operating device to be a slave operating device 200 will be described.
  • the master operating apparatus 100 generates and stores a user-defined rule for adjusting data values.
  • the master operation apparatus 100 sets the user definition function according to a user request through a user interface (UI). That is, the master operation apparatus 100 provides a user interface as shown in FIG. 3, and sets a user defined function according to a user request through the provided user interface.
  • UI user interface
  • the user-defined function as shown in FIG. 4, for example, a function type for distinguishing a business rule, a function name, a parameter included in the function, a data type for returning a function execution result value, and a description of the function.
  • Function usage and function contents which are algorithms that are actually performed in the function, may be included, and the function contents may be written in JavaScript and executed by a JavaScript execution engine, without compiling or re-executing for use in a data conversion system.
  • the user-defined function may be set to work with the built-in function, in response to a user request through the provided user interface.
  • the built-in function refers to a function provided by the system itself, and may include, for example, a date / time function, a math / triangle function, a statistical function, a logical function, and the like.
  • the master operation apparatus 100 performs error verification on the user defined function set through the user interface. In other words, the master operation apparatus 100 performs a grammatical verification of the user-defined function, converts a variable to be used in, for example, Javascript, and verifies the error of the function through testing therefor.
  • the master operating apparatus 100 sets and stores the user defined function in which error verification is completed, as the user defined rule. That is, the master operating apparatus 100 stores a user-defined function that has completed error verification through grammar verification and execution verification as a user-defined rule. Then, the stored user-defined rule is called when a data conversion and purification is required. To be applied.
  • the master operating device 100 requests and collects target data for linkage with reference data to one or more slave operating devices 200.
  • the master operating apparatus 100 transmits a data linkage request to one or more slave operating apparatuses 200 operating each legacy system according to a specific process processing according to an agent driving, thereby linking with reference data. Collect the target data for.
  • the slave operation apparatus 100 supports the process processing operation of the master operation apparatus 100 by extracting and providing self-stored target data when a data linkage request is received from the master operation apparatus 100. .
  • the master operation apparatus 100 performs data value adjustment on the collected target data through the application of the stored user defined rule.
  • the master operating device 100 determines the validity of the target data collected from at least one slave operating device 200. That is, the master operation apparatus 100 compares the target data with the reference data based on predefined heterogeneity determination criteria such as structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity, thereby validating the target data. To judge. Further, when the determination result is not valid, the master operation apparatus 100 applies the user definition rule to adjust the data value for the target data. That is, when it is determined that the validity of the target data is not valid, the master operating apparatus 100 calls a stored user defined rule, for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • a stored user defined rule for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • the target data is converted into valid data for linking with the reference data.
  • the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 are linked, the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 Although different by natural language processing, they are semantically the same.
  • the conversion and purification is carried out to 'KOR', the 'KOR' of the operating device 2 'Is converted to' Korea 'and refined when linked to the operating device 1.
  • the master operation apparatus 100 includes a conversion rule setting unit 110 for generating and storing a user-defined rule, and a data collecting unit 120 for collecting target data for linkage with reference data. , And a conversion processing unit 130 for applying a user-defined rule to perform data value adjustment for the collected target data.
  • the conversion rule setting unit 110 sets the user defined function according to a user request through a user interface (UI).
  • UI user interface
  • the conversion rule setting unit 110 provides a user interface as shown in FIG. 3, and sets a user defined function according to a user request through the provided user interface.
  • a user-defined function as shown in FIG. 4, for example, a function type for distinguishing a business rule, a function name, a parameter included in the function, a data type for returning a function execution result value, and a description of the function.
  • Function usage and function contents which are algorithms that are actually performed in the function, may be included, and the function contents may be written in JavaScript and executed by a JavaScript execution engine, without compiling or re-executing for use in a data conversion system. Make it available during execution.
  • the user-defined function may be set to work with the built-in function, in response to a user request through the provided user interface.
  • the built-in function refers to a function provided by the system itself, and may include, for example, a date / time function, a math / triangle function, a statistical function, a logical function, and the like.
  • the conversion rule setting unit 110 performs error verification on the user defined function set through the user interface.
  • the conversion rule setting unit 110 performs a grammatical verification of the user-defined function, converts a variable to be used in, for example, Javascript, and verifies the error of the function by testing the same. .
  • the conversion rule setting unit 110 sets and stores the user defined function in which error verification is completed as the user defined rule.
  • the conversion rule setting unit 110 stores a user-defined function that has completed error verification through grammar verification and execution verification as a user-defined rule, and then stores the user-defined rule at the time when data conversion and purification are required. To be called and applied.
  • the data collector 120 requests and collects target data for linkage with reference data to one or more slave operating apparatuses 200.
  • the data collection unit 120 transfers a data connection request to one or more slave operating apparatuses 200 operating each legacy system according to a specific process processing according to an agent driving, thereby linking with reference data. Collect the target data for.
  • the conversion processor 130 determines the validity of the target data collected from at least one slave operating apparatus 200.
  • the conversion processor 130 performs a comparison between the target data and the reference data based on predefined heterogeneity determination criteria such as structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity. Determine the validity.
  • the conversion processor 130 adjusts the data value for the target data by applying the user definition rule.
  • the conversion processing unit 130 calls a stored custom rule, for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • a stored custom rule for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • the target data is converted into valid data for linking with the reference data. For example, as shown in FIG. 5, when the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 are linked, the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 Although different by natural language processing, they are semantically the same.
  • the distributed data quality management system by dynamically generating and utilizing a business rule required for data conversion and purification for user interaction based on the user definition for distributed data interoperability, It can be used extensively to improve the quality of data and further minimize changes in legacy information systems.
  • the master operating apparatus 100 generates and stores a user-defined rule for adjusting data values (S110-S120).
  • the master operation apparatus 100 sets the user definition function according to a user request through a user interface (UI). That is, the master operation apparatus 100 provides a user interface as shown in FIG. 3, and sets a user defined function according to a user request through the provided user interface.
  • UI user interface
  • the user-defined function as shown in FIG. 4, for example, a function type for distinguishing a business rule, a function name, a parameter included in the function, a data type for returning a function execution result value, and a description of the function.
  • Function usage and function contents which are algorithms that are actually performed in the function, may be included, and the function contents may be written in JavaScript and executed by a JavaScript execution engine, without compiling or re-executing for use in a data conversion system.
  • the user-defined function may be set to work with the built-in function, in response to a user request through the provided user interface.
  • the built-in function refers to a function provided by the system itself, and may include, for example, a date / time function, a math / triangle function, a statistical function, a logical function, and the like.
  • the master operation apparatus 100 performs error verification on the user defined function set through the user interface. In other words, the master operation apparatus 100 performs a grammatical verification of the user-defined function, converts a variable to be used in, for example, Javascript, and verifies the error of the function through testing therefor.
  • the master operating apparatus 100 sets and stores the user defined function in which error verification is completed, as the user defined rule. That is, the master operating apparatus 100 stores a user-defined function that has completed error verification through grammar verification and execution verification as a user-defined rule. Then, the stored user-defined rule is called when a data conversion and purification is required. To be applied.
  • the master operating apparatus 100 requests and collects target data for linkage with reference data to one or more slave operating apparatuses 200 (S130-S160).
  • the master operating apparatus 100 transmits a data linkage request to one or more slave operating apparatuses 200 operating each legacy system according to a specific process processing according to the agent driving, thereby linking with the reference data. Collect the target data for.
  • the slave operation apparatus 100 supports the process processing operation of the master operation apparatus 100 by extracting and providing self-stored target data when a data linkage request is received from the master operation apparatus 100. .
  • the master operating apparatus 100 performs data value adjustment on the collected target data through the application of the stored user defined rules (S170-S190).
  • the master operating device 100 determines the validity of the target data collected from one or more of the slave operating device 200. That is, the master operation apparatus 100 compares the target data with the reference data based on predefined heterogeneity determination criteria such as structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity, thereby validating the target data. To judge. Further, when the determination result is not valid, the master operation apparatus 100 applies the user definition rule to adjust the data value for the target data. That is, when it is determined that the validity of the target data is not valid, the master operating apparatus 100 calls a stored user defined rule, for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • a stored user defined rule for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • the target data is converted into valid data for linking with the reference data.
  • the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 are linked, the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 Although different by natural language processing, they are semantically the same.
  • the conversion and purification is carried out to 'KOR', the 'KOR' of the operating device 2 'Is converted to' Korea 'and refined when linked to the operating device 1.
  • the user defined function is set according to a user request through a user interface (UI) (S210-S220).
  • UI user interface
  • the conversion rule setting unit 110 provides a user interface as shown in FIG. 3, and sets a user defined function according to a user request through the provided user interface.
  • a user-defined function as shown in FIG. 4, for example, a function type for distinguishing a business rule, a function name, a parameter included in the function, a data type for returning a function execution result value, and a description of the function.
  • Function usage and function contents which are algorithms that are actually performed in the function, may be included, and the function contents may be written in JavaScript and executed by a JavaScript execution engine, without compiling or re-executing for use in a data conversion system. Make it available during execution.
  • the user-defined function may be set to work with the built-in function, in response to a user request through the provided user interface.
  • the built-in function refers to a function provided by the system itself, and may include, for example, a date / time function, a math / triangle function, a statistical function, a logical function, and the like.
  • the conversion rule setting unit 110 performs a grammatical verification of the user-defined function, and converts a variable so that it can be used in, for example, JavaScript, and verifies the error of the function through testing therefor. .
  • the conversion rule setting unit 110 stores a user-defined function that has completed error verification through grammar verification and execution verification as a user-defined rule, and then stores the user-defined rule at the time when data conversion and purification are required. To be called and applied.
  • the target data for linkage with the reference data is collected and collected at one or more slave operating apparatuses 200 (S250).
  • the data collection unit 120 transfers a data connection request to one or more slave operating apparatuses 200 operating each legacy system according to a specific process processing according to the agent driving, thereby linking with the reference data. Collect the target data for.
  • the conversion processor 130 compares the target data with the reference data based on a predetermined heterogeneity determination criterion, for example, structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity. Determine the validity.
  • a predetermined heterogeneity determination criterion for example, structural heterogeneity, expression heterogeneity, relationship heterogeneity, and semantic heterogeneity. Determine the validity.
  • the data value for the target data is adjusted by applying the user-defined rule (S270-S290).
  • the conversion processing unit 130 calls a stored user defined rule, for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • a stored user defined rule for example, data type conversion, range limitation, expression mismatch adjustment, logic error adjustment, and meaningless adjustment.
  • the target data is converted into valid data for linking with the reference data. For example, as shown in FIG. 5, when the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 are linked, the data 'Korea' of the operating device 1 and the data 'KOR' of the operating device 2 Although different by natural language processing, they are semantically the same.
  • the distributed data quality management method by dynamically generating and utilizing a business rule necessary for data conversion and purification for user interaction based on the user definition for distributed data interoperability, It can be used extensively to improve the quality of data and further minimize changes in legacy information systems.
  • the steps of the method or algorithm described in connection with the embodiments presented herein may be embodied in the form of program instructions that may be executed by various computer means and recorded on a computer readable medium.
  • the computer readable medium may include program instructions, data files, data structures, etc. alone or in combination.
  • Program instructions recorded on the media may be those specially designed and constructed for the purposes of the present invention, or they may be of the kind well-known and available to those having skill in the computer software arts.
  • Examples of computer readable recording media include magnetic media such as hard disks, floppy disks and magnetic tape, optical media such as CD-ROMs, DVDs, and magnetic disks such as floppy disks.
  • Magneto-optical media and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like.
  • program instructions include not only machine code generated by a compiler, but also high-level language code that can be executed by a computer using an interpreter or the like.
  • the hardware device described above may be configured to operate as one or more software modules to perform the operations of the present invention, and vice versa.
  • the distributed data quality management system and method thereof overcomes the limitations of the existing technology in that it dynamically generates and utilizes business rules necessary for data transformation and purification for user interaction based on interoperability of distributed data.
  • the present invention there is not only the use of the related technology but also the possibility of marketing or sales of the applied device as well as the degree to which it can be clearly realized in reality.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
PCT/KR2012/010247 2011-12-28 2012-11-29 Système de gestion de la qualité de données distribuées et procédé associé Ceased WO2013100415A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2011-0144472 2011-12-28
KR1020110144472A KR20130076062A (ko) 2011-12-28 2011-12-28 분산 데이터 품질 관리 시스템 및 그 방법

Publications (1)

Publication Number Publication Date
WO2013100415A1 true WO2013100415A1 (fr) 2013-07-04

Family

ID=48697812

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2012/010247 Ceased WO2013100415A1 (fr) 2011-12-28 2012-11-29 Système de gestion de la qualité de données distribuées et procédé associé

Country Status (2)

Country Link
KR (1) KR20130076062A (fr)
WO (1) WO2013100415A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110908984A (zh) * 2019-10-31 2020-03-24 北京浪潮数据技术有限公司 一种数据处理方法、装置、设备及存储介质

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102016405B1 (ko) * 2017-12-07 2019-09-02 넷마블 주식회사 데이터 처리 시스템 및 방법
KR102249350B1 (ko) * 2019-08-23 2021-05-07 넷마블 주식회사 데이터 처리 시스템 및 방법

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5680551A (en) * 1993-10-21 1997-10-21 Sybase, Inc. Electronic messaging method of and system for heterogeneous connectivity and universal and generic interfacing for distributed applications and processes residing in wide variety of computing platforms and communication transport facilities
KR100684680B1 (ko) * 1998-11-18 2007-02-22 사가 소프트웨어, 인크. 확장가능한 분산된 기업용 애플리케이션 통합 시스템
KR100728612B1 (ko) * 2004-07-29 2007-06-14 (주)위세아이텍 데이터이행에 있어서의 데이터 품질관리 방법

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5680551A (en) * 1993-10-21 1997-10-21 Sybase, Inc. Electronic messaging method of and system for heterogeneous connectivity and universal and generic interfacing for distributed applications and processes residing in wide variety of computing platforms and communication transport facilities
KR100684680B1 (ko) * 1998-11-18 2007-02-22 사가 소프트웨어, 인크. 확장가능한 분산된 기업용 애플리케이션 통합 시스템
KR100728612B1 (ko) * 2004-07-29 2007-06-14 (주)위세아이텍 데이터이행에 있어서의 데이터 품질관리 방법

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JIN-YOUNG, KIM ET AL.: "Design and Implementation of Semantic Type Classification Techniques for Reusing Data in Legacy Systems", THE JOURNAL OF KOREAN INSTITUTE OF INFORMATION TECHNOLOGY, vol. 8, no. 7, July 2010 (2010-07-01), pages 29 - 38 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110908984A (zh) * 2019-10-31 2020-03-24 北京浪潮数据技术有限公司 一种数据处理方法、装置、设备及存储介质
CN110908984B (zh) * 2019-10-31 2022-06-10 北京浪潮数据技术有限公司 一种数据处理方法、装置、设备及存储介质

Also Published As

Publication number Publication date
KR20130076062A (ko) 2013-07-08

Similar Documents

Publication Publication Date Title
US10642606B2 (en) Re-use of code
CN112559095A (zh) 目标业务的执行方法、系统、服务器及存储介质
WO2020215680A1 (fr) Procédé et appareil permettant de générer automatiquement une catégorie pojo, support d'informations et dispositif d'ordinateur
WO2011065660A4 (fr) Système de simulation de calcul et son procédé
WO2013100415A1 (fr) Système de gestion de la qualité de données distribuées et procédé associé
WO2015060493A1 (fr) Appareil et procédé permettant d'actualiser de manière incrémentielle des données de navigation tout en conduisant
CN116303540A (zh) Sql信息生成方法、装置、电子设备及可读存储介质
WO2018035791A1 (fr) Procédé et système de commande en boucle fermée de la vitesse de rotation d'une machine d'enroulement horizontal
WO2018027342A1 (fr) Procédé et système d'application de synonyme dans une recherche de mégadonnées
WO2018027576A1 (fr) Procédé et système de collecte de durée de fonctionnement dans des statistiques dans l'internet des objets
WO2018027466A1 (fr) Procédé et système de mémorisation de données volumineuses dans un système distribué
WO2018035788A1 (fr) Procédé et système d'application pour une commande en boucle ouverte dans une machine d'enroulement vertical
WO2018027463A1 (fr) Procédé et système d'application d'analyse de mots-clés dans des mégadonnées
WO2018027460A1 (fr) Procédé et système de comparaison d'algorithmes
WO2018023483A1 (fr) Procédé et système de mise en œuvre de recherche en temps réel de différentes langues dans des mégadonnées
WO2018027343A1 (fr) Procédé et système de mise en œuvre d'une recherche vocale
WO2018027464A1 (fr) Procédé et système de mise en oeuvre pour différentes parties de discours pendant une recherche dans des mégadonnées
WO2018027341A1 (fr) Procédé et système de recherche par mots-clés en fonction de leur catégorie dans des mégadonnées
WO2018023484A1 (fr) Procédé et système de mise en œuvre d'une recherche de différentes parties de discours dans des données volumineuses
WO2018023482A1 (fr) Procédé et système pour mettre en œuvre une recherche vocale
WO2018023481A1 (fr) Procédé et système d'application d'un synonyme dans une recherche de mégadonnées
WO2018027462A1 (fr) Procédé et système de mise en oeuvre de recherche et de comparaison
WO2018027465A1 (fr) Procédé et système permettant une sauvegarde en temps réel sur la base de mégadonnées
WO2018027459A1 (fr) Procédé et système de classification et de comparaison d'une application dans des mégadonnées
WO2018027469A1 (fr) Procédé et système d'application de mot-clé dans une mémoire de mégadonnées

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12861867

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12861867

Country of ref document: EP

Kind code of ref document: A1