[go: up one dir, main page]

US20170154385A1 - System and method for automatic validation - Google Patents

System and method for automatic validation Download PDF

Info

Publication number
US20170154385A1
US20170154385A1 US15/361,934 US201615361934A US2017154385A1 US 20170154385 A1 US20170154385 A1 US 20170154385A1 US 201615361934 A US201615361934 A US 201615361934A US 2017154385 A1 US2017154385 A1 US 2017154385A1
Authority
US
United States
Prior art keywords
electronic document
dataset
transaction
data
created
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/361,934
Inventor
Noam Guzman
Isaac SAFT
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Vatbox Ltd
Original Assignee
Vatbox Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Vatbox Ltd filed Critical Vatbox Ltd
Priority to US15/361,934 priority Critical patent/US20170154385A1/en
Priority to PCT/US2016/066845 priority patent/WO2017142615A1/en
Priority to GB1812979.1A priority patent/GB2562669A/en
Priority to US15/379,971 priority patent/US20170161315A1/en
Priority to DE112016006434.9T priority patent/DE112016006434T5/en
Priority to PCT/US2016/067716 priority patent/WO2017142618A1/en
Priority to CN201680083049.5A priority patent/CN108713198A/en
Priority to EP16890887.9A priority patent/EP3417383A4/en
Priority to US15/384,726 priority patent/US20170169292A1/en
Priority to PCT/US2016/068536 priority patent/WO2017142624A1/en
Priority to US15/390,068 priority patent/US20170169518A1/en
Priority to CN201680085018.3A priority patent/CN109313765A/en
Priority to PCT/US2016/068714 priority patent/WO2017160372A1/en
Priority to US15/391,369 priority patent/US20170169519A1/en
Priority to EP16894794.3A priority patent/EP3430584A4/en
Priority to PCT/US2017/012120 priority patent/WO2017131932A1/en
Priority to US15/398,108 priority patent/US20170185832A1/en
Assigned to VATBOX, LTD. reassignment VATBOX, LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GUZMAN, NOAM, SAFT, Isaac
Priority to EP17767105.4A priority patent/EP3430540A4/en
Priority to US15/415,286 priority patent/US20170193608A1/en
Priority to CN201780027071.2A priority patent/CN109219809A/en
Priority to PCT/US2017/014874 priority patent/WO2017160403A1/en
Priority to US15/416,563 priority patent/US20170193609A1/en
Priority to PCT/US2017/015087 priority patent/WO2017200603A1/en
Priority to EP17799796.2A priority patent/EP3458971A4/en
Priority to PCT/US2017/032854 priority patent/WO2017201012A1/en
Priority to PCT/US2017/032855 priority patent/WO2017201013A1/en
Priority to DE112017002533.8T priority patent/DE112017002533T5/en
Priority to CN201780030660.6A priority patent/CN109154949A/en
Priority to US15/596,489 priority patent/US20170323395A1/en
Priority to DE112017002535.4T priority patent/DE112017002535T5/en
Priority to US15/596,492 priority patent/US20170323006A1/en
Priority to GB1818559.5A priority patent/GB2565014A/en
Priority to US15/597,825 priority patent/US20170323157A1/en
Priority to GB1818560.3A priority patent/GB2565476A/en
Priority to PCT/US2017/033106 priority patent/WO2017201163A1/en
Priority to DE112017002569.9T priority patent/DE112017002569T5/en
Priority to US15/599,096 priority patent/US20170323106A1/en
Priority to GB1818561.1A priority patent/GB2565684B/en
Priority to PCT/US2017/033338 priority patent/WO2017201292A1/en
Publication of US20170154385A1 publication Critical patent/US20170154385A1/en
Priority to EP17849248.4A priority patent/EP3491554A4/en
Priority to PCT/US2017/043644 priority patent/WO2018048512A1/en
Priority to CN201780058542.6A priority patent/CN109791548A/en
Priority to US15/658,832 priority patent/US20180011846A1/en
Priority to PCT/US2017/045342 priority patent/WO2018027057A1/en
Priority to US15/668,416 priority patent/US20180018312A1/en
Priority to CN201780060466.2A priority patent/CN109791560A/en
Priority to US15/668,402 priority patent/US10509811B2/en
Priority to PCT/US2017/045338 priority patent/WO2018027054A1/en
Priority to GB1902676.4A priority patent/GB2568416A/en
Priority to PCT/US2017/045333 priority patent/WO2018027051A1/en
Priority to GB1902675.6A priority patent/GB2571636A/en
Priority to US15/668,426 priority patent/US20180046663A1/en
Priority to EP17837704.0A priority patent/EP3494495A4/en
Priority to DE112017003926.6T priority patent/DE112017003926T5/en
Priority to CN201780060456.9A priority patent/CN109791537A/en
Priority to US15/669,236 priority patent/US11138372B2/en
Priority to CN201780060455.4A priority patent/CN109791641A/en
Priority to PCT/US2017/045491 priority patent/WO2018027130A1/en
Priority to PCT/US2017/045554 priority patent/WO2018027158A1/en
Priority to US15/669,510 priority patent/US20180025225A1/en
Priority to US15/669,253 priority patent/US10387561B2/en
Priority to CN201780060462.4A priority patent/CN109791540A/en
Priority to EP17837762.8A priority patent/EP3494530A4/en
Priority to PCT/US2017/045488 priority patent/WO2018031402A1/en
Priority to EP17837779.2A priority patent/EP3494531A4/en
Priority to CN201780058567.6A priority patent/CN109791643A/en
Priority to US15/669,224 priority patent/US20180025224A1/en
Priority to EP17837759.4A priority patent/EP3494496A4/en
Priority to PCT/US2017/045497 priority patent/WO2018027133A1/en
Priority to EP17840055.2A priority patent/EP3497589A4/en
Priority to US15/674,165 priority patent/US20180025438A1/en
Priority to PCT/US2017/046317 priority patent/WO2018034941A1/en
Priority to US15/724,958 priority patent/US20180096435A1/en
Priority to PCT/US2017/055135 priority patent/WO2018067698A1/en
Priority to EP17859117.8A priority patent/EP3523771A4/en
Priority to CN201780071509.7A priority patent/CN110023970A/en
Priority to US15/782,905 priority patent/US10558880B2/en
Priority to CN201780070059.XA priority patent/CN109983489A/en
Priority to PCT/US2017/056448 priority patent/WO2018071737A1/en
Priority to EP17861011.9A priority patent/EP3526758A4/en
Priority to PCT/US2017/064191 priority patent/WO2018098507A1/en
Priority to US16/376,854 priority patent/US20190236126A1/en
Assigned to SILICON VALLEY BANK reassignment SILICON VALLEY BANK INTELLECTUAL PROPERTY SECURITY AGREEMENT Assignors: VATBOX LTD
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/12Accounting
    • G06Q40/123Tax preparation or submission
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/248Presentation of query results
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/5846Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using extracted text
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/93Document management systems
    • G06F17/30011
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/174Form filling; Merging
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/22Character recognition characterised by the type of writing
    • G06V30/224Character recognition characterised by the type of writing of printed characters having additional code marks or containing code marks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/412Layout analysis of documents structured with printed lines or input boxes, e.g. business forms or tables
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/416Extracting the logical structure, e.g. chapters, sections or page numbers; Identifying elements of the document, e.g. authors
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/40Document-oriented image-based pattern recognition
    • G06V30/41Analysis of document content
    • G06V30/418Document matching, e.g. of document images

Definitions

  • the present disclosure relates generally to data analysis, and more particularly to validation of data.
  • VATs value-added taxes
  • Certain embodiments disclosed herein include a method for validating a transaction represented by an electronic document.
  • the method comprises: identifying, in the electronic document, at least one key field and at least one value; creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyzing the created dataset to determine at least one transaction parameter; creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.
  • Certain embodiments disclosed herein also include a non-transitory computer readable medium having stored thereon instructions for causing a processing circuitry to perform a process, the process comprising: identifying, in the electronic document, at least one key field and at least one value; creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyzing the created dataset to determine at least one transaction parameter; creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.
  • Certain embodiments disclosed herein also include a system for validating a transaction represented by an electronic document.
  • the system comprises: a processing circuitry; and a memory, the memory containing instructions that, when executed by the processing circuitry, configure the system to: identify, in the electronic document, at least one key field and at least one value; create, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyze the created dataset to determine at least one transaction parameter; create a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and apply, in real-time, at least one rule to the created template to determine whether the transaction is validated.
  • FIG. 1 is a network diagram utilized to describe the various disclosed embodiments.
  • FIG. 2 is a schematic diagram of a validation system according to an embodiment.
  • FIG. 3 is a flowchart illustrating a method for validating data according to an embodiment.
  • FIG. 4 is a flowchart illustrating a method for creating a dataset based on at least one electronic document according to an embodiment.
  • FIG. 5 is a flowchart illustrating a method for verifying an entity of a transaction according to an embodiment.
  • FIG. 6 is a flowchart illustrating a method for validating a transaction based on a first dataset and a second dataset according to an embodiment.
  • the various disclosed embodiments include a method and system for automatically validating transactions.
  • a dataset is created based on at least one electronic document.
  • a consumer enterprise indicated in the dataset is verified.
  • the dataset is analyzed to determine if a transaction indicated in the dataset is eligible for validation and, if so, a template of transaction attributes is created.
  • At least one rule is applied to the created template to determine if requirements for validation are met.
  • a notification indicating whether the transaction has been validated may be generated.
  • FIG. 1 shows an example network diagram 100 utilized to describe the various disclosed embodiments.
  • a validation system 120 a consumer enterprise system 130 , a plurality of databases 140 - 1 through 140 -N (hereinafter referred to individually as a database 140 and collectively as databases 140 , merely for simplicity purposes), and a merchant enterprise system 150 , are communicatively connected via a network 110 .
  • the network 110 may be, but is not limited to, a wireless, cellular or wired network, a local area network (LAN), a wide area network (WAN), a metro area network (MAN), the Internet, the worldwide web (WWW), similar networks, and any combination thereof.
  • LAN local area network
  • WAN wide area network
  • MAN metro area network
  • WWW worldwide web
  • the consumer enterprise system 130 is associated with a consumer entity, and may store data related to purchases made by the consumer entity as well as data related to the consumer entity itself.
  • the consumer entity may be, but is not limited to, a business that purchases goods or services in a business-to-business (B2B) transaction.
  • the merchant enterprise system 150 is associated with a merchant entity, and may store data related to sales made by the merchant entity as well as data related to the merchant entity itself.
  • the merchant entity may be, but is not limited to, a business that sold goods or services in a B2B transaction.
  • Each of the consumer entity system 130 and the merchant entity system 150 may be, but is not limited to, a server, a database, an enterprise resource planning system, a customer relationship management system, or any other system storing relevant data.
  • the data stored by each of the consumer enterprise system 130 and the merchant enterprise system 150 may include, but is not limited to, electronic documents (e.g., an image file showing, for example, a scan of an invoice, a text file, a spreadsheet file, etc.).
  • Data included in the electronic document may be structured, semi-structured, unstructured, or a combination thereof.
  • the structured or semi-structured data may be in a format that is not recognized by the validation system 120 and, therefore, may be treated like unstructured data.
  • the databases 140 store at least validation data utilized for validating transactions.
  • the data may include, but is not limited to, requirements for validating data, data related to transactions, and the like.
  • the databases 140 may store regulatory requirements related to B2B transactions (e.g., requirements for validating reverse charge mechanism (RCM) transactions).
  • the requirements may be stored in the form of, for example, rules.
  • the validation system 120 is configured to create datasets based on electronic documents including data at least partially lacking a known structure (e.g., unstructured data, semi-structured data, or structured data having an unknown structure). To this end, the validation system 120 may be further configured to utilize optical character recognition (OCR) or other image processing to determine data in the electronic document.
  • OCR optical character recognition
  • the validation system 120 is configured to analyze the created datasets to identify transaction parameters related to transactions related to the electronic documents. In some embodiments, the validation system 120 may be configured to verify an entity indicated in the created datasets. In another embodiment, the validation system 120 may be configured to determine whether the created datasets are eligible for validation based on, e.g., whether the dataset meets at least one predetermined constraint.
  • the validation system 120 is configured to create a template based on the created datasets.
  • the template is a structured dataset including the identified transaction parameters.
  • the validation system 120 is configured to apply, in real-time, at least one rule to the created template to determine if requirements for validation are met by the transaction.
  • the validation system 120 may be further configured to compare information from a first electronic document with information from a second electronic document to determine whether the information in the first electronic document is accurate.
  • information from an invoice for a business transaction and information from a waybill associated with an item sold during the business transaction may be compared.
  • the validation system 120 may be further configured to utilize optical character recognition (OCR) to determine at least a portion of the information to be compared.
  • OCR optical character recognition
  • determining whether the data is validated may include determining whether the first electronic document is complete.
  • the validation system 120 may be configured to generate a notification based on the validation determination.
  • the validation system 120 may be further configured to send the notification to, for example, a user device 160 communicatively connected to the network 110 .
  • the user device (UD) 160 may be, but is not limited to, a personal computer, a laptop, a tablet computer, a smartphone, a wearable computing device, or any other device capable of receiving and displaying notifications.
  • the user device 160 may be, for example, a user device of a business engaged in a transaction that wishes to know whether the transaction is validated.
  • FIG. 1 the embodiments described herein above with respect to FIG. 1 are described with respect to one consumer enterprise system 130 and one merchant enterprise system 150 merely for simplicity purposes and without limitation on the disclosed embodiments. Multiple consumer enterprise systems, multiple merchant enterprise systems, or both, may be equally utilized without departing from the scope of the disclosure.
  • FIG. 2 is an example schematic diagram of the validation system 120 according to an embodiment.
  • the validation system 120 includes a processing circuitry 410 coupled to a memory 215 , a storage 220 , and a network interface 240 .
  • the validation system 120 may include an optical character recognition (OCR) processor 230 .
  • OCR optical character recognition
  • the components of the validation system 120 may be communicatively connected via a bus 250 .
  • the processing circuitry 210 may be realized as one or more hardware logic components and circuits.
  • illustrative types of hardware logic components include field programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), Application-specific standard products (ASSPs), system-on-a-chip systems (SOCs), general-purpose microprocessors, microcontrollers, digital signal processors (DSPs), and the like, or any other hardware logic components that can perform calculations or other manipulations of information.
  • the memory 215 may be volatile (e.g., RAM, etc.), non-volatile (e.g., ROM, flash memory, etc.), or a combination thereof.
  • computer readable instructions to implement one or more embodiments disclosed herein may be stored in the storage 220 .
  • the memory 215 is configured to store software.
  • Software shall be construed broadly to mean any type of instructions, whether referred to as software, firmware, middleware, microcode, hardware description language, or otherwise. Instructions may include code (e.g., in source code format, binary code format, executable code format, or any other suitable format of code).
  • the instructions when executed by the one or more processors, cause the processing circuitry 210 to perform the various processes described herein. Specifically, the instructions, when executed, cause the processing circuitry 210 to perform an on-demand authorization of access to protected resources, as discussed hereinabove.
  • the storage 220 may be magnetic storage, optical storage, and the like, and may be realized, for example, as flash memory or other memory technology, CD-ROM, Digital Versatile Disks (DVDs), or any other medium which can be used to store the desired information.
  • flash memory or other memory technology
  • CD-ROM Compact Discs
  • DVDs Digital Versatile Disks
  • the OCR processor 230 may include, but is not limited to, a feature and/or pattern recognition unit (RU) 235 configured to identify patterns, features, or both, in unstructured data sets. Specifically, in an embodiment, the OCR processor 230 is configured to identify at least characters in the unstructured data. The identified characters may be utilized to create a validation dataset including data required for validation of a transaction.
  • RU feature and/or pattern recognition unit
  • the network interface 240 allows the validation system 120 to communicate with the consumer enterprise system 130 , the databases 140 , the merchant enterprise system 150 , or a combination of, for the purpose of, for example, collecting metadata, retrieving data, and the like. Additionally, the network interface 240 allows the validation system 120 to communicate with the user device 160 in order to send notifications regarding validation of data.
  • FIG. 3 is an example flowchart 300 illustrating a method for validating data according to an embodiment.
  • the method may be performed by a validation system (e.g., the validation system 120 ).
  • the validation may be validation of a transaction (e.g., a transaction involving the reverse charge mechanism) between a consumer entity (e.g., a buyer business) and a merchant entity (e.g., a seller business).
  • a dataset is created based on at least one electronic document including information related to a transaction.
  • Each of the at least one electronic document may include, but is not limited to, unstructured data, semi-structured data, structured data with structure that is unanticipated or unannounced, or a combination thereof.
  • S 310 may further include analyzing the electronic document using optical character recognition (OCR) to determine data in the electronic document, identifying key fields in the data, identifying values in the data, or a combination thereof.
  • OCR optical character recognition
  • analyzing the dataset may include, but is not limited to, determining transaction parameters such as, but not limited to, at least one entity identifier (e.g., a consumer enterprise identifier, a merchant enterprise identifier, or both), information related to the transaction (e.g., a date, a time, a price, a type of good or service sold, etc.), or both.
  • entity identifier e.g., a consumer enterprise identifier, a merchant enterprise identifier, or both
  • information related to the transaction e.g., a date, a time, a price, a type of good or service sold, etc.
  • analyzing the dataset may also include identifying the transaction based on the dataset.
  • any entity indicated in the created dataset may be verified.
  • the entity to be verified may be a merchant enterprise of the transaction.
  • the verification is based on the determined at least one entity identifier.
  • S 330 may include searching in at least one database based on the at least one identifier. Verifying entities of transactions based on datasets created using electronic documents dataset is described further herein below with respect to FIG. 5 .
  • S 340 may further include determining at least one constraint based on the created dataset. In a further embodiment, determining the at least one constraint may include searching in at least one database based on the created dataset (e.g., using a location of the merchant enterprise indicated in the created dataset). In yet a further embodiment, S 340 may also include analyzing at least one reporting requirements electronic document (e.g., a VAT reclaim form) to determine the at least one constraint. The analysis may further include performing OCR or other image processing on each reporting requirements electronic document.
  • determining the at least one constraint may include searching in at least one database based on the created dataset (e.g., using a location of the merchant enterprise indicated in the created dataset).
  • S 340 may also include analyzing at least one reporting requirements electronic document (e.g., a VAT reclaim form) to determine the at least one constraint. The analysis may further include performing OCR or other image processing on each reporting requirements electronic document.
  • replacement information when it is determined that the transaction is not eligible for validation, replacement information may be retrieved from at least one data source and included in the created dataset. In a further embodiment, upon retrieving the replacement information, execution continues with S 350 . In another embodiment, upon retrieving the replacement information, it is determined whether the dataset with the replacement information is eligible for validation and, if so, execution continues with S 350 ; otherwise, execution terminates.
  • a template of the transaction is created.
  • the template may be, but is not limited to, a data structure including a plurality of fields.
  • the fields may include the identified transaction parameters.
  • the fields may be predefined.
  • Creating templates from electronic documents allows for faster processing due to the structured nature of the created templates. For example, query and manipulation operations may be performed more efficiently on structured datasets than on datasets lacking such structure. Further, organizing information from electronic documents into structured datasets, the amount of storage required for saving information contained in electronic documents may be significantly reduced. Electronic documents are often images that require more storage space than datasets containing the same information. For example, datasets representing data from 100,000 image electronic documents can be saved as data records in a text file. A size of such a text file would be significantly less than the size of the 100,000 images.
  • a notification may be generated.
  • the notification may indicate the results of the application of the at least one rule (i.e., whether the transaction is validated).
  • FIG. 4 is an example flowchart S 310 illustrating a method for creating a dataset based on at least one electronic document according to an embodiment.
  • the key field may include, but are not limited to, merchant's name and address, date, currency, good or service sold, a transaction identifier, an invoice number, and so on.
  • An electronic document may include unnecessary details that would not be considered to be key values. As an example, a logo of the merchant may not be required and, thus, is not a key value.
  • a list of key fields may be predefined, and pieces of data that may match the key fields are extracted. Then, a cleaning process is performed to ensure that the information is accurately presented. For example, if the OCR would result in a data presented as “1211212005”, the cleaning process will convert this data to Dec. 12, 2005. As another example, if a name is presented as “Mo$den”, this will change to “Mosden”.
  • the cleaning process may be performed using external information resources, such as dictionaries, calendars, and the like.
  • S 430 results in a complete set of the predefined key fields and their respective values.
  • a structured dataset is generated.
  • the generated dataset includes the identified key fields and values.
  • FIG. 5 is an example flowchart S 330 illustrating a method for verifying an entity of a transaction based on a dataset created using an electronic document according to an embodiment.
  • the entity to be verified may be a merchant enterprise of a transaction.
  • the verification may be utilized to check that the merchant enterprise is registered for reverse charge mechanism transactions (e.g., if the merchant enterprise is VAT registered in a particular country, if the merchant enterprise is European Union certified, etc.). If the merchant enterprise is not registered for a reverse charge transactions, then any reverse charge mechanism transaction including the merchant enterprise will be determined to not be valid regardless of other requirements for validation.
  • At S 510 at least one entity identifier in the dataset is determined.
  • Each entity identifier indicates a respective entity.
  • the at least one entity identifier may be, but is not limited to, a name of the entity, a location of the entity (e.g., an address), or both.
  • the at least one entity is any entity required in order to identify the transaction (e.g., a buying entity, a selling entity, a shipping entity, etc.).
  • the at least one entity may be predetermined.
  • the at least one entity may include a consumer enterprise system and a merchant enterprise system.
  • At S 520 at least one database is searched using the at least one entity identifier.
  • the at least one database may include, but is not limited to, databases storing entity identifiers of registered entities.
  • S 520 may include determining the at least one database to be searched based on a type of the transaction to be validated.
  • the entity is verified if the entity identifier was found in any of the at least one database.
  • FIG. 6 is an example flowchart 600 illustrating a method for validating a transaction based on a first dataset and a second dataset according to an embodiment.
  • the method includes validating based on a first electronic document including a scanned image of an invoice for a transaction and a second electronic document including a scanned image of a waybill related to the transaction.
  • the data from the second electronic document may be utilized to verify that the data from the first electronic document is accurate.
  • a first dataset is created based on a first electronic document related to a transaction.
  • the first electronic document may include, but is not limited to, unstructured data, semi-structured data, structured data with structure that is unanticipated or unannounced, or a combination thereof.
  • S 610 may further include analyzing the first electronic document using optical character recognition (OCR), identifying key fields in the data, identifying values in the data, or a combination thereof.
  • OCR optical character recognition
  • an entity indicated in the first electronic document may be verified.
  • the entity to be verified may be a consumer enterprise of the transaction.
  • the verification is based on the determined at least one identifier.
  • S 620 may include searching in at least one database based on the at least one identifier. Verifying entities of transactions based on datasets created using electronic documents dataset is described further herein above with respect to FIG. 5 .
  • a second electronic document is obtained.
  • S 630 may include searching, based on the first dataset, in at least one database for an electronic document that is related to the first electronic document. The searching may be further based on metadata of the second electronic document, data in the second electronic document, or both.
  • the second electronic document may be related to the first electronic document if the metadata and data of the second electronic document match the first dataset above a predetermined threshold.
  • S 650 it is determined whether the first dataset is eligible for validation and, if so, execution continues with S 660 ; otherwise, execution terminates.
  • S 650 includes comparing the first dataset to the second dataset.
  • the first dataset is determined to be accurate if the first dataset matches the second dataset above a predetermined threshold.
  • S 650 may include analyzing the first dataset to determine at least one transaction parameter, where the comparison is based on the at least one transaction parameter.
  • S 650 may further include determining whether the first dataset meets at least one predetermined constraint.
  • a dataset may be eligible for validation if, e.g., the dataset meets the at least one predetermined constraint.
  • the at least one predetermined constraint may include, but is not limited to, requirements on types of information needed for validation, accuracy requirements, or a combination thereof. For example, if an electronic document does not include a country for the consumer enterprise in a transaction or a price of the transaction, validation may not be possible. Determining whether the transaction is eligible for validation may reduce use of computing resources by only validating datasets meeting minimum requirements.
  • a template of the transaction is created.
  • the template may be, but is not limited to, a structured dataset including the identifier transaction parameters.
  • At S 670 at least one rule is applied, in real-time, to the created template to determine if requirements for validation are met.
  • S 670 includes retrieving the at least one rule from at least one database (e.g., a database of a regulatory authority that establishes requirements for reverse charge mechanism transactions or other transactions). Applying the at least one rule in real-time allows for automatic application of current rules for transaction validation, thereby increasing accuracy.
  • at least one database e.g., a database of a regulatory authority that establishes requirements for reverse charge mechanism transactions or other transactions.
  • the phrase “at least one of” followed by a listing of items means that any of the listed items can be utilized individually, or any combination of two or more of the listed items can be utilized. For example, if a system is described as including “at least one of A, B, and C,” the system can include A alone; B alone; C alone; A and B in combination; B and C in combination; A and C in combination; or A, B, and C in combination.
  • a non-transitory computer readable medium is any computer readable medium except for a transitory propagating signal.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Development Economics (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Technology Law (AREA)
  • Human Resources & Organizations (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Animal Behavior & Ethology (AREA)
  • Operations Research (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Stored Programmes (AREA)

Abstract

A system and method for validating a transaction represented by an electronic document. The method includes: identifying, in the electronic document, at least one key field and at least one value; creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyzing the created dataset to determine at least one transaction parameter; creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application claims the benefit of U.S. Provisional Application No. 62/260,553 filed on Nov. 29, 2015, and of U.S. Provisional Application No. 62/261,355 filed on Dec. 1, 2015, the contents of which are hereby incorporated by reference.
  • TECHNICAL FIELD
  • The present disclosure relates generally to data analysis, and more particularly to validation of data.
  • BACKGROUND
  • As businesses increasingly rely on technology to manage data related to operations, suitable systems for properly managing and validating data have become crucial to success. Particularly for large businesses, the amount of data utilized daily by businesses can be overwhelming. Accordingly, manual review and validation of such data is impractical, at best. In addition to normal sales data, businesses in countries where value-added taxes (VATs) are applied collect and utilize even more data, thereby raising additional potential points of failure.
  • The challenges facing customers seeking a refund and, in particular, seeking VAT refunds, may result in customers becoming discouraged and failing to follow through on obtaining their refunds. This issue is further compounded when the customer is an employee of an enterprise because the customer is not directly benefiting from the refund. Moreover, employees may submit irrelevant or duplicate information that is unnecessary for seeking refunds. Filtering through such unnecessary information may be time-consuming, costly, and subject to a large degree of human error.
  • Moreover, business-to-business (B2B) transactions typically raise further tax issues, especially when VATs are involved. Previously, businesses engaging in B2B transactions with businesses in other countries needed to seek separate registrations and payments for VATs. To address these challenges, the reverse charge mechanism (RCM) was developed by the European Union. The RCM allows a first party of a transaction to charge the other party only for the actual cost of the transaction, i.e., excluding VAT charges. However, to utilize the RCM, businesses must collect and report various data. Manual collection and use of the data is resource-intensive and error-prone. Further, validation of RCM data is inefficient.
  • Additionally, many existing solutions for validating based on data face challenges in obtaining the data required for validation. Specifically, existing solutions typically require either structured data or data that otherwise adheres to particular format requirements (e.g., a required size of a scanned image, markings indicating validation-related information, etc.) in order to identify and utilize the data. Such existing solutions face challenges when appropriately formatted data cannot be obtained. In particular, such existing solutions may be unable to complete validation or may return inaccurate results (e.g., false positives and false negatives) when required data cannot be properly identified. Such issues may occur when, for example, data is unstructured or semi-structured, or when data is at least partially structured but in an unrecognizable format so as to effectively render the data unstructured for analysis purposes. These issues are compounded when multiple sets of data requirements (e.g., requirements for VAT refunds and for RCM validation) must be met. Manual checking to account for these issues is inefficient and introduces new potential sources of error.
  • It would therefore be advantageous to provide a solution that would overcome the deficiencies of the prior art.
  • SUMMARY
  • A summary of several example embodiments of the disclosure follows. This summary is provided for the convenience of the reader to provide a basic understanding of such embodiments and does not wholly define the breadth of the disclosure. This summary is not an extensive overview of all contemplated embodiments, and is intended to neither identify key or critical elements of all embodiments nor to delineate the scope of any or all aspects. Its sole purpose is to present some concepts of one or more embodiments in a simplified form as a prelude to the more detailed description that is presented later. For convenience, the term “some embodiments” may be used herein to refer to a single embodiment or multiple embodiments of the disclosure.
  • Certain embodiments disclosed herein include a method for validating a transaction represented by an electronic document. The method comprises: identifying, in the electronic document, at least one key field and at least one value; creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyzing the created dataset to determine at least one transaction parameter; creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.
  • Certain embodiments disclosed herein also include a non-transitory computer readable medium having stored thereon instructions for causing a processing circuitry to perform a process, the process comprising: identifying, in the electronic document, at least one key field and at least one value; creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyzing the created dataset to determine at least one transaction parameter; creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.
  • Certain embodiments disclosed herein also include a system for validating a transaction represented by an electronic document. The system comprises: a processing circuitry; and a memory, the memory containing instructions that, when executed by the processing circuitry, configure the system to: identify, in the electronic document, at least one key field and at least one value; create, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value; analyze the created dataset to determine at least one transaction parameter; create a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and apply, in real-time, at least one rule to the created template to determine whether the transaction is validated.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The subject matter disclosed herein is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the disclosed embodiments will be apparent from the following detailed description taken in conjunction with the accompanying drawings.
  • FIG. 1 is a network diagram utilized to describe the various disclosed embodiments.
  • FIG. 2 is a schematic diagram of a validation system according to an embodiment.
  • FIG. 3 is a flowchart illustrating a method for validating data according to an embodiment.
  • FIG. 4 is a flowchart illustrating a method for creating a dataset based on at least one electronic document according to an embodiment.
  • FIG. 5 is a flowchart illustrating a method for verifying an entity of a transaction according to an embodiment.
  • FIG. 6 is a flowchart illustrating a method for validating a transaction based on a first dataset and a second dataset according to an embodiment.
  • DETAILED DESCRIPTION
  • It is important to note that the embodiments disclosed herein are only examples of the many advantageous uses of the innovative teachings herein. In general, statements made in the specification of the present application do not necessarily limit any of the various claimed embodiments. Moreover, some statements may apply to some inventive features but not to others. In general, unless otherwise indicated, singular elements may be in plural and vice versa with no loss of generality. In the drawings, like numerals refer to like parts through several views.
  • The various disclosed embodiments include a method and system for automatically validating transactions. In an embodiment, a dataset is created based on at least one electronic document. In an optional embodiment, a consumer enterprise indicated in the dataset is verified. The dataset is analyzed to determine if a transaction indicated in the dataset is eligible for validation and, if so, a template of transaction attributes is created. At least one rule is applied to the created template to determine if requirements for validation are met. A notification indicating whether the transaction has been validated may be generated.
  • FIG. 1 shows an example network diagram 100 utilized to describe the various disclosed embodiments. In the example network diagram 100, a validation system 120, a consumer enterprise system 130, a plurality of databases 140-1 through 140-N (hereinafter referred to individually as a database 140 and collectively as databases 140, merely for simplicity purposes), and a merchant enterprise system 150, are communicatively connected via a network 110. The network 110 may be, but is not limited to, a wireless, cellular or wired network, a local area network (LAN), a wide area network (WAN), a metro area network (MAN), the Internet, the worldwide web (WWW), similar networks, and any combination thereof.
  • The consumer enterprise system 130 is associated with a consumer entity, and may store data related to purchases made by the consumer entity as well as data related to the consumer entity itself. The consumer entity may be, but is not limited to, a business that purchases goods or services in a business-to-business (B2B) transaction. The merchant enterprise system 150 is associated with a merchant entity, and may store data related to sales made by the merchant entity as well as data related to the merchant entity itself. The merchant entity may be, but is not limited to, a business that sold goods or services in a B2B transaction. Each of the consumer entity system 130 and the merchant entity system 150 may be, but is not limited to, a server, a database, an enterprise resource planning system, a customer relationship management system, or any other system storing relevant data.
  • The data stored by each of the consumer enterprise system 130 and the merchant enterprise system 150 may include, but is not limited to, electronic documents (e.g., an image file showing, for example, a scan of an invoice, a text file, a spreadsheet file, etc.). Data included in the electronic document may be structured, semi-structured, unstructured, or a combination thereof. The structured or semi-structured data may be in a format that is not recognized by the validation system 120 and, therefore, may be treated like unstructured data.
  • The databases 140 store at least validation data utilized for validating transactions. The data may include, but is not limited to, requirements for validating data, data related to transactions, and the like. As a non-limiting example, the databases 140 may store regulatory requirements related to B2B transactions (e.g., requirements for validating reverse charge mechanism (RCM) transactions). The requirements may be stored in the form of, for example, rules.
  • In an embodiment, the validation system 120 is configured to create datasets based on electronic documents including data at least partially lacking a known structure (e.g., unstructured data, semi-structured data, or structured data having an unknown structure). To this end, the validation system 120 may be further configured to utilize optical character recognition (OCR) or other image processing to determine data in the electronic document.
  • In an embodiment, the validation system 120 is configured to analyze the created datasets to identify transaction parameters related to transactions related to the electronic documents. In some embodiments, the validation system 120 may be configured to verify an entity indicated in the created datasets. In another embodiment, the validation system 120 may be configured to determine whether the created datasets are eligible for validation based on, e.g., whether the dataset meets at least one predetermined constraint.
  • In an embodiment, the validation system 120 is configured to create a template based on the created datasets. The template is a structured dataset including the identified transaction parameters. In a further embodiment, the validation system 120 is configured to apply, in real-time, at least one rule to the created template to determine if requirements for validation are met by the transaction.
  • In another example implementation, the validation system 120 may be further configured to compare information from a first electronic document with information from a second electronic document to determine whether the information in the first electronic document is accurate. As a non-limiting example, information from an invoice for a business transaction and information from a waybill associated with an item sold during the business transaction may be compared. The validation system 120 may be further configured to utilize optical character recognition (OCR) to determine at least a portion of the information to be compared. In yet another example implementation, determining whether the data is validated may include determining whether the first electronic document is complete.
  • The validation system 120 may be configured to generate a notification based on the validation determination. The validation system 120 may be further configured to send the notification to, for example, a user device 160 communicatively connected to the network 110. The user device (UD) 160 may be, but is not limited to, a personal computer, a laptop, a tablet computer, a smartphone, a wearable computing device, or any other device capable of receiving and displaying notifications. The user device 160 may be, for example, a user device of a business engaged in a transaction that wishes to know whether the transaction is validated.
  • It should be noted that the embodiments described herein above with respect to FIG. 1 are described with respect to one consumer enterprise system 130 and one merchant enterprise system 150 merely for simplicity purposes and without limitation on the disclosed embodiments. Multiple consumer enterprise systems, multiple merchant enterprise systems, or both, may be equally utilized without departing from the scope of the disclosure.
  • FIG. 2 is an example schematic diagram of the validation system 120 according to an embodiment. The validation system 120 includes a processing circuitry 410 coupled to a memory 215, a storage 220, and a network interface 240. In an optional embodiment, the validation system 120 may include an optical character recognition (OCR) processor 230. In another embodiment, the components of the validation system 120 may be communicatively connected via a bus 250.
  • The processing circuitry 210 may be realized as one or more hardware logic components and circuits. For example, and without limitation, illustrative types of hardware logic components that can be used include field programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), Application-specific standard products (ASSPs), system-on-a-chip systems (SOCs), general-purpose microprocessors, microcontrollers, digital signal processors (DSPs), and the like, or any other hardware logic components that can perform calculations or other manipulations of information.
  • The memory 215 may be volatile (e.g., RAM, etc.), non-volatile (e.g., ROM, flash memory, etc.), or a combination thereof. In one configuration, computer readable instructions to implement one or more embodiments disclosed herein may be stored in the storage 220.
  • In another embodiment, the memory 215 is configured to store software. Software shall be construed broadly to mean any type of instructions, whether referred to as software, firmware, middleware, microcode, hardware description language, or otherwise. Instructions may include code (e.g., in source code format, binary code format, executable code format, or any other suitable format of code). The instructions, when executed by the one or more processors, cause the processing circuitry 210 to perform the various processes described herein. Specifically, the instructions, when executed, cause the processing circuitry 210 to perform an on-demand authorization of access to protected resources, as discussed hereinabove.
  • The storage 220 may be magnetic storage, optical storage, and the like, and may be realized, for example, as flash memory or other memory technology, CD-ROM, Digital Versatile Disks (DVDs), or any other medium which can be used to store the desired information.
  • The OCR processor 230 may include, but is not limited to, a feature and/or pattern recognition unit (RU) 235 configured to identify patterns, features, or both, in unstructured data sets. Specifically, in an embodiment, the OCR processor 230 is configured to identify at least characters in the unstructured data. The identified characters may be utilized to create a validation dataset including data required for validation of a transaction.
  • The network interface 240 allows the validation system 120 to communicate with the consumer enterprise system 130, the databases 140, the merchant enterprise system 150, or a combination of, for the purpose of, for example, collecting metadata, retrieving data, and the like. Additionally, the network interface 240 allows the validation system 120 to communicate with the user device 160 in order to send notifications regarding validation of data.
  • It should be understood that the embodiments described herein are not limited to the specific architecture illustrated in FIG. 2, and other architectures may be equally used without departing from the scope of the disclosed embodiments.
  • FIG. 3 is an example flowchart 300 illustrating a method for validating data according to an embodiment. In an embodiment, the method may be performed by a validation system (e.g., the validation system 120). In a further embodiment, the validation may be validation of a transaction (e.g., a transaction involving the reverse charge mechanism) between a consumer entity (e.g., a buyer business) and a merchant entity (e.g., a seller business).
  • At S310, a dataset is created based on at least one electronic document including information related to a transaction. Each of the at least one electronic document may include, but is not limited to, unstructured data, semi-structured data, structured data with structure that is unanticipated or unannounced, or a combination thereof. In an embodiment, S310 may further include analyzing the electronic document using optical character recognition (OCR) to determine data in the electronic document, identifying key fields in the data, identifying values in the data, or a combination thereof. Creating datasets based on electronic documents is described further herein below with respect to FIG. 4.
  • At S320, the created dataset is analyzed. In an embodiment, analyzing the dataset may include, but is not limited to, determining transaction parameters such as, but not limited to, at least one entity identifier (e.g., a consumer enterprise identifier, a merchant enterprise identifier, or both), information related to the transaction (e.g., a date, a time, a price, a type of good or service sold, etc.), or both. In a further embodiment, analyzing the dataset may also include identifying the transaction based on the dataset.
  • At optional S330, any entity indicated in the created dataset may be verified. In an example, the entity to be verified may be a merchant enterprise of the transaction. In an embodiment, the verification is based on the determined at least one entity identifier. In a further embodiment, S330 may include searching in at least one database based on the at least one identifier. Verifying entities of transactions based on datasets created using electronic documents dataset is described further herein below with respect to FIG. 5.
  • At optional S340, it is determined, based on the analysis, whether the created dataset is eligible for validation and, if so, execution continues with S350; otherwise, execution terminates. In an embodiment, S340 may include determining whether the created dataset meets at least one predetermined constraint. A dataset may be eligible for validation if, e.g., the dataset meets the at least one predetermined constraint. The at least one predetermined constraint may include, but is not limited to, requirements on types of information needed for validation, accuracy requirements, or a combination thereof. For example, if an electronic document does not include a country for the merchant enterprise in a transaction or a price of the transaction, validation may not be possible. Determining whether the transaction is eligible for validation may reduce use of computing resources by only validating datasets meeting minimum requirements.
  • In another embodiment, S340 may further include determining at least one constraint based on the created dataset. In a further embodiment, determining the at least one constraint may include searching in at least one database based on the created dataset (e.g., using a location of the merchant enterprise indicated in the created dataset). In yet a further embodiment, S340 may also include analyzing at least one reporting requirements electronic document (e.g., a VAT reclaim form) to determine the at least one constraint. The analysis may further include performing OCR or other image processing on each reporting requirements electronic document.
  • In another embodiment, when it is determined that the transaction is not eligible for validation, replacement information may be retrieved from at least one data source and included in the created dataset. In a further embodiment, upon retrieving the replacement information, execution continues with S350. In another embodiment, upon retrieving the replacement information, it is determined whether the dataset with the replacement information is eligible for validation and, if so, execution continues with S350; otherwise, execution terminates.
  • At S350, a template of the transaction is created. The template may be, but is not limited to, a data structure including a plurality of fields. The fields may include the identified transaction parameters. The fields may be predefined.
  • Creating templates from electronic documents allows for faster processing due to the structured nature of the created templates. For example, query and manipulation operations may be performed more efficiently on structured datasets than on datasets lacking such structure. Further, organizing information from electronic documents into structured datasets, the amount of storage required for saving information contained in electronic documents may be significantly reduced. Electronic documents are often images that require more storage space than datasets containing the same information. For example, datasets representing data from 100,000 image electronic documents can be saved as data records in a text file. A size of such a text file would be significantly less than the size of the 100,000 images.
  • At S360, at least one rule is applied, in real-time, to the created template to determine if requirements for validation are met. In an embodiment, S360 includes retrieving the at least one rule from at least one database (e.g., a database of a regulatory authority that establishes requirements for reverse charge mechanism transactions or other transactions). In a further embodiment, the at least one rule may be retrieved based on at least a portion of the transaction parameters. Applying the at least one rule in real-time allows for automatic application of current rules for transaction validation, thereby increasing accuracy.
  • At optional S370, a notification may be generated. The notification may indicate the results of the application of the at least one rule (i.e., whether the transaction is validated).
  • FIG. 4 is an example flowchart S310 illustrating a method for creating a dataset based on at least one electronic document according to an embodiment.
  • At S410, the at least one electronic document is obtained. Obtaining each electronic document may include, but is not limited to, receiving the electronic document (e.g., receiving a scanned image) or retrieving the electronic document (e.g., retrieving the electronic document from a consumer enterprise system, a merchant enterprise system, or a database).
  • At S420, the electronic document is analyzed. The analysis may include, but is not limited to, using optical character recognition (OCR) to determine characters in the electronic document.
  • At S430, based on the analysis, key fields and values in the electronic document are identified. The key field may include, but are not limited to, merchant's name and address, date, currency, good or service sold, a transaction identifier, an invoice number, and so on. An electronic document may include unnecessary details that would not be considered to be key values. As an example, a logo of the merchant may not be required and, thus, is not a key value. In an embodiment, a list of key fields may be predefined, and pieces of data that may match the key fields are extracted. Then, a cleaning process is performed to ensure that the information is accurately presented. For example, if the OCR would result in a data presented as “1211212005”, the cleaning process will convert this data to Dec. 12, 2005. As another example, if a name is presented as “Mo$den”, this will change to “Mosden”. The cleaning process may be performed using external information resources, such as dictionaries, calendars, and the like.
  • In a further embodiment, it is checked if the extracted pieces of data are completed. For example, if the merchant name can be identified but its address is missing, then the key field for the merchant address is incomplete. An attempt to complete the missing key filed values is performed. This attempt may include querying external systems and databases, correlation with previously analyzed invoices, or a combination thereof. Examples for external systems and databases may include business directories, Universal Product Code (UPC) databases, parcel delivery and tracking systems, and so on. In an embodiment, S430 results in a complete set of the predefined key fields and their respective values.
  • At S440, a structured dataset is generated. The generated dataset includes the identified key fields and values.
  • FIG. 5 is an example flowchart S330 illustrating a method for verifying an entity of a transaction based on a dataset created using an electronic document according to an embodiment. In an example, the entity to be verified may be a merchant enterprise of a transaction. In such an example, the verification may be utilized to check that the merchant enterprise is registered for reverse charge mechanism transactions (e.g., if the merchant enterprise is VAT registered in a particular country, if the merchant enterprise is European Union certified, etc.). If the merchant enterprise is not registered for a reverse charge transactions, then any reverse charge mechanism transaction including the merchant enterprise will be determined to not be valid regardless of other requirements for validation.
  • At S510, at least one entity identifier in the dataset is determined. Each entity identifier indicates a respective entity. The at least one entity identifier may be, but is not limited to, a name of the entity, a location of the entity (e.g., an address), or both. In an embodiment, the at least one entity is any entity required in order to identify the transaction (e.g., a buying entity, a selling entity, a shipping entity, etc.). The at least one entity may be predetermined. As a non-limiting example, for a RCM transaction, the at least one entity may include a consumer enterprise system and a merchant enterprise system.
  • At S520, at least one database is searched using the at least one entity identifier. The at least one database may include, but is not limited to, databases storing entity identifiers of registered entities. In an embodiment, S520 may include determining the at least one database to be searched based on a type of the transaction to be validated.
  • At S530, based on the search, it is determined whether the entity is verified. In an embodiment, the entity is verified if the entity identifier was found in any of the at least one database.
  • FIG. 6 is an example flowchart 600 illustrating a method for validating a transaction based on a first dataset and a second dataset according to an embodiment. In an example implementation, the method includes validating based on a first electronic document including a scanned image of an invoice for a transaction and a second electronic document including a scanned image of a waybill related to the transaction. The data from the second electronic document may be utilized to verify that the data from the first electronic document is accurate.
  • At S610, a first dataset is created based on a first electronic document related to a transaction. The first electronic document may include, but is not limited to, unstructured data, semi-structured data, structured data with structure that is unanticipated or unannounced, or a combination thereof. In an embodiment, S610 may further include analyzing the first electronic document using optical character recognition (OCR), identifying key fields in the data, identifying values in the data, or a combination thereof. Creating datasets based on electronic documents is described further herein above with respect to FIG. 4.
  • At optional S620, an entity indicated in the first electronic document may be verified. In an example, the entity to be verified may be a consumer enterprise of the transaction. In an embodiment, the verification is based on the determined at least one identifier. In a further embodiment, S620 may include searching in at least one database based on the at least one identifier. Verifying entities of transactions based on datasets created using electronic documents dataset is described further herein above with respect to FIG. 5.
  • At S630, based on the first electronic document, a second electronic document is obtained. In an embodiment, S630 may include searching, based on the first dataset, in at least one database for an electronic document that is related to the first electronic document. The searching may be further based on metadata of the second electronic document, data in the second electronic document, or both. In a further embodiment, the second electronic document may be related to the first electronic document if the metadata and data of the second electronic document match the first dataset above a predetermined threshold.
  • At S640, a second dataset is created based on the obtained second electronic document. In an embodiment, S640 may further include analyzing the second electronic document using optical character recognition (OCR), identifying key fields in the data, identifying values in the data, or a combination thereof. Creating datasets based on electronic documents is described further herein above with respect to FIG. 4.
  • At optional S650, it is determined whether the first dataset is eligible for validation and, if so, execution continues with S660; otherwise, execution terminates. In an embodiment, S650 includes comparing the first dataset to the second dataset. In a further embodiment, the first dataset is determined to be accurate if the first dataset matches the second dataset above a predetermined threshold. In yet a further embodiment, S650 may include analyzing the first dataset to determine at least one transaction parameter, where the comparison is based on the at least one transaction parameter.
  • In an embodiment, S650 may further include determining whether the first dataset meets at least one predetermined constraint. A dataset may be eligible for validation if, e.g., the dataset meets the at least one predetermined constraint. The at least one predetermined constraint may include, but is not limited to, requirements on types of information needed for validation, accuracy requirements, or a combination thereof. For example, if an electronic document does not include a country for the consumer enterprise in a transaction or a price of the transaction, validation may not be possible. Determining whether the transaction is eligible for validation may reduce use of computing resources by only validating datasets meeting minimum requirements.
  • In another embodiment, when it is determined that the transaction is not eligible for validation, replacement information may be retrieved from at least one data source and included in the first dataset. In a further embodiment, upon retrieving the replacement information, execution continues with S660. In another embodiment, upon retrieving the replacement information, it is determined whether the first dataset with the replacement information is eligible for validation and, if so, execution continues with S660; otherwise, execution terminates.
  • At S660, a template of the transaction is created. The template may be, but is not limited to, a structured dataset including the identifier transaction parameters.
  • At S670, at least one rule is applied, in real-time, to the created template to determine if requirements for validation are met. In an embodiment, S670 includes retrieving the at least one rule from at least one database (e.g., a database of a regulatory authority that establishes requirements for reverse charge mechanism transactions or other transactions). Applying the at least one rule in real-time allows for automatic application of current rules for transaction validation, thereby increasing accuracy.
  • At optional S680, a notification may be generated. The notification may indicate the results of the application of the at least one rule (i.e., whether the transaction is validated).
  • As used herein, the phrase “at least one of” followed by a listing of items means that any of the listed items can be utilized individually, or any combination of two or more of the listed items can be utilized. For example, if a system is described as including “at least one of A, B, and C,” the system can include A alone; B alone; C alone; A and B in combination; B and C in combination; A and C in combination; or A, B, and C in combination.
  • The various embodiments disclosed herein can be implemented as hardware, firmware, software, or any combination thereof. Moreover, the software is preferably implemented as an application program tangibly embodied on a program storage unit or computer readable medium consisting of parts, or of certain devices and/or a combination of devices. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units (“CPUs”), a memory, and input/output interfaces. The computer platform may also include an operating system and microinstruction code. The various processes and functions described herein may be either part of the microinstruction code or part of the application program, or any combination thereof, which may be executed by a CPU, whether or not such a computer or processor is explicitly shown. In addition, various other peripheral units may be connected to the computer platform such as an additional data storage unit and a printing unit. Furthermore, a non-transitory computer readable medium is any computer readable medium except for a transitory propagating signal.
  • All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the principles of the disclosed embodiment and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, all statements herein reciting principles, aspects, and embodiments of the disclosed embodiments, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.

Claims (21)

What is claimed is:
1. A method for validating a transaction represented by an electronic document, comprising:
identifying, in the electronic document, at least one key field and at least one value;
creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value;
analyzing the created dataset to determine at least one transaction parameter;
creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and
applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.
2. The method of claim 1, further comprising:
determining, based on the created dataset, at least one entity identifier; and
verifying an entity of the transaction based on the determined at least one entity identifier.
3. The method of claim 2, wherein the electronic document is a first electronic document, wherein the created dataset is a first dataset, further comprising:
obtaining, based on the first electronic document, a second electronic document;
creating, based on the obtained second electronic document, a second dataset; and
comparing the first dataset to the second dataset, wherein the verification is based further on the comparison.
4. The method of claim 2, wherein the transaction is a reverse charge mechanism transaction, wherein the entity is a merchant enterprise entity, wherein verifying the entity further comprises:
determining, based on the at least one entity identifier, whether the merchant enterprise entity is registered for reverse charge mechanism transactions.
5. The method off claim 1, further comprising:
determining, based on at least one predetermined constraint, whether the created dataset is eligible for validation.
6. The method of claim 1, further comprising:
generating a notification, wherein the notification indicates whether the transaction is validated.
7. The method of claim 1, wherein identifying the at least one key field and the at least one value further comprises:
analyzing the electronic document to determine data in the electronic document; and
extracting, based on a predetermined list of key fields, at least a portion of the determined data, wherein the at least a portion of the determined data matches at least one key field of the predetermined list of key fields.
8. The method of claim 7, wherein analyzing the electronic document further comprises:
performing optical character recognition on the electronic document.
9. The method of claim 7, further comprising:
performing a cleaning process on the extracted at least a portion of the determined data.
10. The method of claim 7, further comprising:
checking if each piece of data of the extracted at least a portion of the determined data is completed; and
for each piece of data that is not completed, performing at least one of: querying at least one external source, and correlating the determine data with data of at least one previously analyzed electronic document.
11. A non-transitory computer readable medium having stored thereon instructions for causing a processing circuitry to perform a process, the process comprising:
identifying, in the electronic document, at least one key field and at least one value;
creating, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value;
analyzing the created dataset to determine at least one transaction parameter;
creating a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and
applying, in real-time, at least one rule to the created template to determine whether the transaction is validated.
12. A system for validating a transaction represented by an electronic document, comprising:
a processing circuitry; and
a memory, the memory containing instructions that, when executed by the processing circuitry, configure the system to:
identify, in the electronic document, at least one key field and at least one value;
create, based on the electronic document, a dataset, wherein the created dataset includes the at least one key field and the at least one value;
analyze the created dataset to determine at least one transaction parameter;
create a template for the transaction, wherein the template is a structured dataset including the determined at least one transaction parameter; and
apply, in real-time, at least one rule to the created template to determine whether the transaction is validated.
13. The system of claim 12, wherein the system is further configured to:
determine, based on the created dataset, at least one entity identifier; and
verify an entity of the transaction based on the determined at least one entity identifier.
14. The system of claim 13, wherein the electronic document is a first electronic document, wherein the created dataset is a first dataset, wherein the system is further configured to:
obtain, based on the first electronic document, a second electronic document;
create, based on the obtained second electronic document, a second dataset; and
compare the first dataset to the second dataset, wherein the verification is based further on the comparison.
15. The system of claim 13, wherein the transaction is a reverse charge mechanism transaction, wherein the entity is a merchant enterprise entity, wherein the system is further configured to:
determine, based on the at least one entity identifier, whether the merchant enterprise entity is registered for reverse charge mechanism transactions.
16. The system of claim 12, wherein the system is further configured to:
determine, based on at least one predetermined constraint, whether the created dataset is eligible for validation.
17. The system of claim 12, wherein the system is further configured to:
generate a notification, wherein the notification indicates whether the transaction is validated.
18. The system of claim 12, wherein the system is further configured to:
analyze the electronic document to determine data in the electronic document; and
extract, based on a predetermined list of key fields, at least a portion of the determined data, wherein the at least a portion of the determined data matches at least one key field of the predetermined list of key fields.
19. The method of claim 18, wherein the system is further configured to:
perform optical character recognition on the electronic document.
20. The system of claim 18, wherein the system is further configured to:
perform a cleaning process on the extracted at least a portion of the determined data.
21. The system of claim 18, wherein the system is further configured to:
check if each piece of data of the extracted at least a portion of the determined data is completed; and
for each piece of data that is not completed, perform at least one of: querying at least one external source, and correlating the determine data with data of at least one previously analyzed electronic document.
US15/361,934 2015-02-04 2016-11-28 System and method for automatic validation Abandoned US20170154385A1 (en)

Priority Applications (82)

Application Number Priority Date Filing Date Title
US15/361,934 US20170154385A1 (en) 2015-11-29 2016-11-28 System and method for automatic validation
PCT/US2016/066845 WO2017142615A1 (en) 2016-02-15 2016-12-15 System and method for maintaining data integrity
GB1812979.1A GB2562669A (en) 2016-02-15 2016-12-15 System and method for maintaining data integrity
US15/379,971 US20170161315A1 (en) 2015-11-29 2016-12-15 System and method for maintaining data integrity
DE112016006434.9T DE112016006434T5 (en) 2016-02-15 2016-12-15 SYSTEM AND METHOD FOR MAINTAINING DATA INTEGRITY
PCT/US2016/067716 WO2017142618A1 (en) 2016-02-15 2016-12-20 Automatic verification of requests based on electronic documents
CN201680083049.5A CN108713198A (en) 2016-02-15 2016-12-20 Automatic checking request based on electronic document
EP16890887.9A EP3417383A4 (en) 2016-02-15 2016-12-20 AUTOMATIC VERIFICATION OF REQUESTS BASED ON ELECTRONIC DOCUMENTS
US15/384,726 US20170169292A1 (en) 2015-11-29 2016-12-20 System and method for automatically verifying requests based on electronic documents
PCT/US2016/068536 WO2017142624A1 (en) 2016-02-15 2016-12-23 System and method for automatically tagging electronic documents
US15/390,068 US20170169518A1 (en) 2015-11-29 2016-12-23 System and method for automatically tagging electronic documents
CN201680085018.3A CN109313765A (en) 2016-03-13 2016-12-27 The System and method for of automatic verifying transaction is carried out based on electronic document
PCT/US2016/068714 WO2017160372A1 (en) 2016-03-13 2016-12-27 System and method for automatically verifying transactions based on electronic documents
US15/391,369 US20170169519A1 (en) 2015-11-29 2016-12-27 System and method for automatically verifying transactions based on electronic documents
EP16894794.3A EP3430584A4 (en) 2016-03-13 2016-12-27 System and method for automatically verifying transactions based on electronic documents
PCT/US2017/012120 WO2017131932A1 (en) 2016-01-27 2017-01-04 System and method for verifying extraction of multiple document images from an electronic document
US15/398,108 US20170185832A1 (en) 2015-02-04 2017-01-04 System and method for verifying extraction of multiple document images from an electronic document
EP17767105.4A EP3430540A4 (en) 2016-03-13 2017-01-25 System and method for automatically generating reporting data based on electronic documents
US15/415,286 US20170193608A1 (en) 2015-11-29 2017-01-25 System and method for automatically generating reporting data based on electronic documents
CN201780027071.2A CN109219809A (en) 2016-03-13 2017-01-25 The method and system for automatically generating data reporting based on electronic document
PCT/US2017/014874 WO2017160403A1 (en) 2016-03-13 2017-01-25 System and method for automatically generating reporting data based on electronic documents
US15/416,563 US20170193609A1 (en) 2015-11-29 2017-01-26 System and method for automatically monitoring requests indicated in electronic documents
PCT/US2017/015087 WO2017200603A1 (en) 2016-05-18 2017-01-26 System and method for automatically monitoring requests indicated in electronic documents
EP17799796.2A EP3458971A4 (en) 2016-05-18 2017-01-26 System and method for automatically monitoring requests indicated in electronic documents
PCT/US2017/032854 WO2017201012A1 (en) 2016-05-18 2017-05-16 Providing analytics in real-time based on unstructured electronic documents
PCT/US2017/032855 WO2017201013A1 (en) 2016-05-18 2017-05-16 System and method for creating historical records based on unstructured electronic documents
DE112017002533.8T DE112017002533T5 (en) 2016-05-18 2017-05-16 System and method for generating historical data records on unstructured electronic documents
CN201780030660.6A CN109154949A (en) 2016-05-18 2017-05-16 Analysis is provided in real time based on non-structured electronic document
US15/596,489 US20170323395A1 (en) 2015-11-29 2017-05-16 System and method for creating historical records based on unstructured electronic documents
DE112017002535.4T DE112017002535T5 (en) 2016-05-18 2017-05-16 Provide real-time analysis based on unstructured electronic documents
US15/596,492 US20170323006A1 (en) 2015-11-29 2017-05-16 System and method for providing analytics in real-time based on unstructured electronic documents
GB1818559.5A GB2565014A (en) 2016-05-18 2017-05-16 Providing analytics in real-time based on unstructured electronic documents
US15/597,825 US20170323157A1 (en) 2015-11-29 2017-05-17 System and method for determining an entity status based on unstructured electronic documents
GB1818560.3A GB2565476A (en) 2016-05-18 2017-05-17 System and method for determining an entity status based on unstructured electronic documents
PCT/US2017/033106 WO2017201163A1 (en) 2016-05-18 2017-05-17 System and method for determining an entity status based on unstructured electronic documents
DE112017002569.9T DE112017002569T5 (en) 2016-05-18 2017-05-17 System and method for determining the status of a unit based on unstructured electronic documents
US15/599,096 US20170323106A1 (en) 2015-11-29 2017-05-18 System and method for encrypting data in electronic documents
GB1818561.1A GB2565684B (en) 2016-05-18 2017-05-18 System and method for creating historical records based on unstructured electronic documents
PCT/US2017/033338 WO2017201292A1 (en) 2016-05-19 2017-05-18 System and method for encrypting data in electronic documents
EP17849248.4A EP3491554A4 (en) 2016-07-31 2017-07-25 Matching transaction electronic documents to evidencing electronic
PCT/US2017/043644 WO2018048512A1 (en) 2016-07-31 2017-07-25 Matching transaction electronic documents to evidencing electronic
CN201780058542.6A CN109791548A (en) 2016-07-31 2017-07-25 Match trading electronic document and proof electronic document
US15/658,832 US20180011846A1 (en) 2015-11-29 2017-07-25 System and method for matching transaction electronic documents to evidencing electronic documents
PCT/US2017/045342 WO2018027057A1 (en) 2016-08-05 2017-08-03 System and method for completing electronic documents
US15/668,416 US20180018312A1 (en) 2015-11-29 2017-08-03 System and method for monitoring electronic documents
CN201780060466.2A CN109791560A (en) 2016-08-05 2017-08-03 The system and method for monitoring electronic document
US15/668,402 US10509811B2 (en) 2015-11-29 2017-08-03 System and method for improved analysis of travel-indicating unstructured electronic documents
PCT/US2017/045338 WO2018027054A1 (en) 2016-08-05 2017-08-03 Sytem and method for monitoring electronic documents
GB1902676.4A GB2568416A (en) 2016-08-05 2017-08-03 System and method for monitoring electronic documents
PCT/US2017/045333 WO2018027051A1 (en) 2016-08-05 2017-08-03 System and method for improved analysis of travel-indicating unstructured electronic documents
GB1902675.6A GB2571636A (en) 2016-08-05 2017-08-03 System and method for improved analysis of travel-indicating unstructured electronic documents
US15/668,426 US20180046663A1 (en) 2015-11-29 2017-08-03 System and method for completing electronic documents
EP17837704.0A EP3494495A4 (en) 2016-08-05 2017-08-03 System and method for completing electronic documents
DE112017003926.6T DE112017003926T5 (en) 2016-08-05 2017-08-03 System and method for monitoring electronic documents
CN201780060456.9A CN109791537A (en) 2016-08-05 2017-08-03 Electronic document is supplemented into complete system and method
US15/669,236 US11138372B2 (en) 2015-11-29 2017-08-04 System and method for reporting based on electronic documents
CN201780060455.4A CN109791641A (en) 2016-08-05 2017-08-04 Obtain the system and method for lacking the repeating transmission of electronic document of necessary data
PCT/US2017/045491 WO2018027130A1 (en) 2016-08-05 2017-08-04 System and method for reporting based on electronic documents
PCT/US2017/045554 WO2018027158A1 (en) 2016-08-05 2017-08-04 System and method for generating consolidated data for electronic documents
US15/669,510 US20180025225A1 (en) 2015-11-29 2017-08-04 System and method for generating consolidated data for electronic documents
US15/669,253 US10387561B2 (en) 2015-11-29 2017-08-04 System and method for obtaining reissues of electronic documents lacking required data
CN201780060462.4A CN109791540A (en) 2016-08-05 2017-08-04 System and method for reporting based on electronic documents
EP17837762.8A EP3494530A4 (en) 2016-08-05 2017-08-04 Obtaining reissues of electronic documents lacking required data
PCT/US2017/045488 WO2018031402A1 (en) 2016-08-07 2017-08-04 System and method for identifying unclaimed electronic documents
EP17837779.2A EP3494531A4 (en) 2016-08-05 2017-08-04 System and method for generating consolidated data for electronic documents
CN201780058567.6A CN109791643A (en) 2016-08-05 2017-08-04 System and method for generating the merging data of electronic document
US15/669,224 US20180025224A1 (en) 2015-11-29 2017-08-04 System and method for identifying unclaimed electronic documents
EP17837759.4A EP3494496A4 (en) 2016-08-05 2017-08-04 System and method for reporting based on electronic documents
PCT/US2017/045497 WO2018027133A1 (en) 2016-08-05 2017-08-04 Obtaining reissues of electronic documents lacking required data
EP17840055.2A EP3497589A4 (en) 2016-08-07 2017-08-04 System and method for identifying unclaimed electronic documents
US15/674,165 US20180025438A1 (en) 2015-11-29 2017-08-10 System and method for generating analytics based on electronic documents
PCT/US2017/046317 WO2018034941A1 (en) 2016-08-14 2017-08-10 System and method for generating analytics based on electronic documents
US15/724,958 US20180096435A1 (en) 2015-11-29 2017-10-04 System and method for verifying unstructured enterprise resource planning data
PCT/US2017/055135 WO2018067698A1 (en) 2016-10-09 2017-10-04 System and method for verifying unstructured enterprise resource planning data
EP17859117.8A EP3523771A4 (en) 2016-10-09 2017-10-04 System and method for verifying unstructured enterprise resource planning data
CN201780071509.7A CN110023970A (en) 2016-10-09 2017-10-04 System and method for verifying non-structured Enterprise Resources Plan data
US15/782,905 US10558880B2 (en) 2015-11-29 2017-10-13 System and method for finding evidencing electronic documents based on unstructured data
CN201780070059.XA CN109983489A (en) 2016-10-16 2017-10-13 Electronic document is proved based on non-structured data search
PCT/US2017/056448 WO2018071737A1 (en) 2016-10-16 2017-10-13 Finding evidencing electronic documents based on unstructured data
EP17861011.9A EP3526758A4 (en) 2016-10-16 2017-10-13 Finding evidencing electronic documents based on unstructured data
PCT/US2017/064191 WO2018098507A1 (en) 2016-11-28 2017-12-01 System and method for automatic creation of regulatory reports
US16/376,854 US20190236126A1 (en) 2015-11-29 2019-04-05 System and method for automatic creation of regulatory reports

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201562260553P 2015-11-29 2015-11-29
US201562261355P 2015-12-01 2015-12-01
US15/361,934 US20170154385A1 (en) 2015-11-29 2016-11-28 System and method for automatic validation

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
US15/013,284 Continuation-In-Part US10621676B2 (en) 2015-02-04 2016-02-02 System and methods for extracting document images from images featuring multiple documents
US15/361,934 Continuation-In-Part US20170154385A1 (en) 2015-02-04 2016-11-28 System and method for automatic validation

Related Child Applications (24)

Application Number Title Priority Date Filing Date
US15/361,934 Continuation-In-Part US20170154385A1 (en) 2015-02-04 2016-11-28 System and method for automatic validation
US15/379,971 Continuation-In-Part US20170161315A1 (en) 2015-11-29 2016-12-15 System and method for maintaining data integrity
US15/384,726 Continuation-In-Part US20170169292A1 (en) 2015-11-29 2016-12-20 System and method for automatically verifying requests based on electronic documents
US15/390,068 Continuation-In-Part US20170169518A1 (en) 2015-11-29 2016-12-23 System and method for automatically tagging electronic documents
US15/391,369 Continuation-In-Part US20170169519A1 (en) 2015-11-29 2016-12-27 System and method for automatically verifying transactions based on electronic documents
US15/398,108 Continuation-In-Part US20170185832A1 (en) 2015-02-04 2017-01-04 System and method for verifying extraction of multiple document images from an electronic document
US15/415,286 Continuation-In-Part US20170193608A1 (en) 2015-11-29 2017-01-25 System and method for automatically generating reporting data based on electronic documents
US15/416,563 Continuation-In-Part US20170193609A1 (en) 2015-11-29 2017-01-26 System and method for automatically monitoring requests indicated in electronic documents
US15/596,492 Continuation-In-Part US20170323006A1 (en) 2015-11-29 2017-05-16 System and method for providing analytics in real-time based on unstructured electronic documents
US15/596,489 Continuation-In-Part US20170323395A1 (en) 2015-11-29 2017-05-16 System and method for creating historical records based on unstructured electronic documents
US15/597,825 Continuation-In-Part US20170323157A1 (en) 2015-11-29 2017-05-17 System and method for determining an entity status based on unstructured electronic documents
US15/599,096 Continuation-In-Part US20170323106A1 (en) 2015-11-29 2017-05-18 System and method for encrypting data in electronic documents
US15/658,832 Continuation-In-Part US20180011846A1 (en) 2015-11-29 2017-07-25 System and method for matching transaction electronic documents to evidencing electronic documents
US15/668,426 Continuation-In-Part US20180046663A1 (en) 2015-11-29 2017-08-03 System and method for completing electronic documents
US15/668,402 Continuation-In-Part US10509811B2 (en) 2015-11-29 2017-08-03 System and method for improved analysis of travel-indicating unstructured electronic documents
US15/668,416 Continuation-In-Part US20180018312A1 (en) 2015-11-29 2017-08-03 System and method for monitoring electronic documents
US15/669,253 Continuation-In-Part US10387561B2 (en) 2015-11-29 2017-08-04 System and method for obtaining reissues of electronic documents lacking required data
US15/669,236 Continuation-In-Part US11138372B2 (en) 2015-11-29 2017-08-04 System and method for reporting based on electronic documents
US15/669,224 Continuation-In-Part US20180025224A1 (en) 2015-11-29 2017-08-04 System and method for identifying unclaimed electronic documents
US15/669,510 Continuation-In-Part US20180025225A1 (en) 2015-11-29 2017-08-04 System and method for generating consolidated data for electronic documents
US15/674,165 Continuation-In-Part US20180025438A1 (en) 2015-11-29 2017-08-10 System and method for generating analytics based on electronic documents
US15/724,958 Continuation-In-Part US20180096435A1 (en) 2015-11-29 2017-10-04 System and method for verifying unstructured enterprise resource planning data
US15/782,905 Continuation-In-Part US10558880B2 (en) 2015-11-29 2017-10-13 System and method for finding evidencing electronic documents based on unstructured data
PCT/US2017/064191 Continuation-In-Part WO2018098507A1 (en) 2015-11-29 2017-12-01 System and method for automatic creation of regulatory reports

Publications (1)

Publication Number Publication Date
US20170154385A1 true US20170154385A1 (en) 2017-06-01

Family

ID=58763676

Family Applications (5)

Application Number Title Priority Date Filing Date
US15/361,934 Abandoned US20170154385A1 (en) 2015-02-04 2016-11-28 System and method for automatic validation
US15/363,048 Active US10235723B2 (en) 2015-11-29 2016-11-29 System and method for automatic generation of reports based on electronic documents
US16/211,960 Active US10614527B2 (en) 2015-11-29 2018-12-06 System and method for automatic generation of reports based on electronic documents
US16/232,534 Active US10546351B2 (en) 2015-11-29 2018-12-26 System and method for automatic generation of reports based on electronic documents
US16/232,526 Active US10614528B2 (en) 2015-11-29 2018-12-26 System and method for automatic generation of reports based on electronic documents

Family Applications After (4)

Application Number Title Priority Date Filing Date
US15/363,048 Active US10235723B2 (en) 2015-11-29 2016-11-29 System and method for automatic generation of reports based on electronic documents
US16/211,960 Active US10614527B2 (en) 2015-11-29 2018-12-06 System and method for automatic generation of reports based on electronic documents
US16/232,534 Active US10546351B2 (en) 2015-11-29 2018-12-26 System and method for automatic generation of reports based on electronic documents
US16/232,526 Active US10614528B2 (en) 2015-11-29 2018-12-26 System and method for automatic generation of reports based on electronic documents

Country Status (5)

Country Link
US (5) US20170154385A1 (en)
EP (1) EP3380958A4 (en)
DE (1) DE112016005443T5 (en)
GB (1) GB2560476A (en)
WO (2) WO2017091825A1 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110046335A (en) * 2019-04-03 2019-07-23 江苏科技大学 A kind of rapid generation of profile measurement report
CN110209883A (en) * 2019-06-06 2019-09-06 王慧斌 A kind of method and device of user's history data validity judgement
CN110764952A (en) * 2019-09-30 2020-02-07 深圳市金溢科技股份有限公司 Readable storage medium, front-mounted ETC electronic tag and anti-dismounting method thereof
US20200118122A1 (en) * 2018-10-15 2020-04-16 Vatbox, Ltd. Techniques for completing missing and obscured transaction data items
CN111159110A (en) * 2019-12-03 2020-05-15 深圳市智微智能软件开发有限公司 Data establishing method and system
CN112434507A (en) * 2020-12-10 2021-03-02 四川长虹电器股份有限公司 Electronic bill generating method based on bill element configuration
CN113177851A (en) * 2021-04-15 2021-07-27 国任财产保险股份有限公司 Method and device for storing certificate of online insurance transaction, electronic equipment and storage medium
CN114048308A (en) * 2021-11-03 2022-02-15 中国司法大数据研究院有限公司 Method and device for generating category retrieval report
WO2022173512A1 (en) * 2021-02-12 2022-08-18 Tekion Corp Document parser and generator

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11817993B2 (en) * 2015-01-27 2023-11-14 Dell Products L.P. System for decomposing events and unstructured data
EP3494534A4 (en) * 2016-08-02 2020-01-22 Hexanika SYSTEMS AND METHODS FOR THE COLLECTION, SUMMARY AND PROCESSING OF DATA
CN107798534A (en) * 2017-11-24 2018-03-13 珠海市魅族科技有限公司 A kind of information recording method and device, terminal and readable storage medium storing program for executing
CN108132925A (en) * 2017-12-15 2018-06-08 中盈优创资讯科技有限公司 A kind of report file generation method and device
US10970578B2 (en) * 2019-02-07 2021-04-06 Johnson Controls Fire Protection LP System and method for extracting information from a non-planar surface
CN110222107B (en) * 2019-05-20 2023-08-22 平安科技(深圳)有限公司 Data transmission method and related equipment
WO2021081464A1 (en) * 2019-10-24 2021-04-29 Nickl Ralph Systems and methods for identifying compliance-related information associated with data breach events
CA3157986A1 (en) 2019-10-24 2021-04-29 Canopy Software Inc. Systems and methods for identifying compliance-related information associated with data breach events
BR122022003477A2 (en) * 2020-05-08 2022-03-29 Bold Limited Systems and methods for creating enhanced documents for seamless automated analysis
US12373480B2 (en) 2020-05-08 2025-07-29 Bold Limited Systems and methods for creating enhanced documents for perfect automated parsing
CN112420151A (en) * 2020-12-07 2021-02-26 医惠科技有限公司 Method, system, equipment and medium for structured analysis after ultrasonic report
CN113157938B (en) * 2021-03-25 2022-05-17 支付宝(杭州)信息技术有限公司 Method and device for jointly processing multiple knowledge graphs for protecting privacy data
CN116795789B (en) * 2023-08-24 2024-04-19 卓望信息技术(北京)有限公司 Method and device for automatically generating patent search report
CN117252176B (en) * 2023-09-18 2024-09-20 广东企企通科技有限公司 Intelligent survey report generation method, device, equipment and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7299408B1 (en) * 2002-04-01 2007-11-20 Fannie Mae Electronic document validation
US20140079294A1 (en) * 2009-02-10 2014-03-20 Kofax, Inc. Systems, methods and computer program products for determining document validity
US20140129400A1 (en) * 2012-11-07 2014-05-08 Syncada Llc Electronic payment processing system
US20170147540A1 (en) * 2015-11-24 2017-05-25 Bank Of America Corporation Transforming unstructured documents

Family Cites Families (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5606609A (en) * 1994-09-19 1997-02-25 Scientific-Atlanta Electronic document verification system and method
US7215773B1 (en) * 1998-10-14 2007-05-08 Certicom.Corp. Key validation scheme
US20030115080A1 (en) * 2001-10-23 2003-06-19 Kasra Kasravi System and method for managing contracts using text mining
US7693760B1 (en) * 2002-03-26 2010-04-06 Oracle International Corporation Method and apparatus for providing a tax service that is configurable for local jurisdictions
US6862488B2 (en) * 2002-07-05 2005-03-01 Validation Commerce, Llc Automated validation processing and workflow management
US20040049473A1 (en) * 2002-09-05 2004-03-11 David John Gower Information analytics systems and methods
US7827079B2 (en) * 2003-06-30 2010-11-02 Ebay Inc. Method and system for assessing and reporting VAT charges for network-based marketplace services
US7937319B2 (en) * 2005-03-21 2011-05-03 Oversight Technologies, Inc. Methods and systems for compliance monitoring knowledge base
WO2005089526A2 (en) * 2004-03-19 2005-09-29 Oversight Technologies, Inc. Methods and systems for transaction compliance monitoring
US20060036465A1 (en) * 2004-08-13 2006-02-16 O'donnell Lee F Online interactive interface and automated processing for loan origination and underwriting
US8739059B2 (en) 2005-05-16 2014-05-27 Xcira, Inc. System for generating inspection reports for inspected items
US20060271379A1 (en) * 2005-05-26 2006-11-30 Jason Resnick Intellectual property analysis and report generating system and method
US8024339B2 (en) 2005-10-12 2011-09-20 Business Objects Software Ltd. Apparatus and method for generating reports with masked confidential data
US20070112599A1 (en) * 2005-10-26 2007-05-17 Peiya Liu Method and system for generating and validating clinical reports with built-in automated measurement and decision support
US7698188B2 (en) * 2005-11-03 2010-04-13 Beta-Rubicon Technologies, Llc Electronic enterprise capital marketplace and monitoring apparatus and method
US8069147B2 (en) * 2005-11-10 2011-11-29 Computer Associates Think, Inc. System and method for delivering results of a search query in an information management system
WO2007117334A2 (en) * 2006-01-03 2007-10-18 Kyos Systems Inc. Document analysis system for integration of paper records into a searchable electronic database
US7475062B2 (en) * 2006-02-28 2009-01-06 Business Objects Software Ltd. Apparatus and method for selecting a subset of report templates based on specified criteria
US20070294230A1 (en) * 2006-05-31 2007-12-20 Joshua Sinel Dynamic content analysis of collected online discussions
US7899837B2 (en) * 2006-09-29 2011-03-01 Business Objects Software Ltd. Apparatus and method for generating queries and reports
US20080126155A1 (en) * 2006-11-27 2008-05-29 Sap Ag Method and apparatus for enterprise operation assessment
US20090228777A1 (en) * 2007-08-17 2009-09-10 Accupatent, Inc. System and Method for Search
US20090070338A1 (en) 2007-09-07 2009-03-12 Bowe Bell + Howell Company Centralized production management for measuring mail industry processing performance
US20090112743A1 (en) * 2007-10-31 2009-04-30 Mullins Christine M System and method for reporting according to eu vat related legal requirements
US20100017316A1 (en) * 2007-11-05 2010-01-21 American Express Travel Related Services Company, Inc. Automated expense report
US8762253B2 (en) * 2008-08-07 2014-06-24 Segun Ogundipe Method and system for pooling, securitizing, and trading global dividend and interest tax reclaim assets
US8719308B2 (en) * 2009-02-16 2014-05-06 Business Objects, S.A. Method and system to process unstructured data
TWM378172U (en) * 2009-11-30 2010-04-11 Zhe-Wei Chen Scooter that can be easily swung and speeded up
US8743712B1 (en) * 2010-04-12 2014-06-03 Symantec Corporation Systems and methods for aggregating data for resources in a target group of resources
JP2012164730A (en) * 2011-02-04 2012-08-30 Renesas Electronics Corp Semiconductor device
GB201116864D0 (en) 2011-09-29 2011-11-09 Shl Group Ltd Requirements characterisation
US20130211975A1 (en) 2011-11-26 2013-08-15 Kevin James Masterson Portable electronic kit for payroll voucher, time tracking, and production administration in the entertainment industry
US20130262328A1 (en) * 2012-03-30 2013-10-03 CSRSI, Inc. System and method for automated data breach compliance
US20140046892A1 (en) * 2012-08-10 2014-02-13 Xurmo Technologies Pvt. Ltd. Method and system for visualizing information extracted from big data
US20140095270A1 (en) 2012-09-28 2014-04-03 StreamLink LLC Method and system for managing grants
US20140156500A1 (en) * 2012-12-03 2014-06-05 Experian Information Solutions, Inc. Systems and methods for providing a customizable credit report
US20140195555A1 (en) 2013-01-09 2014-07-10 Wisdomain Inc. Automatic report generation in a networked computing environment
US20170228821A1 (en) 2013-02-25 2017-08-10 Descant, Inc. System and method for self-aggregating, standardizing, sharing and validating credit data between businesses and creditors
US10636100B2 (en) * 2013-02-27 2020-04-28 Vatbox, Ltd. System and method for prediction of value added tax reclaim success
US9922102B2 (en) * 2013-07-31 2018-03-20 Splunk Inc. Templates for defining fields in machine data
US20150046366A1 (en) * 2013-08-09 2015-02-12 Lingua Next Technologies Pvt. Ltd Method And System For Batch Generation Of Reports
US20150127534A1 (en) * 2013-11-04 2015-05-07 Bank Of America Corporation Electronic refund redemption
WO2015120086A1 (en) 2014-02-04 2015-08-13 Shoobx, Inc. Computer-guided corporate governance with document generation and execution
US10340038B2 (en) * 2014-05-13 2019-07-02 Nant Holdings Ip, Llc Healthcare transaction validation via blockchain, systems and methods
US10162848B2 (en) * 2014-06-11 2018-12-25 Ureveal, Inc. Methods and apparatus for harmonization of data stored in multiple databases using concept-based analysis
US20160012030A1 (en) * 2014-07-08 2016-01-14 Tuyen Tran Data form generation and gathering
US9965679B2 (en) * 2014-11-05 2018-05-08 Accenture Global Services Limited Capturing specific information based on field information associated with a document class

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7299408B1 (en) * 2002-04-01 2007-11-20 Fannie Mae Electronic document validation
US20140079294A1 (en) * 2009-02-10 2014-03-20 Kofax, Inc. Systems, methods and computer program products for determining document validity
US20140129400A1 (en) * 2012-11-07 2014-05-08 Syncada Llc Electronic payment processing system
US20170147540A1 (en) * 2015-11-24 2017-05-25 Bank Of America Corporation Transforming unstructured documents

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200118122A1 (en) * 2018-10-15 2020-04-16 Vatbox, Ltd. Techniques for completing missing and obscured transaction data items
CN110046335A (en) * 2019-04-03 2019-07-23 江苏科技大学 A kind of rapid generation of profile measurement report
CN110046335B (en) * 2019-04-03 2023-05-19 江苏科技大学 A Rapid Generation Method of Shape Inspection Report
CN110209883A (en) * 2019-06-06 2019-09-06 王慧斌 A kind of method and device of user's history data validity judgement
CN110764952A (en) * 2019-09-30 2020-02-07 深圳市金溢科技股份有限公司 Readable storage medium, front-mounted ETC electronic tag and anti-dismounting method thereof
CN111159110A (en) * 2019-12-03 2020-05-15 深圳市智微智能软件开发有限公司 Data establishing method and system
CN112434507A (en) * 2020-12-10 2021-03-02 四川长虹电器股份有限公司 Electronic bill generating method based on bill element configuration
WO2022173512A1 (en) * 2021-02-12 2022-08-18 Tekion Corp Document parser and generator
US11803700B2 (en) 2021-02-12 2023-10-31 Tekion Corp Document parser and generator
US12197849B2 (en) 2021-02-12 2025-01-14 Tekion Corp Document parser and generator
CN113177851A (en) * 2021-04-15 2021-07-27 国任财产保险股份有限公司 Method and device for storing certificate of online insurance transaction, electronic equipment and storage medium
CN114048308A (en) * 2021-11-03 2022-02-15 中国司法大数据研究院有限公司 Method and device for generating category retrieval report

Also Published As

Publication number Publication date
US20190130494A1 (en) 2019-05-02
US10614528B2 (en) 2020-04-07
US20190130495A1 (en) 2019-05-02
WO2017091825A1 (en) 2017-06-01
US10546351B2 (en) 2020-01-28
US20190108594A1 (en) 2019-04-11
EP3380958A1 (en) 2018-10-03
US10614527B2 (en) 2020-04-07
DE112016005443T5 (en) 2018-08-16
GB2560476A (en) 2018-09-12
WO2017091829A1 (en) 2017-06-01
GB201809981D0 (en) 2018-08-01
US20170154027A1 (en) 2017-06-01
US10235723B2 (en) 2019-03-19
EP3380958A4 (en) 2019-05-01

Similar Documents

Publication Publication Date Title
US10546351B2 (en) System and method for automatic generation of reports based on electronic documents
US11062132B2 (en) System and method for identification of missing data elements in electronic documents
US20170323006A1 (en) System and method for providing analytics in real-time based on unstructured electronic documents
US20170193608A1 (en) System and method for automatically generating reporting data based on electronic documents
US20180018312A1 (en) System and method for monitoring electronic documents
US20170169292A1 (en) System and method for automatically verifying requests based on electronic documents
US20180011846A1 (en) System and method for matching transaction electronic documents to evidencing electronic documents
US11138372B2 (en) System and method for reporting based on electronic documents
US20180046663A1 (en) System and method for completing electronic documents
US20170323157A1 (en) System and method for determining an entity status based on unstructured electronic documents
EP3494495A1 (en) System and method for completing electronic documents
US20170169518A1 (en) System and method for automatically tagging electronic documents
EP3430540A1 (en) System and method for automatically generating reporting data based on electronic documents
US20170161315A1 (en) System and method for maintaining data integrity
US20170169519A1 (en) System and method for automatically verifying transactions based on electronic documents
EP3494496A1 (en) System and method for reporting based on electronic documents
WO2017201012A1 (en) Providing analytics in real-time based on unstructured electronic documents
US10558880B2 (en) System and method for finding evidencing electronic documents based on unstructured data
US10387561B2 (en) System and method for obtaining reissues of electronic documents lacking required data
US20180096435A1 (en) System and method for verifying unstructured enterprise resource planning data
WO2017142615A1 (en) System and method for maintaining data integrity
WO2018027133A1 (en) Obtaining reissues of electronic documents lacking required data
EP3417383A1 (en) Automatic verification of requests based on electronic documents
WO2018071737A1 (en) Finding evidencing electronic documents based on unstructured data
WO2018067698A1 (en) System and method for verifying unstructured enterprise resource planning data

Legal Events

Date Code Title Description
AS Assignment

Owner name: VATBOX, LTD., ISRAEL

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GUZMAN, NOAM;SAFT, ISAAC;REEL/FRAME:040947/0605

Effective date: 20161214

STPP Information on status: patent application and granting procedure in general

Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER

STPP Information on status: patent application and granting procedure in general

Free format text: FINAL REJECTION MAILED

STPP Information on status: patent application and granting procedure in general

Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION

AS Assignment

Owner name: SILICON VALLEY BANK, MASSACHUSETTS

Free format text: INTELLECTUAL PROPERTY SECURITY AGREEMENT;ASSIGNOR:VATBOX LTD;REEL/FRAME:051187/0764

Effective date: 20191204

STPP Information on status: patent application and granting procedure in general

Free format text: NON FINAL ACTION MAILED

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION