WO2000060531A2 - Extracting user data from a scanned image of a pre-printed form - Google Patents
Extracting user data from a scanned image of a pre-printed form Download PDFInfo
- Publication number
- WO2000060531A2 WO2000060531A2 PCT/US2000/009199 US0009199W WO0060531A2 WO 2000060531 A2 WO2000060531 A2 WO 2000060531A2 US 0009199 W US0009199 W US 0009199W WO 0060531 A2 WO0060531 A2 WO 0060531A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- pixels
- data
- shared
- printed
- user data
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/14—Image acquisition
- G06V30/148—Segmentation of character regions
- G06V30/155—Removing patterns interfering with the pattern to be recognised, such as ruled lines or underlines
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- This invention pertains to extracting information from a scanned version of a pre-printed form. More specifically, the invention pertains to a fast and convenient technique for processing scanned image data to obtain information placed on a form document and distinguish that information from the form document itself.
- Pre-printed forms are commonly used for many applications. For some applications, such as medical claim forms, the forms are used in such great numbers that computerized "reading" of the forms is not only desirable but essential.
- Form documents are typically pre-printed sheets of paper that have blanks or open areas on them where information is to be supplied by the person completing the form. This information, referred herein as "user data" may be entered by hand or a printing device such as a printer or typewriter.
- Other examples of commonly used forms include shipping documents, purchase orders, insurance records, and so forth.
- the form may have been pre-printed with instructions, boxes to be filled in, and other markings. Removable of the pre-printed markings before attempting to "read" the user data, for example using optical character recognition (OCR) systems, is highly desirable.
- OCR optical character recognition
- pre-printed form is well defined, for example in a computer data file, or if a clean, blank pre-printed form is available for scanning, one can take steps to "subtract" the pre-printed indicia from a completed form, leaving only the user data for a subsequent interpretation.
- This approach would work fine, theoretically, but in practice there are often conflicts between the pre-printed markings and the user data. This occurs wherever the user data overlaps the pre-printed markings.
- Digital data corresponding to these common areas as referred to later as "shared pixels.” Shared pixels can arise when a line or box on the form intersects the data to be extracted, for example as illustrated in Figure 1 (further described below).
- U.S. Patent No. 5,694,494 entitled "electronic retrieval of information from form documents” describes a method for retrieving user data from a scanned version of a completed document. That method includes the steps of obtaining a first image of the document having information printed thereon in its blank format before other information has been added to it by the user. A second image of the document is obtained after information has been added to it by the user. The two images are aligned, and for each pixel in the first image which corresponds to information on the second document, those pixels are deleted from the second image to create an image which corresponds to subtraction of the first image from the second image. Finally, a step is performed to electronically restore the information added by the user which was deleted during the subtraction operation. See abstract. While the described "subtraction" step is not difficult, restoring missing data is quite challenging. The methods described in the '494 patent for restoring missing user data require extensive analysis and calculations, and therefore can be expected to be reasonably slow in operation.
- a completed form is optically scanned or otherwise processed so as to form digital data representative of the completed form.
- the digital data is created or further processed if necessary to reduce it to binary pixel data, in which each pixel is assigned a single binary value, for example 0 indicating white and 1 indicating black.
- binary pixel data in which each pixel is assigned a single binary value, for example 0 indicating white and 1 indicating black.
- Various threshold methods are known for reducing gray scale data to form binary data.
- the resulting data may conveniently stored in the form of a matrix or array of bits, each bit corresponding to one pixel of composite image.
- the present invention calls for identifying all common or "shared pixels" in the composite image.
- the number of shared pixels attached in this manner preferably varies in response to the point size of the characters to be extracted and, hopefully, later recognized.
- the specific pattern or geometric arrangement of the attached pixels also can be varied, again preferably in response to the nature of the characters to be extracted.
- the method calls for removing from the composite image all pixels that are part of the pre-printed form except the attached shared pixels.
- This "form fracturing" process has the effect of preserving most if not all of the data pixels that would otherwise be removed as being part of the pre-printed form.
- the technique does not always preserve all of the user data pixels, but it does provide at least a useful approximation of the user data pixels. By useful approximation we mean data that will lend itself to optical character recognition with an acceptable error rate.
- the scanned composite image can be efficiently stored in a computer memory and processed using one bit to represent each pixel of the image. This can be done for an entire form or some portion of it.
- an 8" x 11" form translates to an image of approximately 2400 by 3300 pixels for a total of 7.9 megabits— a modest amount of memory by contemporary standards.
- a given computer implemented a 24-bit word size, it would take only 100- word-long array to represent one horizontal line or scan line of an 8-inch-wide form at 300 dpi. Approximately 150 lines would adequately cover the length of the form.
- the image data can conveniently be described and implemented as a matrix.
- the present invention is directed to a computer-implemented method for extracting user data from a composite image where the composite image is presented as digital data defining a matrix of pixels and the method comprises the following steps: receiving a matrix F of form pixel data defining the pre-printed form; receiving a second matrix X of composite image pixel data defining the composite image; determining an intersection of the F and X of matrixes to form another matrix
- the shared pixels subtracting the shared pixel matrix S from the composite image matrix X to form a fourth matrix Y defining non-shared user data pixels.
- the shared pixel matrix S is compared to the non-shared user data matrix Y so as to identify all shared pixels that are non-diagonally adjacent to at least one non-shared user data pixel, and the result is another matrix Z defining attached pixels.
- the Z matrix can be amended to include more than one attached pixel for each shared pixel that is non-diagonally adjacent to a non-shared user data pixel.
- the method calls for subtracting the pre-printed form matrix F, except for the attached pixels Z, from the composite image pixels X to recover at least a useful approximation of the user data pixels.
- the present invention can be implemented on virtually any digital computer. Specific implementation can be effected by any programmer of ordinary skill in the art in view of the present description.
- the result of the described invention is that, for example when zone descriptions and user data overlap, a larger portion of the user data character(s) remains intact, thus providing the OCR a more concrete feature extraction (character) to work with.
- This technology is not only helpful with OCR, but it can also make the filled in form more easily readable to humans.
- the technique just summarized requires that the blank form (or data defining the blank form, matrix F) is available to carry out the described process.
- Another aspect of the invention is directed to recovering user data without requiring that the pre-printed form be available or defined a priori.
- This technique called “virtual dropout,” employs a special new type of form that can be removed from a composite image by appropriate processing, leaving only the user-entered data for subsequent recognition.
- the form is printed using "speckles" as defined shortly in lieu of solid lines.
- speckles that make up the form are easily removed, while the user data, which generally comprises contiguous markings, remains. Speckles are most effective for forming straight lines such as constraint boxes. Speckles are defined as small clumps or blocks of pixels having limited size, for example of maximum of 36 pixels (6x6) at 300 dpi.
- Speckles should be spaced apart in a range of approximately 6 to 10 pixels. This makes it easy to isolate and remove them, while a series of such speckles still does its intended job of forming a line or constraint box visible to the user completing the form. The box just appears lighter than if it were formed of a solid line. Speckles preferably are defined by "speckle size" which is the number of non-diagonally contiguous pixels, in other words, the number of contiguous pixels that are neighboring on horizontal or vertical sides. Examples are illustrated later in the drawings. Software can be employed to readily find and remove speckles of predetermined size (or less).
- one aspect of the invention is a printed form or sheet having a constraint box printed thereon, the constraint box being formed of a series of speckles of predetermined maximum size and spaced apart no more than a predetermined maximum spacing such that the constraint box is apparent to the naked eye.
- Another aspect of the invention is directed to a method of recognizing such speckles for removal from a completed form image, thereby leaving the user data for subsequent recognition.
- another "virtual dropout" technique calls for forming the pre-printed features of a form, such as instructions or constraint boxes, using a shaded, thin line.
- a shaded line is one having a gray scale level less than a predetermined maximum, e.g. , 25 or 50 percent. (The line need not be formed solely of black ink; color images also have a gray scale equivalent value.)
- Fig. 2 illustrates a portion of a completed form in which user data overlaps a pre-printed zone description on the form.
- Fig. 3 is an enlarged view of a selected region of Figure 2 illustrating form pixels, shared pixels, and non-shared user data pixels.
- Figs. 7A-7G illustrate various "speckle" configurations. Detailed Description of Preferred Embodiment
- Figure 1 illustrates a portion of a completed form in which the user data overlaps a line, for example, a portion of a constraint box, that was pre-printed on the form.
- individual squares for example, squares 10 and 12
- User data which may have been entered using a typewriter or the like, comprises the letters "FRE" indicated in the drawing by reference numbers 14 ("F"), 16 ("R") and 18 ("E").
- F the letters "FRE” indicated in the drawing by reference numbers 14 ("F"), 16 (“R”) and 18 (“E”).
- each character has a size of approximately 8 pixels wide and 12 pixels high although this can vary substantially. For example, at a resolution of 300 dots per inch (dpi), there would be a little over four pixels per point, where a point as 1/72 of a inch.
- dpi dots per inch
- a horizontal bar 20 represents a pre-printed portion of the form, such as a constraint box, that extends horizontally through the letters FRE and is two pixels high.
- F a pre-printed portion of the form
- shared pixels there are a total of nine pixels in which the letter and the form overlap, referred to in this specification and the appended claims as "shared pixels.”
- shared pixels are 22, 24, 26, 28 and five more pixels extending laterally to the right of pixel 26. OCR is impeded by the presence of shared pixels as mentioned above. Below, we describe a process for recovering the user data portion of the data.
- Figure 2 illustrates an example where user data, here the word "MOSER” is overlapping information that was pre-printed on a form, viz. a zone description "Patient Name.” While the user data is illustrated here as being typed, it could also be entered by hand. The present invention is applicable independent of the means used to enter user data on the pre-printed form.
- Figure 3 is an enlarged view of a portion of Figure 2, showing where the letter M (user data) overlaps the zone description. A relatively small number of pixels are shown here (as small boxes) to simplify the illustration.
- pixels 404 a portion of the letter M which does not overlap the letter I is illustrated as pixels 404.
- Shared pixels i.e. , pixels that are part of both the letter M and the letter I are indicated with superimposed diagonal lines forming an X such as pixel 406.
- pixel 408 is a shared pixel
- pixel 410 is a non-overlapping or non-shared user data pixel.
- the method of the invention calls for "attaching" to the non-shared data pixels all shared pixels that are non-diagonally adjacent to at least one non-shared data pixel.
- Figure 4B which shows the same pixel area as Figure 4A, pixels 410 and 412 are non-shared data pixels.
- FIG. 5B Graphic image data, or pixel data, can conveniently be stored and processed on a digital computer in matrix form.
- the process described above can be summarized as follows. First, receiving a matrix F of form-pixel data defining the pre-printed form. Second, receiving a second matrix X of composite image-pixel data defining the composite image, i.e. , a completed form. Third, determining an intersection of the F and X matrixes to form another matrix 5 defining shared pixels. Next, subtracting the shared-pixel matrix S from the composite matrix X to form a fourth matrix Y defining non-shared user-data pixels.
- the shared pixel matrix S is then compared to the non-shared user-data matrix Y so as to identify all shared pixels that are non-diagonally adjacent to at least one non-shared user-data pixel, and the result is another matrix Z defining attached pixels.
- the result is another matrix Z defining attached pixels.
- FIG. 4 and 5 we show attaching only a single shared pixel to an appropriate data pixel. However, multiple pixels can be attached in response to a single user data contacting (non-diagonally adjacent) a shared pixel. For example, two, three or more pixels can be attached to such a data pixel, thereby infiltrating into the shared pixel area, subject to the limitation that only shared pixels can be attached.
- 702 is a (non-shared) data pixel.
- One option would be to attach a single, shared pixel as described above.
- shaded pixel 704 would be attached to pixel 702.
- the policy might be to attach a pair of pixels, e.g. , 704, 706, to an adjacent data-pixel 702. But, 706 would be included only if it were, in fact, a shared pixel.
- Figures 7B, 7C, 7D and 7E show other examples of multiple-pixel speckles that could be attached to individual user- data pixels within the scope of the invention.
- FIG. 6A user data occupies a contiguous series of pixel 602 while the form data, which may be part of a constraint box or zone description, consists of pixels generally indicated as 604. Shared pixels are shown with a superimposed "x" as before.
- Figure 6B illustrates the process in which shared pixels are attached to non-diagonally adjacent data pixels.
- speckles of limited size are easily removed by software.
- the preferred maximum speckle size is about 36 pixels (6x6) or less at 300 dpi.
- the space between pixels should be approximately six to ten pixels. If the resolution is less than 300, the number of pixels should be scaled. So, for example, at 200 dpi the number of pixels per block or speckle is two-thirds of 36, which is 24.
- a typical pencil or other writing implement might have a line width of one-half to one millimeter.
- the line width of hand-print data entered on a form would be approximately 6 to 12 dots or pixels. Even a narrow line of six pixels' width will be contiguous over a length of many pixels in forming the typical alpha numeric character. So, even if speckles of size 6x6 or smaller are removed, it would not affect the user data.
- Figure 8A illustrates just one example of forming a constraint box (or in this case, just the upper-left corner of a constraint box) for virtual dropout.
- the constraint box is not formed of a continuous line. Rather, it is formed of a series of speckles, for example speckles 800 and 810.
- This constraint box is illustrated as being about 30 pixels wide and 40 pixels tall.
- the "line thickness" forming the box is five pixels, indicated at 812. This dimension is not critical and, certainly, a wider line width could be used.
- Each individual speckle has overall size of 25 pixels arranged in a 5x5 square, as illustrated. There is also a spacing between pixels of seven pixels, as shown at 814. The individual speckles are not contiguous, however.
- speckle 820 only alternate pixels are black, or "on," thus forming a checkerboard pattern. Since there are a total of nine black pixels out of 25 total pixels, each speckle will have an apparent gray-scale value of 9/25, which equals 36 percent. This configuration would result in a relatively faint constraint box, yet enough to guide the user in locating data within the box. The constraint box is easily removed through software processing. It may be observed that the speckles shown here for illustration have an effective speckle size of just one. An alternative speckle configuration is shown at 830. Here, a total of 15 out of 25 pixels are black, corresponding to 60 percent gray scale, while the speckle size is four. Next, we illustrated these principles in use.
- Figure 8B shows the constraint box of Figure 8A with a hand-print numeral 6 entered on the form as user data.
- the numeral 6 overlaps the constraint box at several places, and if the constraint box were formed of solid black, it would leave non-shared user-data pixels illustrated as fragments 840, 842 and 846. Those fragments might be difficult for a human reader or a character recognizer to correctly determine the data. Since the constraint box, however, is formed of speckles of limited size, it can be completely removed and, thus, the user data completely restored as indicated in Figure 8C. Many pre-printed forms are produced on offset or lithographic printing machines that have relatively high resolution, for example on the order of 2400 dpi.
- the typical 8 bit (monochrome) document scanner cannot and does not actually resolve individual pixels in the form. Rather, it determines the relative lightness or darkness of a pixel area (defined by the resolution of the scanner) and assigns it a value, for example between 0 and 255 in the case of an 8 bit scanner.
- a thresholding apparatus can be used to drive a gray scale pixel to black or white, depending in part on neighboring pixels. For example, if a light gray region, e.g., 25 percent gray scale, is surrounded by white, the gray scale pixels will be driven to black. On the other hand, gray pixels that are adjacent to a true black will be driven to white. The effect of all of this is that the scanner automatically "drops out" gray pixels, especially near black. This can be used to advantage to drop out the pre-printed aspects of a form.
- speckles will tend to be dropped out, as described, and are particularly apt to be driven to white where they are adjacent to true black, such as user data. Isolated speckles, i.e., those surrounded by white, may tend to be driven toward black by the scanner, but they are easily removed because of their small speckle size as described previously.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Image Processing (AREA)
- Editing Of Facsimile Originals (AREA)
Abstract
Description
Claims
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| AU40773/00A AU4077300A (en) | 1999-04-07 | 2000-04-07 | Extracting user data from a scanned image of a pre-printed form |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12823299P | 1999-04-07 | 1999-04-07 | |
| US12822899P | 1999-04-07 | 1999-04-07 | |
| US60/128,228 | 1999-04-07 | ||
| US60/128,232 | 1999-04-07 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2000060531A2 true WO2000060531A2 (en) | 2000-10-12 |
| WO2000060531A3 WO2000060531A3 (en) | 2001-04-05 |
Family
ID=26826407
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2000/009199 WO2000060531A2 (en) | 1999-04-07 | 2000-04-07 | Extracting user data from a scanned image of a pre-printed form |
Country Status (3)
| Country | Link |
|---|---|
| US (1) | US6330357B1 (en) |
| AU (1) | AU4077300A (en) |
| WO (1) | WO2000060531A2 (en) |
Families Citing this family (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6728426B1 (en) * | 1999-08-23 | 2004-04-27 | International Business Machines Corporation | Compression of form images in gray-level |
| JP4133370B2 (en) * | 2003-01-28 | 2008-08-13 | 富士フイルム株式会社 | Electronic paper reader |
| US8326653B2 (en) * | 2003-03-04 | 2012-12-04 | Nuance Communications, Inc. | Method and apparatus for analyzing patient medical records |
| US20050177396A1 (en) * | 2004-01-14 | 2005-08-11 | Meir Gottlieb | Method and apparatus for performing concurrent patient coding for hospitals |
| US7487438B1 (en) * | 2005-03-08 | 2009-02-03 | Pegasus Imaging Corporation | Method and apparatus for recognizing a digitized form, extracting information from a filled-in form, and generating a corrected filled-in form |
| JP5677139B2 (en) * | 2011-03-07 | 2015-02-25 | 三菱電機株式会社 | Form character recognition device |
| US9111140B2 (en) * | 2012-01-10 | 2015-08-18 | Dst Technologies, Inc. | Identification and separation of form and feature elements from handwritten and other user supplied elements |
| US9019570B1 (en) | 2013-11-27 | 2015-04-28 | Mcgraw-Hill School Education Holdings Llc | Systems and methods for computationally distinguishing handwritten pencil marks from preprinted marks in a scanned document |
Family Cites Families (25)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4326258A (en) | 1980-01-31 | 1982-04-20 | Ncr Canada Ltd - Ncr Canada Ltee | Method and apparatus for reducing the gray scale resolution of a digitized image |
| US4741045A (en) | 1983-09-23 | 1988-04-26 | Dest Corporation | Optical character isolation system, apparatus and method |
| US4577235A (en) | 1984-08-20 | 1986-03-18 | The Mead Corporation | Text/continuous tone image decision processor |
| JPS62279483A (en) | 1986-05-28 | 1987-12-04 | Fujitsu Ltd | Contamination eliminating circuit for printing character information |
| US4817179A (en) | 1986-12-29 | 1989-03-28 | Scan-Optics, Inc. | Digital image enhancement methods and apparatus |
| US5038381A (en) | 1988-07-11 | 1991-08-06 | New Dest Corporation | Image/text filtering system and method |
| US5657395A (en) | 1989-03-02 | 1997-08-12 | Minolta Camera Kabushiki Kaisha | Image processing device providing improved image data processing by converting original image into a plurality of pixels and analyzing density data of each pixel |
| US5181255A (en) * | 1990-12-13 | 1993-01-19 | Xerox Corporation | Segmentation of handwriting and machine printed text |
| US5272764A (en) | 1989-12-08 | 1993-12-21 | Xerox Corporation | Detection of highlighted regions |
| US5579407A (en) | 1992-04-21 | 1996-11-26 | Murez; James D. | Optical character classification |
| US5680479A (en) | 1992-04-24 | 1997-10-21 | Canon Kabushiki Kaisha | Method and apparatus for character recognition |
| US5321768A (en) | 1992-09-22 | 1994-06-14 | The Research Foundation, State University Of New York At Buffalo | System for recognizing handwritten character strings containing overlapping and/or broken characters |
| US6005976A (en) | 1993-02-25 | 1999-12-21 | Fujitsu Limited | Image extraction system for extracting patterns such as characters, graphics and symbols from image having frame formed by straight line portions |
| JP3442466B2 (en) | 1993-04-12 | 2003-09-02 | 株式会社リコー | Image processing apparatus and image processing method |
| US5488491A (en) | 1993-07-06 | 1996-01-30 | Eastman Kodak Company | Image processing apparatus and method for eliminating background density in a designated image area |
| JP3547025B2 (en) | 1994-08-23 | 2004-07-28 | 株式会社リコー | Information extraction method |
| JP3345224B2 (en) | 1995-03-06 | 2002-11-18 | 富士通株式会社 | Pattern extraction device, pattern re-recognition table creation device, and pattern recognition device |
| FR2737930B1 (en) * | 1995-08-18 | 1997-10-31 | Itesoft | METHOD AND SYSTEM FOR RECOGNIZING MANUSCRIPTED CHARACTERS |
| JPH09168083A (en) | 1995-12-15 | 1997-06-24 | Murata Mach Ltd | Image processing unit |
| JP3830998B2 (en) | 1995-12-28 | 2006-10-11 | 株式会社東芝 | Ruled line removal method and character recognition apparatus using the same |
| US5883973A (en) | 1996-02-20 | 1999-03-16 | Seiko Epson Corporation | Method and apparatus for processing a document by segmentation into text and image areas |
| US6157738A (en) * | 1996-06-17 | 2000-12-05 | Canon Kabushiki Kaisha | System for extracting attached text |
| FR2756952B1 (en) * | 1996-12-06 | 1999-06-25 | Itesoft | MANUSCRIPT CHARACTERS RECOGNITION SYSTEM |
| JP3189870B2 (en) | 1996-12-24 | 2001-07-16 | シャープ株式会社 | Image processing device |
| US6023534A (en) * | 1997-08-04 | 2000-02-08 | Xerox Corporation | Method of extracting image data from an area generated with a halftone pattern |
-
2000
- 2000-04-07 WO PCT/US2000/009199 patent/WO2000060531A2/en active Application Filing
- 2000-04-07 US US09/544,802 patent/US6330357B1/en not_active Expired - Lifetime
- 2000-04-07 AU AU40773/00A patent/AU4077300A/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| US6330357B1 (en) | 2001-12-11 |
| AU4077300A (en) | 2000-10-23 |
| WO2000060531A3 (en) | 2001-04-05 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP0677817B1 (en) | Page segmentation and character recognition system | |
| CA1160347A (en) | Method for recognizing a machine encoded character | |
| EP0677818B1 (en) | Image pre-processor for character recognition system | |
| US5778103A (en) | OCR image pre-processor | |
| US6351559B1 (en) | User-enclosed region extraction from scanned document images | |
| EP0677812A1 (en) | Document storage and retrieval system | |
| EP0677816B1 (en) | Programmable function keys for a networked personal imaging computer system | |
| JPS6159568A (en) | document processing device | |
| US5974177A (en) | Apparatus and method of network distribution of record data using transmittal symbols hand entered on a transmittal sheet | |
| CN1964422A (en) | Watermarked information embedding apparatus and method, and computer program | |
| US6330357B1 (en) | Extracting user data from a scanned image of a pre-printed form | |
| EP2580060B1 (en) | High resolution digital embossing | |
| EP0896294B1 (en) | Method for document rendering and character extraction | |
| US5228095A (en) | Apparatus for recognizing printed characters | |
| US20020015524A1 (en) | Image processing device, program product and system | |
| EP1310940A1 (en) | Color display device and method | |
| JPH1027214A (en) | Method and apparatus for separating contact characters in an optical character recognition computer | |
| US7221795B2 (en) | Document processing method, recording medium having recorded thereon document processing program, document processing program, document processing apparatus, and character-input document | |
| US7142733B1 (en) | Document processing method, recording medium recording document processing program and document processing device | |
| US7889925B2 (en) | Method, system and computer program for encoding and decoding a pixelized target graphic symbol | |
| EP0692768A2 (en) | Full text storage and retrieval in image at OCR and code speed | |
| JP3954247B2 (en) | Document input method, recording medium recording document input program, and document input device | |
| JP2025091589A (en) | Image processing device, image processing system and program | |
| JP3081510B2 (en) | Braille recognition system | |
| JP3281786B2 (en) | Braille copying machine |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
| DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
| AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW |
|
| AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
| REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
| 122 | Ep: pct application non-entry in european phase | ||
| NENP | Non-entry into the national phase |
Ref country code: JP |