CN109359652A

CN109359652A - A fast and automatic method for extracting rectangular scans from digital photos

Info

Publication number: CN109359652A
Application number: CN201811403640.1A
Authority: CN
Inventors: 黄晓平; 曹如军; 周全书
Original assignee: Zhejiang Sci Tech University ZSTU
Current assignee: Zhejiang Sci Tech University ZSTU
Priority date: 2018-11-23
Filing date: 2018-11-23
Publication date: 2019-02-19

Abstract

The invention relates to a method for quickly and automatically extracting a rectangular scan from a digital photo, comprising the following steps: 1) data downsampling and grayscale transformation; 2) image noise filtering and edge detection; 3) contour tracking to detect the largest contour Line; 4) Contour simplification and convex hull corner detection; 5) Region clipping and perspective distortion correction. The beneficial effects of the invention are as follows: the method of the present invention for quickly and automatically extracting a rectangular scan piece from an ordinary digital photo has no special requirements for the photographing environment and photographing conditions, no manual intervention is required in the processing process, anti-noise, fast calculation speed, high efficiency, and The processed digital archives occupy a small space, which is convenient for preservation and information exchange. The system with the present invention as the core can be used to quickly and automatically process photos of ordinary digital cameras, or scanned images of devices such as scanners, and can also be directly installed in smart devices such as mobile phones to process documents in real time.

Description

A method of the fast automatic extraction rectangular scanning part from digital photograph

Technical field

The present invention relates to a kind of digital image processing techniques field, especially a kind of extraction fast automatic from digital photograph The method of rectangular scanning part.

Background technique

It is protected compared to by all kinds of data (e.g., paper document, invoice, card, leaflet etc.) digitlization after scanner scanning It deposits, using ordinary digital camera (containing mobile phone camera etc.) acquisition (digitlization) these information, for achieving, transmitting or share also more Convenient and efficient.But with generic digital camera (shooting) acquisition archive data there are several main problems: one, target zone with (imaging) acquisition range is inconsistent, two, influence vulnerable to ambient lighting, three, data collected there are deformation, four, the figure of acquisition As data volume (Pixel Dimensions are million or ten million magnitude, and compressed file is usually number MB in a manner of jpeg) greatly, it is not easy to It saves or handles.

In view of data usually has rectangular outer frame, the prior art is usually to utilize for the method for scanned copy hough transform Hough transform detects straight line, recycles the both horizontally and vertically feature of straight line, detects rectangle.

Edge pixel is extracted from input picture；Straight line is extracted from edge pixel using Hough transformation, reselection is by a plurality of The rectangular area that four straight lines substantially orthogonal two-by-two are constituted in straight line.Its main deficiency is not consider the imaging of general camera Condition, method efficiency are lower, it is difficult to handle rotation and perspective deformation of data object etc..

Existing perspective distortion corrects technology, the whole of the main square boundary according to flat image or part, plan view Wire, text row, column etc. as in can indicate the information of perspective distortion feature, detect and determine perspective distortion parameter, and are rectified Just.Nearly horizontal, vertical straight line is extracted from the file and picture of input, determines going out for less parallel line group using voting mechanism Point, and then calculate perspective transform relationship.Its main deficiency is that straight-line detection efficiency is lower, and to local deformation or non-perspective Deformation is not applicable.

Summary of the invention

The shortcomings that the invention solves the above-mentioned prior arts provides a kind of efficient process generic digital camera in natural environment Under the conditions of, (scanning) archive of taking pictures has the automatic processing method of the document information of rectangular outer frame.The present invention can reduce all kinds of Light environment influence on RT, it is applied widely；It replaces the straight-line detection in conventional method to solve with Corner Detection to wait Select rectangular area；Inactive area automatic cutting reduces data-storing amount.

The technical scheme adopted by the invention to solve the technical problem: this extraction rectangle fast automatic from digital photograph is swept The method for retouching part, comprising the following steps: 1) data are down-sampled and greyscale transformation；2) image filter is made an uproar and edge detection；3) isopleth Tracing detection largest contours line；4) profile simplification and convex closure Corner Detection；5) region is cut out and perspective distortion is corrected.

Preferably, the data are down-sampled by the way of scaled down.

Preferably, the contour tracing eight neighborhood pixel-based, since outermost edge starting point, by counterclockwise Abutment points in its neighborhood are successively added to tracked marginal point and concentrated by direction, and iteration proceeds to not new marginal point and adds Until entering, if the edge length tracked be greater than whole image perimeter half, stop search other marginal informations, i.e., with This is as the candidate rectangle contour line tracked.

Preferably, convex closure Corner Detection uses the scanning method of 2D plane point set.

Preferably, region cut out and perspective distortion correct the step of include selecting region vertex first, calculate correspond to The vertex of the same name of rectangle calculates transformation matrix according to the corresponding relationship on candidate region vertex and rectangle vertex, so as to by institute The candidate region of selection is mapped to corresponding rectangular area, and the perspective distortion in original image is corrected in mapping process.

Invention has the advantages that: the side for extracting rectangular scanning part fast automatic from general digital photo of the invention Method, to photo environment, shooting condition without particular/special requirement, treatment process is not necessarily to manual intervention, antinoise (robust), calculating speed Fastly, high-efficient, and treated that digital archive part occupies little space, convenient for saving and information exchange.Using the present invention as core System can be used for the scan image of the fast automatic processing equipment such as ordinary digital camera photo or scanner, can also directly pacify Loaded on smart machines such as mobile phones, document information is handled in real time.

Detailed description of the invention

Fig. 1 is method process flow diagram of the invention；

Fig. 2 is polyline RDP (Ramer-Douglas-Peucker) algorithm schematic diagram；

Fig. 3 is that region is cut out and rectification of distortion schematic diagram；

Fig. 4 is the paper document (digital photograph) that regular handset is shot under low-light environment；

Fig. 5 is image gray-scale transformation and drafting results；

Fig. 6 be Edge extraction and refine after result；

Fig. 7 is that paper document (digital photograph) extracts and correct result.

Specific embodiment

The present invention will be further explained below with reference to the attached drawings:

Embodiment:

The method that the present invention is used to fast and automatically extract rectangular scanning file from digital photograph is suitable for general digital Camera or mobile phone etc., the image acquired under the conditions of all kinds of natural environments (illumination).Key step includes 1) data it is down-sampled and Greyscale transformation；2) image filter is made an uproar and edge detection；3) contour tracing detects largest contours line；4) profile simplification and convex closure angle point Detection；5) region is cut out and perspective distortion is corrected.Fig. 1 provides the key step and process flow of this method:

1) data are down-sampled and greyscale transformation:

It is rich in color since digital photograph (digital picture) data volume is big (sampled point is more, typically larger than four mega pixels) (usually 24 three colour imagings), need to pre-process data, to promote calculated performance.The present invention is down-sampled, uses The mode of scaled down, that is, keep the length-width ratio of image.Scaling ruler is typically chosen according to the size of input picture Between 1/6~1/2 (input picture is bigger, and scale bar s is smaller).The methods of bilinearity, cubic convolution can be used in sampling process；It examines Consider computational efficiency, also can use nearest point methods (nearest neighbor) and complete down-sampled process, it may be assumed that

X_dst=s X_src

Wherein, s is zoom factor,(image space), X_srcIndicate source pixel, X_dstIndicate object pixel.

Greyscale transformation is then by image by tri- color space transformation of RGB to brightness space Y:

Y←0.299·R+0.587·G+0.114·B

Wherein, RGB is three color components of image pixel, red (red), green (green), blue (blue).

In view of image capture environment (image-forming condition) is complicated (illumination and reflection environment etc.), to promote the method for the present invention Extensive adaptability carries out gray scale to image and draws high, with the robustness of method for improving and the accuracy of processing result.Greyscale transformation is adopted With linear transformation, the actual tonal range of image is mapped to standard grayscale range [0.0,1.0] or [0,255].

Transformation for mula is as follows: Y_dst=α * Y_src+ β,

Wherein, the ÷ of α=255 (max (Y_src)–min(Y_src)), β=0- α * min (Y_src)) (note: if tonal range is 1.0) 255 in formula are then revised as by [0.0,1.0],

Y_srcIndicate source pixel gray value, Y_dstIndicate the gray value of object pixel.

2) image filter is made an uproar and edge detection:

The main purpose that image filter is made an uproar is to improve the accuracy of edge detection.It is flat to filter adoptable all kinds of common images Sliding algorithm, such as mean filter, median filtering.It is briefly explained by taking 5 × 5 Gaussian kernels (σ=1) as an example below.According to Gaussian kernel letter Number can calculate convolution mask (integer value):

Convolution is carried out to the image after greyscale transformation with this template, can preferably be retained while filtering out high-frequency noise Edge feature.

The normal information such as the first derivative based on image, second dervative or gradient of edge detection.The present invention uses Canny operator Edge is detected, considers that image passes through greyscale transformation (stretching), lower threshold value is respectively set to the 3/8 of tonal range thereon, and 6/8 (to 8 Position gray level image, value 96,192).

3) contour tracing and largest contours line drawing:

The marginal information detected in previous step contains noise, and connectivity is relatively poor (discrete short edge), Before carrying out contour tracing, refine.Its processing step is first to carry out morphological dilations, connects discontinuous neighbour Edge fit edge, then carry out etching operation, edge is refined (using 3 × 3 square structure elements in the present invention, it is possible to use its His structural element).

Frontier tracing eight neighborhood pixel-based, since outermost edge starting point, successively by (side counterclockwise in its neighborhood To) abutment points be added to tracked marginal point and concentrate, iteration proceeds to until not new marginal point is added.If institute with The edge length (as unit of pixel) of track be greater than whole image perimeter (that is, 2 times of image it is wide high and) half, then stop searching Other marginal informations of rope, that is, in this, as the candidate rectangle contour line for tracking (detection).

4) profile is simplified and convex closure angle point calculates:

After extracting initial profile boundary, regularization or simplification are needed, to indicate true profile.Ramer- Douglas-Peucker algorithm or Douglas-Pecuker algorithm (RDP/DP), are common polyline reduction algorithm, mistake Journey is with the distance of given point to line (threshold value, this method in be set as the 1~4% of broken line length), and recurrence simplification is apart from threshold Point (Fig. 2) in value.

Profile angle point set after simplified after calculating its convex closure, selects four points closest to right angle as candidate regions The rectangle angle point in domain.By taking the scanning method of 2D plane point set as an example, illustrate convex closure calculating process: (right side) point makees starting point under selection most (that is, y-coordinate smallest point；There is y-coordinate minimum value if there is multiple points, then select the point wherein with maximum x coordinate)；It penetrates Line (direction (1,0) or x-axis positive) rotates counterclockwise around starting point, select on the smallest ray of scan angle point (if there is It is multiple, then select the point nearest apart from starting point) as next convex closure vertex；Using the point being newly added as rotation center, will scan Vector replaces the vector between the point and former point；Above step is repeated until returning to starting point.

(note: the time complexity of above-mentioned algorithm is O (n²), more efficient O (n log n) algorithm can also be used such as, Graham, Chan algorithm etc.)

If above-mentioned convex closure is more than four vertex, the angle point wherein closest to 90 ° is selected, four as region to be transformed The vertex of the same name of side shape (rectangle).

5) region is cut out and perspective distortion is corrected:

After selecting region vertex, need to calculate the vertex of the same name of corresponding rectangle.It sorts first to region vertex, it is specific to arrange Program process are as follows: select most lower-left point (or, point with min coordinates and (x+y)) as starting point P₀, others point is then by the inverse time Needle (or clockwise) direction sequencing is P₁, P₂, P₃.Corresponding rectangle apex coordinate be respectively (counter clockwise direction): (0,0), (w, 0), (w, h), (0, h), in which:

W=max (| P₀P₁|,|P₂P₃|)

H=max (| P₀P₃|,|P₂P₁|)

Wherein, | | indicate line segment length (if up time needle sort, the adjustment of rectangle vertex correspondence).

According to the corresponding relationship on candidate region vertex and rectangle vertex, transformation relation (3 × 3 matrix) can be calculated:

It is denoted as: A_3×3X_3×4=U_3×4, then, and A X X^T=U X^T, that is, have, A=U X^T(X X^T)^-。

After calculating transformation matrix, selected candidate region can be mapped to corresponding rectangular area, in original image Perspective distortion, be able in mapping process correct (Fig. 3).(note: including rotation, ratio, translation in above-mentioned transformation matrix, And mistake such as cuts, has an X-rayed at the transformation)

The fast automatic method for extracting rectangular scanning part in slave general digital photo of the invention, to photo environment, shooting Condition is not necessarily to manual intervention without particular/special requirement, treatment process, and antinoise (robust), calculating speed is fast, high-efficient, and treated Digital archive part occupies little space, convenient for preservation and information exchange.Using the present invention as the system of core, it can be used for fast automatic place The scan image for managing the equipment such as ordinary digital camera photo or scanner, can also be directly mounted at the smart machines such as mobile phone, real When handle document information.

It is illustrated below using automatically extracting rectangular scanning part from the photo that certain mobile phone is shot as embodiment.Photograph taking In indoor environment, illumination is insufficient (whole image is obviously partially dark).Image size is 2592 × 1944 pixels, and 24 RGB tri- are colored, With the storage of jpeg format, occupied space is 1.3 Mbytes.Target area is rectangle paper document, occupies entire imaging region (number Word photo) major part, relative to image direction (coordinate system), there are rotation and perspective distortions.Data are illustrated such as Fig. 4 institute Show.

Data are carried out down-sampled rear (scale bar 1/4) using method of the invention, candidate image size is 648 × 496 Pixel is converted to 8 gray level images, wherein gray scale maximum value be 122, minimum value be 7 (brightness is obviously relatively low), need to its into Row greyscale transformation, to promote the accuracy of edge detection.That is, calculating the coefficient in greyscale transformation formula, Y_dst=α * Y_src+ β, α=2.21739, β=- 15.52174.Greyscale transformation result is as shown in Figure 5.

Gauss filter is carried out to gray level image to make an uproar, and using Canny operator detection image edge, then carries out utilizing morphological operator After carrying out refined processing (that is, first expanded with 3 × 3 square structure elements, corrode again) to extracted edge, for extracting equivalence Line (Fig. 6).

After selecting largest contours line (outermost profile), it is simplified using RDP algorithm, then calculate its convex closure, and selected most Close to the angle point of four convex closure vertex as candidate region at right angle, respective coordinates are (clockwise): (49,123), (104, 604), (484,543), (393,77)；Map that target rectangle region: (0,0), (0,384), (484,384), (484, 0).After (equal proportion amplification) calculates transformation matrix (comprising translation, rotation and perspective etc.), the candidate region application of original image is become It changes, automatic shearing cuts down and merge mapping (rotation and perspective correction) to target (rectangle) region.Image after processing only includes body paper matter File region, size are 1936 × 1536 pixels, about the 69% of original image size；Jpeg format (relevant parameter and original image It is identical) file stores the space occupied about 0.7Mb, than the memory space that original image saves about 45%, and treated image (rectangle Paper document) horizontal or vertical direction it is more neat, meet the visual custom (Fig. 7) of people.

In addition to the implementation, the present invention can also have other embodiments.It is all to use equivalent substitution or equivalent transformation shape At technical solution, fall within the scope of protection required by the present invention.

Claims

1. A method for quickly and automatically extracting a rectangular scan piece from a digital photo, comprising the following steps: 1) data downsampling and grayscale transformation; 2) image noise filtering and edge detection; 3) contour tracking detection maximum contour line ; 4) Contour simplification and convex hull corner detection; 5) Region clipping and perspective deformation correction.

2 . The method for quickly and automatically extracting rectangular scans from digital photos according to claim 1 , wherein the data downsampling adopts a proportional reduction method. 3 .

3. The method for quickly and automatically extracting a rectangular scan piece from a digital photo according to claim 1, wherein the contour tracing is based on eight neighborhoods of pixels, starting from the starting point of the outermost edge, and pressing counterclockwise The direction adds the adjacent points in its neighborhood to the tracked edge point set in turn, and iterates until no new edge points are added. If the tracked edge length is greater than half of the perimeter of the entire image, stop searching for other edge information. That is, it is taken as the tracked candidate rectangle contour line.

4. The method for quickly and automatically extracting a rectangular scan piece from a digital photo according to claim 1, wherein the convex hull corner point detection adopts a scanning method of a 2D plane point set.

5. the method for quickly and automatically extracting a rectangular scan piece from a digital photo according to claim 1, is characterized in that: the step of region clipping and perspective deformation correction comprises, at first select the region vertex, calculate the vertex of the same name corresponding to the rectangle, according to The corresponding relationship between the vertices of the candidate area and the vertices of the rectangle is calculated, and the transformation matrix is calculated, so that the selected candidate area can be mapped to the corresponding rectangular area, and the perspective deformation in the original image can be corrected during the mapping process.