CN107330930A

CN107330930A - Depth of 3 D picture information extracting method

Info

Publication number: CN107330930A
Application number: CN201710502470.1A
Authority: CN
Inventors: 邓浩; 于荣; 陈树强; 余金清
Original assignee: Jinjiang Tide Photoelectric Technology Co Ltd; University of Electronic Science and Technology of China
Current assignee: Jinjiang Tide Photoelectric Technology Co Ltd; University of Electronic Science and Technology of China
Priority date: 2017-06-27
Filing date: 2017-06-27
Publication date: 2017-11-07
Anticipated expiration: 2037-06-27
Also published as: CN107330930B

Abstract

The present application discloses a method for extracting the depth of a three-dimensional object, which solves the problem that the algorithm is too complicated in the process of extracting the depth information of the current 3D image. The method for extracting the depth of a three-dimensional object obtains the depth of the pixel by calculating the similarity of adjacent pixels in each element image in the two-dimensional plane image. This extraction method is a fast and accurate method for depth extraction of 3D objects, considering integrated imaging (II) and synthetic aperture integral imaging, by assuming that a 3D object is a surface composed of many aspects, a depth extraction method is developed based on the Patch‑Match algorithm Mathematical frame. The present application also discloses a device using the method for extracting the depth of a three-dimensional object.

Description

3D Image Depth Information Extraction Method

技术领域technical field

本发明涉及光学信息工程领域的信息提取技术，特别涉及三维图像深度信息提取的技术。The invention relates to an information extraction technology in the field of optical information engineering, in particular to a technology for extracting depth information of a three-dimensional image.

背景技术Background technique

作为下一代显示技术，近年来三维(3D)成像技术的飞速发展。综合成像(II)(英文为：Integrated imaging(II))涉及其高分辨率和全视差。它与传统的图像处理技术(如超分辨率和图像匹配)兼容。为了实现3D成像和显示。综合成像(II)需要来自3D对象(称为元素图像)的不同视角，这些对象通常由通用综合成像(II)系统中的小透镜阵列拾取。由于使用标准2D图像，可以使用具有小透镜阵列或廉价成像器阵列的单个便宜的相机来构建多尺度3D成像系统。现有技术已经实现了许多研究成果，包括3D显示和自动目标识别。As a next-generation display technology, three-dimensional (3D) imaging technology has developed rapidly in recent years. Integrated imaging (II) (English: Integrated imaging (II)) involves its high resolution and full parallax. It is compatible with traditional image processing techniques such as super-resolution and image matching. In order to realize 3D imaging and display. Integrated imaging (II) requires different perspectives from 3D objects (called elemental images), which are usually picked up by small lens arrays in general integrated imaging (II) systems. Due to the use of standard 2D images, multiscale 3D imaging systems can be constructed using a single inexpensive camera with an array of small lenses or an array of inexpensive imagers. Existing technologies have achieved many research results, including 3D display and automatic object recognition.

深度提取被称为综合成像(II)的最重要的问题之一。许多研究者已经注意到综合成像(II)的深度提取。然而，现有技术已经提出的方法存在的缺点是低分辨率元素图像或复杂的算法。Depth extraction is known as one of the most important problems in integrated imaging (II). Many researchers have paid attention to depth extraction for integrated imaging (II). However, the methods that have been proposed in the prior art suffer from low-resolution elemental images or complex algorithms.

发明内容Contents of the invention

根据本申请的一个方面，提出一种三维图像深度信息的提取方法，解决目前三维图像深度信息提取过程中算法过于复杂的问题。该提取方法是一种快速准确的三维物体深度提取方法，考虑了综合成像(II)和合成孔径积分成像(英文为：synthetic apertureintegral imaging)，通过假设三维对象是由许多方面构成的表面，基于Patch-Match算法开发了深度提取的数学框架。According to one aspect of the present application, a method for extracting depth information of a three-dimensional image is proposed, which solves the problem that the algorithm is too complicated in the current process of extracting depth information of a three-dimensional image. This extraction method is a fast and accurate method for extracting the depth of three-dimensional objects. It considers integrated imaging (II) and synthetic aperture integral imaging (English: synthetic aperture integral imaging). By assuming that three-dimensional objects are surfaces composed of many aspects, based on Patch -Match algorithm develops a mathematical framework for deep extraction.

所述三维对象深度提取方法，通过计算二维平面图像中每个元素图像中相邻像素的相似度，进而得到所述像素的深度。The method for extracting the depth of a three-dimensional object obtains the depth of the pixel by calculating the similarity of adjacent pixels in each element image in the two-dimensional plane image.

优选地，在计算二维平面图像中每个元素图像中像素的相似度时，假设相邻像素在同一平面上，并用多个小平面对表面进行建模。Preferably, when calculating the similarity of pixels in each element image in a two-dimensional plane image, it is assumed that adjacent pixels are on the same plane, and multiple small planes are used to model the surface.

优选地，所述计算二维平面图像中每个元素图像中相邻像素的相似度，采用相邻像素传播和随机优化的多循环算法。Preferably, the calculation of the similarity of adjacent pixels in each element image in the two-dimensional plane image adopts a multi-cycle algorithm of adjacent pixel propagation and random optimization.

优选地，包括将横向像素初始化为随机平面和迭代计算相邻像素的相似度的步骤。Preferably, it includes the steps of initializing the horizontal pixels to a random plane and iteratively calculating the similarity of adjacent pixels.

进一步优选地，所述将横向像素初始化为随机平面，包括步骤：Further preferably, the initialization of the horizontal pixels to a random plane includes the steps of:

将横向像素初始化为随机平面；Initialize horizontal pixels to a random plane;

每个像素的初始深度设定为随机值，通过每个像素的表面的法线矢量被设置为随机单位向量。The initial depth of each pixel is set to a random value, and the normal vector of the surface through each pixel is set to a random unit vector.

进一步优选地，所述将横向像素初始化为随机平面，包括如下过程：Further preferably, said initializing the horizontal pixels as random planes includes the following process:

通过所述横向像素的深度坐标的平面，由式(5)表示，The plane passing through the depth coordinates of the horizontal pixels is represented by formula (5),

z＝f₁▽p_x+f₂▽p_y+f₃ 式(5)z＝f ₁ ▽p _x +f ₂ ▽p _y +f ₃ formula (5)

其中，z为所述横向像素的深度坐标，和p_x和p_y为随机平面，f ₁、f₂和f ₃分别如式(6-1)、式(6-2)和式(6-3)所示，Wherein, z is the depth coordinate of the horizontal pixel, and p _x and p _y are random planes, and f ₁ , f ₂ and f ₃ are as in formula (6-1), formula (6-2) and formula (6- 3) as shown,

f₁＝-n₁/n₃ 式(6-1)f ₁ =-n ₁ /n ₃ formula (6-1)

f₂＝-n₂/n₃式(6-2)f ₂ =-n ₂ /n ₃ formula (6-2)

f₃＝(n₁·x₀+n₂·y₀+n₃·z₀)/n₃ 式(6-3)f ₃ =(n ₁ ·x ₀ +n ₂ ·y ₀ +n ₃ ·z ₀ )/n ₃ Formula (6-3)

式(6-1)、式(6-2)和式(6-3)中，n₁、n₂和n₃是标量、是由如式(7)所示的数值向量表示最小聚合成本所在的所有可能平面，x₀和y₀分别为初始化的所述横向像素的坐标数值，z₀为初始化的所述横向像素的初始深度值，In formula (6-1), formula (6-2) and formula (6-3), n ₁ , n ₂ and n ₃ are scalars, which are numerical vectors shown in formula (7) Indicates all possible planes where the minimum aggregation cost is located, x ₀ and y ₀ are the coordinate values of the initialized horizontal pixels respectively, z ₀ is the initial depth value of the initialized horizontal pixels,

式(7)中m由式(8)提供，In formula (7), m is provided by formula (8),

式(8)中，w用于实现自适应加权，w由式(9)提供；E表示相似性计算因素，E由式(10)提供；▽表示梯度值，W_p表示集中在p的一个方形窗口，In formula (8), w is used to implement adaptive weighting, and w is provided by formula (9); E represents the similarity calculation factor, and E is provided by formula (10); ▽ represents the gradient value, and W _p represents a square window,

式(9)中，||I_p-I_q||表示两相邻像素p和q间的距离，p为横向像素，q为与p在同一平面内的相邻像素，In formula (9), ||I _p -I _q || represents the distance between two adjacent pixels p and q, p is a horizontal pixel, q is an adjacent pixel in the same plane as p,

E＝α||I_i-I_j||+(1-α)||▽I_i-▽I_j|| 式(10)E＝α||I _i -I _j ||+(1-α)||▽I _i -▽I _j || Formula (10)

式(10)中，I是元素图像中像素的强度，下标i，j是元素图像的索引，I_i，I_j分别表示第i，第j个元素图像中的相应像素的强度，I_i和I_j将投射到相同的空间点，I_i和I_j的坐标由式(11)计算得到，||I_i-I_j||为RGB空间中的I_i和I_j的颜色的曼哈顿距离，▽I_i和▽I_j是像素的灰度值梯度，||▽I_i-▽I_j||表示在I_i和I_j计算的灰度梯度的绝对差，α是没有单位的权重因子，用于平衡颜色和渐变项的影响；In formula (10), I is the intensity of the pixel in the element image, the subscripts i and j are the index of the element image, I _i , I _j represent the intensity of the corresponding pixel in the i-th and j-th element image respectively, I _i and I _j will be projected to the same space point, the coordinates of I _i and I _j are calculated by formula (11), ||I _i -I _j || is the Manhattan distance of the colors of I _i and I _j in RGB space , ▽I _i and ▽I _j are the gray value gradient of the pixel, ||▽I _i -▽I _j || represents the absolute difference of the gray gradient calculated at I _i and I _j , α is the weight factor without unit , used to balance the influence of color and gradient items;

式(11)中，u_i是每个元素图像中对应于坐标为y和z的点的像素的局部坐标。In Equation (11), u _i are the local coordinates of the pixel corresponding to the point with coordinates y and z in each element image.

作为一个具体的实施方式，所述方法在使用综合成像(II)系统的计算机上执行。As a specific embodiment, the method is executed on a computer using an integrated imaging (II) system.

进一步优选地，所述迭代计算相邻像素的相似度，包括步骤：Further preferably, the iterative calculation of the similarity of adjacent pixels comprises the steps of:

a、初始化一个随机平面内的一个横向像素并计算其深度坐标和向量值，计算其聚合成本，将此聚合成本作为参考聚合成本；a. Initialize a horizontal pixel in a random plane and calculate its depth coordinate and vector value, calculate its aggregation cost, and use this aggregation cost as a reference aggregation cost;

b、计算与步骤a中横向像素在同一平面内的任意一相邻像素的聚合成本；b. Calculate the aggregation cost of any adjacent pixel in the same plane as the horizontal pixel in step a;

c、比较步骤a中参考聚合成本与步骤b中相邻像素的聚合成本；c. comparing the reference aggregation cost in step a with the aggregation cost of adjacent pixels in step b;

d.将步骤c中比较小的聚合成本对应像素作为新的参考值；d. Use the pixel corresponding to the relatively small aggregation cost in step c as a new reference value;

e.将步骤d中新的参考值对应像素设为与该对比参考值的对应像素左上相邻；e. Set the pixel corresponding to the new reference value in step d to be adjacent to the upper left of the corresponding pixel of the comparison reference value;

f.设定条件：步骤d中新的参考值对应深度值在最大允许范围内；f. Setting conditions: the depth value corresponding to the new reference value in step d is within the maximum allowable range;

g.如果步骤f条件成立，则循环执行步骤a到步骤f；g. If the condition of step f is established, execute step a to step f in a loop;

l.步骤f条件不成立，将最后一次循环步骤e中作为图像最左边像素；l. The condition of step f is not established, and the last loop step e is used as the leftmost pixel of the image;

m.在步骤l的基础上，图像右下进行下降偶次迭代；m. On the basis of step l, the lower right of the image is performed to descend even iterations;

n.根据步骤m的迭代次数计算每个像素的计算次数。n. Calculate the number of calculations for each pixel according to the number of iterations in step m.

进一步优选地，所述迭代计算相邻像素的相似度，包括空间传播和平面精修的步骤；Further preferably, the iterative calculation of the similarity of adjacent pixels includes the steps of spatial propagation and plane refinement;

所述空间传播的步骤中，相邻像素点设定为在同一个平面上，首先由式(8)评估不同情况的成本m，In the step of spatial propagation, adjacent pixel points are set to be on the same plane, and the cost m of different situations is first evaluated by formula (8),

式(8)中，p表示当前像素，f_p是其对应的平面的向量，q是p的相邻像素，在p(x₀，y₀)下分别用f_p和f_q计算，以评估这两种情况的成本；检查条件如式(12)所示，In formula (8), p represents the current pixel, f _p is the vector of its corresponding plane, and q is the adjacent pixel of p, which is calculated by f _p and f _q respectively under p(x ₀ , y ₀ ) to evaluate The cost of these two cases; the check condition is shown in formula (12),

m(x₀，y₀，f_p')<m(x₀，y₀，f_p)；式(12)m(x ₀ , y ₀ , f _p ')<m(x ₀ , y ₀ , f _p ); formula (12)

式(12)中的和分别由式(8)得到；and in formula (12) are respectively obtained by formula (8);

如果式(12)所示的表达式成立，则f_q被接受为p的新向量，即f_p＝f_q；If the expression shown in formula (12) is established, then f _q is accepted as a new vector of p, i.e. f _p =f _q ;

在奇数迭代中，q是左边和上边界；In odd iterations, q is the left and upper bounds;

在偶数迭代中，q是右边界和下边界；In even iterations, q is the right and bottom bounds;

所述平面精修的步骤中，将f_p转换为法向量n_p，两个参数▽z和▽n被定义为分别限制z₀和n的最大允许变化，z₀'计算为z₀'＝z₀+▽z，其中▽z位于[-▽z^max，▽z^max]，并且n'＝u(n+▽n)，u()表示计算单位向量，▽n位于[-▽n^max，▽n^max]；In the step of plane refinement, f _p is converted into normal vector n _p , two parameters ▽z and ▽n are defined to limit the maximum allowable variation of z ₀ and n respectively, and z ₀ ' is calculated as z ₀ '= z ₀ +▽z, where ▽z is located at [-▽z ^max , ▽z ^max ], and n'=u(n+▽n), u() represents the calculation unit vector, ▽n is located at [-▽n ^max , ▽ ^nmax ];

最后，通过p和n'得到一个新的f_p'，如果m(x₀，y₀，f_p')<m(x₀，y₀，f_p)，则f_p＝f_p'；Finally, get a new f _p ' through p and n', if m(x ₀ , y ₀ , f _p ')<m(x ₀ , y ₀ , f _p ), then f _p =f _p ';

所述平面精修的步骤中，从设置▽z^max＝maxdisp/2开始，其中maxdisp是允许的最大视差，▽n^max＝1，每次细化后，参数将更新为▽z^max＝▽z^max/2、▽n^max＝▽n^max/2；直到▽z^max<resolution/2，得到最小化的分辨率；对于奇数迭代，从图像的左侧开始，向右下进行偶数迭代；In the step of plane refinement, start from setting ▽z ^max =maxdisp/2, where maxdisp is the maximum allowable parallax, ▽n ^max =1, after each refinement, the parameter will be updated to ▽z ^max =▽z ^max /2, ▽n ^max ＝▽n ^max /2; until ▽z ^max <resolution/2, the minimized resolution is obtained; for odd iterations, start from the left side of the image and proceed to even iterations down to the right;

迭代后获得相邻像素的相似度，进而得到所述三维对象的深度。After iteration, the similarity of adjacent pixels is obtained, and then the depth of the three-dimensional object is obtained.

进一步优选地，z₀以固定值初始化为所有像素，并且在迭代之前添加精修步骤。It is further preferred that z ₀ is initialized to a fixed value for all pixels, and a refinement step is added before the iteration.

根据本申请的一个方面，提供一种由二维图片获得三维图像信息的装置，解决目前3D图像深度信息提取过程中算法过于复杂的问题。该由二维图片获得三维图像信息的装置是一种快速准确的三维物体深度提取装置，考虑了综合成像(II)和合成孔径积分成像(英文为：synthetic aperture integral imaging)，通过假设3D对象是由许多方面构成的表面，基于Patch-Match算法开发了深度提取的数学框架。According to one aspect of the present application, a device for obtaining 3D image information from a 2D image is provided, which solves the problem that algorithms are too complicated in the current 3D image depth information extraction process. The device for obtaining three-dimensional image information from two-dimensional pictures is a fast and accurate three-dimensional object depth extraction device, which considers comprehensive imaging (II) and synthetic aperture integral imaging (English: synthetic aperture integral imaging), by assuming that the 3D object is A surface composed of many facets, a mathematical framework for depth extraction was developed based on the Patch-Match algorithm.

所述由二维图片获得三维图像信息的装置包括图片采集单元，图像存储单元和图像处理单元；The device for obtaining three-dimensional image information from two-dimensional pictures includes a picture acquisition unit, an image storage unit and an image processing unit;

所述图片采集单元和图像存储单元电连接，所述图像存储单元和图像处理单元电连接；The picture acquisition unit is electrically connected to the image storage unit, and the image storage unit is electrically connected to the image processing unit;

所述图像处理单元采用上述所述三维对象深度提取方法中的至少一种，获得二维图片中对象的深度信息，建立三维图像。The image processing unit adopts at least one of the above-mentioned 3D object depth extraction methods to obtain the depth information of the object in the 2D picture and create a 3D image.

本发明所述技术方案的有益效果包括但不限于：The beneficial effects of the technical solution of the present invention include but are not limited to:

(1)本申请提出了一种三维对象深度提取的新计算方法，该方法计算每个元素图像中像素的相似度，同时将其投影到可能的深度。(1) This application proposes a new computational method for depth extraction of 3D objects, which computes the similarity of pixels in each element image while projecting them to possible depths.

(2)本申请提供的三维图像深度信息的提取方法，考虑了表面的连续性，以提高分辨率。(2) The method for extracting the depth information of the 3D image provided by this application takes into account the continuity of the surface to improve the resolution.

(3)本申请提供的三维图像深度信息的提取方法，用于综合成像(II)系统中，使用补丁匹配方法，大大减少计算量，加快计算速度，可以使应用了本发明算法的设备更加普遍化。(3) The method for extracting the depth information of the three-dimensional image provided by the application is used in the comprehensive imaging (II) system, and the patch matching method is used to greatly reduce the amount of calculation and speed up the calculation, which can make the equipment applied to the algorithm of the present invention more common change.

附图说明Description of drawings

图1是本发明3D图像深度信息提取方法原理示意图；其中图1(a)是综合成像(II)的拾取部分的示意图；图1(b)是综合成像(II)的投影部分示意图。Fig. 1 is a schematic diagram of the principle of the 3D image depth information extraction method of the present invention; wherein Fig. 1 (a) is a schematic diagram of the picking part of the comprehensive imaging (II); Fig. 1 (b) is a schematic diagram of the projection part of the comprehensive imaging (II).

图2是在体表面和自由空间体素的综合成像(II)系统中光传播图。Figure 2 is a diagram of light propagation in an integrated imaging (II) system of body surface and free space voxels.

图3是图像信息提取的3D对象，其中图3(a)是采用综合成像(II)进行成像的对象；图3(b)是采用合成孔径积分成像进行成像的对象。Fig. 3 is a 3D object extracted from image information, wherein Fig. 3(a) is an object imaged by comprehensive imaging (II); Fig. 3(b) is an object imaged by synthetic aperture integral imaging.

图4是经过数学软件技术，采用本申请方法成像结果与采用现有技术方法成像结果的对比；其中图4(a)为采用微针的深度信息提取方法得到的结果；图4(b)为采用折反射全向提取方法得到的结果；图4(c)为采用本发明所述方法得到的综合成像(II)结果；图4(d)为采用本发明所述方法得到的合成孔径积分成像结果。Fig. 4 is a comparison between the imaging result of the application method and the imaging result of the prior art method through mathematical software technology; wherein Fig. 4 (a) is the result obtained by using the microneedle depth information extraction method; Fig. 4 (b) is Adopt the result that the catadioptric omnidirectional extraction method obtains; Fig. 4 (c) is the comprehensive imaging (II) result that adopts the method of the present invention to obtain; Fig. 4 (d) is the synthetic aperture integral imaging that adopts the method of the present invention to obtain result.

图5是降低白噪音影响后的结果对比；其中图5(a)是降低白噪音影响前的综合成像(II)结果；图5(b)是降低白噪音影响后的综合成像(II)结果；图5(c)是降低白噪音影响前的合成孔径积分成像结果；图5(d))是降低白噪音影响后的合成孔径积分成像结果。Figure 5 is a comparison of the results after reducing the influence of white noise; among them, Figure 5(a) is the result of comprehensive imaging (II) before reducing the influence of white noise; Figure 5(b) is the result of comprehensive imaging (II) after reducing the influence of white noise ; Figure 5(c) is the synthetic aperture integral imaging result before reducing the influence of white noise; Figure 5(d)) is the synthetic aperture integral imaging result after reducing the influence of white noise.

图6是采用本发明3D图像深度信息提取方法的背面投影图；其中图6(a)是图3(a)采用本发明3D图像深度信息提取方法的背面投影图；图6(b)是图3(b)采用本发明3D图像深度信息提取方法的背面投影图。Fig. 6 is the rear projection diagram adopting the 3D image depth information extraction method of the present invention; Wherein Fig. 6 (a) is Fig. 3 (a) adopts the rear projection diagram of the 3D image depth information extraction method of the present invention; Fig. 6 (b) is a diagram 3(b) The back projection diagram of the 3D image depth information extraction method of the present invention.

具体实施方式detailed description

下面结合实施例，详细描述本发明的技术方案。The technical solution of the present invention will be described in detail below in conjunction with the embodiments.

本发明提出了一种用于综合成像(II)系统中3D对象深度提取的新计算方法，计算每个元素图像中像素的相似度，同时将其投影到可能的深度。还考虑到表面的连续性，以提高分辨率。并使用补丁匹配方法加快计算速度。The present invention proposes a new calculation method for depth extraction of 3D objects in an integrated imaging (II) system, calculating the similarity of pixels in each element image while projecting them to possible depths. Surface continuity is also taken into account for improved resolution. And use the patch matching method to speed up the computation.

图1(a)和图1(b)为综合成像(II)的原理示意图。其中图1(a)为综合成像(II)的拾取部分的示意图，图1(b)为综合成像(II)的投影部分示意图。Figure 1(a) and Figure 1(b) are schematic diagrams of the principle of integrated imaging (II). 1(a) is a schematic diagram of the pick-up part of the comprehensive imaging (II), and FIG. 1(b) is a schematic diagram of the projection part of the comprehensive imaging (II).

参照图1(a)，字母A表示三维(3D)物体。Z(大写字母)表示物体和小透镜阵列之间的距离，g表示透镜阵列与像面之间的距离。来自3D物体的光线的强度和方向由小透镜记录在不同的位置。图1(a)中也显示了不同的元素图像，见右侧从上到下的三个图像。Referring to Figure 1(a), the letter A denotes a three-dimensional (3D) object. Z (capital letter) represents the distance between the object and the lenslet array, and g represents the distance between the lens array and the image plane. The intensity and direction of the light rays from the 3D object are recorded at different positions by the lenslets. Different element images are also shown in Figure 1(a), see the three images from top to bottom on the right.

在如图1(b)所示的投影部分中，每个元素图像通过相应的针孔投影到目标空间。在图1(b)中，z(小写字母)表示投影距离，g表示图像与针孔之间的距离。并且这些投影图像在重构平面中被放大了z/g的因子。最后，将这些放大的图像重叠并累积在输出平面的相应像素上。In the projection part as shown in Fig. 1(b), each element image is projected to the target space through the corresponding pinhole. In Figure 1(b), z (lowercase letter) represents the projection distance, and g represents the distance between the image and the pinhole. And these projected images are magnified by a factor of z/g in the reconstruction plane. Finally, these upscaled images are overlaid and accumulated on the corresponding pixels of the output plane.

在图1(a)和图1(b)示出的系统中，投影距离z由实验者设置，因此，当投影距离z与其空间深度Z不匹配，投影图像是模糊的像素。因此，如果已知不同投影距离中的每个像素的模糊度，就可以计算相应的投影距离。In the systems shown in Fig. 1(a) and Fig. 1(b), the projection distance z is set by the experimenter, so when the projection distance z does not match its spatial depth Z, the projected images are blurred pixels. Therefore, if the blurriness of each pixel in different projection distances is known, the corresponding projection distances can be calculated.

图2是在体表面和自由空间体素的综合成像(II)系统中光传播图。图2显示的是本发明的方法的2D结构，显示的是3D空间中的y-z平面，成像对象被显示为左边的一个面。小透镜阵列在y轴上。而z轴是深度方向。每个小透镜的坐标标记为S_i。成像对象平面被标记为u。u到小透镜阵列的距离是g。如图2所示，当将元素图像投影到z₀平面时，得到的结果图像中的(y₀，z₀)对于其相应像素的高相似性将是清晰的。作为比较，当投影到z₁平面时，(y₁，z₁)是模糊的，因为(y₁，z₁)处和(y₀，z₀)的像素来自对象u上不同部分，如图2中不同的颜色所示。对应于投影的每个像素的局部坐标点可以通过式(1)得到：Figure 2 is a diagram of light propagation in an integrated imaging (II) system of body surface and free space voxels. Fig. 2 shows the 2D structure of the method of the present invention, showing the yz plane in 3D space, and the imaging object is shown as a plane on the left. The lenslet array is on the y-axis. And the z axis is the depth direction. The coordinates of each lenslet are labeled S _i . The imaged object plane is labeled u. The distance from u to the lenslet array is g. As shown in Fig. 2, when the elemental image is projected onto the z ₀ plane, the high similarity of (y ₀ , z ₀ ) to its corresponding pixel in the resulting image will be clear. As a comparison, when projected onto the z ₁ plane, (y ₁ , z ₁ ) is blurred because the pixels at (y ₁ , z ₁ ) and (y ₀ , z ₀ ) come from different parts of the object u, as shown in 2 shown in different colors. The local coordinate point corresponding to each pixel of the projection can be obtained by formula (1):

其中u_i是每个元素图像中对应于点(y，z)的像素的局部坐标。g是图像平面和小透镜阵列之间的距离。s_i是小透镜的坐标，即小透镜的指数。利用该等式，可以通过等式(1)来估计投影到相同点的像素的相似度。where u _i are the local coordinates of the pixel corresponding to point (y, z) in each element image. g is the distance between the image plane and the lenslet array. s _i are the coordinates of the lenslet, that is, the index of the lenslet. Using this equation, the similarity of pixels projected to the same point can be estimated by equation (1).

E＝α||I_i-I_j||+(1-α)||▽I_i-▽I_j|| 式(2)E＝α||I _i -I _j ||+(1-α)||▽I _i -▽I _j || Formula (2)

在这个方程中，E是相似性的评估因子。E越小像素越相似。I是元素图像中像素的强度。下标i，j是元素图像的索引。I_i，I_j分别表示第i，第j个元素图像中的相应像素。I_i和I_j将投射到相同的空间点，它们的坐标由等式(1)计算得到。||I_i-I_j||计算RGB空间中的I_i和I_j的颜色的L1距离(即曼哈顿距离)。▽I是像素的灰度值梯度；||▽Ii-▽Ij||表示在I_i和I_j计算的灰度梯度的绝对差。α是没有单位的权重因子，是用户定义的参数，用于平衡颜色和渐变项的影响。从该等式可以计算投影到相同空间位置的像素之间的相似度。In this equation, E is the evaluation factor of similarity. The smaller E is, the more similar the pixels are. I is the intensity of the pixel in the element image. The subscripts i, j are the indices of the element image. I _i , I _j represent corresponding pixels in the i-th and j-th element images respectively. I _i and I _j will project to the same spatial point, and their coordinates are calculated by equation (1). ||I _i -I _j || Calculate the L1 distance (ie Manhattan distance) of the colors of I _i and I _j in the RGB space. ▽I is the gray value gradient of the pixel; ||▽Ii-▽Ij|| represents the absolute difference of the gray gradient calculated at I _i and I _j . α is a weight factor without units and is a user-defined parameter to balance the influence of color and gradient terms. From this equation the similarity between pixels projected to the same spatial location can be calculated.

因此，在3D模式中，可以通过找到Z来提取具有横向点(x，y)的成像对象表面的深度，使得E(x，y，z)在Z＝[Z min，Z max]的范围内最小化。这个假设在数学上的表达如式(3)所示：Therefore, in 3D mode, the depth of the imaged object surface with a lateral point (x, y) can be extracted by finding Z such that E(x, y, z) is in the range Z = [Z min, Z max] minimize. The mathematical expression of this assumption is shown in formula (3):

可以通过检查所有可能的z来找到答案。然而，利用这种方式发现的可能的z是离散的，受到分辨率的限制。这种方式忽略了表面的连续性，并且在计算上是密集的，要获得亚分辨效果(sub-resolution effect)，需要更多的表面信息。本发明考虑了物体表面的连续性。相邻的像素假设在同一平面上，所以本发明用很多小平面对表面进行建模。The answer can be found by examining all possible z. However, the possible z's discovered in this way are discrete and limited by resolution. This approach ignores the continuity of the surface and is computationally intensive, requiring more surface information to obtain sub-resolution effects. The present invention takes into account the continuity of the surface of the object. Adjacent pixels are assumed to be on the same plane, so the present invention models the surface with many facets.

通过(x₀，y₀，z₀)的表面可以表示为式(4)：The surface passing through (x ₀ , y ₀ , z ₀ ) can be expressed as formula (4):

n₁x+n₂y+n₃z＝n₁x₀+n₂y₀+n₃z₀ 式(4)n ₁ x+n ₂ y+n ₃ z＝n ₁ x ₀ +n ₂ y ₀ +n ₃ z ₀ Formula (4)

n(n₁，n₂，n₃)是法向量。在本发明，横向像素被称为p(p_x，p_y)，z是要求的深度坐标，所以可以转换式(4)，得到式(5)和(6)：n(n ₁ , n ₂ , n ₃ ) is a normal vector. In the present invention, the horizontal pixel is called p(p _x , p _y ), and z is the required depth coordinate, so formula (4) can be transformed to obtain formulas (5) and (6):

f₁＝-n₁/n₃，f₂＝-n₂/n₃，f₃＝(n₁·x₀+n₂·y₀+n₃·z₀)/n₃ 式(6)f ₁ =-n ₁ /n ₃ , f ₂ =-n ₂ /n ₃ , f ₃ =(n ₁ x ₀ +n ₂ y ₀ +n ₃ z ₀ )/n ₃ Formula (6)

因此，找到z的问题变为找到f，而向量f是所有可能的平面中的最小聚合匹配成本之一，可以表达为式(7)：Therefore, the problem of finding z becomes finding f, and the vector f is one of the minimum aggregated matching costs in all possible planes, which can be expressed as Equation (7):

其中F表示大小无穷大的所有向量的集合。根据向量f匹配p(p_x，p_y)的聚合成本m通过式(8)和式(9)计算得到：where F represents the set of all vectors of infinite size. The aggregation cost m of matching p(p _x , p _y ) according to the vector f is calculated by formula (8) and formula (9):

式(9)中，w_p表示以p(p_x，p_y)为中心的平方窗口；w用于实现自适应权值立体匹配，可以克服边缘增殖问题；γ是用户定义的参数；I_p表示图像p的像素强度；I_q表示图像q的像素强度。附近的像素的E也用相同的向量f计算，揭示它们在同一平面中。所有向量F的集合是无限的标签空间，不能采用通常的做法，仅简单地检查所有可能的标签。In formula (9), w _p represents a square window centered on p(p _x , p _y ); w is used to realize adaptive weight stereo matching, which can overcome the problem of edge proliferation; γ is a user-defined parameter; I _p Indicates the pixel intensity of image p; I _q indicates the pixel intensity of image q. The E of nearby pixels is also computed with the same vector f, revealing that they are in the same plane. The set of all vectors F is an infinite label space, and the usual practice of simply checking all possible labels cannot be adopted.

本发明提出的3D图像深度信息提取方法，该方法基于Patch-Match，其基本思想是大多数相邻像素应该在同一平面。根据该假设，本发明开发了包含相邻像素传播和随机优化的多循环算法。The 3D image depth information extraction method proposed by the present invention is based on Patch-Match, and its basic idea is that most adjacent pixels should be on the same plane. Based on this assumption, the present invention develops a multi-loop algorithm involving neighbor pixel propagation and stochastic optimization.

基于上述分析，本发明提出的3D图像深度信息提取方法包括以下步骤：Based on the above analysis, the 3D image depth information extraction method proposed by the present invention comprises the following steps:

步骤1：初始化Step 1: Initialize

将横向像素p(x₀，y₀)初始化为随机平面；Initialize the horizontal pixel p(x ₀ , y ₀ ) as a random plane;

平面可以通过点和法向量来确定。每个像素的z₀由随机值初始化，并且通过该像素的表面的法线矢量被设置为随机单位向量n(n₁，n₂，n₃)。矢量f可以由正态n和点p(x₀，y₀，z₀)导出。The plane can be determined by point and normal vector. Each pixel's z ₀ is initialized with a random value, and the normal vector of the surface through that pixel is set to a random unit vector n(n ₁ , n ₂ , n ₃ ). The vector f can be derived from the normal n and the point p(x ₀ , y ₀ , z ₀ ).

该横向像素的深度坐标为z(x₀，y₀，z₀)，随机平面表示为p(p_x，p_y)，通过z的平面可表示为：The depth coordinate of the horizontal pixel is z(x ₀ , y ₀ , z ₀ ), the random plane is expressed as p(p _x , p _y ), and the plane passing through z can be expressed as:

z＝f₁▽p_x+f₂▽p_y+f₃ z=f ₁ ▽p _x +f ₂ ▽p _y +f ₃

其中参见式(6)，f₁＝-n₁/n₃f₂＝-n₂/n₃，f₃＝(n₁·x₀+n₂·y₀+n₃·z₀)/n₃，n是标量，是数值向量表示最小聚合成本所在的所有可能平面， Wherein refer to formula (6), f ₁ =-n ₁ /n ₃ f ₂ =-n ₂ /n ₃ , f ₃ =(n ₁ ·x ₀ +n ₂ ·y ₀ +n ₃ ·z ₀ )/n ₃ , n is a scalar and a numeric vector Denotes all possible planes where the minimum aggregate cost lies,

该步骤中初始化的横向像素为p(x₀，y₀)，其对应的深度值为z0，其聚合成本m为：The horizontal pixel initialized in this step is p(x ₀ , y ₀ ), its corresponding depth value is z0, and its aggregation cost m is:

其中，||I_p-I_q||表示两相邻像素p和q间的距离，p为横向像素，q为与p在同一平面内的相邻像素，w用于实现自适应加权，E表示相同的计算因素，▽表示梯度值，W_p表示一个方形窗口集中在p(px，p_y)。in, ||I _p -I _q || indicates the distance between two adjacent pixels p and q, p is a horizontal pixel, q is an adjacent pixel in the same plane as p, w is used to realize adaptive weighting, E means the same The calculation factor of , ▽ represents the gradient value, W _p represents a square window centered on p(px, p _y ).

相同计算因素E可表示为：The same calculation factor E can be expressed as:

E＝α||I_i-I_j||+(1-α)||▽I_i-▽I_j E＝α||I _i -I _j ||+(1-α)||▽I _i -▽I _j

其中，I是元素图像中像素的强度。下标i，j是元素图像的索引。I_i，I_j分别表示第i，第j个元素图像中的相应像素的强度。I_i和I_j将投射到相同的空间点，它们的坐标由等式(1)计算得到。即对应于投影的每个像素的局部坐标点为：where I is the intensity of the pixel in the element image. The subscripts i, j are the indices of the element image. I _i , I _j represent the intensity of the corresponding pixel in the i-th and j-th element images respectively. I _i and I _j will project to the same spatial point, and their coordinates are calculated by equation (1). That is, the local coordinate points corresponding to each pixel of the projection are:

其中u_i是每个元素图像中对应于点(y，z)的像素的局部坐标。g是图像平面和小透镜阵列之间的距离(参见图1)。s_i是小透镜的坐标，即小透镜的指数。I_i和I_j可利用该公式求得。where u _i are the local coordinates of the pixel corresponding to point (y, z) in each element image. g is the distance between the image plane and the lenslet array (see Figure 1). s _i are the coordinates of the lenslet, that is, the index of the lenslet. I _i and I _j can be obtained using this formula.

||I_i-I_j||计算RGB空间中的I_i和I_j的颜色的L1距离(即曼哈顿距离)。▽I是像素的灰度值梯度；||▽I_i-▽I_j||表示在I_i和I_j计算的灰度梯度的绝对差。α是没有单位的权重因子，是用户定义的参数，用于平衡颜色和渐变项的影响。通过该等式可以计算投影到相同空间位置的像素之间的相似度。||I _i -I _j || Calculate the L1 distance (ie Manhattan distance) of the colors of I _i and I _j in the RGB space. ▽I is the gray value gradient of the pixel; ||▽I _i -▽I _j || represents the absolute difference of the gray gradient calculated at I _i and I _j . α is a weight factor without units and is a user-defined parameter to balance the influence of color and gradient terms. The similarity between pixels projected to the same spatial location can be calculated by this equation.

由于有了许多预测值，在这种随机初始化之后，该区域的至少一个带有平面的区域的像素接近正确值。经过一个该平面传递到其他像素的传播步骤，一个好的预测值足以使算法运作起来。Thanks to many predicted values, after this random initialization, the pixels of at least one region with a plane in the region are close to the correct value. After a propagation step where the plane is passed to other pixels, a good prediction is enough for the algorithm to work.

步骤2：迭代Step 2: Iterate

在迭代中，每个像素运行两个阶段：首先是空间传播，其次是平面精修。In iterations, two stages are run per pixel: first spatial propagation, followed by planar refinement.

(2-1)空间传播(2-1) Spatial propagation

相邻点被假设为与通常想到在同一个平面上。这是传播的关键点。p表示当前像素，f_p是其对应的平面的向量。q是p的相邻像素。式(8)在p(x₀，y₀)下分别用f_p和f_q计算，以评估这两种情况的成本。检查条件m(x₀，y₀，f_p')<m(x₀，y₀，f_p)。Adjacent points are assumed to be on the same plane as is usually thought of. This is the key point of communication. p represents the current pixel, and f _p is the vector of its corresponding plane. q is the neighbor pixel of p. Equation (8) is calculated with f _p and f _q under p(x ₀ , y ₀ ), respectively, to evaluate the cost of these two cases. Check the condition m(x ₀ , y ₀ , f _p ')<m(x ₀ , y ₀ , f _p ).

式(12)中的m(x₀，y₀，f_p')和m(x₀，y₀，f_p)分别由式(8)得到；m(x ₀ , y ₀ , f _p ') and m(x ₀ , y ₀ , f _p ) in formula (12) are respectively obtained from formula (8);

如果该表达式成立，则f_q被接受为p的新向量，即f_p＝f_q。在奇数迭代中，q是左边和上边界，在偶数迭代中，q是右边界和下边界。If this expression holds, then f _q is accepted as a new vector of p, ie f _p =f _q . In odd iterations, q is the left and upper bounds, and in even iterations, q is the right and lower bounds.

(2-2)平面精修(2-2) Plane finishing

平面精修的目标是在像素p处改进平面的参数。为了进一步降低式(6)中的Z来提取具有横向点的成像对象表面的深度。The goal of plane refinement is to improve the parameters of the plane at pixel p. In order to further reduce Z in formula (6) to extract the depth of the imaging object surface with lateral points.

将f_p转换为法向量n_p。两个参数▽z和▽n被定义为分别限制z₀和n的最大允许变化。z₀'计算为z₀'＝z₀+▽z，其中▽z位于[-▽z^max，▽z^max]。并且n'＝u(n+▽n)，u()表示计算单位向量，▽n位于[-▽n^max，▽n^max]。最后，通过p和n'得到一个新的f_p'。如果m(x₀，y₀，f_p')<m(x₀，y₀，f_p)，则f_p＝f_p'。Convert f _p to normal vector n _p . Two parameters ▽z and ▽n are defined to limit the maximum allowable variation of z ₀ and n, respectively. z ₀ ' is calculated as z ₀ '=z ₀ +▽z, where ▽z is located at [-▽z ^max ,▽z ^max ]. And n'=u(n+▽n), u() represents a calculation unit vector, and ▽n is located at [-▽n ^max , ▽n ^max ]. Finally, pass p and n' to get a new f _p '. If m(x ₀ , y ₀ , f _p ')<m(x ₀ , y ₀ , f _p ), then f _p =f _p '.

该方法从设置▽z^max＝maxdisp/2开始，其中maxdisp是允许的最大视差，▽n^max＝1。每次细化后，参数将更新为▽z^max＝▽z^max/2，▽n^max＝▽n^max/2，从而减少搜索范围。我们再次回到特殊传播，直到▽z^max<resolution/2，其中分辨率如文献[DaneshPanah M,JavidiB.“Profilometry and optical slicing by passive three-dimensional imaging[J]”.Optics letters,2009,34(7):1105-1107]所示最小化。对于奇数迭代，从图像的左侧开始，向右下进行偶数迭代。最后的结果是在迭代后获得的。The method starts by setting ▽z ^max =maxdisp/2, where maxdisp is the maximum allowed disparity and ▽n ^max =1. After each refinement, the parameters will be updated as ▽z ^max = ▽z ^max /2, ▽n ^max = ▽n ^max /2, thus reducing the search range. We return to the special propagation again until ▽z ^max <resolution/2, where the resolution is as in the literature [DaneshPanah M, JavidiB. "Profilometry and optical slicing by passive three-dimensional imaging[J]". Optics letters, 2009, 34( 7):1105-1107] as shown in the minimization. For odd iterations, start from the left of the image and work down to the right for even iterations. The final result is obtained after iterations.

为了验证这种方法的实用性，提出了两种II型实验。并且还进行检查所有可能的z的常规方法进行比较。首先，在计算机集成成像系统中使用拖拉机的模型作为3D对象。该物体如图3(a)所示。结果具有20×28个元素图像，每个元素图像具有100×100像素。小透镜的焦距为1.5mm。拖拉机的深度为α设定为0.5，γ设定为5.通过计算，最小深度分辨率为0.005mm。To test the utility of this approach, two type II experiments are presented. And also do the normal way of checking all possible z for comparison. First, a model of the tractor was used as a 3D object in a computer-integrated imaging system. The object is shown in Figure 3(a). The result has 20x28 element images with 100x100 pixels each. The focal length of the small lens is 1.5mm. The depth of the tractor is α is set to 0.5, and γ is set to 5. By calculation, the minimum depth resolution is 0.005mm.

而在合成孔径积分成像中，元素图像如图3(b)所示。3D对象包括一个构建块，一个娃娃和一个玩具大象，分别位于53-57厘米，89-93厘米和131～136厘米。系统包含6×6透视图。并且图像被捕获在具有5mm间距的规则网格上。相机的焦距为16mm。α设定为0.5，γ设定为5.通过计算，最小深度分辨率为2mm。In synthetic aperture integral imaging, the elemental image is shown in Fig. 3(b). The 3D objects included a building block, a doll, and a toy elephant, located at 53-57 cm, 89-93 cm, and 131-136 cm, respectively. The system contains a 6×6 perspective. And the images are captured on a regular grid with 5mm spacing. The focal length of the camera is 16mm. α is set to 0.5, and γ is set to 5. By calculation, the minimum depth resolution is 2mm.

该算法在Matlab中计算。通过本文提出的方法接收的结果与常用方法的结果的比较如图所示。在图4(b)和(d)中，底部的水平条纹是背景的折叠。The algorithm is calculated in Matlab. The comparison of the results received by the method proposed in this paper with the results of commonly used methods is shown in the figure. In Figure 4(b) and (d), the horizontal stripes at the bottom are folds of the background.

从这些结果可以看出，这两种方法都在对象部分中表现良好。但是通过常用方法得到的结果如图4(a)和4(b)所示，边肥育相当明显。在本文提出的方法中，如图4(c)和4(d)所示，结果的空白部分充满了白噪声。对这些区域的深度变化不敏感，因为难以区分差异。所以这部分的深度依然是它初始化的价值。对象内部存在一些不准确的一点，需要通过一些更多的迭代来消除这一点。As can be seen from these results, both methods perform well in the object part. However, the results obtained by common methods are shown in Fig. 4(a) and 4(b), and the edge fattening is quite obvious. In the method proposed in this paper, as shown in Fig. 4(c) and 4(d), the blank part of the result is filled with white noise. Insensitive to depth changes in these regions, as differences are difficult to discern. So the depth of this part is still its initialized value. There is a bit of inaccuracy inside the object that needs to be removed with a few more iterations.

为了减少白噪声的影响，z₀以固定值初始化为所有像素，并且在迭代之前添加精修步骤。得到如图5所示的更好的结果，白噪声被很好地消除。在图5(a)中，特别是在物体边缘处精确提取拖拉机的深度。但是图5(b)中的物体边缘仍然存在一些白噪声，这可能受到实验条件的限制。To reduce the influence of white noise, z ₀ is initialized to all pixels with a fixed value, and a refinement step is added before iterations. A better result is obtained as shown in Fig. 5, the white noise is well removed. In Fig. 5(a), the depth of the tractor is precisely extracted especially at the edge of the object. But there are still some white noises at the edge of the object in Fig. 5(b), which may be limited by the experimental conditions.

虽然考虑了表面的连续性，但从这个结果来看，深度不断变化并不直观。这可能是由于这种几何模型的较粗糙的分辨率造成的。Although the continuity of the surface is considered, it is not intuitive from this result that the depth is constantly changing. This may be due to the coarser resolution of this geometric model.

为了进一步验证所提出的方法，每个像素被投影到由所提出的方法计算的深度，如图6所示。该步骤中使用的深度是高斯平滑的并且过滤出背景。由于算法尚未优化，计算时间难以评估该方法的计算效率。因此，可以通过核心因子的计算时间来评估m，不需要计算所有z空间；计算资源支付给子分辨率计算。在模拟光场中，maxdisp设置为10mm，分辨率为0.005mm，在该算法中计算了12次迭代。在每次迭代中，在每个像素(f_p，f_q，f_p'，其中q是两个相邻像素)中计算m次4次。与计算每个可能位置的常用方法的10/0.005＝2000倍相比，该方法中每个像素的12×4＝48次计算的m为近40倍。从这个角度来看，提出的算法可以有效地减少计算量。To further verify the proposed method, each pixel is projected to the depth calculated by the proposed method, as shown in Fig. 6. The depth used in this step is Gaussian smoothed and the background filtered out. Since the algorithm has not been optimized, the calculation time makes it difficult to evaluate the computational efficiency of this method. Thus, m can be evaluated by the computation time of the kernel factor, and not all z spaces need to be computed; computing resources are paid for sub-resolution computations. In the simulated light field, with maxdisp set to 10 mm and a resolution of 0.005 mm, 12 iterations were calculated in the algorithm. In each iteration, m is computed 4 times in each pixel (f _p , f _q , f _p' , where q are two adjacent pixels). 12 x 4 = 48 calculations per pixel in this method is nearly 40 times m compared to 10/0.005 = 2000 times the usual method of calculating each possible position. From this point of view, the proposed algorithm can effectively reduce the amount of computation.

以上所述，仅是本发明的几种具体实施方式，并非对本发明做任何形式的限制，虽然本发明以较佳实施例揭示如上，然而并非用以限制本发明，任何熟悉本专业的技术人员，在不脱离本申请技术方案的范围内，利用上述揭示的技术内容做出些许的变动或修饰均等同于等效实施案例，均属于技术方案范围内。The above are only several specific implementations of the present invention, and do not limit the present invention in any form. Although the present invention is disclosed as above with preferred embodiments, it is not intended to limit the present invention. Anyone familiar with this field , without departing from the scope of the technical solution of the present application, any changes or modifications made using the technical content disclosed above are equivalent to equivalent implementation cases, and all belong to the scope of the technical solution.

Claims

1. A three-dimensional object depth extraction method is characterized in that the depth of a pixel is obtained by calculating the similarity of adjacent pixels in each element image in a two-dimensional plane image.

2. The method of extracting a depth of a three-dimensional object according to claim 1, wherein in calculating the similarity of pixels in each elemental image in the two-dimensional plane image, it is assumed that adjacent pixels are on the same plane, and the surface is modeled with a plurality of small planes.

3. The method for extracting the depth of the three-dimensional object according to claim 1, wherein the calculating the similarity of the adjacent pixels in each elemental image in the two-dimensional plane image adopts a multi-cycle algorithm of adjacent pixel propagation and random optimization.

4. The method of claim 1, comprising the steps of initializing a horizontal pixel to a random plane and iteratively calculating the similarity of neighboring pixels.

5. The method of claim 4, wherein the initializing of the horizontal pixels to a random plane comprises:

initializing a horizontal pixel to a random plane;

the initial depth of each pixel is set to a random value, and the normal vector passing through the surface of each pixel is set to a random unit vector.

6. The method of claim 4, wherein the initializing the horizontal pixels to a random plane comprises:

a plane passing through the depth coordinate of the lateral pixel, represented by equation (5),

z＝f₁▽p_x+f₂▽p_y+f₃formula (5)

Wherein z is the depth coordinate of the lateral pixel, and p_xAnd p_yIs a random plane, f₁、f₂And f₃Respectively represented by formula (6-1), formula (6-2) and formula (6-3),

f₁＝-n₁/n₃formula (6-1)

f₂＝-n₂/n₃Formula (6-2)

f₃＝(n₁·x₀+n₂·y₀+n₃·z₀)/n₃Formula (6-3)

In the formulae (6-1), (6-2) and (6-3), n₁、n₂And n₃Is a scalar quantity, is a numerical vector represented by the formula (7)All possible planes, x, representing the minimum aggregation cost₀And y₀Respectively initialized coordinate values of said transverse pixels, z₀For the initialized initial depth values of the lateral pixels,

formula (7) wherein m is provided by formula (8),

in the formula (8), W is used to realize adaptive weighting, W is provided by the formula (9), E represents a similarity calculation factor, E is provided by the formula (10), ▽ represents a gradient value, W_pA square window centered on p is shown,

in the formula (9), | | I_p-I_qAnd | | l represents the distance between two adjacent pixels p and q, p being a transverse pixel, q being an adjacent pixel in the same plane as p,

E＝α||I_i-I_j||+(1-α)||▽I_i-▽I_j| | formula (10)

In the formula (10), I is the intensity of a pixel in an element image, subscripts I, j are indices of the element image, I_i，I_jRespectively representing the intensity, I, of the corresponding pixel in the ith and jth elemental images_iAnd I_jWill project to the same spatial point, I_iAnd I_jIs calculated by the formula (11) | | I_i-I_j| is in RGB spaceI of (A)_iAnd I_jManhattan distance of color of (1), ▽ I_iAnd ▽ I_jIs the gray value gradient of the pixel, | | ▽ I_i-▽I_jI is expressed in_iAnd I_jThe absolute difference of the calculated gray gradients, α is a weighting factor without units for balancing the influence of color and gradient terms;

in the formula (11), u_iIs the local coordinate of the pixel in each elemental image corresponding to the point with coordinates y and z.

7. The method of claim 4, wherein the method of three-dimensional object depth extraction is performed on a computer using an Integrated Imaging (II) system, and the iterative computation of the similarity of neighboring pixels comprises the steps of:

a. initializing a horizontal pixel in a random plane, calculating a depth coordinate and a vector value of the horizontal pixel, calculating the aggregation cost of the horizontal pixel, and taking the aggregation cost as a reference aggregation cost;

b. calculating the aggregation cost of any adjacent pixel in the same plane with the horizontal pixel in the step a;

c. comparing the reference aggregation cost in the step a with the aggregation cost of the adjacent pixels in the step b;

d. taking the pixel corresponding to the smaller aggregation cost in the step c as a new reference value;

e. setting the pixel corresponding to the new reference value in the step d as the upper left adjacent to the pixel corresponding to the comparison reference value;

f. setting conditions: d, the corresponding depth value of the new reference value in the step d is in the maximum allowable range;

g. if the condition of the step f is satisfied, circularly executing the step a to the step f;

step f, if the condition is not satisfied, taking the last cycle of step e as the leftmost pixel of the image;

m, on the basis of the step l, performing descending even iteration on the right lower part of the image;

n, calculating the calculation times of each pixel according to the iteration times of the step m.

8. The method of claim 4, wherein the iterative computation of the similarity of neighboring pixels comprises steps of spatial propagation and planar refinement;

in the step of spatial propagation, adjacent pixels are set to be on the same plane, the cost m of different situations is evaluated by the formula (8),

in the formula (8), p represents the current pixel, f_pIs the vector of its corresponding plane, q is the neighboring pixel of p, under p, f_pAnd f_qCalculating to evaluate the cost of both cases; the examination conditions are as shown in the formula (12),

m(x₀，y₀，f_p')<m(x₀，y₀，f_p) (ii) a Formula (12)

M (x) in formula (12)₀，y₀，f_p') and m (x)₀，y₀，f_p) Respectively obtained by formula (8);

if the expression shown in equation (12) holds, f_qNew vector accepted as p, i.e. f_p＝f_q；

In odd iterations, q is the left and upper bounds;

in even iterations, q is the right and lower bounds;

in the step of plane finishing, f is_pConversion to normal vector n_pTwo parameters ▽ z and ▽ n are defined to limit z, respectively₀And the maximum allowable variation of n, z₀' calculation as z₀'＝z₀+ ▽ z, wherein ▽ z is located at [ - ▽ z^max，▽z^max]And n' ═ u (n + ▽ n), where u denotes the calculation unit vector, and ▽ n is located at [ - ▽ n^max，▽n^max]；

Finally, a new f is obtained by p and n_p', if m (x)₀，y₀，f_p')<m(x₀，y₀，f_p) Then f is_p＝f_p'；

In the step of plane finishing, from setting ▽ z^maxMaxdisp/2 starts, where maxdisp is the maximum allowed disparity, ▽ n^maxAfter each refinement, the parameter will be updated to ▽ z as 1^max＝▽z^max/2、▽n^max＝▽n^max/2 up to ▽ z^max<resolution/2, obtaining the minimized resolution; for odd iterations, starting from the left side of the image, performing even iterations towards the right and downwards;

and after iteration, obtaining the similarity of adjacent pixels so as to further obtain the depth of the three-dimensional object.

9. The method of claim 8, wherein z is₀All pixels are initialized with fixed values and a refinement step is added before the iteration.

10. The device for obtaining three-dimensional image information from a two-dimensional picture is characterized by comprising a picture acquisition unit, an image storage unit and an image processing unit;

the image acquisition unit is electrically connected with the image storage unit, and the image storage unit is electrically connected with the image processing unit;

the image processing unit adopts the three-dimensional object depth extraction method of any one of claims 1 to 9 to obtain the depth information of the object in the two-dimensional picture and establish a three-dimensional image.