WO2015188666A1

WO2015188666A1 - Three-dimensional video filtering method and device

Info

Publication number: WO2015188666A1
Application number: PCT/CN2015/077707
Authority: WO
Inventors: 朱策; 王昕�; 郑建铧; 张玉花
Original assignee: Huawei Technologies Co Ltd
Current assignee: Huawei Technologies Co Ltd
Priority date: 2014-06-13
Filing date: 2015-04-28
Publication date: 2015-12-17
Anticipated expiration: 2016-12-13
Also published as: CN104010180B; CN104010180A

Abstract

Provided are a three-dimensional video filtering method and device, the three-dimensional video filtering method comprising: projecting the pixels in the plane of an image into three-dimensional space; according to the coordinate values of the pixels to be filtered and the reference pixels in the three-dimensional space, calculating the spatial proximity of the pixels to be filtered and the reference pixels in the three-dimensional space; according to the texture pixel values of the pixels to be filtered and the reference pixels, calculating texture pixel value similarity of the pixels to be filtered and the reference pixels; according to the texture pixel values of the pixels to be filtered, the reference pixels, and the pixels in the same position in the previous frame of the image, calculating motion characteristics consistency of the pixels to be filtered and the reference pixels; calculating the weight of the pixels to be filtered; determining filtering weight according to the proximity, texture pixel value similarity and motion characteristics consistency, and conducting weighted averaging on the pixel values of the reference pixels to obtain a filtering result of the pixels to be filtered, thus improving three-dimensional video filtering accuracy.

Description

Three-dimensional video filtering method and device

Technical field

本发明实施例涉及图像处理技术，尤其涉及一种三维视频滤波方法和装置。Embodiments of the present invention relate to image processing technologies, and in particular, to a three-dimensional video filtering method and apparatus.

Background technique

随着视觉多媒体技术的不断发展，三维视频以其独特景深效果逐步走进人们的生活，并应用于教育、军事、娱乐及医疗等多个领域。目前的三维视频根据视频内容主要分为两类：纯彩色三维视频和基于深度的三维视频。纯彩色三维视频将多路彩色视频直接呈现给用户，其视点位置和视差固定，给人们的观看带来了一定的局限性。相比于纯彩色三维视频，由于深度图的引入，基于深度的三维视频能够通过基于深度图像的绘制技术合成任意视点的虚拟图像，人们可以根据个人喜好选择视点和调节视差，进而更好的享受三维视频带来的乐趣。这一自由灵活的特点使基于深度的三维视频成为目前被广泛接受的三维视频格式。With the continuous development of visual multimedia technology, 3D video gradually enters people's lives with its unique depth of field effect, and is applied in many fields such as education, military, entertainment and medical. The current 3D video is mainly divided into two categories according to the video content: pure color 3D video and depth based 3D video. Pure color 3D video directly presents multi-channel color video to users, and its viewpoint position and parallax are fixed, which brings certain limitations to people's viewing. Compared with pure color 3D video, due to the introduction of depth map, depth-based 3D video can synthesize virtual images of arbitrary viewpoints by depth image-based rendering technology. People can select viewpoints and adjust parallax according to personal preference, so as to enjoy better. 3D video brings fun. This free and flexible feature makes depth-based 3D video the currently accepted 3D video format.

基于深度的三维视频内容由纹理图序列和深度图序列组成，纹理图直观的呈现了物体表面的纹理特征，深度图反映了物体与相机之间的距离。利用上述视频内容和基于深度图像的绘制技术可以合成指定的虚拟视点纹理图像。然而，深度图和纹理图在获取、编码和传输等过程中会引入大量噪声。在视点合成阶段，深度图和纹理图中的噪声会分别引起合成图像的几何失真和纹理失真，进而严重影响人们的视觉体验。而滤波技术可以有效的去除这些噪声，有效提升三维视频质量。The depth-based 3D video content consists of a sequence of texture maps and a sequence of depth maps that visually represent the texture features of the surface of the object, and the depth map reflects the distance between the object and the camera. The specified virtual view texture image can be synthesized using the above video content and depth image based rendering techniques. However, depth maps and texture maps introduce a lot of noise during acquisition, encoding, and transmission. In the view synthesis stage, the noise in the depth map and texture map will cause geometric distortion and texture distortion of the composite image, which will seriously affect people's visual experience. The filtering technology can effectively remove these noises and effectively improve the quality of 3D video.

现有技术中，针对纹理图的去噪方法主要是双边滤波器，利用待滤波像素周围像素作为参考，并对其进行加权平均的方法，得到滤波结果。在权值计算的过程中主要参考了图像中像素位置邻近性和像素值的相似性。该滤波方法认为：两个像素点在图像平面的距离越接近，相关性越强；两个像素点的像素值越相似，相关性越强。In the prior art, the denoising method for the texture map is mainly a bilateral filter, and the filtering result is obtained by using the pixels around the pixel to be filtered as a reference and weighting and averaging the same. In the process of weight calculation, the similarity between pixel positional proximity and pixel value in the image is mainly referred to. The filtering method considers that the closer the distance between two pixel points in the image plane is, the stronger the correlation is; the more similar the pixel values of the two pixel points are, the stronger the correlation is.

图1为现有技术的双边滤波器的计算邻近性的示意图。现有技术的问题是，由于图像中的像素点是真实三维空间中的点在二维图像平面的再现，然而双边滤波器在考虑像素点邻近性时并没有从真实三维场景出发，其计算结果并不准确，如图1所示，其中A’、B’、C’为真实场景中的三个点，经过相机采集其在图像平面中的位置为A、B、C，A与C在平面上的距离与B与C在平面上的距离相等。如果对C进行滤波，A、B为参考像素。根据图1中所示，可以明显看出，在三维空间中，B与C的邻近性更强，而双边滤波器则认为A与C和B与C的邻近性一致，因此滤波结果准确性不高。1 is a schematic diagram of the computational proximity of a prior art bilateral filter. The problem of the prior art is that since the pixel points in the image are the reproduction of the points in the real three-dimensional space in the two-dimensional image plane, The bilateral filter does not start from the real three-dimensional scene when considering the pixel proximity. The calculation result is not accurate, as shown in Figure 1, where A', B', C' are three points in the real scene. The position in the image plane is A, B, C collected by the camera, and the distance between A and C on the plane is equal to the distance between B and C on the plane. If C is filtered, A and B are reference pixels. As shown in Fig. 1, it can be clearly seen that in three-dimensional space, the proximity of B and C is stronger, while the bilateral filter considers that the proximity of A and C and B is consistent with C, so the accuracy of the filtering result is not high.

发明内容Summary of the invention

本发明实施例提供一种三维视频滤波方法和装置，以克服现有技术中滤波结果准确性不高的问题。The embodiment of the invention provides a three-dimensional video filtering method and device to overcome the problem that the filtering result in the prior art is not high.

第一方面，本发明实施例提供一种三维视频滤波方法，包括：In a first aspect, an embodiment of the present invention provides a three-dimensional video filtering method, including:

将图像平面中的像素投影到三维空间；所述像素包括待滤波像素和参考像素集合；Projecting pixels in the image plane into a three-dimensional space; the pixels comprising a pixel to be filtered and a reference pixel set;

根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合与所述待滤波像素在同一帧图像中；Calculating spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the pixel to be filtered and the reference pixel in the reference pixel set in the three-dimensional space; The reference pixel set is in the same frame image as the pixel to be filtered;

根据所述待滤波像素和所述参考像素集合内的参考像素的纹理像素值，计算所述待滤波像素和所述参考像素的纹理像素值相似性；Calculating texture pixel value similarity of the pixel to be filtered and the reference pixel according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set;

根据所述待滤波像素、所述参考像素集合内的参考像素和所述待滤波像素所在帧的前一帧图像中相同位置的像素的纹理像素值，计算所述待滤波像素和所述参考像素的运动特征一致性；Calculating the pixel to be filtered and the reference pixel according to the texel value of the pixel to be filtered, the reference pixel in the reference pixel set, and the pixel in the same position in the previous frame image of the frame in which the pixel to be filtered is located Consistency of motion characteristics;

根据所述空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。Determining the weights of the filtering according to the spatial proximity, the texel value similarity, and the motion feature consistency, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered.

结合第一方面，在第一方面的第一种实现方式中，对深度图像滤波时，根据所述空间邻近性、所述深度像素对应的纹理像素值相似性和所述运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的深度像素值进行加权平均，获得所述深度图像待滤波像素的深度像素值的滤波结果；或，With reference to the first aspect, in a first implementation manner of the first aspect, when filtering the depth image, determining filtering according to the spatial proximity, the similarity of the texel values corresponding to the depth pixels, and the consistency of the motion features a weighted average of the depth pixel values of the reference pixels in the reference pixel set to obtain a filtering result of the depth pixel value of the depth image to be filtered pixel; or

对纹理图像滤波时，根据所述空间邻近性、所述纹理像素相似性和所述运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的纹理像素值进行加权平均，获得所述纹理图像待滤波像素的纹理像素值的滤波结果。When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the motion feature consistency, respectively, for the reference pixels in the reference pixel set The texture pixel values are weighted averaged to obtain a filtering result of the texture pixel values of the texture image to be filtered.

结合第一方面、或第一方面的第一种实现方式，在第一方面的第二种实现方式中，所述将图像平面中的像素投影到三维空间，包括：In conjunction with the first aspect, or the first implementation of the first aspect, in the second implementation of the first aspect, the projecting the pixels in the image plane into the three-dimensional space includes:

利用三维视频提供的深度图像信息、视点位置信息和参考相机参数信息，将所述像素从图像平面投影到三维空间；所述深度图像信息包括所述像素的深度像素值。The pixels are projected from the image plane to the three-dimensional space using depth image information, viewpoint position information, and reference camera parameter information provided by the three-dimensional video; the depth image information includes depth pixel values of the pixels.

结合第一方面的第二种实现方式，在第一方面的第三种实现方式中，所述利用三维视频提供的深度图像信息、视点位置信息和参考相机参数信息，将像素从图像平面投影到三维空间，包括：With reference to the second implementation manner of the first aspect, in a third implementation manner of the first aspect, the using the depth image information, the viewpoint position information, and the reference camera parameter information provided by the three-dimensional video to project the pixel from the image plane to Three-dimensional space, including:

根据公式P＝R^-1(dA^-1p-t)计算所述像素投影到三维空间后的坐标值；Calculating coordinate values of the pixel after being projected into the three-dimensional space according to the formula P=R ^-1 (dA ^-1 pt);

其中，R和t为参考相机的旋转矩阵和平移矢量，A为参考相机参数矩阵，

为所述像素在所述图像平面中的坐标值，

为所述像素在所述三维空间中的坐标值，d为所述像素的深度像素值；f_x和f_y分别为水平和竖直方向的归一化焦距，r为径向畸变系数，(o_x,o_y)为所述图像平面上的基准点的坐标值；所述基准点为所述参考相机的光轴和所述图像平面的交点。Where R and t are the rotation matrix and translation vector of the reference camera, and A is the reference camera parameter matrix.

For the coordinate values of the pixels in the image plane,

a coordinate value of the pixel in the three-dimensional space, d is a depth pixel value of the pixel; f _x and f _y are normalized focal lengths in horizontal and vertical directions, respectively, and r is a radial distortion coefficient, ( o _x , o _y ) is a coordinate value of a reference point on the image plane; the reference point is an intersection of an optical axis of the reference camera and the image plane.

结合第一方面、或第一方面的第一-第三任一种实现方式，在第一方面的第四种实现方式中，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；With reference to the first aspect, or the first to the third implementation of the first aspect, in a fourth implementation manner of the first aspect, the spatial proximity is through the pixel to be filtered in the three-dimensional space and the The distance of the reference pixel is calculated as an input value of the function; the output value of the function increases as the input value decreases;

所述纹理像素值相似性通过所述待滤波像素和所述参考像素的纹理像素值的差值作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；The texture pixel value similarity is calculated by using a difference value of the texel value of the pixel to be filtered and the reference pixel as a function of an input value; an output value of the function increases as the input value decreases;

所述运动特征一致性通过计算所述待滤波像素和所述参考像素的运动特征是否一致得到，包括：The motion feature consistency is obtained by calculating whether the motion characteristics of the pixel to be filtered and the reference pixel are consistent, including:

当所述待滤波像素与前一帧中对应位置的像素的纹理像素值的差值，以及所述参考像素与前一帧中对应位置的像素的纹理像素值的差值，同时大于或小于预设的阈值时确定为所述待滤波像素和所述参考像素的运动状态一致；否则确定为所述待滤波像素和所述参考像素的运动状态不一致。When the difference between the texel value of the pixel to be filtered and the pixel at the corresponding position in the previous frame is And determining, by the difference between the reference pixel and the texel value of the pixel of the corresponding position in the previous frame, that the difference between the reference pixel and the pixel of the reference pixel is greater than or less than a preset threshold; The motion states of the pixel to be filtered and the reference pixel are inconsistent.

结合第一方面、或第一方面的第一-第四任一种实现方式，在第一方面的第五种实现方式中，所述根据所述空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果，包括：With reference to the first aspect, or the first to fourth implementation manners of the first aspect, in the fifth implementation manner of the first aspect, the spatial proximity, texture pixel value similarity, and motion feature are The consistency determines the weight of the filtering, and performs weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered, including:

根据公式(1)：

计算获得所述待滤波像素的深度像素值的滤波结果；或，According to formula (1):

Calculating a filtering result of obtaining a depth pixel value of the pixel to be filtered; or

根据公式(2)：

计算获得所述待滤波像素的纹理像素值的滤波结果；According to formula (2):

Calculating a filtering result of obtaining a texel value of the pixel to be filtered;

其中，

用于计算所述待滤波像素和所述参考像素的空间邻近性；among them,

And configured to calculate spatial proximity of the pixel to be filtered and the reference pixel;

f_T(T_p,T_q)＝f_T(||T_p-T_q||)用于计算所述待滤波像素和所述参考像素的纹理像素值相似性；f _T (T _p , T _q )=f _T (||T _p −T _q ||) is used for calculating texture pixel value similarity of the pixel to be filtered and the reference pixel;

用于计算所述待滤波像素和所述参考像素的运动特征一致性；

And calculating a motion feature consistency of the pixel to be filtered and the reference pixel;

其中，p为待滤波像素，q为参考像素，K为参考像素集合，D_p'为p滤波后的深度像素值，D_q为q的深度像素值，P、Q为p、q在三维空间中的坐标值，T_p、T_q为p、q的纹理像素值，T_p'、T_q'为p、q在前一帧相同位置的纹理像素值，T_p'为p滤波后的纹理像素值，th为预设的纹理像素差阈值。Where p is the pixel to be filtered, q is the reference pixel, K is the reference pixel set, D _p ' is the depth pixel value after p filtering, D _q is the depth pixel value of q, P, Q are p, q in three-dimensional space In the coordinate values, T _p , T _q are the texel values of p and q, T _{p '} and T _{q '} are the texel values of p and q at the same position in the previous frame, and T _p ' is the texture of the p-filtered texture. The pixel value, th is the preset texture pixel difference threshold.

第二方面，本发明实施例提供一种三维视频滤波方法，包括： In a second aspect, an embodiment of the present invention provides a three-dimensional video filtering method, including:

根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合在与所述待滤波像素所在同一帧图像和相邻多帧图像中；Calculating spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the pixel to be filtered and the reference pixel in the reference pixel set in the three-dimensional space; The reference pixel set is in the same frame image and the adjacent multi-frame image as the pixel to be filtered;

根据所述待滤波像素和所述参考像素集合内的参考像素所在帧的时间间隔，计算所述待滤波像素和所述参考像素的时域邻近性；Calculating a time domain proximity of the pixel to be filtered and the reference pixel according to a time interval of a frame in which the pixel to be filtered and a reference pixel in the reference pixel set are located;

根据所述空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。Determining the weights of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered.

结合第二方面，在第二方面的第一种实现方式中，对深度图像滤波时，根据所述空间邻近性、所述深度像素对应的纹理像素值相似性和所述时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的深度像素值进行加权平均，获得所述深度图像待滤波像素的深度像素值的滤波结果；或，With reference to the second aspect, in a first implementation manner of the second aspect, when filtering the depth image, determining filtering according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the time domain proximity a weighted average of the depth pixel values of the reference pixels in the reference pixel set to obtain a filtering result of the depth pixel value of the depth image to be filtered pixel; or

对纹理图像滤波时，根据所述空间邻近性、所述纹理像素相似性和所述时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的纹理像素值进行加权平均，获得所述纹理图像待滤波像素的纹理像素值的滤波结果。When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the time domain proximity, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set And obtaining a filtering result of the texel value of the texture image to be filtered pixel.

结合第二方面、或第二方面的第一种实现方式，在第二方面的第二种实现方式中，所述将图像平面中的像素投影到三维空间，包括：With reference to the second aspect, or the first implementation of the second aspect, in the second implementation manner of the second aspect, the projecting the pixels in the image plane into the three-dimensional space includes:

结合第二方面的第二种实现方式，在第二方面的第三种实现方式中，所述利用三维视频提供的深度图像信息、视点位置信息和参考相机参数信息，将所述像素从图像平面投影到三维空间，包括：With reference to the second implementation manner of the second aspect, in a third implementation manner of the second aspect, the using the depth image information, the viewpoint location information, and the reference camera parameter information provided by the three-dimensional video, the pixel from the image plane Projected into 3D space, including:

为所述像素在所述图像平面中的坐标值，

For the coordinate values of the pixels in the image plane,

结合第二方面、或第二方面的第一-第三任一种实现方式，在第二方面的第四种实现方式中，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；With reference to the second aspect, or the first to the third implementation of the second aspect, in a fourth implementation manner of the second aspect, the spatial proximity is through the pixel to be filtered in the three-dimensional space and the The distance of the reference pixel is calculated as an input value of the function; the output value of the function increases as the input value decreases;

所述时域邻近性通过所述待滤波像素和所述参考像素所在帧的时间间隔作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大。The time domain proximity is calculated by using a time interval of the pixel to be filtered and a frame in which the reference pixel is located as a function of an input value; an output value of the function increases as the input value decreases.

结合第二方面、或第二方面的第一-第四任一种实现方式，在第二方面的第五种实现方式中，所述根据所述空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果，包括：With reference to the second aspect, or the first to fourth implementation manners of the second aspect, in the fifth implementation manner of the second aspect, the spatial proximity, texture pixel value similarity, and time domain are And determining, by the weighting average, the weighted average of the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered, including:

根据公式(3)：

计算获得所述待滤波像素的深度像素值的滤波结果；或，According to formula (3):

根据公式(4)：

计算获得所述待滤波像素的纹理像素值的滤波结果； According to formula (4):

其中，

用于计算所述待滤波像素和所述参考像素的纹理像素值相似性；

Means for calculating texture pixel value similarity between the pixel to be filtered and the reference pixel;

f_tem(i,N)＝f_tem(||i-N||)用于计算所述待滤波像素和所述参考像素的时域邻近性；f _tem (i, N)=f _tem (||iN||) for calculating a time domain proximity of the pixel to be filtered and the reference pixel;

其中，N为待滤波像素所在帧的帧号，i为参考像素所在帧的帧号，i取值为[N-m，N+n]区间的整数，m、n分别为在待滤波像素所在帧之前、之后的参考帧个数，m、n为非负整数，p为待滤波像素，q_i为第i帧中的参考像素，K_i为第i帧中的参考像素集合，D_p'为p滤波后的深度像素值，

第i帧中q的深度像素值，P、Q_i为p、第i帧中q在三维空间中的坐标值，T_p、

分别为p、第i帧中q的纹理像素值，T_p'为p滤波后的纹理像素值。Where N is the frame number of the frame in which the pixel to be filtered is located, i is the frame number of the frame in which the reference pixel is located, i is an integer in the interval [Nm, N+n], and m and n are respectively before the frame in which the pixel to be filtered is located And the number of reference frames, m, n are non-negative integers, p is the pixel to be filtered, q _i is the reference pixel in the ith frame, K _i is the reference pixel set in the ith frame, and D _p ' is p Filtered depth pixel value,

The depth pixel value of q in the i-th frame, P, Q _i is p, the coordinate value of q in the three-dimensional space in the i-th frame, T _p ,

They are p, the texel value of q in the i-th frame, and T _p ' is the p-filtered texel value.

第三方面，本发明实施例提供一种三维视频滤波装置，包括：In a third aspect, an embodiment of the present invention provides a three-dimensional video filtering apparatus, including:

投影模块，用于将图像平面中的像素投影到三维空间；所述像素包括待滤波像素和参考像素集合；a projection module, configured to project pixels in an image plane into a three-dimensional space; the pixels include a pixel to be filtered and a reference pixel set;

计算模块，用于根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合与所述待滤波像素在同一帧图像中；a calculation module, configured to calculate spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the reference pixel in the to-be-filtered pixel and the reference pixel set in the three-dimensional space And wherein the reference pixel set is in the same frame image as the pixel to be filtered;

所述计算模块，还用于根据所述待滤波像素和所述参考像素集合内的参考像素的纹理像素值，计算所述待滤波像素和所述参考像素的纹理像素值相似性；The calculating module is further configured to calculate, according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set, a texture pixel value similarity between the pixel to be filtered and the reference pixel;

所述计算模块，还用于根据所述待滤波像素、所述参考像素集合内的参考像素和所述待滤波像素所在帧的前一帧图像中相同位置的像素的纹理像素值，计算所述待滤波像素和所述参考像素的运动特征一致性；The calculating module is further configured to calculate, according to the texel value of the pixel to be filtered, the reference pixel in the reference pixel set, and the pixel of the same position in the previous frame image of the frame where the pixel to be filtered is located, Consistency of motion characteristics of the pixel to be filtered and the reference pixel;

滤波模块，用于根据所述空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。a filtering module, configured to determine a weight of the filtering according to the spatial proximity, the texel value similarity, and the motion feature consistency, and respectively perform pixel values of the reference pixels in the reference pixel set A weighted average obtains a filtering result of the pixel to be filtered.

结合第三方面，在第三方面的第一种实现方式中，所述滤波模块，具体用于：With reference to the third aspect, in a first implementation manner of the third aspect, the filtering module is specifically configured to:

对深度图像滤波时，根据所述空间邻近性、所述深度像素对应的纹理像素值相似性和所述运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的深度像素值进行加权平均，获得所述深度图像待滤波像素的深度像素值的滤波结果；或，When filtering the depth image, determining the weight of the filtering according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the motion feature consistency, respectively, the depth of the reference pixel in the reference pixel set Performing a weighted average of the pixel values to obtain a filtering result of the depth pixel value of the pixel to be filtered of the depth image; or

对纹理图像滤波时，根据所述空间邻近性、所述纹理像素相似性和所述运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的纹理像素值进行加权平均，获得所述纹理图像待滤波像素的纹理像素值的滤波结果。When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the motion feature consistency, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set And obtaining a filtering result of the texel value of the texture image to be filtered pixel.

结合第三方面、或第三方面的第一种实现方式，在第三方面的第二种实现方式中，所述投影模块，具体用于：With reference to the third aspect, or the first implementation manner of the third aspect, in the second implementation manner of the third aspect, the

结合第三方面的第二种实现方式，在第三方面的第三种实现方式中，所述投影模块，具体用于：With the second implementation of the third aspect, in a third implementation manner of the third aspect, the projection module is specifically configured to:

为所述像素在所述图像平面中的坐标值，

For the coordinate values of the pixels in the image plane,

结合第三方面、或第三方面的第一-第三任一种实现方式，在第三方面的第四种实现方式中，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；With reference to the third aspect, or the first to the third implementation of the third aspect, in a fourth implementation manner of the third aspect, the spatial proximity is through the pixel to be filtered in the three-dimensional space and the The distance of the reference pixel is calculated as an input value of the function; the output value of the function increases as the input value decreases;

当所述待滤波像素与前一帧中对应位置的像素的纹理像素值的差值，以及所述参考像素与前一帧中对应位置的像素的纹理像素值的差值，同时大于或小于预设的阈值时确定为所述待滤波像素和所述参考像素的运动状态一致；否则确定为所述待滤波像素和所述参考像素的运动状态不一致。a difference between a texel value of the pixel to be filtered and a pixel at a corresponding position in the previous frame, and a difference between the reference pixel and a texel value of the pixel at a corresponding position in the previous frame, which are simultaneously larger or smaller than The threshold value is determined to be consistent with the motion state of the pixel to be filtered and the reference pixel; otherwise, it is determined that the motion state of the pixel to be filtered and the reference pixel are inconsistent.

结合第三方面、或第三方面的第一-第四任一种实现方式，在第三方面的第五种实现方式中，所述滤波模块，具体用于：With reference to the third aspect, or the first to the fourth implementation manner of the third aspect, in the fifth implementation manner of the third aspect, the filtering module is specifically configured to:

根据公式(1)：

根据公式(2)：

其中，

第四方面，本发明实施例提供一种三维视频滤波装置，包括：In a fourth aspect, an embodiment of the present invention provides a three-dimensional video filtering apparatus, including:

计算模块，用于根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合在与所述待滤波像素所在同一帧图像和相邻多帧图像中；a calculation module, configured to calculate spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the reference pixel in the to-be-filtered pixel and the reference pixel set in the three-dimensional space And the reference pixel set is in the same frame image and the adjacent multi-frame image as the pixel to be filtered;

所述计算模块，还用于根据所述待滤波像素和所述参考像素集合内的参考像素所在帧的时间间隔，计算所述待滤波像素和所述参考像素的时域邻近性；The calculating module is further configured to calculate a time domain proximity of the pixel to be filtered and the reference pixel according to a time interval of a frame in which the reference pixel in the pixel to be filtered and the reference pixel in the reference pixel set are located;

滤波模块，用于根据所述空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。And a filtering module, configured to determine a weight of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, and perform weighted averaging on the pixel values of the reference pixels in the reference pixel set respectively to obtain the to-be-filtered The filtering result of the pixel.

结合第四方面，在第四方面的第一种实现方式中，所述滤波模块，具体用于：With reference to the fourth aspect, in a first implementation manner of the fourth aspect, the filtering module is specifically configured to:

对深度图像滤波时，根据所述空间邻近性、所述深度像素对应的纹理像素值相似性和所述时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的深度像素值进行加权平均，获得所述深度图像待滤波像素的深度像素值的滤波结果；或，When filtering the depth image, determining the weight of the filtering according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the time domain proximity, respectively, the depth of the reference pixel in the reference pixel set Performing a weighted average of the pixel values to obtain a filtering result of the depth pixel value of the pixel to be filtered of the depth image; or

对纹理图像滤波时，根据所述空间邻近性、所述纹理像素相似性和所述时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的纹理像素值进行加权平均，获得所述纹理图像待滤波像素的纹理像素值的滤波结果。 When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the time domain proximity, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set And obtaining a filtering result of the texel value of the texture image to be filtered pixel.

结合第四方面、或第四方面的第一种实现方式，在第四方面的第二种实现方式中，所述投影模块，具体用于：With reference to the fourth aspect, or the first implementation manner of the fourth aspect, in the second implementation manner of the fourth aspect, the

结合第四方面的第二种实现方式，在第四方面的第三种实现方式中，所述投影模块，具体用于：With reference to the second implementation manner of the fourth aspect, in a third implementation manner of the fourth aspect, the

为所述像素在所述图像平面中的坐标值，

For the coordinate values of the pixels in the image plane,

结合第四方面、或第四方面的第一-第三任一种实现方式，在第四方面的第四种实现方式中，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；With reference to the fourth aspect, or the first to the third implementation of the fourth aspect, in a fourth implementation manner of the fourth aspect, the spatial proximity is through the pixel to be filtered in the three-dimensional space and the The distance of the reference pixel is calculated as an input value of the function; the output value of the function increases as the input value decreases;

结合第四方面、或第四方面的第一-第四任一种实现方式，在第四方面的第五种实现方式中，所述滤波模块，具体用于： With reference to the fourth aspect, or the first to the fourth implementation manner of the fourth aspect, in the fifth implementation manner of the fourth aspect, the filtering module is specifically configured to:

根据公式(3)：

根据公式(4)：

计算获得所述待滤波像素的纹理像素值的滤波结果；According to formula (4):

其中，

本发明实施例三维视频滤波方法和装置，利用待滤波像素和参考像素在真实三维空间中的关系，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性、纹理像素值相似性、运动特征一致性和时域邻近性；根据空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，或根据空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果，计算权值时综合考虑了空间邻近性、纹理像素值相似性、运动特征一致性和时域邻近性，由于计算空间邻近性时利用的是真实三维空间中的位置，而且由于三维视频由一系列采集自不同时刻的图像组成，不同帧之间的像素点也存在相关性，因此权值考虑时域邻近性滤波后帧与帧之间的连续性较强，并且增加考虑了像素点之间的运动特征一致性，提高了滤波结果准确性，解决了现有技术中滤波结果准确性不高的问题。The three-dimensional video filtering method and apparatus of the embodiment of the present invention calculates the spatial proximity and texture pixels of the pixel to be filtered and the reference pixel in the three-dimensional space by using the relationship between the pixel to be filtered and the reference pixel in the real three-dimensional space. Value similarity, motion feature consistency, and time domain proximity; determine the weight of the filter based on spatial proximity, texture pixel value similarity, and motion feature consistency, or based on spatial proximity, texture pixel value similarity, and time domain proximity Sex determination of the weight of the filter, respectively The weighted average of the pixel values of the reference pixels in the reference pixel set is used to obtain the filtering result of the pixel to be filtered, and the spatial proximity, texture pixel value similarity, motion feature consistency and time domain proximity are comprehensively considered when calculating the weight. Since the space proximity is used to calculate the position in the real three-dimensional space, and since the three-dimensional video is composed of a series of images collected from different moments, the pixels between different frames also have correlation, so the weight considers the time domain. The continuity between the frame and the frame after the proximity filtering is strong, and the motion feature consistency between the pixel points is considered, the accuracy of the filtering result is improved, and the problem that the filtering result is not high in the prior art is solved. .

DRAWINGS

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作一简单地介绍，显而易见地，下面描述中的附图是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.

图1为现有技术的双边滤波器的计算邻近性的示意图；1 is a schematic diagram of computational proximity of a prior art bilateral filter;

图2为本发明三维视频滤波方法实施例一的流程图；2 is a flowchart of Embodiment 1 of a three-dimensional video filtering method according to the present invention;

图3为本发明方法实施例一的像素投影示意图；3 is a schematic diagram of pixel projection according to Embodiment 1 of the method of the present invention;

图4为本发明三维视频滤波方法实施例二的流程图；4 is a flowchart of Embodiment 2 of a three-dimensional video filtering method according to the present invention;

图5为本发明方法实施例二的参考像素选择示意图；FIG. 5 is a schematic diagram of reference pixel selection according to Embodiment 2 of the method of the present invention; FIG.

图6为本发明三维视频滤波装置实施例的结构示意图；6 is a schematic structural diagram of an embodiment of a three-dimensional video filtering device according to the present invention;

图7为本发明三维视频滤波设备实施例的结构示意图。FIG. 7 is a schematic structural diagram of an embodiment of a three-dimensional video filtering device according to the present invention.

detailed description

为使本发明实施例的目的、技术方案和优点更加清楚，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the drawings in the embodiments of the present invention. It is a partial embodiment of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.

图2为本发明三维视频滤波方法实施例一的流程图，图3为本发明方法实施例一的像素投影示意图。如图2所示，本实施例的方法可以包括：FIG. 2 is a flowchart of Embodiment 1 of a method for filtering a three-dimensional video according to the present invention, and FIG. 3 is a schematic diagram of pixel projection according to Embodiment 1 of the method of the present invention. As shown in FIG. 2, the method in this embodiment may include:

步骤201、将图像平面中的像素投影到三维空间；所述像素包括待滤波像素和参考像素集合。 Step 201: Project a pixel in an image plane to a three-dimensional space; the pixel includes a pixel to be filtered and a reference pixel set.

可选地，将图像平面中的像素投影到三维空间，包括：Optionally, projecting pixels in the image plane into the three-dimensional space includes:

利用三维视频提供的深度图像信息、视点位置信息和参考相机参数信息，将像素从图像平面投影到三维空间；所述深度图像信息包括所述像素的深度像素值。The pixels are projected from the image plane to the three-dimensional space using depth image information, viewpoint position information, and reference camera parameter information provided by the three-dimensional video; the depth image information includes depth pixel values of the pixels.

可选地，所述利用三维视频提供的深度图像信息、视点位置信息和参考相机参数信息，将像素从图像平面投影到三维空间，包括：Optionally, the projecting the pixels from the image plane to the three-dimensional space by using the depth image information, the viewpoint position information and the reference camera parameter information provided by the three-dimensional video comprises:

为所述像素在所述图像平面中的坐标值，

For the coordinate values of the pixels in the image plane,

如图3所示，u v坐标所在的平面为图像平面，三维空间中的像素位置用世界坐标系中的坐标表示，p为图像平面中的像素，所述像素在所述图像平面中的坐标值为

应用三维投影技术，将像素投影到世界坐标系中的P点，所述像素在所世界坐标系中的坐标值为

该坐标值可以通过公式P＝R^-1(dA^-1p-t)计算得到，其中R和t为参考相机的旋转矩阵和平移矢量，d为像素的深度像素值，可以由三维视频提供的深度图信息得到，A为参考相机参数矩阵

f_x和f_y分别为水平和竖直方向的归一化焦距， r为径向畸变系数，(o_x,o_y)为所述图像平面上的基准点的坐标值；所述基准点为所述参考相机的光轴和所述图像平面的交点。As shown in FIG. 3, the plane where the uv coordinates are located is an image plane, the pixel positions in the three-dimensional space are represented by coordinates in the world coordinate system, p is a pixel in the image plane, and the coordinates of the pixels in the image plane are for

Applying 3D projection technology to project pixels to P points in the world coordinate system, the coordinates of the pixels in the world coordinate system

The coordinate value can be calculated by the formula P=R ^-1 (dA ^-1 pt), where R and t are the rotation matrix and translation vector of the reference camera, d is the depth pixel value of the pixel, and the depth map can be provided by the 3D video. Information is obtained, A is the reference camera parameter matrix

f _x and f _y are normalized focal lengths in horizontal and vertical directions, respectively, r is a radial distortion coefficient, and (o _x , o _y ) is a coordinate value of a reference point on the image plane; The intersection of the optical axis of the reference camera and the image plane.

步骤202、根据待滤波像素和参考像素集合内的参考像素在三维空间的坐标值，计算待滤波像素和参考像素在所述三维空间内的空间邻近性；其中，参考像素集合与待滤波像素在同一帧图像中。Step 202: Calculate spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the reference pixel in the pixel to be filtered and the reference pixel in the reference pixel set; wherein, the reference pixel set and the pixel to be filtered are In the same frame image.

可选地，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大。Optionally, the spatial proximity is calculated by using a distance of the pixel to be filtered and the reference pixel in a three-dimensional space as a function of an input value; an output value of the function increases as the input value decreases.

具体地，两点之间的空间距离可以反映其空间邻近程度，距离越近相关性越强，则空间邻近性越大，即可以通过待滤波像素和参考像素集合内的参考像素在三维空间的坐标值计算空间距离，并将该空间距离作为输入值例如通过高斯函数计算出空间邻近性，计算空间邻近性的函数还可以是其他函数，但是需要保证该函数的输出值随着输入值的减小而增大；其中，本实施例中的参考像素集合与待滤波像素在同一帧中。Specifically, the spatial distance between two points may reflect the spatial proximity thereof. The closer the distance is, the stronger the correlation is, the larger the spatial proximity is, that is, the reference pixels in the pixel to be filtered and the reference pixel set in the three-dimensional space. The coordinate value calculates the spatial distance, and the spatial distance is used as the input value, for example, the spatial proximity is calculated by a Gaussian function. The function for calculating the spatial proximity may also be other functions, but it is necessary to ensure that the output value of the function decreases with the input value. Small and increasing; wherein the reference pixel set in this embodiment is in the same frame as the pixel to be filtered.

步骤203、根据待滤波像素和参考像素集合内的参考像素的纹理像素值，计算待滤波像素和参考像素的纹理像素值相似性。Step 203: Calculate texture pixel value similarity of the pixel to be filtered and the reference pixel according to the texel value of the reference pixel in the pixel to be filtered and the reference pixel set.

可选地，所述纹理像素值相似性通过所述待滤波像素和所述参考像素的纹理像素值的差值作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大。Optionally, the texture pixel value similarity is calculated by using a difference between a texel value of the pixel to be filtered and the reference pixel as a function of an input value; and an output value of the function decreases with an input value. And increase.

具体地，两点之间纹理特征的差异程度反映了其相似程度，纹理越相似相关性越强，则纹理像素值相似性越大，即可以通过所述待滤波像素和所述参考像素的纹理像素值计算差值，并将该差值作为输入值例如通过高斯函数计算出纹理像素值相似性，计算纹理像素值相似性的函数还可以是其他函数，但是需要保证该函数的输出值随着输入值的减小而增大。Specifically, the degree of difference in the texture features between the two points reflects the degree of similarity. The more similar the texture is, the stronger the correlation is, the greater the similarity of the texel values, that is, the texture of the pixel to be filtered and the reference pixel. The pixel value is calculated as a difference value, and the difference value is used as an input value to calculate the similarity of the texel value, for example, by a Gaussian function. The function for calculating the similarity of the texel value may also be other functions, but it is necessary to ensure that the output value of the function follows The input value decreases as the value decreases.

步骤204、根据待滤波像素、参考像素集合内的参考像素和待滤波像素所在帧的前一帧图像中相同位置的像素的纹理像素值，计算待滤波像素和参考像素的运动特征一致性。Step 204: Calculate motion feature consistency of the pixel to be filtered and the reference pixel according to the texel value of the pixel in the same position in the pixel to be filtered, the reference pixel in the reference pixel set, and the previous frame image of the frame in which the pixel to be filtered is located.

可选地，所述运动特征一致性通过计算所述待滤波像素和所述参考像素的运动特征是否一致得到，包括：Optionally, the motion feature consistency is obtained by calculating whether the motion feature of the pixel to be filtered and the reference pixel are consistent, including:

当所述待滤波像素与前一帧中对应位置的像素的纹理像素值的差值，以及所述参考像素与前一帧中对应位置的像素的纹理像素值的差值，同时大于或小于预设的阈值时确定为所述待滤波像素和所述参考像素的运动特征一致；否则确定为所述待滤波像素和所述参考像素的运动特征不一致。a difference between a texel value of the pixel to be filtered and a pixel at a corresponding position in the previous frame, and a difference between the reference pixel and a texel value of a pixel at a corresponding position in the previous frame, and greater than Or less than a preset threshold, determining that the motion characteristics of the pixel to be filtered and the reference pixel are consistent; otherwise determining that the motion characteristics of the pixel to be filtered and the reference pixel are inconsistent.

具体地，两点之间相对运动的关系也反映了其运动相似性，运动越相似相关性越强。由于从三维视频序列中很难获取像素点的运动信息，因此本发明实施例用前后两帧的像素在图像平面中相同位置纹理像素的差值来判断所述像素是否运动，当差值大于某一预设的阈值时，认为该像素的运动特征为运动，反之，认为该像素的运动特征为无运动；进一步地，用前后两帧的待滤波像素和参考像素在图像平面中相同位置纹理像素的差值来判断所述待滤波像素和参考像素运动特征是否一致，当差值都大于或小于某一预设的阈值时，认为所述待滤波像素和参考像素的运动特征一致，反之，运动特征不一致。如果像素点的运动特征一致，认为有相关性，反之认为无相关性。Specifically, the relationship between the relative motions between the two points also reflects its similarity in motion, and the more similar the motion, the stronger the correlation. Since it is difficult to obtain the motion information of the pixel from the three-dimensional video sequence, the embodiment of the present invention determines whether the pixel moves by using the difference between the pixels of the two positions of the pixels in the image plane at the same position in the image plane, when the difference is greater than a certain value. When a preset threshold is used, the motion feature of the pixel is considered to be motion, and conversely, the motion feature of the pixel is considered to be motionless; further, the pixel to be filtered and the reference pixel of the two frames before and after are in the same position in the image plane. The difference is used to determine whether the motion characteristics of the pixel to be filtered and the reference pixel are consistent. When the difference is greater than or less than a certain threshold, the motion characteristics of the pixel to be filtered and the reference pixel are considered to be the same. Inconsistent features. If the motion characteristics of the pixels are consistent, it is considered to be relevant, and vice versa.

步骤205、根据空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对参考像素集合内的参考像素的像素值进行加权平均获得待滤波像素的滤波结果。Step 205: Determine a weight of the filter according to spatial proximity, texture pixel value similarity, and motion feature consistency, and perform weighted average on the pixel values of the reference pixels in the reference pixel set to obtain a filtering result of the pixel to be filtered.

可选地，对深度图像滤波时，根据所述空间邻近性、所述深度像素对应的纹理像素值相似性和所述运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的深度像素值进行加权平均，获得所述待深度图像待滤波像素的深度像素值的滤波结果；或，Optionally, when filtering the depth image, determining, according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the motion feature consistency, determining a weight of the filtering, respectively, in the reference pixel set Performing a weighted average of the depth pixel values of the reference pixels to obtain a filtering result of the depth pixel values of the pixels to be filtered by the depth image; or

对纹理图像滤波时，根据所述空间邻近性、所述纹理像素相似性和所述运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的纹理像素值进行加权平均，获得所述待纹理图像待滤波像素的纹理像素值的滤波结果。When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the motion feature consistency, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set And obtaining a filtering result of the texel value of the pixel to be filtered of the image to be textured.

可选地，根据空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对参考像素集合内的参考像素的像素值进行加权平均获得待滤波像素的滤波结果，包括：Optionally, determining the weight of the filtering according to the spatial proximity, the texel value similarity, and the motion feature consistency, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered, including:

根据公式(1)：

计算获得待滤波像素的深度像素值的滤波结果；或， According to formula (1):

根据公式(2)：

计算获得待滤波像素的纹理像素值的滤波结果；According to formula (2):

其中，

用于计算待滤波像素和参考像素的空间邻近性；among them,

Used to calculate spatial proximity of the pixel to be filtered and the reference pixel;

f_T(T_p,T_q)＝f_T ⁽||T_p-T_q||)用于计算待滤波像素和参考像素的纹理像素值相似性；f _T (T _p , T _q )=f _T ⁽ ||T _p −T _q ||) is used for calculating texture pixel value similarity of the pixel to be filtered and the reference pixel;

用于计算待滤波像素和参考像素的运动特征一致性；

Used to calculate motion feature consistency of the pixel to be filtered and the reference pixel;

具体地，可以根据公式

对参考像素集合中的参考像素进行加权平均，计算获得待滤波像素的深度像素值的滤波结果；根据公式

对参考像素集合中的参考像素进行加权平均，计算获得待滤波像素的纹理像素值的滤波结果；Specifically, according to the formula

Performing weighted averaging on the reference pixels in the reference pixel set, and calculating a filtering result of obtaining the depth pixel value of the pixel to be filtered; according to the formula

Performing a weighted average on the reference pixels in the reference pixel set, and calculating a filtering result of obtaining the texel value of the pixel to be filtered;

其中，

用于计算待滤波像素和参考像素的空间邻近性；该函数的输入值为所述待滤波像素和所述参考像素的空间距离；所述函数的输出值随着输入值的减小而增大；among them,

Means for calculating spatial proximity of the pixel to be filtered and the reference pixel; an input value of the function is a spatial distance between the pixel to be filtered and the reference pixel; an output value of the function increases as the input value decreases ;

f_T(T_p,T_q)＝f_T(||T_p-T_q||)用于计算待滤波像素和参考像素的纹理像素值相似性；该函数的输入值为所述待滤波像素和所述参考像素的纹理像素值的差值；所述函数的输出值随着输入值的减小而增大；f _T (T _p , T _q )=f _T (||T _p −T _q ||) is used for calculating the texel value similarity of the pixel to be filtered and the reference pixel; the input value of the function is the pixel to be filtered a difference from a texel value of the reference pixel; an output value of the function increases as the input value decreases;

用于计算待滤波像素和参考像素的运动特征一致性；即当所述待滤波像素与前一帧中对应位置的像素的纹理像素值的差值，以及所述参考像素与前一帧中对应位置的像素的纹理像素值的差值，同时大于或小于预设的阈值时确定为所述待滤波像素和所述参考像素的运动特征一致；否则确定为所述待滤波像素和所述参考像素的运动特征不一致。

For calculating the motion feature consistency of the pixel to be filtered and the reference pixel; that is, the difference between the texel value of the pixel to be filtered and the pixel at the corresponding position in the previous frame, and the reference pixel corresponding to the previous frame The difference between the texel values of the pixels of the location is greater than or less than the preset threshold, and is determined to be consistent with the motion features of the pixel to be filtered and the reference pixel; otherwise determined as the pixel to be filtered and the reference pixel The motion characteristics are inconsistent.

其中，p为待滤波像素，q为参考像素，K为参考像素集合，此集合通常取以待滤波像素为中心的正方形区域，大小为5*5或7*7，D_p'为p滤波后的深度像素值，D_q为q的深度像素值，P、Q为p、q在三维空间中的坐标值，T_p、T_q为p、q的纹理像素值，T_p'、T_q'为p、q在前一帧相同位置的纹理像素值，此处相同位置是指在图像平面中对应相同的位置，T_p'为p滤波后的纹理像素值，th为预设的纹理像素差阈值。th为判断像素点运动特征是否一致的门限值，可根据三维视频序列内容的不同选择，一般为6～20，当选择适当时，可以较好的区分运动物体边界，使滤波后物体边界更加明显。Where p is the pixel to be filtered, q is the reference pixel, and K is the reference pixel set. This set is usually taken as a square region centered on the pixel to be filtered, and the size is 5*5 or 7*7, and D _p ' is filtered by p Depth pixel value, D _q is the depth pixel value of q, P, Q are the coordinate values of p, q in three-dimensional space, T _p , T _q are texture pixel values of p, q, T _{p '} , T _{q '} The texture pixel values of p and q at the same position in the previous frame, where the same position refers to the same position in the image plane, T _p ' is the filtered tex value of p, and th is the preset texture pixel difference Threshold. Th is a threshold value for judging whether the motion characteristics of the pixel points are consistent, and may be selected according to different contents of the three-dimensional video sequence, generally 6 to 20. When the selection is appropriate, the boundary of the moving object can be better distinguished, so that the boundary of the filtered object is further improved. obvious.

需要说明的是，在本实施例中步骤202、步骤203、步骤204不分先后顺序。It should be noted that, in this embodiment, step 202, step 203, and step 204 are in no particular order.

本实施例，利用待滤波像素和参考像素在真实三维空间中的关系，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性、纹理像素值相似性、运动特征一致性；根据空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果，计算权值时综合考虑了空间邻近性、纹理像素值相似性、运动特征一致性，由于计算空间邻近性时利用的是真实三维空间中的位置，并且增加考虑了像素点之间的运动特征一致性，提高了滤波结果准确性，解决了现有技术中滤波结果准确性不高的问题。In this embodiment, the spatial proximity, the texel value similarity, and the motion feature of the pixel to be filtered and the reference pixel in the three-dimensional space are calculated by using a relationship between the pixel to be filtered and the reference pixel in a real three-dimensional space. Determining, according to the spatial proximity, the texel value similarity and the motion feature consistency, determining the weight of the filtering, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered, Space is considered when calculating weights Proximity, texture pixel value similarity, motion feature consistency, the position in the real three-dimensional space is utilized when calculating the spatial proximity, and the motion feature consistency between the pixel points is increased, and the accuracy of the filtering result is improved. The problem that the filtering result in the prior art is not high is solved.

图4为本发明三维视频滤波方法实施例二的流程图，图5为本发明方法实施例二的参考像素选择示意图，如图4所示，本实施例的方法可以包括：4 is a flowchart of a second embodiment of a three-dimensional video filtering method according to the present invention. FIG. 5 is a schematic diagram of reference pixel selection according to a second embodiment of the present invention. As shown in FIG. 4, the method in this embodiment may include:

步骤401、将图像平面中的像素投影到三维空间；所述像素包括待滤波像素和参考像素集合。Step 401: Project a pixel in an image plane to a three-dimensional space; the pixel includes a pixel to be filtered and a reference pixel set.

可选地，利用三维视频提供的深度图像信息、视点位置信息和参考相机参数信息，将像素从图像平面投影到三维空间，包括：Optionally, using the depth image information, the viewpoint location information, and the reference camera parameter information provided by the three-dimensional video to project the pixel from the image plane to the three-dimensional space, including:

为所述像素在所述图像平面中的坐标值，

For the coordinate values of the pixels in the image plane,

步骤402、根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合在与所述待滤波像素所在同一帧图像和相邻多帧图像中。Step 402: Calculate spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the reference pixel in the to-be-filtered pixel and the reference pixel set in the three-dimensional space; The reference pixel set is in the same frame image and the adjacent multi-frame image as the pixel to be filtered.

可选地，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大。 Optionally, the spatial proximity is calculated by using a distance of the pixel to be filtered and the reference pixel in a three-dimensional space as a function of an input value; an output value of the function increases as the input value decreases.

步骤403、根据所述待滤波像素和所述参考像素集合内的参考像素的纹理像素值，计算所述待滤波像素和所述参考像素的纹理像素值相似性。Step 403: Calculate texture pixel value similarity between the pixel to be filtered and the reference pixel according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set.

可选地，所述纹理像素值相似性通过所述待滤波像素和所述参考像素的纹理像素值的差值作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；Optionally, the texture pixel value similarity is calculated by using a difference between a texel value of the pixel to be filtered and the reference pixel as a function of an input value; and an output value of the function decreases with an input value. Increase

步骤401、步骤402、步骤403中的实现原理与实施例一中的类似，此处不再赘述。The implementation principles in the steps 401, 402, and 403 are similar to those in the first embodiment, and are not described here.

步骤404、根据所述待滤波像素和所述参考像素集合内的参考像素所在帧的时间间隔，计算所述待滤波像素和所述参考像素的时域邻近性。Step 404: Calculate time domain proximity of the pixel to be filtered and the reference pixel according to a time interval of a frame in which the pixel to be filtered and a reference pixel in the reference pixel set are located.

可选地，所述时域邻近性通过所述待滤波像素和所述参考像素所在帧的时间间隔作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大。Optionally, the time domain proximity is calculated by using a time interval of the pixel to be filtered and a frame where the reference pixel is located as a function input value; an output value of the function increases as the input value decreases. .

具体地，由于不同帧之间的像素共同反映了一段时间内物体的状态，其存在一定的相关性，此种相关性也能用于三维视频中图像的滤波方法中。因此，本发明在上述实施例一的权重计算方法的基础上，将参考像素的选取范围从当前待滤波像素点所在帧拓展至其相邻帧(滤波参考帧)，以增加滤波后帧与帧之间的连续性。如图5所示，各滤波参考帧中，参考像素的选取范围与待滤波帧中参考像素的选取范围一致，其中，第N帧为当前待滤波像素所在帧，选择其前m帧和后n帧作为滤波参考帧，各帧参考像素窗口与第N帧参考像素窗口坐标和大小相同(这里指图像平面上的坐标和大小相同)，K_i为第i帧中的参考像素集合，i取值为[N-m，N+n]区间的整数，m、n为非负整数，当n＝0时表示只从待滤波帧之前已经编码或解码的帧获取。Specifically, since the pixels between different frames collectively reflect the state of the object in a period of time, there is a certain correlation, and the correlation can also be used in the filtering method of the image in the three-dimensional video. Therefore, the present invention extends the selection range of the reference pixel from the frame where the pixel to be filtered is located to its adjacent frame (filtered reference frame) to increase the filtered frame and frame, based on the weight calculation method of the first embodiment. Continuity between. As shown in FIG. 5, in each filtering reference frame, the selected range of the reference pixel is consistent with the selected range of the reference pixel in the frame to be filtered, wherein the Nth frame is the frame where the pixel to be filtered is currently located, and the previous m frame and the subsequent n are selected. The frame is used as the filtering reference frame, and the reference pixel window of each frame is the same as the coordinate and size of the reference pixel window of the Nth frame (here, the coordinates and the size on the image plane are the same), and K _i is the reference pixel set in the ith frame, and the value of i is For an integer in the interval [Nm, N+n], m and n are non-negative integers, and when n=0, it means that only the frame that has been encoded or decoded before the frame to be filtered is acquired.

两点之间在时域上的距离反映了其时域邻近程度，时域距离越近相关性越强，则时域邻近性越大，即可以通过所述待滤波像素和所述参考像素所在的帧计算时间间隔，并将该时间间隔作为输入值例如通过高斯函数计算出时域邻近性，计算时域邻近性的函数还可以是其他函数，但是需要保证该函数的输出值随着输入值的减小而增大。The distance between the two points in the time domain reflects the degree of proximity of the time domain. The closer the time domain distance is, the stronger the correlation is, the larger the time domain proximity is, that is, the pixel to be filtered and the reference pixel can be located. The frame calculates the time interval and uses the time interval as an input value to calculate the time domain proximity, for example, by a Gaussian function. The function for calculating the time domain proximity may also be other functions, but it is necessary to ensure that the output value of the function follows the input value. The decrease increases.

步骤405、根据所述空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。Step 405: Determine weights of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, and perform weighted averaging on the pixel values of the reference pixels in the reference pixel set respectively to obtain the pixels to be filtered. Filter the result.

可选地，对深度图像滤波时，根据所述空间邻近性、所述深度像素对应的纹理像素值相似性和所述时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的深度像素值进行加权平均，获得所述深度图像待滤波像素的深度像素值的滤波结果；或，Optionally, when filtering the depth image, according to the spatial proximity, the depth pixel corresponds to The texel value similarity and the time domain proximity determine the weight of the filtering, respectively performing weighted averaging on the depth pixel values of the reference pixels in the reference pixel set to obtain the depth pixel value of the depth image to be filtered pixel Filtered result; or,

可选地，根据所述空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果，包括：Optionally, determining the weight of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the to-be-filtered pixel The filtering results include:

根据公式(3)：

根据公式(4)：

其中，

用于计算所述待滤波像素和所述参考像素的空间邻近性；；among them,

具体地，可以根据公式

其中，

f_tem(i,N)＝f_tem(||i-N||)用于计算所述待滤波像素和所述参考像素的时域邻近性；该函数的输入值为所述待滤波像素和所述参考像素所帧的时间间隔；所述函数的输出值随着输入值的减小而增大；f _tem (i, N)=f _tem (||iN||) for calculating a time domain proximity of the pixel to be filtered and the reference pixel; an input value of the function is the pixel to be filtered and the The time interval of the frame of the reference pixel; the output value of the function increases as the input value decreases;

其中，N为待滤波像素所在帧的帧号，i为参考像素所在帧的帧号，m、n分别为在待滤波像素所在帧之前、之后的参考帧个数，通常m、n可以为1～3，因为随着时间间隔的增加，帧与帧之间的相关性会变得非常小，可以忽略，p为待滤波像素，q_i为第i帧中的参考像素，K_i为第i帧中的参考像素集合，此集合通常取以待滤波像素为中心的正方形区域，大小为5*5或7*7，D_p'为p滤波后的深度像素值，

分别为p、第i帧中q的纹理像素值，T_p'为p滤波后的纹理像素值。Where N is the frame number of the frame in which the pixel to be filtered is located, i is the frame number of the frame in which the reference pixel is located, and m and n are the number of reference frames before and after the frame in which the pixel to be filtered is located, usually m and n may be 1 ～3, because as the time interval increases, the correlation between frames and frames becomes very small, which can be ignored. p is the pixel to be filtered, q _i is the reference pixel in the ith frame, and K _i is the ith a set of reference pixels in the frame, which is usually taken as a square region centered on the pixel to be filtered, and has a size of 5*5 or 7*7, and _Dp ' is a depth pixel value after p filtering.

需要说明的是，在本实施例中步骤402、步骤403、步骤404不分先后顺序。It should be noted that in the embodiment, step 402, step 403, and step 404 are in no particular order.

本实施例，利用待滤波像素和参考像素在真实三维空间中的关系，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性、纹理像素值相似性、和时域邻近性；根据空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果，计算权值时综合考虑了空间邻近性、纹理像素值相似性和时域邻近性，由于计算空间邻近性时利用的是真实三维空间中的位置，而且由于三维视频由一系列采集自不同时刻的图像组成，不同帧之间的像素点也存在相关性，因此权值考虑时域邻近性滤波后帧与帧之间的连续性较强，提高了滤波结果准确性，解决了现有技术中滤波结果准确性不高的问题。In this embodiment, the spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space, the similarity of the texture pixel value, and the time domain are calculated by using the relationship between the pixel to be filtered and the reference pixel in the real three-dimensional space. Proximity; determining weights of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered When calculating weights, spatial proximity, texel value similarity and time domain proximity are considered. Since the spatial proximity is used to calculate the position in the real three-dimensional space, and because the three-dimensional video is collected from different moments The image composition, the pixel points between different frames also have correlation, so the weight value considers the continuity between the frame and the frame after the time domain proximity filtering is strong, which improves the accuracy of the filtering result and solves the prior art. The result of the filtering result is not high.

图6为本发明三维视频滤波装置实施例的结构示意图，如图6所示，本实施例的三维视频滤波装置60可以包括：投影模块601、计算模块602和滤波模块603；FIG. 6 is a schematic structural diagram of an embodiment of a three-dimensional video filtering apparatus according to the present invention. As shown in FIG. 6, the three-dimensional video filtering apparatus 60 of the present embodiment may include: a projection module 601, a calculation module 602, and a filtering module 603;

其中，投影模块601，用于将图像平面中的像素投影到三维空间；所述像素包括待滤波像素和参考像素集合；The projection module 601 is configured to project pixels in the image plane into the three-dimensional space; the pixels include a pixel to be filtered and a reference pixel set;

计算模块602，用于根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合与所述待滤波像素在同一帧图像中；The calculating module 602 is configured to calculate a space of the pixel to be filtered and the reference pixel in the three-dimensional space according to the coordinate value of the pixel to be filtered and the reference pixel in the reference pixel set in the three-dimensional space Proximity; wherein the reference pixel set is in the same frame image as the pixel to be filtered;

所述计算模块602，还用于根据所述待滤波像素和所述参考像素集合内的参考像素的纹理像素值，计算所述待滤波像素和所述参考像素的纹理像素值相似性； The calculating module 602 is further configured to calculate, according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set, a texture pixel value similarity between the pixel to be filtered and the reference pixel;

所述计算模块602，还用于根据所述待滤波像素、所述参考像素集合内的参考像素和所述待滤波像素所在帧的前一帧图像中相同位置的像素的纹理像素值，计算所述待滤波像素和所述参考像素的运动特征一致性；The calculating module 602 is further configured to calculate, according to the texel value of the pixel to be filtered, the reference pixel in the reference pixel set, and the pixel in the same position in the previous frame image of the frame where the pixel to be filtered is located, Having a consistency of motion characteristics of the filtered pixel and the reference pixel;

滤波模块603，用于根据所述空间邻近性、纹理像素值相似性和运动特征一致性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。a filtering module 603, configured to determine a weight of the filter according to the spatial proximity, the texel value similarity, and the motion feature consistency, and perform weighted averaging on the pixel values of the reference pixels in the reference pixel set respectively to obtain the to-be-checked The filtered result of the filtered pixel.

可选地，所述滤波模块603，具体用于：Optionally, the filtering module 603 is specifically configured to:

可选地，所述投影模块601，具体用于：Optionally, the projection module 601 is specifically configured to:

为所述像素在所述图像平面中的坐标值，

为所述像素在所述三维空间中的坐标值，d为所述像素的深度像素值；f_x和f_y分别为水平和竖直方向的归一化焦距，r为径向畸变系数，(o_x,o_y)为所述图像平面上的基准点的坐标值；所述基准点为所述参考相机的光轴和所述图像平面的交点。 Where R and t are the rotation matrix and translation vector of the reference camera, and A is the reference camera parameter matrix.

For the coordinate values of the pixels in the image plane,

可选地，所述空间邻近性通过三维空间中所述待滤波像素和所述参考像素的距离作为函数的输入值计算得到；所述函数的输出值随着输入值的减小而增大；Optionally, the spatial proximity is calculated by using an input value of the distance between the pixel to be filtered and the reference pixel in a three-dimensional space as a function; an output value of the function increases as the input value decreases;

根据公式(1)：

根据公式(2)：

其中，

本实施例的装置，可以用于执行图2所示方法实施例的技术方案，其实现原理和技术效果类似，此处不再赘述。The device in this embodiment may be used to implement the technical solution of the method embodiment shown in FIG. 2, and the implementation principle and technical effects are similar, and details are not described herein again.

在本发明三维视频滤波装置实施例二中，本实施例的装置在图6所示装置结构的基础上，进一步地，本实施例的三维视频滤波装置60中的投影模块601，用于将图像平面中的像素投影到三维空间；所述像素包括待滤波像素和参考像素集合；In the second embodiment of the three-dimensional video filtering device of the present invention, the device of the present embodiment is based on the device structure shown in FIG. 6. Further, the projection module 601 in the three-dimensional video filtering device 60 of the present embodiment is used for the image. a pixel in a plane is projected into a three-dimensional space; the pixel includes a pixel to be filtered and a reference pixel set;

计算模块602，用于根据所述待滤波像素和所述参考像素集合内的参考像素在所述三维空间的坐标值，计算所述待滤波像素和所述参考像素在所述三维空间内的空间邻近性；其中，所述参考像素集合在与所述待滤波像素所在同一帧图像和相邻多帧图像中；The calculating module 602 is configured to calculate a space of the pixel to be filtered and the reference pixel in the three-dimensional space according to the coordinate value of the pixel to be filtered and the reference pixel in the reference pixel set in the three-dimensional space Proximity; wherein the reference pixel set is in the same frame image and adjacent multi-frame image as the pixel to be filtered;

所述计算模块602，还用于根据所述待滤波像素和所述参考像素集合内的参考像素的纹理像素值，计算所述待滤波像素和所述参考像素的纹理像素值相似性；The calculating module 602 is further configured to calculate, according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set, a texture pixel value similarity between the pixel to be filtered and the reference pixel;

所述计算模块602，还用于根据所述待滤波像素和所述参考像素集合内的参考像素所在帧的时间间隔，计算所述待滤波像素和所述参考像素的时域邻近性；The calculating module 602 is further configured to calculate a time domain proximity of the to-be-filtered pixel and the reference pixel according to a time interval between the to-be-filtered pixel and a frame in which the reference pixel in the reference pixel set is located;

滤波模块603，用于根据所述空间邻近性、纹理像素值相似性和时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的像素值进行加权平均获得所述待滤波像素的滤波结果。a filtering module 603, configured to determine a weight of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, and perform weighted averaging on the pixel values of the reference pixels in the reference pixel set respectively to obtain the to-be-determined The filtered result of the filtered pixel.

对纹理图像滤波时，根据所述空间邻近性、所述纹理像素相似性和所述时域邻近性确定滤波的权值，分别对所述参考像素集合内的参考像素的纹理像素值进行加权平均，获得所述纹理图像待滤波像素的纹理像素值的滤波结果。When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the time domain proximity, respectively, the texture of the reference pixel in the reference pixel set The pixel values are weighted and averaged to obtain a filtering result of the texture pixel values of the texture image to be filtered.

为所述像素在所述图像平面中的坐标值，

For the coordinate values of the pixels in the image plane,

根据公式(3)：

计算获得所述待滤波像素的深度像素值的滤波结果；或， According to formula (3):

根据公式(4)：

其中，

本实施例的装置，可以用于执行图4所示方法实施例的技术方案，其实现原理和技术效果类似，此处不再赘述。The device in this embodiment may be used to implement the technical solution of the method embodiment shown in FIG. 4, and the implementation principle and technical effects are similar, and details are not described herein again.

图7为本发明三维视频滤波设备实施例的结构示意图。如图7所示，本实施例提供的三维视频滤波设备70包括处理器701和存储器702。其中存储器702用于存储执行指令，当三维视频滤波设备70运行时，处理器701与存储器702之间通信，处理器701调用存储器702中的执行指令，用于执行任一方法实施例所述的技术方案，其实现原理和技术效果类似，此处不再赘述。FIG. 7 is a schematic structural diagram of an embodiment of a three-dimensional video filtering device according to the present invention. As shown in FIG. 7, the three-dimensional video filtering device 70 provided in this embodiment includes a processor 701 and a memory 702. The memory 702 is configured to store execution instructions. When the three-dimensional video filtering device 70 is in operation, the processor 701 communicates with the memory 702, and the processor 701 calls an execution instruction in the memory 702 for executing the method described in any of the method embodiments. The technical solution has similar implementation principles and technical effects, and will not be described here.

在本申请所提供的几个实施例中，应该理解到，所揭露的设备和方法，可以通过其它的方式实现。例如，以上所描述的设备实施例仅仅是示意性的，例如，所述单元或模块的划分，仅仅为一种逻辑功能划分，实际实现时可以有另外的划分方式，例如多个单元或模块可以结合或者可以集成到另一个系统，或一些特征可以忽略，或不执行。另一点，所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口，设备或模块的间接耦合或通信连接，可以是电性，机械或其它的形式。In the several embodiments provided by the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. For example, the device embodiments described above are merely illustrative. For example, the division of the unit or module is only a logical function division, and may be implemented in actual implementation. There are additional ways of dividing, for example multiple units or modules may be combined or integrated into another system, or some features may be omitted or not performed. In addition, the mutual coupling or direct coupling or communication connection shown or discussed may be an indirect coupling or communication connection through some interface, device or module, and may be electrical, mechanical or otherwise.

所述作为分离部件说明的模块可以是或者也可以不是物理上分开的，作为模块显示的部件可以是或者也可以不是物理模块，即可以位于一个地方，或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。The modules described as separate components may or may not be physically separated. The components displayed as modules may or may not be physical modules, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the embodiment.

本领域普通技术人员可以理解：实现上述各方法实施例的全部或部分步骤可以通过程序指令相关的硬件来完成。前述的程序可以存储于一计算机可读取存储介质中。该程序在执行时，执行包括上述各方法实施例的步骤；而前述的存储介质包括：ROM、RAM、磁碟或者光盘等各种可以存储程序代码的介质。One of ordinary skill in the art will appreciate that all or part of the steps to implement the various method embodiments described above may be accomplished by hardware associated with the program instructions. The aforementioned program can be stored in a computer readable storage medium. The program, when executed, performs the steps including the foregoing method embodiments; and the foregoing storage medium includes various media that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

最后应说明的是：以上各实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述各实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分或者全部技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。 Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the technical solutions of the embodiments of the present invention. range.

Claims

A three-dimensional video filtering method, comprising:

Projecting pixels in the image plane into a three-dimensional space; the pixels comprising a pixel to be filtered and a reference pixel set;

Calculating spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the pixel to be filtered and the reference pixel in the reference pixel set in the three-dimensional space; The reference pixel set is in the same frame image as the pixel to be filtered;

Calculating texture pixel value similarity of the pixel to be filtered and the reference pixel according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set;

Calculating the pixel to be filtered and the reference pixel according to the texel value of the pixel to be filtered, the reference pixel in the reference pixel set, and the pixel in the same position in the previous frame image of the frame in which the pixel to be filtered is located Consistency of motion characteristics;

Determining the weights of the filtering according to the spatial proximity, the texel value similarity, and the motion feature consistency, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered.

The method according to claim 1, wherein when filtering the depth image, determining the weight of the filtering according to the spatial proximity, the similarity of the texel values corresponding to the depth pixels, and the consistency of the motion features, Performing a weighted average on the depth pixel values of the reference pixels in the reference pixel set to obtain a filtering result of the depth pixel value of the depth image to be filtered pixel; or

When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the motion feature consistency, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set And obtaining a filtering result of the texel value of the texture image to be filtered pixel.

The method according to claim 1 or 2, wherein the projecting pixels in the image plane into the three-dimensional space comprises:

The pixels are projected from the image plane to the three-dimensional space using depth image information, viewpoint position information, and reference camera parameter information provided by the three-dimensional video; the depth image information includes depth pixel values of the pixels.

The method according to claim 3, wherein said using said depth image information, viewpoint position information and reference camera parameter information provided by the three-dimensional video, said pixel from said image plane Projected into 3D space, including:

Calculating coordinate values of the pixel after being projected into the three-dimensional space according to the formula P=R ^-1 (dA ^-1 pt);

Where R and t are the rotation matrix and translation vector of the reference camera, and A is the reference camera parameter matrix.

For the coordinate values of the pixels in the image plane,

The method according to any one of claims 1 to 4, wherein the spatial proximity is calculated by inputting a distance between the pixel to be filtered and the reference pixel in a three-dimensional space as a function; The output value increases as the input value decreases;

The texture pixel value similarity is calculated by using a difference value of the texel value of the pixel to be filtered and the reference pixel as a function of an input value; an output value of the function increases as the input value decreases;

The motion feature consistency is obtained by calculating whether the motion characteristics of the pixel to be filtered and the reference pixel are consistent, including:

a difference between a texel value of the pixel to be filtered and a pixel at a corresponding position in the previous frame, and a difference between the reference pixel and a texel value of the pixel at a corresponding position in the previous frame, which are simultaneously larger or smaller than The threshold value is determined to be consistent with the motion state of the pixel to be filtered and the reference pixel; otherwise, it is determined that the motion state of the pixel to be filtered and the reference pixel are inconsistent.

The method according to any one of claims 1 to 5, wherein the determining the weight of the filtering according to the spatial proximity, the texel value similarity and the motion feature consistency, respectively, the reference pixel set Performing a weighted average of the pixel values of the reference pixels within the obtained pixel to obtain the filtering result of the pixel to be filtered, including:

According to formula (1):

According to formula (2):

among them,

f _T (T _p , T _q )=f _T (||T _p −T _q ||) is used for calculating texture pixel value similarity of the pixel to be filtered and the reference pixel;

Where p is the pixel to be filtered, q is the reference pixel, K is the reference pixel set, D _p ' is the depth pixel value after p filtering, D _q is the depth pixel value of q, P, Q are p, q in three-dimensional space In the coordinate values, T _p , T _q are the texel values of p and q, T _{p '} and T _{q '} are the texel values of p and q at the same position in the previous frame, and T _p ' is the texture of the p-filtered texture. The pixel value, th is the preset texture pixel difference threshold.

A three-dimensional video filtering method, comprising:

Calculating spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the pixel to be filtered and the reference pixel in the reference pixel set in the three-dimensional space; The reference pixel set is in the same frame image and the adjacent multi-frame image as the pixel to be filtered;

Calculating a time domain proximity of the pixel to be filtered and the reference pixel according to a time interval of a frame in which the pixel to be filtered and a reference pixel in the reference pixel set are located;

Determining the weights of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, respectively performing weighted averaging on the pixel values of the reference pixels in the reference pixel set to obtain the filtering result of the pixel to be filtered.

The method according to claim 7, wherein when filtering the depth image, determining the weight of the filtering according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the time domain proximity, Performing a weighted average on the depth pixel values of the reference pixels in the reference pixel set to obtain a filtering result of the depth pixel value of the depth image to be filtered pixel; or

When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the time domain proximity, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set And obtaining a filtering result of the texel value of the texture image to be filtered pixel.

The method according to claim 7 or 8, wherein the projecting pixels in the image plane into the three-dimensional space comprises:

The method according to claim 9, wherein the projecting the pixel from the image plane to the three-dimensional space by using the depth image information, the viewpoint position information and the reference camera parameter information provided by the three-dimensional video comprises:

For the coordinate values of the pixels in the image plane,

Method according to any of claims 7-10, wherein said spatial proximity The property is calculated by inputting the distance between the pixel to be filtered and the reference pixel in a three-dimensional space as a function; the output value of the function increases as the input value decreases;

The time domain proximity is calculated by using a time interval of the pixel to be filtered and a frame in which the reference pixel is located as a function of an input value; an output value of the function increases as the input value decreases.

The method according to any one of claims 7 to 11, wherein the determining the weight of the filtering according to the spatial proximity, the texel value similarity and the time domain proximity, respectively, the reference pixel set Performing a weighted average of the pixel values of the reference pixels within the obtained pixel to obtain the filtering result of the pixel to be filtered, including:

According to formula (3):

According to formula (4):

among them,

f _tem (i, N)=f _tem (||iN||) for calculating a time domain proximity of the pixel to be filtered and the reference pixel;

Where N is the frame number of the frame in which the pixel to be filtered is located, i is the frame number of the frame in which the reference pixel is located, i is an integer in the interval [Nm, N+n], and m and n are respectively before the frame in which the pixel to be filtered is located And the number of reference frames, m, n are non-negative integers, p is the pixel to be filtered, q _i is the reference pixel in the ith frame, K _i is the reference pixel set in the ith frame, and D _p ' is p Filtered depth pixel value,

A three-dimensional video filtering device, comprising:

a projection module, configured to project pixels in an image plane into a three-dimensional space; the pixels include a pixel to be filtered and a reference pixel set;

a calculation module, configured to calculate spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the reference pixel in the to-be-filtered pixel and the reference pixel set in the three-dimensional space And wherein the reference pixel set is in the same frame image as the pixel to be filtered;

The calculating module is further configured to calculate, according to the texel value of the pixel to be filtered and the reference pixel in the reference pixel set, a texture pixel value similarity between the pixel to be filtered and the reference pixel;

The calculating module is further configured to calculate, according to the texel value of the pixel to be filtered, the reference pixel in the reference pixel set, and the pixel of the same position in the previous frame image of the frame where the pixel to be filtered is located, Consistency of motion characteristics of the pixel to be filtered and the reference pixel;

a filtering module, configured to determine a weight of the filtering according to the spatial proximity, the texel value similarity, and the motion feature consistency, and perform weighted averaging on the pixel values of the reference pixels in the reference pixel set respectively to obtain the to-be-filtered The filtering result of the pixel.

The device according to claim 13, wherein the filtering module is specifically configured to:

When filtering the depth image, determining the weight of the filtering according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the motion feature consistency, respectively, the depth of the reference pixel in the reference pixel set Performing a weighted average of the pixel values to obtain a filtering result of the depth pixel value of the pixel to be filtered of the depth image; or

When filtering the texture image, determining the weight of the filtering according to the spatial proximity, the texture pixel similarity, and the motion feature consistency, respectively performing weighted average on the texel values of the reference pixels in the reference pixel set Obtaining a filter of the texture pixel value of the texture image to be filtered pixel Wave results.

The device according to claim 13 or 14, wherein the projection module is specifically configured to:

The device according to claim 15, wherein the projection module is specifically configured to:

For the coordinate values of the pixels in the image plane,

The apparatus according to any one of claims 13 to 16, wherein the spatial proximity is calculated by inputting a distance between the pixel to be filtered and the reference pixel in a three-dimensional space as a function; the function The output value increases as the input value decreases;

The device according to any one of claims 13-17, wherein the filtering module is specifically configured to:

According to formula (1):

According to formula (2):

among them,

A three-dimensional video filtering device, comprising:

a calculation module, configured to calculate spatial proximity of the pixel to be filtered and the reference pixel in the three-dimensional space according to coordinate values of the reference pixel in the to-be-filtered pixel and the reference pixel set in the three-dimensional space And wherein the reference pixel set is in the same manner as the pixel to be filtered One frame of image and adjacent multi-frame image;

The calculating module is further configured to calculate a time domain proximity of the pixel to be filtered and the reference pixel according to a time interval of a frame in which the reference pixel in the pixel to be filtered and the reference pixel in the reference pixel set are located;

And a filtering module, configured to determine a weight of the filtering according to the spatial proximity, the texel value similarity, and the time domain proximity, and perform weighted averaging on the pixel values of the reference pixels in the reference pixel set respectively to obtain the to-be-filtered The filtering result of the pixel.

The device according to claim 19, wherein the filtering module is specifically configured to:

When filtering the depth image, determining the weight of the filtering according to the spatial proximity, the texel value similarity corresponding to the depth pixel, and the time domain proximity, respectively, the depth of the reference pixel in the reference pixel set Performing a weighted average of the pixel values to obtain a filtering result of the depth pixel value of the pixel to be filtered of the depth image; or

The device according to claim 19 or 20, wherein the projection module is specifically configured to:

The device according to claim 21, wherein the projection module is specifically configured to:

For the coordinate values of the pixels in the image plane,

The apparatus according to any one of claims 19-22, wherein the spatial proximity is calculated by inputting a distance of the pixel to be filtered and the reference pixel in a three-dimensional space as a function; The output value increases as the input value decreases;

The device according to any one of claims 19 to 23, wherein the filtering module is specifically configured to:

According to formula (3):

According to formula (4):

among them,