[go: up one dir, main page]

WO2015139206A1 - Methods for 3d video coding - Google Patents

Methods for 3d video coding Download PDF

Info

Publication number
WO2015139206A1
WO2015139206A1 PCT/CN2014/073617 CN2014073617W WO2015139206A1 WO 2015139206 A1 WO2015139206 A1 WO 2015139206A1 CN 2014073617 W CN2014073617 W CN 2014073617W WO 2015139206 A1 WO2015139206 A1 WO 2015139206A1
Authority
WO
WIPO (PCT)
Prior art keywords
size
mpi
level
mode
sub
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2014/073617
Other languages
French (fr)
Inventor
Han HUANG
Kai Zhang
Jicheng An
Xianguo Zhang
Jian-Liang Lin
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
MediaTek Singapore Pte Ltd
Original Assignee
MediaTek Singapore Pte Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by MediaTek Singapore Pte Ltd filed Critical MediaTek Singapore Pte Ltd
Priority to PCT/CN2014/073617 priority Critical patent/WO2015139206A1/en
Publication of WO2015139206A1 publication Critical patent/WO2015139206A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/597Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding specially adapted for multi-view video sequence encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding

Definitions

  • the invention relates generally to video processing.
  • the present invention relates to methods for 3D video coding.
  • 3D video coding 3DVC is developed for encoding or decoding video data of multiple views simultaneously captured by several cameras. Since all cameras capture the same scene from different viewpoints, multi-view video data contains a large amount of inter-view redundancy.
  • additional tools such as inter-view motion parameter prediction (IvMC), inter-view disparity compensation (IvDC), view synthesis prediction (VSP), and advanced residual prediction (ARP) [1] have been integrated to conventional 3D-HEVC (High Efficiency Video Coding) , MV-HEVC or 3D-AVC (Advanced Video Coding) codec.
  • IvMC, IvDC, VSP and their variations are used to construct merging candidates in addition to the HEVC merging candidate list.
  • motion parameter inheritance MPI
  • texture merging candidate is used to construct additional merging candidate.
  • a disparity vector such as the well-known neighboring block disparity vector (NBDV) or the depth-oriented NBDV (DoNBDV) is derived.
  • NBDV neighboring block disparity vector
  • DoNBDV depth-oriented NBDV
  • IvMC inter-view motion parameter prediction
  • a reference block determined by the derived DV is fetched from the already coded picture in the reference view.
  • this reference block is coded using motion compensated prediction (MCP)
  • MCP motion compensated prediction
  • motion parameters either at sub-PU level or PU level, are added as an additional merging candidate of the current PU.
  • a depth block is fetched from a reference view depth image and used as an estimator for the depth information of the current PU.
  • the current processed PU is divided into multiple sub-PUs.
  • the size of sub-PU can be 8x4 or 4x8, and is determined by the size of PU or the depth values in the 4 corners of the current processed PU's corresponding depth block.
  • each sub-PU its corresponding depth sub-block is used to derive a new DV for disparity compensation.
  • the VSP method can also be used as an additional merging candidate in current 3D-HEVC.
  • Fig. 3 The basic concept of the MPI for depth coding in current 3D-HEVC is illustrated in Fig. 3. First, the collocated texture block is fetched from the corresponding texture picture.
  • the associated motion parameters are used as candidate motion parameters for the current depth block.
  • the MPI method is used as an additional merging candidate in current 3D-HEVC.
  • the PU partition modes are shown in Fig. 4.
  • the MPI is always performed at sub-PU level. That is, a PU is first divided into non- overlapping sub-PUs, and then MPI is applied on each sub-PU.
  • the sub-PU level MPI could still generate irregular 8x12/12x8 and bi-predicted 4x8/8x4 MCP blocks that are not in HEVC specification.
  • the DV used in IvMP and VSP is derived from neighboring coded blocks.
  • the assumption is that the DV of current block is the same with that of neighboring blocks. However, such assumption is not always true.
  • Fig. 1 is a diagram illustrating the concept of IvMC.
  • Fig. 2 is a diagram illustrating the concept of VSP.
  • Fig. 3 is a diagram illustrating the concept of MPI.
  • Fig. 4 is a diagram showing the PU partition modes in current 3D-HEVC.
  • PART_2Nx2N and use sub-PU level MPI when PartMode is equal to PART_2Nx2N. Therefore, when MPI is applied and PartMode is not PART_2Nx2N, current PU is not divided into sub-PUs.
  • PartMode is not equal to PART_2Nx2N.
  • the DV can be directly coded or differentially coded.
  • the proposed method is referred to as generalized IvMP (GIvMP).
  • VSP generalized VSP
  • an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein.
  • An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein.
  • DSP Digital Signal Processor
  • the invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA).
  • processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention.
  • the software code or firmware codes may be developed in different programming languages and different format or style.
  • the software code may also be compiled for different target platform.
  • different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Methods for motion parameter inheritance merging candidate in 3D video coding are disclosed. With the proposed method, the MCP block sizes in 3D-HEVC are aligned with HEVC specification and computational complexity of the merging candidate list construction is reduced.

Description

METHODS FOR 3D VIDEO CODING
FIELD OF INVENTION
The invention relates generally to video processing. In particular, the present invention relates to methods for 3D video coding.
BACKGROUND OF THE INVENTION
3D video coding (3DVC) is developed for encoding or decoding video data of multiple views simultaneously captured by several cameras. Since all cameras capture the same scene from different viewpoints, multi-view video data contains a large amount of inter-view redundancy. To exploit the inter-view redundancy, additional tools such as inter-view motion parameter prediction (IvMC), inter-view disparity compensation (IvDC), view synthesis prediction (VSP), and advanced residual prediction (ARP) [1] have been integrated to conventional 3D-HEVC (High Efficiency Video Coding) , MV-HEVC or 3D-AVC (Advanced Video Coding) codec. IvMC, IvDC, VSP and their variations are used to construct merging candidates in addition to the HEVC merging candidate list. To exploit the correlation between texture video and corresponding depth map, when coding the depth map in 3D-HEVC, motion parameter inheritance (MPI), also termed as texture merging candidate, is used to construct additional merging candidate.
For a current prediction unit (PU), a disparity vector (DV), such as the well-known neighboring block disparity vector (NBDV) or the depth-oriented NBDV (DoNBDV) is derived.
The basic concept of the inter-view motion parameter prediction (IvMC) in current 3D- HEVC is illustrated in Fig. 1.
First, a reference block determined by the derived DV is fetched from the already coded picture in the reference view.
Second, if this reference block is coded using motion compensated prediction (MCP), the associated motion parameters are used as candidate motion parameters for the current PU.
Finally, these motion parameters, either at sub-PU level or PU level, are added as an additional merging candidate of the current PU.
The basic concept of the VSP in current 3D-HEVC is illustrated in Fig. 2.
First, given the derived DV, a depth block is fetched from a reference view depth image and used as an estimator for the depth information of the current PU.
Second, the current processed PU is divided into multiple sub-PUs. The size of sub-PU can be 8x4 or 4x8, and is determined by the size of PU or the depth values in the 4 corners of the current processed PU's corresponding depth block.
Finally, for each sub-PU, its corresponding depth sub-block is used to derive a new DV for disparity compensation.
The VSP method can also be used as an additional merging candidate in current 3D-HEVC.
The basic concept of the MPI for depth coding in current 3D-HEVC is illustrated in Fig. 3. First, the collocated texture block is fetched from the corresponding texture picture.
Second, if the texture block is coded using MCP, the associated motion parameters are used as candidate motion parameters for the current depth block.
When coding the depth map, the MPI method is used as an additional merging candidate in current 3D-HEVC.
The PU partition modes are shown in Fig. 4.
When the PU partition mode PartMode is equal to PART_2Nx2N, IvMP is applied at sub- PU level. However, when the PU partition mode PartMode is not equal to PART_2Nx2N IvMP is applied at PU level. In this way, the irregular 8x12/12x8 MCP blocks are avoided.
The MPI is always performed at sub-PU level. That is, a PU is first divided into non- overlapping sub-PUs, and then MPI is applied on each sub-PU.
The sub-PU level MPI could still generate irregular 8x12/12x8 and bi-predicted 4x8/8x4 MCP blocks that are not in HEVC specification.
The DV used in IvMP and VSP is derived from neighboring coded blocks. The assumption is that the DV of current block is the same with that of neighboring blocks. However, such assumption is not always true.
SUMMARY OF THE INVENTION
In light of the previously described problems, the restriction methods for MPI are proposed and generalized IvMP and VSP are proposed.
Other aspects and features of the invention will become apparent to those with ordinary skill in the art upon review of the following descriptions of specific embodiments. BRIEF DESCRIPTION OF THE DRAWINGS
The invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
Fig. 1 is a diagram illustrating the concept of IvMC.
Fig. 2 is a diagram illustrating the concept of VSP. Fig. 3 is a diagram illustrating the concept of MPI.
Fig. 4 is a diagram showing the PU partition modes in current 3D-HEVC.
DETAILED DESCRIPTION
The following description is of the best-contemplated mode of carrying out the invention.
This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
Proposed method 1 :
It is proposed to use PU level MPI when the PU partition mode PartMode is not equal to
PART_2Nx2N, and use sub-PU level MPI when PartMode is equal to PART_2Nx2N. Therefore, when MPI is applied and PartMode is not PART_2Nx2N, current PU is not divided into sub-PUs.
Proposed method 2:
It is proposed to disable MPI, both PU and sub-PU level, when the PU partition mode
PartMode is not equal to PART_2Nx2N.
Proposed method 3 :
It is proposed to explicit signaling a DV for IvMP. The DV can be directly coded or differentially coded. In the following, the proposed method is referred to as generalized IvMP (GIvMP).
Proposed method 4:
It is proposed to explicit signaling a DV for VSP. The DV can be directly coded or differentially coded. In the following, the proposed method is referred to as generalized VSP (GVSP).
The proposed methods described above can be used in a video encoder as well as in a video decoder. Embodiments of the proposed method according to the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA). These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware codes may be developed in different programming languages and different format or style. The software code may also be compiled for different target platform. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art). Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.

Claims

I . A method of restriction for MPI merge candidate in 3D video coding, comprising determining whether to use MPI merge candidate according to a condition.
2. The method as claimed in claim 1, wherein use of MPI merge candidate depends on PU partition mode.
3. The method as claimed in claim 2, wherein sub-PU level MPI is used when the PU partition mode is equal to SIZE_2Nx2N and PU level MPI is used when the PU partition mode is not equal to SIZE_2Nx2N.
4. The method as claimed in claim 3, when the PU partition mode PartMode is equal to
SIZE_2NxN, SIZE_Nx2N, SIZE_2NxnU, SIZE_2NxnD, SIZE_nLx2N or SIZE_nRx2N, the
MPI merge candidate is applied at PU level instead of sub-PU level.
5. The method as claimed in claim 4, wherein sub-PU level MPI is disabled and PU level
MPI candidate is used when the PU partition mode PartMode is not equal to PART_2Nx2N.
6. The method as claimed in claim 2, wherein both PU level and sub-PU level MPI are disabled when the PU partition mode PartMode is not equal to SIZE_2Nx2N.
7. The method as claimed in claim 6, wherein both PU level and sub-PU level MPI are disabled when the PU partition mode PartMode is equal to SIZE_2NxN, SIZE_Nx2N,
SIZE_2NxnU, SIZE_2NxnD, SIZE_nLx2N or SIZE_nRx2N.
8. The method of explicit signaling a DV for IvMP, instead of deriving from neighboring blocks.
9. The method as claimed in claim 8, IvMP is added as an additional mode (GIvMP) for 3D video coding instead of added as a merging candidate; and the DV for IvMP is signaled by differential coding.
10. The method as claimed in claim 9, the DV in GIvMP mode is predicted by neighboring blocks and then the prediction error Disparity Vector Difference (DVD) is coded; at the decoder, DV is reconstructed by adding DVD to the predictor.
I I . The method as claimed in claim 9, the DV in GIvMP mode is coded at integer pixel accuracy instead of quarter pixel accuracy.
12. The method of explicit signaling a DV for VSP, instead of deriving from neighboring blocks.
13. The method as claimed in claim 12, wherein VSP is added as an additional mode (GVSP) for 3D video coding instead of added as a merging candidate; the DV for VSP is signaled by differential coding.
14. The method as claimed in claim 13, wherein the DV in GVSP mode is predicted by neighboring blocks and then the prediction error Disparity Vector Difference (DVD) is coded; at the decoder, DV is reconstructed by adding DVD to the predictor.
15. The method as claimed in claim 13, wherein the DV in GVSP mode is coded at integer pixel accuracy instead of quarter pixel accuracy.
PCT/CN2014/073617 2014-03-18 2014-03-18 Methods for 3d video coding Ceased WO2015139206A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/073617 WO2015139206A1 (en) 2014-03-18 2014-03-18 Methods for 3d video coding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/073617 WO2015139206A1 (en) 2014-03-18 2014-03-18 Methods for 3d video coding

Publications (1)

Publication Number Publication Date
WO2015139206A1 true WO2015139206A1 (en) 2015-09-24

Family

ID=54143638

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/073617 Ceased WO2015139206A1 (en) 2014-03-18 2014-03-18 Methods for 3d video coding

Country Status (1)

Country Link
WO (1) WO2015139206A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101166271A (en) * 2006-10-16 2008-04-23 华为技术有限公司 A Viewpoint Difference Estimation/Compensation Method in Multi-viewpoint Video Coding
CN102801995A (en) * 2012-06-25 2012-11-28 北京大学深圳研究生院 Template-matching-based multi-view video motion and parallax vector prediction method
WO2014029086A1 (en) * 2012-08-22 2014-02-27 Mediatek Singapore Pte. Ltd. Methods to improve motion vector inheritance and inter-view motion prediction for depth map

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101166271A (en) * 2006-10-16 2008-04-23 华为技术有限公司 A Viewpoint Difference Estimation/Compensation Method in Multi-viewpoint Video Coding
CN102801995A (en) * 2012-06-25 2012-11-28 北京大学深圳研究生院 Template-matching-based multi-view video motion and parallax vector prediction method
WO2014029086A1 (en) * 2012-08-22 2014-02-27 Mediatek Singapore Pte. Ltd. Methods to improve motion vector inheritance and inter-view motion prediction for depth map

Similar Documents

Publication Publication Date Title
WO2015062002A1 (en) Methods for sub-pu level prediction
WO2015192781A1 (en) Method of sub-pu syntax signaling and illumination compensation for 3d and multi-view video coding
WO2015003383A1 (en) Methods for inter-view motion prediction
WO2016008157A1 (en) Methods for motion compensation using high order motion model
CA2920413C (en) Method of deriving default disparity vector in 3d and multiview video coding
WO2015109598A1 (en) Methods for motion parameter hole filling
US20150085932A1 (en) Method and apparatus of motion vector derivation for 3d video coding
WO2014166068A1 (en) Refinement of view synthesis prediction for 3-d video coding
WO2015192706A1 (en) Method of coding for depth based block partitioning mode in three-dimensional or multi-view video coding
WO2015006951A1 (en) Methods for fast encoder decision
WO2015007348A1 (en) Method and apparatus for encoding and decoding a texture block using depth based block partitioning
WO2015100710A1 (en) Existence of inter-view reference picture and availability of 3dvc coding tools
WO2015123806A1 (en) Methods for depth based block partitioning
WO2014005280A1 (en) Method and apparatus to improve and simplify inter-view motion vector prediction and disparity vector prediction
JP5986657B2 (en) Simplified depth-based block division method
WO2015100731A1 (en) Methods for determining the prediction partitions
EP2936815A1 (en) Method and apparatus of disparity vector derivation in 3d video coding
WO2015006922A1 (en) Methods for residual prediction
WO2014029086A1 (en) Methods to improve motion vector inheritance and inter-view motion prediction for depth map
CN105519120A (en) Method for sub-PU syntax signaling and brightness compensation in 3D and multi-view video coding
WO2015113245A1 (en) Methods for merging candidates list construction
WO2015139206A1 (en) Methods for 3d video coding
WO2014106327A1 (en) Method and apparatus for inter-view residual prediction in multiview video coding
WO2013159326A1 (en) Inter-view motion prediction in 3d video coding
WO2014023024A1 (en) Methods for disparity vector derivation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14885877

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14885877

Country of ref document: EP

Kind code of ref document: A1