WO2016119666A1 - Procédé et appareil de codage/décodage d'image - Google Patents
Procédé et appareil de codage/décodage d'image Download PDFInfo
- Publication number
- WO2016119666A1 WO2016119666A1 PCT/CN2016/072039 CN2016072039W WO2016119666A1 WO 2016119666 A1 WO2016119666 A1 WO 2016119666A1 CN 2016072039 W CN2016072039 W CN 2016072039W WO 2016119666 A1 WO2016119666 A1 WO 2016119666A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- pixel
- predicted value
- value
- block
- representative
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/103—Selection of coding mode or of prediction mode
- H04N19/105—Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/56—Motion estimation with initialisation of the vector search, e.g. estimating a good candidate to initiate a search
Definitions
- the present invention relates to the field of image processing, and in particular to an image encoding and decoding method and apparatus.
- HEVC High Efficiency Video Coding
- the natural form of a digital video signal of an image is a sequence of images.
- a frame of image is usually a rectangular area composed of several pixels, and a digital video signal is a sequence of video images composed of tens of frames to thousands of frames of images, sometimes simply referred to as a video sequence or sequence.
- Encoding a digital video signal encodes a frame by frame image. At any one time, the image of the frame being encoded is referred to as the current encoded image.
- decoding a compressed video stream of a digital video signal (referred to as a bitstream, also referred to as a bitstream) is decoding a stream of one frame by one image.
- the image of the frame being decoded is referred to as the current decoded image.
- the current encoded image or the currently decoded image is collectively referred to as the current image.
- coding block from a decoding perspective, that is, a decoding block, collectively referred to as a codec block
- CU Coding Unit
- encoding a video image sequence is to encode one CU for each coding unit of each frame image, that is, CU.
- the CU being coded is referred to as the current coded CU.
- decoding the code stream of a video image sequence is also decoding one CU for each CU of each frame image, and finally reconstructing the entire video image sequence.
- the CU being decoded is referred to as the currently decoded CU.
- the current coding CU or the current decoding CU is collectively referred to as the current CU.
- each CU in one frame of image can be different, some are 8x8, some are 64x64, and so on.
- LCUs Large Coding Units
- NxN pixels the same size and having NxN pixels
- each LCU is further divided into Multiple CUs of a tree structure that are not necessarily the same size. Therefore, the LCU is also referred to as a "Coding Tree Unit (CTU)".
- CTU Coding Tree Unit
- One of the LCUs is composed of three 32x32 pixel CUs and four 16x16 pixel CUs, so that seven tree-structured CUs form a CTU.
- the other LCU consists of two 32x32 pixel CUs, three 16x16 pixel CUs, and 20 8x8 pixel CUs.
- Such 25 CUs in a tree structure constitute another CTU.
- Encoding one frame of image is to sequentially encode one CU in one CTU.
- LCU is synonymous with CTU.
- the CU can also be further divided into sub-areas.
- the sub-area includes, but is not limited to, a prediction unit (Picture Unit, abbreviated as PU), a transform unit (Transform Unit, TU for short), and an asymmetric division (Asymmetric Multi-Processing, AMP for short).
- PU Picture Unit
- TU transform Unit
- AMP asymmetric Multi-Processing
- a color pixel usually consists of three components.
- the two most commonly used pixel color formats are the GBR color format consisting of a green component, a blue component, and a red component, and a YUV color consisting of a luma component and two chromaticity components.
- the format commonly known as the YUV color format, actually includes multiple color formats, such as the YCbCr color format. Therefore, when encoding a CU, a CU can be divided into three component planes (G plane, B plane, R plane or Y plane, U plane, V plane), and the three component planes are respectively coded; The three component bundles of one pixel are combined into one 3-tuple, and the CUs composed of these 3-tuples are encoded as a whole.
- the arrangement of the former pixel and its components is called the planar format of the image (and its CU), and the arrangement of the latter pixel and its components is called the stacked format of the image (and its CU). Format).
- the GBR color format and the YUV color format of the pixel are both 3-component representation formats of the pixel.
- the value of a pixel can also be represented by the index of the palette.
- the palette space stores the value or approximate value of the three components of the pixel that needs to be represented.
- the address of the palette is called the index of the pixel stored in this address.
- An index can represent one component of a pixel, and an index can also represent three components of a pixel.
- the palette can be one or more. In the case of multiple palettes, a complete index is actually composed of the palette number and the index of the numbered palette.
- the index representation format of a pixel is to represent this pixel with an index.
- the index representation format of a pixel is also referred to as an indexed color or a pseudo color representation format of a pixel in the prior art, or is often referred to directly as an indexed pixel or a pseudo pixel (pseudo pixel). ) or pixel index or index. Indexes are sometimes referred to as indices.
- the representation of a pixel in its index representation format is also referred to as indexing or indexing.
- CMYK presentation formats Other commonly used prior art pixel representation formats include CMYK presentation formats and grayscale representation formats.
- the YUV color format can be subdivided into several seed formats according to whether the chroma component is downsampled: a YUV 4:4:4 pixel color format consisting of 1 Y component, 1 U component, and 1 V component.
- the left and right adjacent pixels are composed of two Y components, one U component, and one V component in a YUV 4:2:2 pixel color format; four pixels arranged in a left and right adjacent position by 2x2 spatial position are composed of four pixels.
- YUV4: 2:0 pixel color format consisting of Y component, 1 U component, and 1 V component.
- a component is generally represented by a number of 8 to 16 bits.
- the YUV4:2:2 pixel color format and the YUV4:2:0 pixel color format are all downsampled for the YUV4:4:4 pixel color format.
- a pixel component is also referred to as a pixel sample or simply as a sample.
- the most basic element when encoding or decoding can be one pixel, one pixel component, or one pixel index (ie, index pixel).
- a pixel or a pixel component or an index pixel, which is the most basic element of encoding or decoding, is collectively referred to as a pixel sample, sometimes referred to as a pixel value, or simply as a sample.
- pixel sample In the present patent application, "pixel sample”, “pixel value”, “sample value”, “index pixel”, “pixel index” are synonyms, depending on the context, whether it is clear whether “pixel” or “one pixel” The “component” still means “index pixel” or both. If it is not clear from the context, then it means either of the three.
- a coded block or a decoded block is an area composed of a number of pixel values.
- the shape of the codec block may be a rectangle, a square, a parallelogram, a trapezoid, a polygon, a circle, an ellipse, a string, and the like.
- a rectangle also includes a rectangle whose width or height is one pixel value that degenerates into a line (ie, a line segment or a line shape).
- each codec block may have a different shape and size.
- some or all of the codec blocks may overlap each other, or all codec blocks may not overlap each other.
- a codec block may be composed of "pixels”, or may be composed of “components of pixels”, or may be composed of "index pixels”, or may be composed of a mixture of the three, or any of the three. Mixed composition.
- a codec block refers to an area in which encoding or decoding is performed in a frame image, including but not limited to at least one of the following: a maximum coding unit LCU, a coding tree unit CTU, and a coding unit CU. a sub-region of the CU, a prediction unit PU, a transform unit TU, a series of pixels, and a set of pixels.
- Video image compression techniques in the related art include: prediction modes (including but not limited to intra prediction and inter prediction) and copying modes (including but not limited to block copying, index copying, microblock copying, strip copying, string copying, and rectangles). Copy, point copy).
- prediction modes including but not limited to intra prediction and inter prediction
- copying modes including but not limited to block copying, index copying, microblock copying, strip copying, string copying, and rectangles. Copy, point copy).
- copy is the optimal matching pixel that the copy encoder searches for. Therefore, from the perspective of the encoder, the copy mode is also called the matching mode (including but not limited to block matching, index matching, microblock matching, bar matching, string matching, rectangle matching, point matching).
- An important feature of the prediction mode and the copy mode is to copy reconstructed pixel samples (including fully reconstructed pixels) from image regions (including directly adjacent and indirectly adjacent) outside the current codec block (referred to as the current block).
- Samples and/or different degrees of partially reconstructed pixel samples referred to as predicted values (also referred to as reference values) of pixel samples (referred to as current pixel samples) in the current encoding or decoding, which are to be predicted The value is assigned to the current pixel sample as the reconstructed pixel sample of the current pixel sample.
- the predicted value is obtained from the reconstructed pixel samples of the image region outside the current block, and has no direct relationship with the pixel samples of the current block, and thus the positions of the reconstructed pixels exist.
- the present invention provides an image encoding and decoding method and apparatus, to at least solve the problem that the positions of the reconstructed pixels obtained by obtaining the predicted values in the related art are far from the current block, and the correlation between each other is small, and the compression efficiency is small. Low problem.
- an image encoding method including:
- a predicted value of the pixel sample in the coding block where the predicted value is generated by: reconstructing a pixel sample value of the image region other than the coded block and a pixel value of the representative pixel set, The predicted value of the pixel sample;
- the information required to construct the representative set of pixels and the information required to construct the predicted values are encoded and written to the video stream.
- the coding block is an coding region of an image, including at least one of: a maximum coding unit LCU, a coding tree unit CTU, a coding unit CU, a sub-region of a CU, a prediction unit PU, a transformation unit TU, pixel string, pixel group, pixel microblock, pixel strip;
- the image area other than the coded block includes at least one of: an image area other than the variant of the coded block; an image area formed by the partially coded pixels.
- the manner in which the predicted value is generated includes one of the following:
- the predicted value is set to a pixel value of the representative pixel set or a pixel value variant of the representative pixel set.
- the predictive value of the pixel samples in the encoded block is constructed using at least one of the following predefined encoding parameters, including:
- a type of a pixel component of the coding block wherein one pixel component or two of the pixel components of the pixel component select a pixel value of the representative pixel set as a predicted value, the pixel component except the one pixel component Or other pixel components other than the two pixel components, and selecting a pixel value of the reconstructed pixel sample as a predicted value;
- the manner in which the predicted value is generated includes:
- the predicted value is set to a value of the optimal representative pixel sample, if The second predefined condition is not met, the pre- The measured value is set to the value of the reconstructed pixel sample.
- the distance is an error between pixels
- the first predefined condition is that the error is minimal
- the second predefined condition is that the error is less than a first predetermined threshold
- setting the predicted value to the optimal representative pixel sample or the reconstructed pixel sample further includes:
- a condition parameter constructing the predicted value is written in the video code stream, wherein a pixel sample value satisfying the condition parameter is used as the predicted value.
- writing a flag indicating the predicted value in the video code stream comprises at least one of the following:
- a flag bit for identifying a prediction value selected for the entire sequence into a parameter set of at least one of: a video parameter set VPS, a sequence parameter set SPS, and an image parameter set PPS;
- a flag bit for identifying a prediction value selected for the block layer is written to the maximum coding unit LCU and/or the coding unit CU.
- condition parameter constructing the predicted value is written in the video code stream, including at least one of the following:
- the predetermined condition parameters for the block layer are written to the maximum coding unit LCU layer and/or the coding unit CU layer.
- the position of the reconstructed pixel sample outside the encoded block in the image is divided into rows and/or columns adjacent to the encoded block;
- the predicted value is set to a value of the reconstructed pixel sample if the reconstructed pixel sample is located outside of a row and/or column adjacent to the encoded block.
- the method includes:
- the flag of the predicted value is directly obtainable from the video code stream or derived from existing parameters in the video code stream;
- the flag of the predicted value is used to indicate that one of the values of the representative pixel samples of the representative pixel set and the reconstructed pixel samples of the coded block are selected as the current encoding in the encoding process.
- the flag bit includes one of the following locations at the location of the video stream:
- determining the predicted value according to the number of pixels of the representative pixel set includes:
- the predicted value is set as a value of the reconstructed pixel sample as a predicted value.
- an image decoding method including:
- Parsing a code stream obtaining a decoding parameter of the decoding block, where the decoding parameter includes at least one of: a parameter required to construct a representative pixel set of the decoding block; and constructing a pixel sample value in the decoding block The parameters required to predict the value;
- the representative pixel set is constructed to determine a predicted value, wherein the predicted value is generated by constructing the predicted value according to the reconstructed pixel sample and the pixel value of the representative pixel set.
- the decoding block is one decoding region of an image, and includes at least one of: a maximum coding unit LCU, a coding tree unit CTU, a coding unit CU, a sub-region of a CU, a prediction unit PU, and a transformation unit.
- TU pixel string, pixel group, pixel microblock, pixel strip;
- the image area other than the decoded block includes at least one of: an image area other than the variant of the decoded block; an image area formed by the partially decoded pixel.
- the manner in which the predicted value is generated includes one of the following:
- the predicted value is set to a pixel value of the representative pixel set or a pixel value variant of the representative pixel set.
- constructing a predicted value of a pixel sample in the decoded block using at least one of the following predefined decoding parameters includes:
- a type of a pixel component of the decoding block wherein one pixel component or two of the pixel components of the pixel component select a pixel value of the representative pixel set as a predicted value, the pixel component except the one pixel component Or other pixel components other than the two pixel components, and selecting a pixel value of the reconstructed pixel sample as a predicted value;
- the manner in which the predicted value is generated includes:
- the predicted value is set to a value of the optimal representative pixel sample, if The second predefined condition is not met and the predicted value is set to the value of the reconstructed pixel sample.
- the method includes: the distance is an error between pixels, the first predefined condition is that the error is the smallest, and the second predefined condition is that the error is less than a first predetermined threshold.
- setting the predicted value to the optimal representative pixel sample or the reconstructed pixel sample includes:
- a condition parameter in the code stream is parsed, and the predicted value is set to a pixel sample that satisfies the condition parameter.
- parsing the flag bits in the code stream comprises at least one of the following:
- the parameter set for parsing the flag bit used to identify the prediction value selected for the sequence includes at least one of the following: a video parameter set VPS, a sequence parameter set SPS, and an image parameter set PPS;
- a maximum coding unit LCU layer and/or a coding unit CU layer in which the flag bits for identifying the prediction values selected for the block layer are located are parsed.
- parsing the predetermined condition parameters in the code stream comprises at least one of the following:
- Parsing a parameter set for the predetermined condition parameter of the sequence comprising at least one of the following: a video parameter set VPS, a sequence parameter set SPS, and an image parameter set PPS;
- a maximum coding unit LCU layer and/or a coding unit CU layer in which the predetermined condition parameter for the block layer is located is parsed.
- the method includes:
- the position of the reconstructed pixel samples outside the decoded block in the image is divided into rows and/or columns adjacent to the decoded block:
- the predicted value is set to a value of the reconstructed pixel sample if the reconstructed pixel sample is located outside of a row and/or column adjacent to the decoded block.
- the method includes:
- the flag of the predicted value is directly obtained from the code stream or obtained according to an existing parameter in the code stream; the flag bit of the predicted value indicates that the predicted value is set as the representative pixel.
- the value of the set representative pixel sample is also the value of the reconstructed pixel sample outside the decoded block.
- determining the predicted value according to the number of pixels of the representative pixel set comprises:
- the predicted value is set to a value of the reconstructed pixel sample.
- an image encoding apparatus including:
- a first determining module configured to construct a representative pixel set of the encoded block, and determine information required to construct the representative pixel set
- a second determining module configured to construct a predicted value of the pixel sample of the encoded block, where the predicted value is generated by: reconstructing a pixel sample according to an image region other than the encoded block and the representative pixel set a pixel value that constructs a predicted value of the pixel sample;
- An encoding module is arranged to encode information required to construct the representative set of pixels and information required to construct the predicted value, and write the code stream.
- an image decoding apparatus including:
- a parsing module configured to parse the code stream, to obtain a decoding parameter of the decoding block, where the decoding parameter includes at least one of the following parameters: constructing a parameter required for the representative pixel set of the decoding block, and constructing a pixel in the decoding block The parameters required for the predicted value of the sample;
- a third determining module configured to construct the representative pixel set, and determine a predicted value, where the predicted value is generated by: constructing the predicted value according to the reconstructed pixel sample and the pixel value of the representative pixel set .
- the representative pixel set of the coding block is constructed, the information required to construct the representative pixel set is determined, and the predicted value of the pixel sample in the coding block is constructed, and the predicted value is generated according to the coding block.
- Image area reconstruction Pixel samples and pixel values of the representative pixel set, constructing a predicted value of the pixel sample, encoding information required to construct the representative pixel set, and information required to construct the predicted value, and writing the video code stream to solve
- the distance between the positions of the reconstructed pixels for obtaining the predicted values and the current block is long, the correlation between each other is small, and the compression efficiency is low, and the predicted values of the image regions from outside the current block are improved. When the pixel samples are correlated, the compression efficiency is effectively improved.
- FIG. 1 is a flow chart of an image encoding method according to an embodiment of the present invention.
- FIG. 2 is a flowchart of an image decoding method according to an embodiment of the present invention.
- FIG. 3 is a block diagram 1 of an image encoding apparatus according to an embodiment of the present invention.
- FIG. 4 is a block diagram 2 of an image decoding apparatus according to an embodiment of the present invention.
- FIG. 5 is a schematic diagram of obtaining a current pixel prediction value according to a preferred embodiment of the present invention.
- FIG. 6 is a schematic flow chart of an encoding method according to a preferred embodiment of the present invention.
- FIG. 7 is a schematic flowchart diagram of a decoding method provided by a preferred embodiment of the present invention.
- FIG. 1 is a flowchart of an image coding method according to an embodiment of the present invention. As shown in FIG. 1, the process includes the following steps:
- Step S102 constructing a representative pixel set of the coding block, and determining information required to construct the representative pixel set;
- Step S104 constructing a predicted value of the pixel sample in the coded block, where the predicted value is generated by: constructing the pixel according to the reconstructed pixel sample value of the image region other than the coded block and the pixel value of the representative pixel set The predicted value of the sample;
- Step S106 encoding the information required to construct the representative pixel set and the information required to construct the predicted value, and writing the video code stream.
- constructing a representative pixel set of the coding block determining information required for constructing the representative pixel set, and constructing a predicted value of the pixel sample value in the coding block, where the predicted value is generated according to the coding block Constructing a reconstructed pixel sample of the image region and a pixel value of the representative pixel set, constructing a predicted value of the pixel sample, and configuring the representative pixel set
- the information and the information needed to construct the predicted value are encoded and written into the video bitstream, which solves the problem that the position of the reconstructed pixels obtaining the predicted value is far from the current block, and the correlation between the two is small.
- the problem of low compression efficiency improves the correlation between the predicted value of the image region from outside the current block and the pixel sample value, and effectively improves the compression efficiency.
- the coding block is an coding region of an image, and includes at least one of: a maximum coding unit LCU, a coding tree unit CTU, a coding unit CU, a sub-region of a CU, a prediction unit PU, a transform unit TU, and a pixel.
- a maximum coding unit LCU a coding tree unit CTU
- a coding unit CU a sub-region of a CU
- a prediction unit PU a transform unit TU
- a pixel String, pixel group, pixel microblock, pixel strip
- the image area outside the coded block includes at least one of: an image area other than the variant of the coded block; an image area formed by the partially coded pixels.
- the manner in which the predicted value is generated includes one of the following:
- the predicted value is set to a pixel value of the representative pixel set or a pixel value variant of the representative pixel set.
- constructing a predicted value of the pixel sample in the encoding block including:
- a type of a pixel component of the coding block wherein one pixel component of the pixel component or two pixel components select a pixel value of the representative pixel set as a predicted value, the pixel component except the one pixel component or the two pixel components a pixel value other than the pixel component, the pixel value of the reconstructed pixel sample is selected as a predicted value;
- the manner in which the predicted value is generated includes:
- the predicted value is set to the value of the optimal representative pixel sample, if the second is not satisfied A predefined condition that sets the predicted value to the value of the reconstructed pixel sample.
- the distance is an error between pixels
- the first predefined condition is that the error is the smallest
- the second predefined condition is that the error is less than the first predetermined threshold
- setting the predicted value to the optimal representative pixel sample or the reconstructed pixel sample includes:
- a condition parameter constructing the predicted value is written in the video code stream, wherein a pixel sample value satisfying the condition parameter is used as the predicted value.
- writing a flag indicating the predicted value in the video code stream includes at least one of the following:
- a flag bit for identifying a prediction value selected for the entire sequence into a parameter set of at least one of: a video parameter set VPS, a sequence parameter set SPS, and an image parameter set PPS;
- a flag bit for identifying a prediction value selected for the block layer is written to the maximum coding unit LCU and/or the coding unit CU.
- condition parameter for constructing the predicted value is written in the video code stream, including at least one of the following:
- the predetermined condition parameter for the block layer is written to the maximum coding unit LCU layer and/or the coding unit CU layer.
- the position of the reconstructed pixel sample outside the coded block in the image is divided into rows and/or columns adjacent to the coded block;
- the predicted value is set to the value of the reconstructed pixel sample.
- the flag of the predicted value is directly obtained from the video code stream or obtained according to an existing parameter in the video code stream;
- the flag of the predicted value is used to indicate that one of the value of the representative pixel sample value of the representative pixel set and the reconstructed pixel sample value other than the coded block is selected as the current coded pixel sample value in the encoding process.
- the flag bit includes one of the following locations in the video stream:
- determining the predicted value according to the number of pixels of the representative pixel set includes:
- the predicted value is set as the value of the reconstructed pixel sample as a predicted value.
- FIG. 2 is a flowchart of an image decoding method according to an embodiment of the present invention. As shown in FIG. 2, the process includes the following steps:
- Step S202 parsing the code stream, and acquiring a decoding parameter of the decoding block, where the decoding parameter includes at least one of the following parameters: a parameter required to construct a representative pixel set of the decoding block; and constructing a pixel sample value in the decoding block.
- Step S204 constructing the representative pixel set to determine a predicted value, wherein the predicted value is generated by constructing the predicted value according to the reconstructed pixel sample value and the pixel value of the representative pixel set.
- the code stream is parsed to obtain a decoding parameter of the decoding block, where the decoding parameter includes at least one of the following parameters: a parameter required to construct a representative pixel set of the decoding block; and a pixel sample in the decoding block is constructed.
- the correlation of pixel samples effectively improves the compression efficiency.
- the decoding block is a decoding region of an image, including at least one of: a maximum coding unit LCU, a coding tree unit CTU, a coding unit CU, a sub-region of a CU, a prediction unit PU, a transform unit TU, and a pixel.
- a maximum coding unit LCU a coding tree unit CTU
- a coding unit CU a sub-region of a CU
- a prediction unit PU a transform unit TU
- a pixel String, pixel group, pixel microblock, pixel strip
- the image area outside the decoded block includes at least one of: an image area other than the variant of the decoded block; an image area formed by the partially decoded pixel.
- the manner in which the predicted value is generated includes one of the following:
- the predicted value is set to a pixel value of the representative pixel set or a pixel value variant of the representative pixel set.
- constructing a predicted value of the pixel sample in the decoded block including:
- the type of the pixel component of the decoded block wherein one pixel component of the pixel component or two of the pixel components are selected
- the pixel value of the representative pixel set is used as a predicted value, and the pixel component selects a pixel value of the reconstructed pixel sample as a predicted value in addition to the one pixel component or other pixel components other than the two pixel components;
- the generated manner of the predicted value includes:
- the predicted value is set to the value of the optimal representative pixel sample, if the second is not satisfied A predefined condition that sets the predicted value to the value of the reconstructed pixel sample.
- the distance is an error between pixels
- the first predefined condition is that the error is the smallest
- the second predefined condition is that the error is less than the first predetermined threshold
- setting the predicted value to the optimal representative pixel sample or the reconstructed pixel sample includes:
- condition parameter in the code stream is parsed, and the predicted value is set to a pixel sample that satisfies the condition parameter.
- parsing the flag bit in the code stream includes at least one of the following:
- the parameter set for parsing the flag bit used to identify the prediction value selected for the sequence includes at least one of the following: a video parameter set VPS, a sequence parameter set SPS, and an image parameter set PPS;
- a maximum coding unit LCU layer and/or a coding unit CU layer in which the flag bits for identifying the prediction values selected for the block layer are located are parsed.
- parsing the predetermined condition parameter in the code stream includes at least one of the following:
- Parsing a parameter set for the predetermined condition parameter of the sequence comprising at least one of the following: a video parameter set VPS, a sequence parameter set SPS, and an image parameter set PPS;
- a maximum coding unit LCU layer and/or a coding unit CU layer in which the predetermined condition parameter for the block layer is located is parsed.
- the position of the reconstructed pixel sample outside the decoded block in the image is divided into rows and/or columns adjacent to the decoded block:
- the reconstructed pixel sample is located within a row and/or column adjacent to the decoded block, setting the predicted value to a value of a representative pixel sample of the representative pixel set;
- the predicted value is set to the value of the reconstructed pixel sample.
- At least one of the data units in the code stream is parsed to obtain a flag bit of the predicted value, and the data unit includes at least one of the following:
- the flag of the predicted value is directly obtained from the code stream or derived according to an existing parameter in the code stream; the flag bit of the predicted value indicates that the predicted value is set as a representative pixel sample of the representative pixel set The value is also the value of the reconstructed pixel sample outside of the decoded block.
- determining the predicted value according to the number of pixels of the representative pixel set includes:
- the predicted value is set to a value of the reconstructed pixel sample.
- module may implement a combination of software and/or hardware of a predetermined function.
- apparatus described in the following embodiments is preferably implemented in software, hardware, or a combination of software and hardware, is also possible and contemplated.
- FIG. 3 is a block diagram showing the structure of an image encoding apparatus according to an embodiment of the present invention. As shown in FIG. 3, the apparatus includes:
- a first determining module 32 configured to construct a representative pixel set of the encoded block, and determine information required to construct the representative pixel set
- the second determining module 34 is connected to the first determining module 32 and configured to construct a predicted value of the pixel sample of the encoded block, where the predicted value is generated by: reconstructing a pixel according to an image region other than the encoded block Constructing a predicted value of the pixel sample value and the pixel value of the representative pixel set;
- the encoding module 36 is coupled to the second determining module 34 and configured to encode the information required to construct the representative set of pixels and the information required to construct the predicted value, and write the code stream.
- the first determining module 32 constructs a representative pixel set of the coding block, determines information required to construct the representative pixel set
- the second determining module 34 constructs a predicted value of the pixel sample of the encoded block, and the generation of the predicted value
- the method includes: constructing a predicted value of the pixel sample according to the reconstructed pixel sample of the image region outside the coded block and the pixel value of the representative pixel set, and the encoding module 36 constructs the information required for the representative pixel set and
- the information required to construct the predicted value is encoded and written into the code stream, which solves the problem that the positions of the reconstructed pixels for obtaining the predicted value are far from the current block, the correlation between the two is small, and the compression efficiency is low.
- the problem is that the correlation between the predicted value of the image region from the current block and the pixel sample is improved, and the compression efficiency is effectively improved.
- FIG. 4 is a second structural block diagram of an image decoding apparatus according to an embodiment of the present invention. As shown in FIG. 4, the apparatus includes:
- the parsing module 42 is configured to parse the code stream, and obtain a decoding parameter of the decoding block, where the decoding parameter includes at least one of the following parameters: constructing a parameter required for the representative pixel set of the decoding block, and constructing a pixel sample in the decoding block The required parameters for the predicted value of the value;
- the third determining module 44 is connected to the parsing module 42 and configured to construct the representative pixel set to determine a predicted value, where the predicted value is generated according to the reconstructed pixel sample value and the pixel value of the representative pixel set, The predicted value.
- the parsing module 42 parses the code stream to obtain the decoding parameters of the decoding block, where the decoding parameter includes at least one of the following parameters: constructing parameters required for the representative pixel set of the decoding block, and constructing the pixels in the decoding block.
- the third determining module 44 constructs the representative pixel set to determine a predicted value, wherein the predicted value is generated according to the reconstructed pixel sample and the pixel value of the representative pixel set. Constructing the predicted value solves the problem that the position of the reconstructed pixels obtained by obtaining the predicted value is far from the current block, the correlation between each other is small, and the compression efficiency is low, and the problem from the current block is improved.
- the correlation between the predicted value of the image area and the pixel sample value effectively improves the compression efficiency.
- each of the above modules may be implemented by software or hardware.
- the foregoing may be implemented by, but not limited to, the foregoing modules are all located in the same processor; or, the modules are located in multiple In the processor.
- the main technical feature of the preferred embodiment of the present invention is that when encoding or decoding a current block, constructing a representative pixel set of the current block, reconstructing pixel samples from the image region outside the current block and representative pixels of the current block.
- the set of pixel values optionally in combination with other codec parameters, obtains predicted values for some or all of the current pixel samples according to predetermined rules.
- FIG. 5 is a schematic diagram of obtaining a current pixel prediction value according to a preferred embodiment of the present invention, constructing a representative pixel set of 8 representative pixels from a current block. For some or all of the pixels of the current block, the predicted values of some or all of the pixels of the current block are obtained from the reconstructed pixels of the image region outside the current block and the representative pixels of the current block.
- the pixel shown in FIG. 5 may be in a stacked format or a flat format. Therefore, the method of the preferred embodiment of the present invention can be applied to encoding or decoding pixels of a coded block or a decoded block of a stacked packet format, and can also be applied to a pixel sample of a plane of a coded block or a decoded block of a planar format. Encoding or decoding.
- the most basic unique technical features are: counting and analyzing the characteristics of the current coding block, constructing a representative pixel set of the current coding block, also called a color palette; searching for the current coding block a predicted value of the pixel that best matches; the manner in which the predicted value is generated includes, but is not limited to, a reconstructed pixel sample from an image region other than the current encoded block and a pixel value of a representative pixel set of the current encoded block, optionally in combination with other
- the encoding parameters are obtained according to a predetermined rule; the information required to construct the representative pixel set is written into the video compressed code stream, and the information required to obtain the predicted value is written into the video compressed code stream.
- the most basic characteristic feature is that when the compressed code stream data of the current decoding block is decoded, the representative pixel set constructing the current decoding block is parsed from the code stream data, which is also called The information required by the palette is parsed to obtain the information required to obtain the predicted value of the pixel of the current decoded block; the calculation of the predicted value
- the manner includes, but is not limited to, reconstructed pixel samples from image regions outside the current decoded block and pixel values of representative pixel sets of the current decoded block, optionally in combination with other decoding parameters, calculated according to predetermined rules.
- FIG. 6 is a schematic flowchart of an encoding method according to a preferred embodiment of the present invention. As shown in FIG. 6, the encoding method of the present invention includes but is not limited to the following steps:
- S602. Perform prediction or copy coding on the coding block, and generate at least a prediction value of a pixel sample of the coding block, where the prediction value is generated by, but not limited to, an image region other than the coding block.
- FIG. 7 is a schematic flowchart of a decoding method according to a preferred embodiment of the present invention. As shown in FIG. 3, the decoding method of the present invention includes but is not limited to the following steps:
- the predicted value is generated by, but not limited to, a reconstructed pixel sample from an image region other than the decoded block and a pixel value of the representative pixel set, optionally Combined with other decoding parameters, the calculation is obtained according to a predetermined rule.
- the position of the reconstructed pixel sample in the image region is typically specified by a displacement vector (also referred to as a motion vector or copy position, etc.) between the reconstructed pixel sample and the currently decoded pixel sample.
- a variant of the current coding block or current decoding block is a pixel group or pixel string or pixel micro-block and pixel strip to be encoded or to be decoded.
- the image area outside the coding block or the decoding block is an image area composed of pixels that have completed at least partial encoding or pixels that have completed at least partial decoding.
- the manner of generating the predicted value includes at least the following steps: selecting a reconstructed pixel sample or a variant thereof other than the codec block as a predicted value according to a predetermined rule, or selecting a pixel value of the representative pixel set or Its variant is used as a predictor.
- a codec block some or all of the current pixel samples adopt reconstructed pixel samples or variants thereof outside the codec block as prediction values, and some or all of the current pixel samples adopt the representative pixel set
- the pixel value or its variant is used as the predicted value.
- Such variants include, but are not limited to, filtering and transforming.
- Embodiment 2 of the manner in which the predicted value is generated in the preferred embodiment of the present invention
- the manner in which the predicted value is generated includes at least determining the predicted value based on at least part or all or a combination of the following parameters:
- a position of the reconstructed pixel sample outside the codec block in the current image is usually specified by a displacement vector (also called a motion vector or a copy position, etc.);
- a value of the reconstructed pixel sample outside the codec block is usually specified by a displacement vector (also referred to as a motion vector or a copy position, etc.);
- pixel component selecting according to the kind of the pixel component, one pixel component (such as Y component) or two pixel components (such as G component and R component) selecting a value representing the pixel sample as a predicted value, and other pixel components selecting the codec
- a displacement vector also called a motion vector or a copy position, etc.
- the manner in which the predicted value is generated includes at least the following steps:
- the representative pixel sample is called the optimal representative pixel sample
- the value of the optimal representative pixel sample is selected as a predicted value, otherwise, the reconstructed pixel is selected The value of the sample is used as the predicted value.
- the distance in Embodiment 3 is an error between pixels, which is less than a predetermined threshold. That is, for a current pixel sample of the codec block, the manner in which the predicted value is generated includes at least the following steps:
- the position of the reconstructed pixel sample outside the codec block in Embodiment 2 in the current image is divided into a number of rows and/or columns directly adjacent to the codec block (typically 1-5 lines and / or 1-5 columns) and the rest of the location:
- the value of the reconstructed pixel samples is selected as the predicted value.
- the flag bit described in Embodiment 2 appears directly or indirectly (implicitly derived form) in the following places in the code stream to determine that the current codec pixel value in the subsequent codec process is a value that selects a representative pixel value.
- the predicted value the value of the reconstructed pixel sample other than the codec block is selected as the predicted value:
- a video parameter set typically a grammatical element of a direct or implicit derivation of the video parameter set VPS;
- sequence parameter set usually a grammatical element of a direct or implicit derivation of the sequence parameter set SPS;
- a strip head usually a grammatical element of a direct or implicit derivation of a stripe header
- CTU header usually a grammatical element of a direct or implicit derivation of the CTU header
- CU header usually a grammatic element of a direct or implicit derivation of the CU header
- codec block header usually a syntax element of a direct or implicit derivation of the codec block header
- codec block before the codec block one or a group or a string of current codec pixel samples; usually one of the codec blocks or a set or a string of current codec pixel samples before the syntax element directly exists or hidden Contains derived syntax elements.
- the predicted value is determined according to the number of pixels of the representative pixel set of the codec block described in Embodiment 2:
- the encoder chooses to use one of the optimal representative pixel samples and the reconstructed pixel samples as the predicted value.
- the encoder sets a flag bit for identifying the selected prediction value, and writing the flag bit to the code stream using at least one of the following methods: (Method 1) is used to identify the selected sequence for the entire sequence. The flag of the predicted value is written into one or more parameter sets of the video parameter set VPS, the sequence parameter set SPS, and the image parameter set PPS; (method 2) the flag bits for identifying the predicted value used for the image are written. At least one of image parameter set PPS, slice (also called stripe) header information; (method 3) writing a flag bit for identifying a prediction value selected for the block layer to a maximum coding unit (or coding tree unit CTU) At least one of a layer, a coding unit CU layer.
- Method 1 is used to identify the selected sequence for the entire sequence.
- the flag of the predicted value is written into one or more parameter sets of the video parameter set VPS, the sequence parameter set SPS, and the image parameter set PPS
- method 2 the flag bits for identifying the predicted value used for the image are
- the decoder obtains the value of the flag bit by parsing the code stream, and determines whether the optimal representative pixel sample value or the reconstructed pixel sample value is used as the predicted value in the decoding process according to the value of the flag bit.
- the encoder does not write a flag bit for identifying the selected prediction value to the code stream, and the encoder defaults to using the prediction value candidate that satisfies the predetermined condition as the prediction value used in the encoding process.
- the decoder selects the predicted value candidate satisfying the condition as the predicted value used in the decoding process using the same predetermined condition as the encoder.
- the encoder does not write the flag bit for identifying the selected prediction value into the code stream, and the encoder uses the prediction value candidate that satisfies the predetermined condition as the prediction value used in the encoding process, and at the same time, the encoder will The parameter of the predetermined condition is written into the code stream, and the parameter of the predetermined condition is written into the code stream using at least one of the following methods: (Method 1) writing the parameter for the predetermined condition of the entire sequence to the video parameter set One or more parameter sets in the VPS, the sequence parameter set SPS, and the image parameter set PPS; (method 2) writing parameters for identifying the predetermined condition of the image to at least one of the image parameter set PPS and the slice header information (Method 3) The parameter for identifying the predetermined condition of the block layer is written into at least one of a maximum coding unit (or coding tree unit CTU) layer and a coding unit CU layer.
- the decoder obtains the parameters of the predetermined condition by pars
- the method according to the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course, by hardware, but in many cases, the former is A better implementation.
- the technical solution of the present invention which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
- the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
- Embodiments of the present invention also provide a storage medium.
- the foregoing storage medium may be configured to store program code for performing the following steps:
- the foregoing storage medium may include, but not limited to, a USB flash drive, a Read-Only Memory (ROM), a Random Access Memory (RAM), a mobile hard disk, and a magnetic memory.
- ROM Read-Only Memory
- RAM Random Access Memory
- a mobile hard disk e.g., a hard disk
- magnetic memory e.g., a hard disk
- the processor performs the methods and steps of the foregoing embodiments according to the stored program code in the storage medium.
- modules or steps of the present invention described above can be implemented by a general-purpose computing device that can be centralized on a single computing device or distributed across a network of multiple computing devices. Alternatively, they may be implemented by program code executable by the computing device such that they may be stored in the storage device by the computing device and, in some cases, may be different from the order herein.
- the steps shown or described are performed, or they are separately fabricated into individual integrated circuit modules, or a plurality of modules or steps thereof are fabricated as a single integrated circuit module.
- the invention is not limited to any specific combination of hardware and software.
- a representative pixel set of a coded block is constructed, and information required for constructing the representative pixel set is determined, and a predicted value of the pixel sample value in the coded block is constructed, where the predicted value is generated by: Constructing a predicted value of the pixel sample according to the reconstructed pixel sample of the image region outside the coded block and the pixel value of the representative pixel set, the information required to construct the representative pixel set and the need to construct the predicted value
- the information is encoded and written into the video code stream, which solves the problem that the position of the reconstructed pixels obtaining the predicted value is far from the current block, the correlation between the two is small, and the compression efficiency is low, and the problem is improved.
- the correlation between the predicted value of the image region from outside the current block and the pixel sample value effectively improves the compression efficiency.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
La présente invention concerne un procédé et un appareil de codage/décodage d'image. Le procédé de codage comprend les étapes consistant à : construire un ensemble de pixels représentatif d'un bloc de codage et déterminer les informations nécessaires pour construire l'ensemble de pixels représentatif ; construire une valeur prédite d'une valeur d'échantillon de pixels dans le bloc de codage, un mode de génération de la valeur prédite consistant à construire la valeur prédite de la valeur d'échantillon de pixels en fonction d'une valeur d'échantillon de pixels de reconstruction d'une zone d'image hors du bloc de codage et d'une valeur de pixel de l'ensemble de pixels représentatif ; coder les informations nécessaires pour construire l'ensemble de pixels représentatif et les informations nécessaires pour construire la valeur prédite et écrire un flux de code vidéo. La présente solution technique règle les problèmes d'une longue distance entre les positions des pixels de reconstruction, d'obtention de la valeur prédite et d'un bloc actuel et d'une médiocre efficacité de corrélation et de compression. De plus, la corrélation entre une valeur prédite d'une zone d'image hors du bloc actuel et une valeur d'échantillon de pixels actuelle est améliorée et l'efficacité de compression est efficacement accrue.
Priority Applications (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/551,945 US10666971B2 (en) | 2015-01-26 | 2016-01-25 | Method and device for image coding and method and device for image decoding |
| EP16742728.5A EP3253055B1 (fr) | 2015-01-26 | 2016-01-25 | Procédé et appareil de codage/décodage d'image |
| US16/853,895 US11082718B2 (en) | 2015-01-26 | 2020-04-21 | Method and device for coding image and method and device for decoding image |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510037769.5 | 2015-01-26 | ||
| CN201510037769 | 2015-01-26 | ||
| CN201610045722.8 | 2016-01-22 | ||
| CN201610045722.8A CN105828080B (zh) | 2015-01-26 | 2016-01-22 | 图像编解码方法及装置 |
Related Child Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/551,945 A-371-Of-International US10666971B2 (en) | 2015-01-26 | 2016-01-25 | Method and device for image coding and method and device for image decoding |
| US16/853,895 Continuation US11082718B2 (en) | 2015-01-26 | 2020-04-21 | Method and device for coding image and method and device for decoding image |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2016119666A1 true WO2016119666A1 (fr) | 2016-08-04 |
Family
ID=56542419
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2016/072039 Ceased WO2016119666A1 (fr) | 2015-01-26 | 2016-01-25 | Procédé et appareil de codage/décodage d'image |
Country Status (1)
| Country | Link |
|---|---|
| WO (1) | WO2016119666A1 (fr) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114979628A (zh) * | 2021-02-24 | 2022-08-30 | 腾讯科技(深圳)有限公司 | 图像块预测样本的确定方法及编解码设备 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0790742A2 (fr) * | 1996-02-19 | 1997-08-20 | Fuji Xerox Co., Ltd. | Système de codage et décodage prédictif d'image |
| CN102984523A (zh) * | 2012-12-14 | 2013-03-20 | 北京大学 | 一种多方向的帧内预测编解码方法及装置 |
| CN104244007A (zh) * | 2013-06-13 | 2014-12-24 | 上海天荷电子信息有限公司 | 基于任意形状匹配的图像压缩方法和装置 |
-
2016
- 2016-01-25 WO PCT/CN2016/072039 patent/WO2016119666A1/fr not_active Ceased
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| EP0790742A2 (fr) * | 1996-02-19 | 1997-08-20 | Fuji Xerox Co., Ltd. | Système de codage et décodage prédictif d'image |
| CN102984523A (zh) * | 2012-12-14 | 2013-03-20 | 北京大学 | 一种多方向的帧内预测编解码方法及装置 |
| CN104244007A (zh) * | 2013-06-13 | 2014-12-24 | 上海天荷电子信息有限公司 | 基于任意形状匹配的图像压缩方法和装置 |
Non-Patent Citations (1)
| Title |
|---|
| See also references of EP3253055A4 * |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN114979628A (zh) * | 2021-02-24 | 2022-08-30 | 腾讯科技(深圳)有限公司 | 图像块预测样本的确定方法及编解码设备 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20230362374A1 (en) | Encoding and decoding methods and apparatuses with multiple image block division manners | |
| CN105704491B (zh) | 图像编码方法、解码方法、编码装置和解码装置 | |
| JP6523324B2 (ja) | 画像符号化/復号方法及び装置 | |
| CN105491376B (zh) | 图像编码、解码方法及装置 | |
| CN104754362B (zh) | 使用精细划分块匹配的图像压缩方法 | |
| US20220417535A1 (en) | Image encoding and decoding methods, image processing device, and computer storage medium | |
| US10575022B2 (en) | Image encoding and decoding method, image processing device and computer storage medium | |
| KR102532391B1 (ko) | 영상 부호화 방법과 장치 및 영상 복호화 방법과 장치 | |
| CN106254878B (zh) | 一种图像编码及解码方法、图像处理设备 | |
| CN105992003B (zh) | 依据排序或频度对调色板颜色编号的图像压缩方法和装置 | |
| WO2016202189A1 (fr) | Procédés de codage et de décodage d'image, dispositif de traitement d'image, et support de stockage informatique | |
| CN106303535B (zh) | 参考像素取自不同程度重构像素的图像压缩方法和装置 | |
| US11082718B2 (en) | Method and device for coding image and method and device for decoding image | |
| CN106303534B (zh) | 多种索引串与像素串融合复制方式的图像压缩方法和装置 | |
| WO2016119666A1 (fr) | Procédé et appareil de codage/décodage d'image | |
| WO2016197893A1 (fr) | Procédé de codage et de décodage d'image, dispositif de traitement d'image et support de stockage informatique | |
| WO2016124154A1 (fr) | Procédé et appareil de codage d'image ainsi que procédé et appareil de décodage d'image | |
| CN105828079B (zh) | 图像处理方法及装置 | |
| WO2016119667A1 (fr) | Procédé et appareil de traitement d'images |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16742728 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| REEP | Request for entry into the european phase |
Ref document number: 2016742728 Country of ref document: EP |