EP3085087A1 - Procédé d'estimation d'un débit de codage d'une image d'une séquence d'images, procédé de codage, dispositif et programme d'ordinateur correspondants - Google Patents
Procédé d'estimation d'un débit de codage d'une image d'une séquence d'images, procédé de codage, dispositif et programme d'ordinateur correspondantsInfo
- Publication number
- EP3085087A1 EP3085087A1 EP14790557.4A EP14790557A EP3085087A1 EP 3085087 A1 EP3085087 A1 EP 3085087A1 EP 14790557 A EP14790557 A EP 14790557A EP 3085087 A1 EP3085087 A1 EP 3085087A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- image
- coding
- residue
- rate
- denotes
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 238000004590 computer program Methods 0.000 title claims description 8
- 238000013459 approach Methods 0.000 claims abstract description 33
- 230000009466 transformation Effects 0.000 claims abstract description 23
- 238000011002 quantification Methods 0.000 claims abstract description 7
- 238000013139 quantization Methods 0.000 claims description 37
- 238000004364 calculation method Methods 0.000 claims description 16
- 230000007246 mechanism Effects 0.000 claims description 7
- 230000001419 dependent effect Effects 0.000 claims description 4
- 230000001105 regulatory effect Effects 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 230000033228 biological regulation Effects 0.000 description 14
- 230000033001 locomotion Effects 0.000 description 10
- 238000007906 compression Methods 0.000 description 6
- 230000006835 compression Effects 0.000 description 6
- 238000009472 formulation Methods 0.000 description 6
- 239000000203 mixture Substances 0.000 description 6
- 238000004088 simulation Methods 0.000 description 4
- 230000002123 temporal effect Effects 0.000 description 4
- 239000013598 vector Substances 0.000 description 4
- 238000005259 measurement Methods 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 230000006978 adaptation Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 238000011217 control strategy Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- VBRBNWWNRIMAII-WYMLVPIESA-N 3-[(e)-5-(4-ethylphenoxy)-3-methylpent-3-enyl]-2,2-dimethyloxirane Chemical compound C1=CC(CC)=CC=C1OC\C=C(/C)CCC1C(C)(C)O1 VBRBNWWNRIMAII-WYMLVPIESA-N 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000000354 decomposition reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000012417 linear regression Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000011664 signaling Effects 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 238000011282 treatment Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/18—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/115—Selection of the code volume for a coding unit prior to coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/124—Quantisation
- H04N19/126—Details of normalisation or weighting functions, e.g. normalisation matrices or variable uniform quantisers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/147—Data rate or code amount at the encoder output according to rate distortion criteria
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/196—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding being specially adapted for the computation of encoding parameters, e.g. by averaging previously computed encoding parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/132—Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/14—Coding unit complexity, e.g. amount of activity or edge presence estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
- H04N19/149—Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/189—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding
- H04N19/19—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the adaptation method, adaptation tool or adaptation type used for the adaptive coding using optimisation based on Lagrange multipliers
Definitions
- the field of the invention is that of the coding of image sequences and the compression of video sequences, in particular, but not exclusively, in the context of H EVC / H.265 standards (for "High Efficiency Video Coding") and SHVC (for Scalable-High Efficiency Video Coding) established by standardization bodies such as the ITU-T Video Coding Expert Group (VCEG) and the ISO / IEC Motion Picture Expert Group (MPEG).
- H EVC / H.265 for "High Efficiency Video Coding”
- SHVC Scalable-High Efficiency Video Coding
- the invention relates to the estimation and the regulation of the data rate at the output of a video encoder, implemented in the context of such a compression.
- the compression of a video sequence consists in reducing the amount of data necessary to represent it, while seeking to minimize the phenomena of loss of information, and therefore of quality.
- it is necessary to adapt the data rate at the output of the encoding system to take account, for example, of the constraints of the network on which they must be transmitted, or the size of the data carrier on which they must be stored.
- This rate adaptation must also take into account the quality of reproduction of the video sequence that one wishes to achieve for the decoded video stream.
- flow control strategies aimed at modulating the amount of information at the output of the encoder to obtain an optimal bit rate and quality, are conventionally implemented. They allow to control certain parameters of the encoding to obtain a flow which respects a set of imposed constraints.
- One of the main challenges of such a flow control is therefore to find a model connecting the parameters of the encoding to the data rate at the output of the encoder.
- the video coding as proposed for example in the standard MPEG-4 AVC (for "Advanced Video Coding") or in the standard HEVC, implements the following steps:
- the images are first divided into blocks of variable size, depending on the complexity of the movement to be represented;
- Block Residue Block 0riginal - Block Predicts ;
- the residual signal of the prediction then undergoes a set of treatments aimed at removing as many redundancies as possible, namely a transformation, to make it pass in the frequency domain, followed by a quantization;
- the final stream, or bitstream, is then formed respecting the normative structure, an entropy encoding for compressing the information in the bitstream.
- QP quantization parameter
- R (p) 6 (1 - p).
- each image is encoded twice: the first pass is used to initialize the R-p rate model, by establishing a slope parameter ⁇ ; the optimal value of the quantization parameter QP is then chosen thanks to the flow model; a second encoding pass permanently encodes the image using the QP quantization parameter chosen to achieve the target rate.
- a disadvantage of this first two-pass encoding approach is that the pre-encoding phase necessary to initialize the bitrate model induces a sharp increase in encoding time: it therefore strongly penalizes the speed of the mechanism, and introduces a complexity in terms of calculations.
- the encoding is performed in a single pass, and the flow control on the current image is based on the information obtained after the encoding of the images. preceding. More specifically, for the rate regulation of a current image, the slope ⁇ of the rate model Rp is initialized to a value calculated on a previous image chosen as a reference for the rate model, namely the image which has the behavior closest to that of the image being controlled. This value of ⁇ is then corrected, as the encoding progresses, thanks to the feedback of the values of ⁇ measured on the previously coded images.
- this second approach has the advantage of inducing a reduced encoding time compared to the first approach above, thanks to the removal of the first encoding pass, and a very low cost in terms of complexity calculations, it induces a decrease in the accuracy of flow control. Indeed, the changes of content between successive images, even minimal, introduce a bias in the calculation of the optimal quantization parameter. In addition, if there is a significant change in activity between successive images (large variations in movement and detail), the results obtained according to this second approach are less efficient than in the first approach.
- p denotes a percentage of zero coefficients in at least one coding residue of the image, after transformation and quantization of the residue, the residue being obtained by difference between a block of the image and a prediction of the block
- ⁇ denotes a parameter of slope of said flow model.
- such an estimation method comprises a step of calculating the slope parameter ⁇ taking into account a multiple of a surface of the coded coding residue (s) for the image, the surface being the sum of the resolutions of each of the components of the image.
- the invention is based on a completely new and inventive approach to the estimation of the coding complexity of an image to be encoded, which may notably but not exclusively be applied to the regulation of output flow of a video encoder, for example according to H EVC or SHVC standard.
- such a link between the slope parameter ⁇ and the surface of a coding residue, or more generally, the image to be encoded may be a proportionality link.
- the slope parameter ⁇ is calculated as a multiple of this area.
- surface area of the image here and in the entirety of this document is meant the sum of the resolutions of each of the components of the image in the chosen decomposition space. So, in the YUV space for example, the surface of the image corresponds to the sum of the resolutions of each of the chrominance and luminance components of the image.
- Such a rate estimation method implemented in an H EVC / H.265 or SHVC type coder, advantageously makes it possible to achieve optimal compression as soon as the first coding pass, and to substantially reduce the algorithmic complexity of such an encoder.
- the inventors of the present patent application have established a formulation of the Rp model at a coding residue of the image, in which the slope parameter ⁇ depends on the size of the residue.
- the flow estimation method of the invention thus offers a precise and deterministic formulation of the flow rate model Rp, at the level of a residue, and thus allows a simplification of the algorithmic complexity with respect to the flow control methods of the invention. prior art.
- the adjustment parameter ⁇ depends on the statistical distribution of the coefficients and their location in the residue.
- the skip skip rate designates here, and in all of this document, the total area covered by the coding units CU (for "Coding Units") whose flag cu_skip_flag is worth ⁇ ', divided by the total area of the image.
- the inventors have demonstrated that, during the coding of an image, all the coding residues are not coded: when the prediction is sufficiently good, the residue can be zero.
- the associated macroblock may not be compressed: it is said that it is "skipped".
- the invention therefore advantageously proposes to refine the R-p flow model, by expressing the slope parameter ⁇ , not from the total surface of the image, but from the unskipped surface of the image. Such a formulation makes it possible to obtain an increased accuracy of the flow model.
- the adjustment parameter ⁇ depends on the statistical distribution of the coefficients and their location in the residue.
- the invention also relates to a coding method of an image sequence comprising a step of cutting each of the images into at least one image block, and an encoding phase comprising steps of:
- such a coding method comprises a step of calculating the slope parameter ⁇ taking into account a surface of the coded coding residue (s) for the image, the surface being the sum of the resolutions of each component of the image.
- such a slope parameter ⁇ is in particular calculated as a multiple of a surface of the coded encoding residue (s) for the image.
- such a coding method implements a single iteration of the encoding phase.
- the video coding method of the invention makes it possible, with respect to prior video coding methods based on a rate regulation according to a so-called p-domain approach, to realize only one encoding pass, the slope parameter ⁇ can be calculated directly, without having to evaluate it during a first initialization pass of the encoding. Deleting a password encoding, the encoding method of the invention can significantly improve latency latency encoders of the prior art.
- the method of the invention allows a more accurate estimation of the slope parameter ⁇ .
- the invention also relates to a coding device for an image sequence comprising a module for cutting each of the images into at least one image block, and an encoding module comprising:
- a prediction module of each of the blocks delivering a predicted block
- the regulation means comprise a module for calculating the slope parameter ⁇ taking into account a surface of the coded coding residue (s) for the image, the surface being the sum of the resolutions each of the components of the image, and each of the blocks is encoded by a single passage in the encoding module.
- the invention relates to a coding device having in combination all or some of the features set forth throughout this document.
- the invention also relates to computer programs comprising instructions for implementing a control method or coding method as described above when the program is executed by a processor.
- the invention relates to computer readable recording media on which is recorded a computer program comprising instructions for executing the steps of the control method or coding method as described above.
- FIG. 1 illustrates the architecture of a video coder of the prior art making it possible to deliver a video stream conforming to the H EVC standard
- FIGS. 2A and 2B show a comparison between the R-p model of the invention and simulation results for various residues derived from reference bitstreams;
- FIG. 3 illustrates a comparison between the R-p model according to a first embodiment of the invention and simulation results for different class A to E images
- FIG. 4 illustrates a comparison between the R-p model according to a second embodiment of the invention and simulation results for different class A to E images
- FIG. 5 illustrates a first variant embodiment of the encoder of FIG. 1, in which a regulation of flow rate at the residue level is implemented in accordance with one embodiment of the invention
- FIG. 6 illustrates a frame structure in Random Access configuration, with encoding order, in accordance with the H EVC standard
- FIG. 7 illustrates a second variant embodiment of the encoder of FIG. 1, in which frame level flow regulation is implemented in accordance with one embodiment of the invention.
- the general principle of the invention is based on a precise and deterministic calculation of the slope parameter ⁇ in a rate model Rp of a so-called p-domain approach. implemented as part of a mechanism for estimating the complexity of coding an image.
- Such a principle applies in particular advantageously, but not exclusively, to the regulation of the data rate at the output of a video encoder.
- the inventors of the present patent application have established that, in the context of a video coding of type H EVC or SHVC for example, this slope parameter ⁇ could simply be calculated from a surface of the coded residues of the image, and in particular, in the embodiment described below, as the quadruple of this surface.
- the invention makes it possible to estimate the coding complexity of an image to be encoded or of a coding residue, in the context of a predictive video coding.
- the HEVC standard has been designed to meet all existing H. 264 / MPEG-4 AVC standard applications, allowing for increased video resolution and increased use of parallel processing architectures.
- FIG. 1 illustrates the architecture of a video encoder making it possible to produce a video stream having a structure and syntax in accordance with the H EVC standard, and capable of implementing the rate control method of the invention.
- a video coder is known from the prior art, but its operating principle is recalled to better illustrate the principle of the invention.
- Each CTU is itself subdivided into coding units called CUs (for "Coding Units”, composed of CBs for “Coding Blocks”, namely one per color component), themselves divided into prediction units called PUs (for " Prediction Units ", composed of PB for” Prediction Blocks ", one per color component) and in transformation units called TUs (for" Transform Units ", composed of TB for” Transform Blocks ", one per colorimetric component).
- CUs for "Coding Units", composed of CBs for "Coding Blocks", namely one per color component
- PUs for " Prediction Units ", composed of PB for” Prediction Blocks ", one per color component
- TUs for Transform Units ", composed of TB for" Transform Blocks ", one per colorimetric component
- the first image of a video sequence (and each image used as a random access point to the video sequence) is encoded using the intra prediction mode (ie a spatial prediction from other regions of the same image, but without using other images of the sequence).
- the intra prediction mode ie a spatial prediction from other regions of the same image, but without using other images of the sequence.
- the block referenced 101 estimates which intra prediction mode is optimal, while the block referenced 102 realizes the prediction from this optimal mode.
- an inter-image temporal prediction is used for all the other images of the sequence, or between the different random access points to the sequence, for most blocks.
- the block referenced 103 estimates the optimal motion vector or vectors (from the buffer 110 containing the previously encoded images); the block referenced 104 then applies these movements by making motion compensation for the prediction.
- the important prediction information is stored and transmitted (via data signals referenced 105 and 106) to be encoded and placed in the bitstream.
- the encoder and decoder generate identical inter prediction signals, applying motion compensation from the motion vectors, and information about the prediction modes used, which are transmitted in the bitstream.
- the residual of the inter or intra prediction (applied to each PB of a PU block) is formed by the difference between the original block and its prediction. The more accurate the prediction, the less energy the residue contains. It is then transformed by a spatial linear transform (applied to each TB of a TU block). The coefficients of the transform are then scaled, quantized (block referenced 107). The thus transformed and quantized residue 108 is retained to be encoded and transmitted in the bitstream. In order for the decoder to reproduce the same reference images used for the temporal prediction, the encoder integrates a quantization and inverse transformation block 109. Block partitioning can create visible artifacts and level differences during the transformation phases. quantization, which may negatively impact the accuracy of the prediction.
- a first pre-analysis step 111 determines the optimal filters to be applied, then these filters are applied in the block referenced 112.
- the referenced information 113 relating to the choice of filters are retained to be encoded and transmitted in the bitstream.
- a control block 114 manages the CTUs and the prediction modes to be applied. The information relating to this management is also stored in the stream referenced 115 for encoding and insertion in the final bitstream.
- the bitstream 116 is formed respecting the normative structure of the H EVC stream.
- An entropic coder CABAC Arimetic Binary Encoding with Context Adaptation
- the quantization of the coefficients resulting from the transform, implemented in the block referenced 107 in FIG. 1, constitutes a very important step in the encoding process with respect to the problematic of the invention, since it makes it possible to vary the the rate needed to encode the signal by adjusting the amount of data lost.
- quantization parameter QP Quantization Parameter
- the present invention proposes a regulation of flow rate based on a so-called p-domain approach, which is interesting from the point of view of both the precision of its representation and its low complexity in algorithmic terms.
- constraints to be applied to the flow are determined according to the context envisaged
- the available resources are then calculated and distributed among the different levels of the flow;
- the value of the quantization parameter is chosen in accordance with the resources to be respected;
- the data is encoded using the chosen QP and the available resources are updated.
- the p-domain rate regulation approach is based on an expression of the relationship between the value of the quantization parameter and the output rate of the encoder.
- the adjustment parameter ⁇ can be neglected at first approach.
- a frame is conventionally composed of several components, for example the components YUV. If we consider more generally the case of a component C of the image (C can, in a particular case, be one of the components Y, U or V), the relation (4) can then be summed on this component C of the image, denoting by RZ the set of residues in this component C:
- FIGS. 2A and 2B show the curves of the rate R as a function of the parameter p, after entropic coding of the CABAC type, respectively for class A (FIG. 2A) and class B (FIG. 2B) images.
- class A contains two sequences of 2500x1600 resolution, the two sequences containing many movements.
- Class B contains 6 sequences of 1920x1080 resolution with varied content (tracking, fixed shots, with or without plane changes).
- JCTVC-O0022 prepared by the Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG 16 WP 3 and ISO / IEC JTC 1 / SC 29 / WG 11 "at the 15th meeting held in Geneva, Switzerland, from 23 October to 1 November 2013.
- the lines referenced 20, 21, 22 and 23 represent the theoretical model of the rate R as a function of the parameter p obtained according to the calculation method of the invention, considering the slope parameter ⁇ equal to 4 * N 2 , for coding residuals of size 4x4 (right referenced 20), 8X8 (right referenced 21), 16x16 (right referenced 22), 32x32 (right referenced 23), and neglecting the adjustment coefficient ⁇ .
- the scatter plots represent the flow values measured on residues from reference bitstreams, respectively of size 4x4, 8x8, 16x16 and 32x32. These reference bitstreams correspond to the sequences from the classes mentioned previously encoded using the reference software, in Random-Access configuration and for four fixed QP values (22, 27, 32 and 37).
- the HM reference software consists of an encoder and a decoder used for compliance and interoperability testing.
- the reference bitstreams are streams, encoded with the reference software, and made available by the BBC (registered trademark) to be able for example to test the decoders. Bitstreams are available and cover for each sequence the eight configurations (All-Intra, All-Intra 10, Low-Delay P, Low-Delay P 10, Low-Delay B, Low-Delay B 10, Random-Access and Random Access 10), for four different QP's (22, 27, 32 and 37).
- JCT-VC Joint Collaborative Team on Video Coding
- FIG. 3 illustrates the evolution of the rate R as a function of the parameter p, no longer at the level of a residual, but at the level of one frame, for different class A to E images. Indeed, to achieve a saving in In terms of calculations, it is sometimes preferable to perform rate control at the image level and to encode each macroblock with the same value of the quantization parameter QP.
- the test sequences are divided into 5 classes of resolutions and varied contents (A : 2560x1600, B: 1920x1080, C: 832x480, D: 416x240 and E: 1280x720).
- the clouds of points concentrated around the lines referenced 30 to 34 represent, for their part, the values measured on frames from the aforementioned reference bitstreams for these different classes A to E.
- the flow rate model Rp illustrated by the curves referenced 30 to 34 is therefore fairly close to reality, but does not prove to be as accurate as the flow rate model at the residue level illustrated in FIGS. 2A and 2B. Indeed, when encoding a frame, a non-negligible number of residues are not coded: the prediction being perfect, the residues are indeed null, which causes the "skip” (ie the non-compression ) of some elementary coding units in the image, or CL ) for "Coding Unit".
- an alternative embodiment of the invention relies on an approach making it possible to estimate it more precisely, and thus to move the lines referenced 30 to 34 of FIG. 3 to the associated point clouds, depending on the properties of the frame.
- the total area of the frame is not considered to be the coded surface, but the non-skippered surface in the image is taken into account.
- the rate of skip skip r, the parameter p and the flow rate outputted from the encoder were measured when decoding each frame, for all sequences. For each skip rate interval of 1%, the slope of the line is estimated by linear regression. These estimated values are then compared to the slope values determined using equation (15).
- Table 4 below makes it possible to observe the average accuracy obtained in the slope estimation using the method of the invention. This precision is increased on the range corresponding to the most frequent cases in practice (namely + 50% of skip).
- the flow control method of the invention thus makes it possible, compared with the known methods of the prior art implementing flow control in two encoding passes, to eliminate a first encoding pass. It also makes it possible to estimate the slope of the p-domain more precisely in the context of one-pass regulation.
- FIG. 5 illustrates a modified architecture of the video coder of FIG. 1 for implementing flow control at the CTU level, in accordance with the principle of the invention described above.
- the architecture of the encoder of FIG. 5 differs from that of FIG. 1 in that the transform-quantization block 107 is split into a first block referenced 107a operating the transformation of the residues, and a second block referenced 107b operating their quantization.
- the block referenced 118 receives the budget allocated for the coding, as well as the size of the residue, from the control block referenced 114. From the equation (4) proposed above, it calculates the value of the parameter p, it transmits to the block referenced 119.
- the block referenced 119 then calculates, from the value of p received from the block referenced 118, the appropriate quantization parameter QP.
- the block referenced 107b then proceeds to the quantization of the transformed residue by the block referenced 107a from the quantization parameter QP determined by the block referenced 119.
- Random-Access configuration is the case of use for broadcasting: the images are placed in the decoding order in the bitstream (which is different from the display order).
- Ramdom-Access Point pictures appear regularly in the bitstream so that you can resume decoding the current program.
- the numerical value associated with each of these frames indicates the order of encoding (and thus decoding) of these different images.
- These images are grouped into groups of images (or GOP, for Groups of Pictures): thus, the images numbered from 1 to 8 constitute a first GOP; the images referenced from 9 to 16 constitute a second GOP; etc.
- an Intra image uses only spatial prediction, and in particular allows to initialize the encoding of a sequence (RAP), because it is coded independently of the other images;
- an Inter image uses intra- and inter-image prediction and therefore exploits spatial redundancies in the image, and temporal in the image sequences.
- the images are encoded in successive time levels according to a pyramidal structure based on Inter images.
- the frame 1 is a unidirectional Inter picture
- the frames numbered 2 to 8 are bidirectional Inter pictures.
- the initialization for the first GOP may be as described in Table 5 below, with the POC (for "Picture Order Count") which numbers the position of the frame in the GOP:
- the skip rate ski p of the numbered frame 15 can be predicted with respect to the frames previously encoded according to the equation:
- the skip rate can be predicted according to the equation:
- the skip rate sk i P of the enhancement layer can be determined from the skip rate Sk i P measured on the layer. basic.
- the skip rate sk i P can be determined based on the correlation between an image and its nearest neighbors, which can give an indication of the proportion of the image that is motionless, and who could be skippered.
- the measurement of r skip skip rate during encoding can be carried out simply, using a counter: for a given frame, when a coding unit CU is skippered (ie uncompressed because the residue is zero) this counter is incremented with the surface of this skippered CU coding unit, taking into account the 3 characteristic YUV components of the color space used.
- the value S skip stored in the counter corresponding to the total area skipped for this frame, is divided by the total area of the image S Frame in order to obtain the r Sk i P -
- the counter is then reset for the next frame.
- the formula for calculating the r skip is then:
- FIG. 7 illustrates a variant of the architecture of the encoder of FIG. 1, making it possible to implement a flow rate regulation at the frame level, according to the principle of the invention.
- the architecture of the video encoder of FIG. 7 differs from that of the encoder of FIG. 1 by the elements described hereinafter.
- the block referenced 119 receives the value of the counter coming from the block referenced 118 and divides it by the total surface of the image (ie the sum of the surfaces of the components) in order to determine the value of the r skip parameter.
- the value of the ratio r skip is transmitted to a database 120 of r skips per frame (according to the hierarchical level of the frame in the GOP).
- the block referenced 121 determines the r skip for the next frame, for example by moving average (as explained above in relation to FIG. 6), using the database 120. It transmits the value thus determined to the block referenced 122. , which also receives from block referenced 114 the budget allocated to the coding and the total surface of the image. The block referenced 122 can then deduce the value of the parameter p from the equation (15) at the frame level, and transmits it to the block referenced 123.
- the block referenced 123 determines the value of the quantization parameter QP, and transmits it to the controller 114, which, in turn, communicates it to the quantization block referenced 107.
- the rate control method of the invention makes it possible to substantially reduce the complexity of the video encoder compared to the methods of the prior art. In particular, it makes it possible to halve the latency of a double-pass encoder. It also makes it possible to substantially increase the accuracy, the quality or the bit rate, according to the fixed constraints, of a simple pass encoder.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computing Systems (AREA)
- Theoretical Computer Science (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| FR1362827A FR3015163B1 (fr) | 2013-12-17 | 2013-12-17 | Procede d'estimation d'un debit de codage d'une image d'une sequence d'images, procede de codage, dispositif et programme d'ordinateur correspondants. |
| PCT/EP2014/072604 WO2015090682A1 (fr) | 2013-12-17 | 2014-10-22 | Procédé d'estimation d'un débit de codage d'une image d'une séquence d'images, procédé de codage, dispositif et programme d'ordinateur correspondants |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| EP3085087A1 true EP3085087A1 (fr) | 2016-10-26 |
Family
ID=50489240
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP14790557.4A Withdrawn EP3085087A1 (fr) | 2013-12-17 | 2014-10-22 | Procédé d'estimation d'un débit de codage d'une image d'une séquence d'images, procédé de codage, dispositif et programme d'ordinateur correspondants |
Country Status (3)
| Country | Link |
|---|---|
| EP (1) | EP3085087A1 (fr) |
| FR (1) | FR3015163B1 (fr) |
| WO (1) | WO2015090682A1 (fr) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10223810B2 (en) | 2016-05-28 | 2019-03-05 | Microsoft Technology Licensing, Llc | Region-adaptive hierarchical transform and entropy coding for point cloud compression, and corresponding decompression |
| US11297346B2 (en) | 2016-05-28 | 2022-04-05 | Microsoft Technology Licensing, Llc | Motion-compensated compression of dynamic voxelized point clouds |
| US10694210B2 (en) | 2016-05-28 | 2020-06-23 | Microsoft Technology Licensing, Llc | Scalable point cloud compression with transform, and corresponding decompression |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5440344A (en) * | 1992-04-28 | 1995-08-08 | Mitsubishi Denki Kabushiki Kaisha | Video encoder using adjacent pixel difference for quantizer control |
| US5532940A (en) * | 1993-11-24 | 1996-07-02 | Intel Corporation | Process, apparatus and system for selecting quantization levels for encoding video signals |
| US7606427B2 (en) * | 2004-07-08 | 2009-10-20 | Qualcomm Incorporated | Efficient rate control techniques for video encoding |
| US8126283B1 (en) * | 2005-10-13 | 2012-02-28 | Maxim Integrated Products, Inc. | Video encoding statistics extraction using non-exclusive content categories |
| EP2882189A1 (fr) * | 2009-03-13 | 2015-06-10 | Telefonaktiebolaget L M Ericsson (publ) | Procédés et configurations permettant de traiter un flux binaire codé |
-
2013
- 2013-12-17 FR FR1362827A patent/FR3015163B1/fr active Active
-
2014
- 2014-10-22 WO PCT/EP2014/072604 patent/WO2015090682A1/fr not_active Ceased
- 2014-10-22 EP EP14790557.4A patent/EP3085087A1/fr not_active Withdrawn
Non-Patent Citations (2)
| Title |
|---|
| None * |
| See also references of WO2015090682A1 * |
Also Published As
| Publication number | Publication date |
|---|---|
| FR3015163A1 (fr) | 2015-06-19 |
| FR3015163B1 (fr) | 2017-04-28 |
| WO2015090682A1 (fr) | 2015-06-25 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US9143776B2 (en) | No-reference video/image quality measurement with compressed domain features | |
| US11743475B2 (en) | Advanced video coding method, system, apparatus, and storage medium | |
| US20150312575A1 (en) | Advanced video coding method, system, apparatus, and storage medium | |
| FR2932637A1 (fr) | Procede et dispositif de codage d'une sequence d'images | |
| MX2013004362A (es) | Codificacion de video utilizando mapeo de rango dinamico temporalmente coherente. | |
| FR2939593A1 (fr) | Procede et dispositif de codage video | |
| FR2889004A1 (fr) | Procede et dispositif de traitement d'une sequence d'images numeriques a scalabilite spatiale ou en qualite | |
| FR2951345A1 (fr) | Procede et dispositif de traitement d'une sequence video | |
| FR2881898A1 (fr) | Procede et dispositif de codage d'une image video en mode inter ou intra | |
| EP3085087A1 (fr) | Procédé d'estimation d'un débit de codage d'une image d'une séquence d'images, procédé de codage, dispositif et programme d'ordinateur correspondants | |
| EP3972247B1 (fr) | Procédé de codage et de décodage d'images, dispositif de codage et de décodage d'images et programmes d'ordinateur correspondants | |
| FR3026261A1 (fr) | Procede de codage et de decodage d'images integrales, dispositif de codage et de decodage d'images integrales et programmes d'ordinateur correspondants | |
| FR2996093A1 (fr) | Procede de codage et decodage d'images, dispositifs de codage et decodage et programmes d'ordinateur correspondants | |
| FR3139260A1 (fr) | Lissage hors boucle de codage d’une frontière entre deux zones d’image | |
| FR2933837A1 (fr) | Procede de codage, procede et dispositif de transcodage et flux de donnees image codees. | |
| EP2633687B1 (fr) | Codage et décodage vidéo a partir d'un épitome | |
| FR2959093A1 (fr) | Procede et dispositif de prediction d'une information de complexite de texture contenue dans une image | |
| FR2990319A1 (fr) | Procede de codage et de decodage d'un ensemble de blocs d'image et dispositifs correspondants | |
| FR3041851A1 (fr) | Procede d'allocation de debit, dispositif, codeur et programme d'ordinateur associes | |
| WO2018065698A1 (fr) | Procédé de codage d'une image numérique, procédé de décodage, dispositifs, équipement terminal et programmes d'ordinateurs associés | |
| FR3033114A1 (fr) | Procede de codage et decodage d'images, dispositif de codage et decodage et programmes d'ordinateur correspondants | |
| Sharma et al. | An improvement analysis on video compression using file segmentation | |
| FR2956552A1 (fr) | Procede de codage ou de decodage d'une sequence video, dispositifs associes | |
| EP2805486A1 (fr) | Procede de quantification dynamique pour le codage de flux de donnees | |
| FR3022095A1 (fr) | Dispositif et procede de transmission de donnees multimedia |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| 17P | Request for examination filed |
Effective date: 20160517 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR |
|
| AX | Request for extension of the european patent |
Extension state: BA ME |
|
| DAX | Request for extension of the european patent (deleted) | ||
| 17Q | First examination report despatched |
Effective date: 20170328 |
|
| GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
| INTG | Intention to grant announced |
Effective date: 20180322 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
| 18D | Application deemed to be withdrawn |
Effective date: 20180802 |