[go: up one dir, main page]

WO2025139841A1 - Methods and apparatus of enabling resolution change in loop filter stage for video coding - Google Patents

Methods and apparatus of enabling resolution change in loop filter stage for video coding Download PDF

Info

Publication number
WO2025139841A1
WO2025139841A1 PCT/CN2024/139042 CN2024139042W WO2025139841A1 WO 2025139841 A1 WO2025139841 A1 WO 2025139841A1 CN 2024139042 W CN2024139042 W CN 2024139042W WO 2025139841 A1 WO2025139841 A1 WO 2025139841A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
resolution
filtered
filters
level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
PCT/CN2024/139042
Other languages
French (fr)
Inventor
Guan-hao CHEN
Shih-Chun Chiu
Tzu-Der Chuang
Ching-Yeh Chen
Chih-Wei Hsu
Yi-Wen Chen
Yu-Wen Huang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
MediaTek Inc
Original Assignee
MediaTek Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by MediaTek Inc filed Critical MediaTek Inc
Priority to TW113151254A priority Critical patent/TW202527543A/en
Publication of WO2025139841A1 publication Critical patent/WO2025139841A1/en
Pending legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
    • H04N19/82Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/59Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/86Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness

Definitions

  • the present invention is a non-Provisional Application of and claims priority to U.S. Provisional Patent Application No. 63/615,816, filed on December 29, 2023.
  • the U.S. Provisional Patent Application is hereby incorporated by reference in its entirety.
  • the present invention relates to video coding system.
  • the present invention relates to changing picture resolution for in-loop filter in a video coding system.
  • VVC Versatile video coding
  • JVET Joint Video Experts Team
  • MPEG ISO/IEC Moving Picture Experts Group
  • ISO/IEC 23090-3 2021
  • Information technology -Coded representation of immersive media -Part 3 Versatile video coding, published Feb. 2021.
  • VVC is developed based on its predecessor HEVC (High Efficiency Video Coding) by adding more coding tools to improve coding efficiency and also to handle various types of video sources including 3-dimensional (3D) video signals.
  • HEVC High Efficiency Video Coding
  • Fig. 1A illustrates an exemplary adaptive Inter/Intra video encoding system incorporating loop processing.
  • the prediction data is derived based on previously coded video data in the current picture (i.e., Intra prediction) or previous reconstructed reference picture (s) .
  • the prediction data is subtracted from the input data using Adder 112 to form prediction errors, also called residues.
  • the prediction errors are then processed by Transform (T) 114 followed by Quantization (Q) 116.
  • the transformed and quantized residues are then coded by Entropy Encoding 120 to be included in a video bitstream corresponding to the compressed video data.
  • the bitstream associated with the residues is then packed with side information such as motion and coding modes associated with Inter/Intra prediction, and other information such as parameters associated with loop filters applied to underlying video area.
  • the encoder also needs reconstructed data to derive the inter/intra prediction. Accordingly, the transformed and quantized residues are processed by Inverse Quantization (IQ) 126 and Inverse Transformation (IT) 124 to reconstruct the residues.
  • IQ Inverse Quantization
  • IT Inverse Transformation
  • the residues are then added back to prediction data at Reconstruction (REC) 128 to form reconstructed video data.
  • the reconstructed video data may be stored in Frame Buffer 140 and used for prediction of other frames.
  • incoming video data undergoes a series of processing in the encoding system.
  • the reconstructed video data from REC 128 may be subject to various impairments due to lossy processing (e.g. quantization) .
  • in-loop filter processing is often applied to the reconstructed video data before the reconstructed video data are stored in the Frame Buffer 140 in order to improve video quality.
  • deblocking filter (DF) 130 Sample Adaptive Offset (SAO) 132 and Adaptive Loop Filter (ALF) 134 may be used.
  • the loop filter information may need to be incorporated in the bitstream so that a decoder can properly recover the required information.
  • the decoder can use similar or portion of the reconstruction blocks as the encoder except for Transform 114 and Quantization 116 since the decoder only needs Inverse Quantization 126 and Inverse Transform 124.
  • Entropy Encoding 120 the decoder uses an Entropy Decoding 160 to decode the video bitstream into quantized transform coefficients and needed coding information (e.g. in-loop filter information, Intra prediction information and Inter prediction information) .
  • the Inter/Intra prediction 150 at the decoder side does not need to perform the Intra mode search nor motion estimation. Instead, the decoder only needs to generate Inter/Intra prediction according to Inter/Intra prediction information received from the encoder.
  • the decoder also uses Frame Buffer to store reconstructed pictures for Inter/Intra prediction.
  • the Inter/Intra prediction data derived by using Inter/Intra Prediction is added to the reconstructed residues from IT 124 using Adder 112.
  • the combined data is provided to REC 128 to form reconstructed pictures.
  • VVC images in the DF 130, SAO 132, and ALF 134 modules as shown in Fig. 1A and Fig. 1B all use the same resolution as those in the REC 128 module.
  • one of the target applications is real-time communication, such as video conference and live video.
  • VVC can adapt to dynamic network environment by adaptively adjust picture resolution. Therefore, the encoder can adapt to the worse network condition by encoding low-resolution frames, or transmit full-resolution video when network condition gets better.
  • the above mentioned applications can benefit from supporting spatial resolution changes within a video sequence under unstable network conditions, namely reference picture resampling (RPR) .
  • RPR reference picture resampling
  • a method and apparatus for video coding with in-loop filter at different resolutions form the reconstructed picture are disclosed.
  • reconstructed data associated with a current block or first filtered data of the reconstructed data is received.
  • One or more second filters are applied to the reconstructed data on the first filtered data to generate second filtered data, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution.
  • the second filtered data is provided.
  • said one or more second filters comprise DF (Deblocking filter) , SAO (Sample Adaptive Offset) , ALF (Adaptive Loop Filter) , or a combination thereof.
  • DF Deblocking filter
  • SAO Sample Adaptive Offset
  • ALF Adaptive Loop Filter
  • the picture rescaling corresponds to up-sampling or down-sampling.
  • said at least one of said one or more second filters is determined and an on/off control syntax is signalled or parsed to indicate whether said at least one of said one or more second filters is performed with the picture rescaling to change the pre-filtering resolution.
  • the on/off control syntax is signalled or parsed in a slice level, picture level, sequence level, CTU-level, block level, or a combination thereof.
  • an index is signalled or parsed in a slice level, picture level, sequence level, CTU-level, block level, or a combination thereof to indicate whether said at least one of said one or more second filters is rescaled.
  • a number of CTUs being rescaled is changed if CTU size is kept the same as before the picture rescaling.
  • CTU size is rescaling to kept a number of CTUs the same as before the picture rescaling.
  • CTU size is unchanged and a number of CTUs being rescaled is re-calculated.
  • a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution
  • the filtered reconstruction picture is rescaled to a rescaled-filtered reconstruction picture having the pre-filtering resolution, and the filtered reconstruction picture or the rescaled-filtered reconstruction picture is stored in a buffer as one reference picture.
  • the filtered reconstruction picture is stored in a buffer as one reference picture.
  • a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution
  • the filtered reconstruction picture is rescaled to a rescaled-filtered reconstruction picture having the pre-filtering resolution, and both of the filtered reconstruction picture and the rescaled-filtered reconstruction picture are stored in a buffer as reference pictures.
  • Fig. 5 illustrates a flowchart of an exemplary video coding system that enables different resolution in the in-loop filter according to an embodiment of the present invention.
  • the steps shown in the flowchart may be implemented as program codes executable on one or more processors (e.g., one or more CPUs) at the encoder side.
  • the steps shown in the flowchart may also be implemented based hardware such as one or more electronic devices or processors arranged to perform the steps in the flowchart.
  • reconstructed data associated with a current block or first filtered data of the reconstructed data is received in step 510.
  • One or more second filters are applied to the reconstructed data on the first filtered data to generate second filtered data in step 520, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution.
  • the second filtered data is provided in step 530.
  • the software code or firmware code may be developed in different programming languages and different formats or styles.
  • the software code may also be compiled for different target platforms.
  • different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Image Processing (AREA)

Abstract

A method and apparatus for video coding with in-loop filter at different resolutions form the reconstructed picture are disclosed. According to one method, reconstructed data associated with a current block or first filtered data of the reconstructed data is received. One or more second filters are applied to the reconstructed data on the first filtered data to generate second filtered data, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution. The second filtered data is provided.

Description

METHODS AND APPARATUS OF ENABLING RESOLUTION CHANGE IN LOOP FILTER STAGE FOR VIDEO CODING
CROSS REFERENCE TO RELATED APPLICATIONS
The present invention is a non-Provisional Application of and claims priority to U.S. Provisional Patent Application No. 63/615,816, filed on December 29, 2023. The U.S. Provisional Patent Application is hereby incorporated by reference in its entirety.
FIELD OF THE INVENTION
The present invention relates to video coding system. In particular, the present invention relates to changing picture resolution for in-loop filter in a video coding system.
BACKGROUND AND RELATED ART
Versatile video coding (VVC) is the latest international video coding standard developed by the Joint Video Experts Team (JVET) of the ITU-T Video Coding Experts Group (VCEG) and the ISO/IEC Moving Picture Experts Group (MPEG) . The standard has been published as an ISO standard: ISO/IEC 23090-3: 2021, Information technology -Coded representation of immersive media -Part 3: Versatile video coding, published Feb. 2021. VVC is developed based on its predecessor HEVC (High Efficiency Video Coding) by adding more coding tools to improve coding efficiency and also to handle various types of video sources including 3-dimensional (3D) video signals.
Fig. 1A illustrates an exemplary adaptive Inter/Intra video encoding system incorporating loop processing. For Inter/Intra Prediction 110, the prediction data is derived based on previously coded video data in the current picture (i.e., Intra prediction) or previous reconstructed reference picture (s) . The prediction data is subtracted from the input data using Adder 112 to form prediction errors, also called residues. The prediction errors are then processed by Transform (T) 114 followed by Quantization (Q) 116. The transformed and quantized residues are then coded by Entropy Encoding 120 to be included in a video bitstream corresponding to the compressed video data. The bitstream associated with the residues is then packed with side information such as motion and coding modes associated with Inter/Intra prediction, and other information such as parameters associated with loop filters applied to underlying video area. The encoder also needs reconstructed data to derive the inter/intra prediction. Accordingly, the transformed and quantized residues are processed by Inverse Quantization (IQ) 126 and Inverse Transformation (IT) 124 to reconstruct the residues. The residues are then added back to prediction data at Reconstruction (REC) 128 to form reconstructed video data. The reconstructed video data may be stored in Frame Buffer 140 and used for prediction of other frames.
As shown in Fig. 1A, incoming video data undergoes a series of processing in the encoding system. The reconstructed video data from REC 128 may be subject to various impairments due to lossy processing (e.g. quantization) . Accordingly, in-loop filter processing is often applied to the reconstructed video data before the reconstructed video data are stored in the Frame Buffer 140 in order to improve video quality. For example, deblocking filter (DF) 130, Sample Adaptive Offset (SAO) 132 and Adaptive Loop Filter (ALF) 134 may be used. The loop filter information may need to be incorporated in the bitstream so that a decoder can properly recover the required information.
The decoder, as shown in Fig. 1B, can use similar or portion of the reconstruction blocks as the encoder except for Transform 114 and Quantization 116 since the decoder only needs Inverse Quantization 126 and Inverse Transform 124. Instead of Entropy Encoding 120, the decoder uses an Entropy Decoding 160 to decode the video bitstream into quantized transform coefficients and needed coding information (e.g. in-loop filter information, Intra prediction information and Inter prediction information) . The Inter/Intra prediction 150 at the decoder side does not need to perform the Intra mode search nor motion estimation. Instead, the decoder only needs to generate Inter/Intra prediction according to Inter/Intra prediction information received from the encoder. The decoder also uses Frame Buffer to store reconstructed pictures for Inter/Intra prediction. The Inter/Intra prediction data derived by using Inter/Intra Prediction is added to the reconstructed residues from IT 124 using Adder 112. The combined data is provided to REC 128 to form reconstructed pictures.
In VVC, images in the DF 130, SAO 132, and ALF 134 modules as shown in Fig. 1A and Fig. 1B all use the same resolution as those in the REC 128 module. In VVC, one of the target applications is real-time communication, such as video conference and live video. VVC can adapt to dynamic network environment by adaptively adjust picture resolution. Therefore, the encoder can adapt to the worse network condition by encoding low-resolution frames, or transmit full-resolution video when network condition gets better. The above mentioned applications can benefit from supporting spatial resolution changes within a video sequence under unstable network conditions, namely reference picture resampling (RPR) .
In the present invention, methods and apparatus to use adaptive picture resolution in the in-loop filter process are disclosed.
BRIEF SUMMARY OF THE INVENTION
A method and apparatus for video coding with in-loop filter at different resolutions form the reconstructed picture are disclosed. According to one method, reconstructed data associated with a current block or first filtered data of the reconstructed data is received. One or more second filters are applied to the reconstructed data on the first filtered data to generate second filtered data, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution. The second filtered data is provided.
In one embodiment, said one or more second filters comprise DF (Deblocking filter) , SAO (Sample Adaptive Offset) , ALF (Adaptive Loop Filter) , or a combination thereof.
In one embodiment, the picture rescaling corresponds to up-sampling or down-sampling.
In one embodiment, said at least one of said one or more second filters is determined and an on/off control syntax is signalled or parsed to indicate whether said at least one of said one or more second filters is performed with the picture rescaling to change the pre-filtering resolution. In one embodiment, the on/off control syntax is signalled or parsed in a slice level, picture level, sequence level, CTU-level, block level, or a combination thereof.
In one embodiment, an index is signalled or parsed in a slice level, picture level, sequence level, CTU-level, block level, or a combination thereof to indicate whether said at least one of said one or more second filters is rescaled.
In one embodiment, for at least one of said one or more second filters, a number of CTUs being rescaled is changed if CTU size is kept the same as before the picture rescaling. In another embodiment, for at least one of said one or more second filters, CTU size is rescaling to kept a number of CTUs the same as before the picture rescaling. In yet another embodiment, for at least one of said one or more second filters, CTU size is unchanged and a number of CTUs being rescaled is re-calculated.
In one embodiment, for inter-coded pictures, if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution, the filtered reconstruction picture is rescaled to a rescaled-filtered reconstruction picture having the pre-filtering resolution, and the filtered reconstruction picture or the rescaled-filtered reconstruction picture is stored in a buffer as one reference picture. In one embodiment, for inter-coded pictures, if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution, the filtered reconstruction picture is stored in a buffer as one reference picture. In another embodiment, for inter-coded pictures, if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution, the filtered reconstruction picture is rescaled to a rescaled-filtered reconstruction picture having the pre-filtering resolution, and both of the filtered reconstruction picture and the rescaled-filtered reconstruction picture are stored in a buffer as reference pictures.
In one embodiment, a picture-level and/or a slice-level control flag to control resolution of a target second filter is signalled or parsed after a second flag indicating whether the target second filter is on or off. In one embodiment, if the second flag indicates that the target second filter is off, the target second filter is not applied and target picture rescaling is not applied before and/or after the target second filter. In one embodiment, the picture-level and/or the slice-level control flag is signalled or parsed only if the second flag indicates that the target second filter is on. In one embodiment, the picture-level and/or the slice-level control flag is separate for different colour components. In one embodiment, if target resolution for the target second filter is allowed to be different, QP value for the target second filter is adaptively changed according to the target resolution.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1A illustrates an exemplary adaptive Inter/Intra video coding system incorporating loop processing.
Fig. 1B illustrates a corresponding decoder for the encoder in Fig. 1A.
Fig. 2 illustrates an example of possible filter stages for applying different resolution according to embodiments of the present invention.
Fig. 3 illustrates an example of possible filter stages at DF, SAO and ALAF for applying different resolution according to embodiments of the present invention.
Fig. 4 illustrates an example of possible filter stages for applying different resolution and storing filtered reconstructed pictures at two different resolutions in the reference picture buffer according to embodiments of the present invention.
Fig. 5 illustrates a flowchart of an exemplary video coding system that enables different resolution in the in-loop filter according to an embodiment of the present invention.
DETAILED DESCRIPTION OF THE INVENTION
It will be readily understood that the components of the present invention, as generally described and illustrated in the figures herein, may be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the systems and methods of the present invention, as represented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. References throughout this specification to “one embodiment, ” “an embodiment, ” or similar language mean that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of the present invention. Thus, appearances of the phrases “in one embodiment” or “in an embodiment” in various places throughout this specification are not necessarily all referring to the same embodiment.
Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. One skilled in the relevant art will recognize, however, that the invention can be practiced without one or more of the specific details, or with other methods, components, etc. In other instances, well-known structures, or operations are not shown or described in detail to avoid obscuring aspects of the invention. The illustrated embodiments of the invention will be best understood by reference to the drawings, wherein like parts are designated by like numerals throughout. The following description is intended only by way of example, and simply illustrates certain selected embodiments of apparatus and methods that are consistent with the invention as claimed herein.
In the present invention, methods and apparatus to enable in-loop filter at different resolutions are disclosed.
Scheme 1: Process In-Loop Filter at Desired Resolution
In one embodiment, the picture resolution can be different from the pre-LF resolution during in-loop filter (LF) processing at any stage. An example of possible location (or stage) of In-Loop Filter processing at desired resolution is shown in Fig. 2, where LR means low resolution and HR mean high resolution, where REC 128 and Inter/Intra Prediction 110 are performed at low resolution while DF 130, SAO 132 and ALF 134 are performed at high resolution.
For example, one picture is coded in smaller or larger resolution using the VVC reference picture rescaling (RPR) functionality at the pre-LF stage. Before processing the LF, we can rescale the picture to the original resolution or rescale to another resolution that is different from the resolution in pre-LF stage. The LF processing then can be applied at the original resolution or at another resolution that is different from the resolution in the pre-LF stage.
In another example, we can process partial stages of LF at the same resolution as pre-LF stage and do the rest of LF stages at desired resolution. In another example, the rescaling process can be applied after the partial stages LF stages.
For example, Fig. 3 shows the case of performing rescaling before ALF so that the processing in different resolutions is at ALF stage only, where only ALF is performed at HR while others (i.e., REC, DF and SAO) are performed at LR..
In one embodiment, the stage of LF to perform rescaling is fixed and an on/off control syntax is signalled at a high-level (e.g. at slice level, picture level and/or sequence level) , CTU-level, or block level to indicate whether the method is applied or not.
In another embodiment, and index is signalled at a high-level (e.g. slice level, picture level, and/or sequence level) , CTU-level, or block level to indicate which stage of LF to perform rescaling or not perform rescaling.
When the picture is rescaled at one of LF stages, the number of CTUs to be processed in LF stage may be changed if the CTU size is kept the same as pre-LF stage. In one embodiment, the CTU size is also rescaled to keep the number of CTU the same as the pre-LF stage.
For example, if we rescale the picture to make the width and height increased to 2 times, the CTU width and height are also rescaled by 2 times.
In another embodiment, the CTU size is unchanged, and the number of CTUs in LF stage should be re-calculated.
When inter-coded pictures are allowed and the above method is applied, the resolution of reconstruction after LF of the coded picture may be different from the pre-LF resolution of the current picture. In one embodiment, the reconstruction after LF of the coded picture is rescaled to the same resolution as pre-LF resolution of the current picture and put into frame buffer as one of the reference picture. In one embodiment, the reconstruction after LF is stored and put into the frame buffer as reference picture. In another embodiment, two reconstructions are stored into the frame buffer if the resolution is changed during LF. One is the after LF reconstruction and the other one is the same resolution as pre-LF which is rescaled from the after LF one. In another example, the reconstruction picture after LF is always rescaled to the high-resolution (maximum resolution, or the original resolution) if needed. The high-resolution picture can also be put in the reference frame buffer. In yet another example, the coded reference picture is rescaled from the high-resolution picture reconstructed picture to the same resolution as pre-LF of current picture.
For example, we choose to process ALF at different resolution. The reconstruction after ALF can be put into frame buffer directly (path 1) . Also, the reconstruction after ALF can be rescaled to the same resolution as pre-LF (path 2) . In above embodiment, we can use path 1, path 2, or both as shown in Fig. 4.
In one embodiment, a flag is signalled to indicate whether an LF is performed at a higher resolution or not, and the flag is signalled after the on/off control of the LF. Specifically, the flag is signalled only when the LF is turned on.
In another embodiment, an LF can be performed at a resolution different from the pre-LF resolution. If the LF is turned off, the reconstructed picture not only bypasses the LF, but also the rescaling process before and/or after the LF.
For example, we choose to process ALF at a different resolution, and the reconstruction after ALF is rescaled to the same resolution as pre-LF (i.e., path 2 in Fig. 4) . If ALF is turned off, not only ALF is bypassed, but also the rescaling process before and after ALF are bypassed.
In the above embodiments, the design can be separate for different luma/chroma components.
For example, a flag indicating whether a luma/chroma LF is performed at a higher resolution or not is signalled after the on/off control of the luma/chroma LF.
For another example, we choose to process ALF at a different resolution, and the reconstruction after ALF is rescaled to the same resolution as pre-LF (path 2 in Fig. 4) . If luma ALF is turned on but chroma ALF and CCALF are turned off, the luma component is processed normally, but for the chroma components, both ALF and the rescaling process before and after ALF are bypassed.
In one embodiment, LF can be processed at different resolutions. The QP value for the LF stage is adaptively changed according to the resolution.
For example, if ALF is processed at the RPR (down-scaled) resolution, the QP value for ALF fixed filter selection follows the signalled QP value; while if ALF is processed at the original resolution, the QP value for ALF fixed filter selection will be adjusted according to the RPR scaling ratio.
Any of the foregoing proposed methods of applying in-loop filter at different resolution can be implemented in encoders and/or decoders. For example, any of the proposed methods can be implemented in one module of an encoders and/or decoder. Alternatively, any of the proposed methods can be implemented as a circuit coupled to one module of the encoders and/or decoder, so as to provide the information needed by the module used in encoders and/or decoder. The proposed methods of applying in-loop filter at different resolution can be implemented in an encoder side or a decoder side. For example, with reference to the encoder and decoder in Fig. 1A and Fig. 1B, any of the proposed methods can be implemented in an Intra/Inter coding module (e.g. Intra Pred. 150/MC 152 in Fig. 1B) in a decoder or an Intra/Inter coding module is an encoder (e.g. Intra Pred. 110/Inter Pred. 112 in Fig. 1A) . However, the decoder or encoder may also use additional processing unit to implement the proposed method. While the Intra Pred. units (e.g. unit 110/112 in Fig. 1A and unit 150/152 in Fig. 1B) are shown as individual processing units, they may correspond to executable software or firmware codes stored on a media, such as hard disk or flash memory, for a CPU (Central Processing Unit) or programmable devices (e.g. DSP (Digital Signal Processor) or FPGA (Field Programmable Gate Array)) .
Fig. 5 illustrates a flowchart of an exemplary video coding system that enables different resolution in the in-loop filter according to an embodiment of the present invention. The steps shown in the flowchart may be implemented as program codes executable on one or more processors (e.g., one or more CPUs) at the encoder side. The steps shown in the flowchart may also be implemented based hardware such as one or more electronic devices or processors arranged to perform the steps in the flowchart. According to one method, reconstructed data associated with a current block or first filtered data of the reconstructed data is received in step 510. One or more second filters are applied to the reconstructed data on the first filtered data to generate second filtered data in step 520, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution. The second filtered data is provided in step 530.
The flowchart shown is intended to illustrate an example of video coding according to the present invention. A person skilled in the art may modify each step, re-arranges the steps, split a step, or combine steps to practice the present invention without departing from the spirit of the present invention. In the disclosure, specific syntax and semantics have been used to illustrate examples to implement embodiments of the present invention. A skilled person may practice the present invention by substituting the syntax and semantics with equivalent syntax and semantics without departing from the spirit of the present invention.
The above description is presented to enable a person of ordinary skill in the art to practice the present invention as provided in the context of a particular application and its requirement. Various modifications to the described embodiments will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed. In the above detailed description, various specific details are illustrated in order to provide a thorough understanding of the present invention. Nevertheless, it will be understood by those skilled in the art that the present invention may be practiced.
Embodiment of the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be one or more circuit circuits integrated into a video compression chip or program code integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program code to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA) . These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware code may be developed in different programming languages and different formats or styles. The software code may also be compiled for different target platforms. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.

Claims (19)

  1. A method of video coding, the method comprising:
    receiving reconstructed data associated with a current block or first filtered data of the reconstructed data;
    applying one or more second filters to the reconstructed data on the first filtered data to generate second filtered data, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution; and
    providing the second filtered data.
  2. The method of Claim 1, wherein said one or more second filters comprise DF (Deblocking filter) , SAO (Sample Adaptive Offset) , ALF (Adaptive Loop Filter) , or a combination thereof.
  3. The method of Claim 1, wherein the picture rescaling corresponds to up-sampling or down-sampling.
  4. The method of Claim 1, wherein said at least one of said one or more second filters is determined and an on/off control syntax is signalled or parsed to indicate whether said at least one of said one or more second filters is performed with the picture rescaling to change the pre-filtering resolution.
  5. The method of Claim 4, wherein the on/off control syntax is signalled or parsed in a slice level, picture level, sequence level, CTU-level, block level, or a combination thereof.
  6. The method of Claim 1, wherein an index is signalled or parsed in a slice level, picture level, sequence level, CTU-level, block level, or a combination thereof to indicate whether said at least one of said one or more second filters is rescaled.
  7. The method of Claim 1, wherein for at least one of said one or more second filters, a number of CTUs being rescaled is changed if CTU size is kept the same as before the picture rescaling.
  8. The method of Claim 1, wherein for at least one of said one or more second filters, CTU size is rescaling to kept a number of CTUs the same as before the picture rescaling.
  9. The method of Claim 1, wherein for at least one of said one or more second filters, CTU size is unchanged and a number of CTUs being rescaled is re-calculated.
  10. The method of Claim 1, wherein for inter-coded pictures, if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution, the filtered reconstruction picture is rescaled to a rescaled-filtered reconstruction picture having the pre-filtering resolution, and the filtered reconstruction picture or the rescaled-filtered reconstruction picture is stored in a buffer as one reference picture.
  11. The method of Claim 1, wherein for inter-coded pictures, if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution, the filtered reconstruction picture is stored in a buffer as one reference picture.
  12. The method of Claim 1, wherein for inter-coded pictures, if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a resolution different from the pre-filtering resolution, the filtered reconstruction picture is rescaled to a rescaled-filtered reconstruction picture having the pre-filtering resolution, and both of the filtered reconstruction picture and the rescaled-filtered reconstruction picture are stored in a buffer as reference pictures.
  13. The method of Claim 1, wherein if a filtered reconstruction picture corresponding to after said applying said one or more second filters to a reconstruction picture has a same resolution as the pre-filtering resolution, the filtered reconstruction picture is stored in a buffer as one reference picture.
  14. The method of Claim 1, wherein a picture-level and/or a slice-level control flag to control resolution of a target second filter is signalled or parsed after a second flag indicating whether the target second filter is on or off.
  15. The method of Claim 14, wherein if the second flag indicates that the target second filter is off, the target second filter is not applied and target picture rescaling is not applied before and/or after the target second filter.
  16. The method of Claim 14, wherein the picture-level and/or the slice-level control flag is signalled or parsed only if the second flag indicates that the target second filter is on.
  17. The method of Claim 14, wherein the picture-level and/or the slice-level control flag is separate for different colour components.
  18. The method of Claim 14, wherein if target resolution for the target second filter is allowed to be different, QP value for the target second filter is adaptively changed according to the target resolution.
  19. An apparatus for video coding, the apparatus comprising one or more electronics or processors arranged to:
    receive reconstructed data associated with a current block or first filtered data of the reconstructed data;
    apply one or more second filters to the reconstructed data on the first filtered data to generate second filtered data, wherein at least one of said one or more second filters is performed with picture rescaling to change pre-filtering resolution to a different resolution; and
    provide the second filtered data.
PCT/CN2024/139042 2023-12-29 2024-12-13 Methods and apparatus of enabling resolution change in loop filter stage for video coding Pending WO2025139841A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW113151254A TW202527543A (en) 2023-12-29 2024-12-27 Methods and apparatus of enabling resolution change in loop filter stage for video coding

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US202363615816P 2023-12-29 2023-12-29
US63/615,816 2023-12-29

Publications (1)

Publication Number Publication Date
WO2025139841A1 true WO2025139841A1 (en) 2025-07-03

Family

ID=96216750

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/139042 Pending WO2025139841A1 (en) 2023-12-29 2024-12-13 Methods and apparatus of enabling resolution change in loop filter stage for video coding

Country Status (2)

Country Link
TW (1) TW202527543A (en)
WO (1) WO2025139841A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210092391A1 (en) * 2019-09-24 2021-03-25 Tencent America LLC Method for reference picture resampling with offset in video bitstream
CN115315945A (en) * 2020-12-23 2022-11-08 腾讯美国有限责任公司 Video coding and decoding method and device
WO2023048500A1 (en) * 2021-09-23 2023-03-30 엘지전자 주식회사 Image encoding/decoding method, method for transmitting bitstream, and recording medium in which bitstream is stored
CN116438795A (en) * 2020-10-06 2023-07-14 交互数字Vc控股法国有限公司 Spatial resolution adaptation for in-loop filtering and post-filtering of compressed video using metadata
CN116965024A (en) * 2021-08-02 2023-10-27 腾讯美国有限责任公司 Adaptive upsampling filters for luma and chroma with reference picture resampling (RPR)

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210092391A1 (en) * 2019-09-24 2021-03-25 Tencent America LLC Method for reference picture resampling with offset in video bitstream
CN116438795A (en) * 2020-10-06 2023-07-14 交互数字Vc控股法国有限公司 Spatial resolution adaptation for in-loop filtering and post-filtering of compressed video using metadata
CN115315945A (en) * 2020-12-23 2022-11-08 腾讯美国有限责任公司 Video coding and decoding method and device
CN116965024A (en) * 2021-08-02 2023-10-27 腾讯美国有限责任公司 Adaptive upsampling filters for luma and chroma with reference picture resampling (RPR)
WO2023048500A1 (en) * 2021-09-23 2023-03-30 엘지전자 주식회사 Image encoding/decoding method, method for transmitting bitstream, and recording medium in which bitstream is stored

Also Published As

Publication number Publication date
TW202527543A (en) 2025-07-01

Similar Documents

Publication Publication Date Title
CN114073094B (en) Video encoding and decoding method and device
US10123048B2 (en) Method of filter control for block-based adaptive loop filtering
JP6634151B2 (en) Method and apparatus for adaptive inter prediction in video coding
US11483549B2 (en) Methods and apparatuses for transform skip mode information signaling
US10116933B2 (en) Method of lossless mode signaling for video system with lossless and lossy coding
US20100008428A1 (en) Multistandard video recorder
US20110103464A1 (en) Methods and Apparatus for Locally Adaptive Filtering for Motion Compensation Interpolation and Reference Picture Filtering
EP4005213A1 (en) Method and apparatus of cross-component adaptive loop filtering for video coding
US8565558B2 (en) Method and system for interpolating fractional video pixels
US20140198840A1 (en) Method and apparatus for adaptive loop filter with constrained filter coefficients
US20230007311A1 (en) Image encoding device, image encoding method and storage medium, image decoding device, and image decoding method and storage medium
US20210185332A1 (en) Method and Apparatus of Scaling Window Constraint for Worst Case Bandwidth Consideration for Reference Picture Resampling in Video Coding
KR20240089011A (en) Video coding using optional neural network-based coding tools
WO2025139841A1 (en) Methods and apparatus of enabling resolution change in loop filter stage for video coding
EP1599049A2 (en) Multistandard video decoder
WO2024008060A1 (en) Method and apparatus of dependent quantization for video coding
TWI784367B (en) Video encoding or decoding methods and apparatuses with scaling ratio constraint
WO2025139532A1 (en) Methods and apparatus of enabling tools on scaled reference picture for video coding
US20250392717A1 (en) Method and Apparatus of Dependent Quantization for Video Coding

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24910772

Country of ref document: EP

Kind code of ref document: A1