US20190364291A1 - Method and system for improved image compression - Google Patents
Method and system for improved image compression Download PDFInfo
- Publication number
- US20190364291A1 US20190364291A1 US16/533,992 US201916533992A US2019364291A1 US 20190364291 A1 US20190364291 A1 US 20190364291A1 US 201916533992 A US201916533992 A US 201916533992A US 2019364291 A1 US2019364291 A1 US 2019364291A1
- Authority
- US
- United States
- Prior art keywords
- plane
- pixel
- pixels
- image data
- original image
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
- 238000000034 method Methods 0.000 title claims description 29
- 230000006835 compression Effects 0.000 title abstract description 22
- 238000007906 compression Methods 0.000 title abstract description 22
- 238000007781 pre-processing Methods 0.000 claims abstract description 38
- 230000000694 effects Effects 0.000 description 11
- 238000010586 diagram Methods 0.000 description 6
- 230000000007 visual effect Effects 0.000 description 6
- 230000006870 function Effects 0.000 description 5
- 238000004891 communication Methods 0.000 description 3
- 230000000116 mitigating effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
- H04N19/423—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
- H04N19/426—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements using memory downsizing methods
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/167—Position within a video image, e.g. region of interest [ROI]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/182—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a pixel
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
- H04N19/86—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving reduction of coding artifacts, e.g. of blockiness
Definitions
- FIG. 1 illustrates an example block diagram of a system for performing deblocking on pre-compressed visual data according to some implementations.
- FIG. 2 illustrates an example graphical representation of deblocking planes generated with respect to an original image according to some implementations.
- FIG. 3 illustrates an example graphical representation of an original image with padding to accommodate shifting of the original image with respect to generation of one or more planes according to some implementations.
- FIG. 4 illustrates another example graphical representation of deblocking planes generated with respect to an original image according to some implementations.
- FIG. 5 illustrates an example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations.
- FIG. 6 another illustrates an example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations.
- FIG. 7 illustrates another example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations.
- FIG. 8 illustrates yet another example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations.
- FIG. 9 illustrates an example a deblocking unit for deblocking pre-processed planes according to some implementations.
- FIG. 10 illustrates example components an electronic device that may be configured to perform deblocking on an image prior to encoding according to some implementations
- FIG. 11 is an example flow diagram showing an illustrative process for deblocking an image prior to encoding according to some implementations.
- This disclosure includes techniques and implementations for deblocking of image data, including spatiotemporal three-dimensional video sequences, to improve effective compressing rates realized by a video encoder.
- image data is often pre-processed prior to compressing by a video encoder to improve the compression rate of the image data when compared with a non-processed image.
- many conventional systems utilize block based pre-processing systems that introduce imperceptible irregularities (e.g., data not noticeable or detectable by the human eye) along the edge of the blocks.
- the pre-processed image data including the imperceptible irregularities are then encoded and transmitted to a receiver (such as a set-top-box).
- the image data may be provided to the pre-processor to improve the overall compression rate of the image data.
- the pre-processed data may then be deblocked either by the pre-processor or by the encoder prior to compressing.
- the deblocking may include receiving a plurality of planes, each representative of at least a portion of the image data.
- each plane may be offset from a top left corner of the image, designated at (0,0), by X and Y coordinate combinations formed based on a predetermined offset value.
- one possible series of offsets for an 8 ⁇ 8 image with a preprocess that utilizes a block size of 4 ⁇ 4 may be based on a predetermined offset value of two (e.g., half the block size).
- the offset values for X and Y may be (0,0), (2,0), (0,2), and (2,2).
- the number of planes for an image may set to four.
- the offset may be based on a value other than half the block size as discussed above.
- the plane size may be based on the size of the predetermined offset value and the size of the plane.
- the plane size may be set to eight by eight and the coordinates of each plane offset from (0,0) at the top left corner may be (0,0), (0,2), (2,0), and (2,2).
- each plane having a size of eight by eight e.g., the size of the image minus the offset value
- the image data may be provided to the pre-processor to improve the overall compression rate with respect to the original image data.
- the pre-processed data may then be deblocked either by the pre-processor or by the encoder prior to compressing.
- the deblocking unit may receive a single plane representing at least a portion of the image to be deblocked, and may or may not carry associated data indicating the size and alignment of a plurality of processed blocks comprising the plane.
- other deblocking methods may be employed to determine block boundaries and remove pre-processing edge artifacts, rendering the plane more compressible.
- the number of planes for an image may set to four and the offset set to half the block size. For instance, if the image has a size of eight by eight and the predetermined offset value of four, the planes may be offset by X, Y values of (0,0), (0,2), (2,0), and (2,2), as discussed above.
- the boundary pixels may be ignored as in some systems (e.g., video processing systems) boundary pixels may not affect the compressibility of the image, as some types of pre-processing, such as motion estimation, do not extend outside the boundary of the image.
- the blocks for the plane at (0,0) may be four by four
- the blocks for the plane at (0,2) may be four by four or four by two
- the blocks for the plane at (2,0) may be two by four or four by four
- the blocks for the plane at (2,2) may be two by two, two by four, four by four, or four by two.
- the blocks of each of the four planes overlap such that none of the interior pixels of the image are boundary pixels for all of the blocks over all of the planes (e.g., each interior pixel of the image is also an interior pixel for at least one block within at least one plane).
- the number of planes for an image may set to four.
- the plane size may be based on the size of the predetermined offset value and the size of a padded version of the original image.
- the padding may cause boundary pixels of the original image to be interior pixels of at least one block of at least one plane.
- the pre-processor may extend or shift the image by mirroring the edges pixels or padding the original image.
- the padding may be equal to half a block size or in this case of a block size of 4 ⁇ 4, the padding may add two additional pixels around the exterior of the original image.
- the top left corner of a block within each plane offset from (0,0) of the original image may be ( ⁇ 2, ⁇ 2), ( ⁇ 2,0), (0, ⁇ 2), and (0,0).
- the four planes overlap such that none of the interior pixels nor the exterior pixels of the original image are boundary pixels for all of the blocks over all of the planes (e.g., each pixel of the original image is also an interior pixel for at least one block of at least one plane).
- FIG. 1 illustrates an example block diagram of a system 100 for performing deblocking on pre-compressed visual data according to some implementations.
- an image 102 such as one image of a video sequence
- a pre-processor 104 may process the image 102 in a manner to improve compressibility by the encoder or video encoder 106 .
- the pre-processing of the original image 102 by the pre-processor 104 may introduce imperceptible irregularities (e.g., data not noticeable or detectable by the human eye) along the edge of the block, particularly when block based pre-processing is performed.
- imperceptible irregularities may introduce additional data to be compressed, encoded, or otherwise processed by the encoder 106 reducing an overall achievable compression ratio and increasing the overall bandwidth usage of transmitting the image 102 or the video sequence associated with the image 102 .
- the pre-processor 104 may generate a plurality of planes 108 representative of shifted versions of the original image 102 and perform the pre-processing on each of the individual planes 108 . While the pre-processing of each plane 108 by the pre-processor 104 may introduce imperceptible irregularities along the boundary of blocks of each plane 108 , the planes 108 may be shifted with respect to the original image such that each pixel (or in other cases, each interior pixel) of the original image 102 is an interior pixel of at least one block of at least one plane 108 . Thus, mitigating the effect of the imperceptible irregularities introduced along the boundary of each block with respect to at least one copy of the pixel from the original image 102 .
- the planes 108 are then provided to a deblocking unit 110 .
- the deblocking unit 110 combines the plurality of planes 108 back into a single pre-processed and deblocked image 112 .
- the deblocking unit 110 may select a pixel position corresponding to a pixel of the original image 102 and utilize the pixels of at least one block of at least one plane 108 to generate the pixel of the pre-processed and deblocked image 112 at the pixel position.
- the deblocking unit 112 may identify each of the blocks within one of the planes 108 at which the pixel position exists and weight each pixel based on a distance from the center of the corresponding block.
- the pre-processed and deblocked image 112 generated by the deblocking unit 110 contains image data that is least likely to experience edge effects during pre-processing, thus, reducing the effect that the imperceptible irregularities has on compression by encoder 106 .
- the deblocking unit 110 may determine a weight associated with a pixel of a block based on a pixel's position relative to the center of the block, as pixels near the center of a block contain more accurate estimates than pixels near the edges of the block. Thus, the pixels that are near the center of the block are assigned a higher weight.
- a sum of the weights across all blocks having a pixel at a corresponding pixel position may be equal to 1.0. For instance, if N different planes 108 are used, the final pixel weight of a pixel at a selected position in the deblocked image 112 is calculated by summing the weighted pixels at the selected position for each plane 108 .
- the deblocked image 112 may be expressed as P′(i,j) and generated from the planes 108 as follows:
- the encoder 106 may compress the pre-processed and deblocked image 112 and generate a compressed image or date 114 .
- the compressed image 114 may then be transmitted, for example, to a decoder for display to a user.
- FIG. 2 illustrates an example graphical representation 200 of deblocking planes generated with respect to an original image 202 having pixels 204 according to some implementations.
- the pixels 204 may be referred to with respect to a position offset from a pixel 206 at position (0,0).
- the pixel 208 may have a position of (1,4) (e.g., one pixel to the right of the pixel 206 and four pixels below the pixel 206 ).
- planes such as planes 210 , 212 , 214 , and 216
- planes may be generated by a pre-processor.
- the planes 210 - 216 may be processed according to blocks, such as the illustrated blocks 218 - 268 .
- the plane 210 may include blocks 218 - 224
- the plane 212 may include the blocks 226 - 236
- the plane 214 may include the blocks 238 - 254
- the plane 216 may include blocks 256 - 266 .
- the pre-processor may generate the planes 210 - 216 such that each pixel 204 of the original image 202 are interior pixels of at least one block of at least one block of one of the planes 210 - 216 , as shown via the shading of the pixels 204 within the planes 210 - 216 .
- the planes are not illustrated with padding. However, if padding was used to extend the boundary of the original image 202 prior to processing by the pre-processor, each of the shaded pixels within a plane would correspond to an interior pixel of at least one block 218 - 266 .
- a deblocked image may be formed by using only interior pixels of the block 218 - 266 to further reduce the effect of the imperceptible irregularities on the compression rates associated with the original image 202 .
- the pixels 204 may be weighted based on a distance from the edge of a block 218 - 266 containing the pixel being weighted. The pixels may then be used to form the deblocked image based on the weight assigned.
- FIG. 3 illustrates an example graphical representation 300 of an original image 302 with padding, generally indicated by dashed line 304 , to accommodate shifting of the original image with respect to generation of one or more planes according to some implementations.
- the pre-processor or other image processing component may be configured to extend the border or edges of the image 302 by mirroring or adding pixel, such as shaded pixels 306 , around the exterior of the image pixels, such as pixels 308 .
- each pixel 308 of the original image may be an interior pixel of at least one block of at least one plane as will be discussed in more detail below with respect to FIG. 4 .
- FIG. 4 illustrates another example graphical representation 400 of deblocking planes 402 - 408 generated with respect to padded version of the original image 302 of FIG. 3 according to some implementations.
- the graphical representation 400 illustrates a pixel 410 of an original image 412 within each of the deblocking planes 402 - 408 and a deblocked image 414 .
- the pixel 410 is within a different block of each of the planes 402 - 408 .
- the pixel 410 is within the block 416 , within plane 404 , the pixel 410 is within the block 418 , within plane 406 , the pixel 410 is within the block 420 , and within plane 402 , the pixel 410 is within the block 422 .
- the pixel 410 is in the same position in each plane 402 - 408 with respect to the (0,0) pixel 424 of the original image 412 (e.g., the pixel 410 in each plane 402 - 408 is at a position (3,3) or three pixels to the right of pixel 424 and three pixels below the pixel 424 ).
- the deblocking unit selects, weights, and combines the image data from each of the planes 402 - 408 to generate the deblocked image 414 , the deblocking unit identifies each plane 402 - 408 having a pixel at a select position of the original image 412 and weights each of the identified pixels based on a distance from the center of the corresponding block, such as blocks 416 - 422 . For instance, in the current example, the pixel 410 is nearer the center of the block 416 than the pixel 410 within the blocks 418 , 420 , and 422 .
- the data from the plane 402 associated with the pixel 410 may have a higher weight than the corresponding pixel 410 within the planes 404 - 408 , and thus will contribute more data to the pixel 410 within the deblocked image 412 .
- the pixel 410 within planes 404 - 408 will have a lower weight than the pixel 410 within plane 402 and, thus, will contribute less image data to the deblocked image 412 with respect to pixel 410 .
- the pixels of the planes 402 - 408 extend beyond the edge of the original image 412 in each case.
- the edge pixels such as pixel 424 at (0,0)
- the pre-processor may generate the additional pixels of the planes 402 - 08 by padding, mirroring, or extending the original image 412 prior to performing additional pre-processing operations on each plane 402 - 408 .
- FIG. 5 illustrates an example graphical representation 500 of weighting a pixel 502 within a block 506 of a deblocking plane 504 according to some implementations
- weights may be assigned to individual pixels within a block, such as block 506 based on a distance, such as a distance 508 , between the individual pixel 502 and a center of the block 510 .
- the weight of a pixel may be calculated as:
- w(i,j) is the weight of a pixel at position (i,j)
- d is the distance
- B 2 is half the preprocessor block size (or plane size).
- FIG. 6 illustrates another example graphical representation 600 of weighting a pixel 602 within a block 604 of a deblocking plane according to some implementations.
- a block 604 of the plane is shown.
- the block 604 is a block having a size of an eight by eight.
- the weights may be assigned to individual pixels within the block 604 based on a distance, such as a distance 606 , between the individual pixel 602 and a center 608 of the block 604 . Similar to the weighting discussed above, the weight of a pixel may be calculated as:
- w(i,j) is the weight of a pixel at position (i,j)
- d is the distance
- B 2 is half the preprocessor block size (or plane size).
- the weight of the pixel 602 is being determined using the distance 606 .
- the pixels along the edge may be less than zero.
- the weight may be set to zero.
- the pixels 610 having a weight of zero are the pixels that fall along or outside of the line 612 .
- FIG. 7 illustrates an example graphical representation 700 of weighting a pixel 702 within a block 704 of a deblocking plane according to some implementations.
- a block 704 of the plane is shown and weights are again assigned to individual pixels of the block 704 based on a distance, such as a distance 706 , between the individual pixel and a center 708 of the block 704 .
- the weight of a pixel may be calculated as:
- w(i,j) is the weight of a pixel at position (i,j)
- d is the distance
- R 2 is the distance from the corner to the center of the block 704 times the square root of two.
- the corner pixels of a plane will have the lowest weight.
- the weight of the pixel 702 is being determined using the distance 706 .
- the weights are greater than zero as the zero condition is shown by line 710 .
- FIGS. 6 and 7 illustrate two examples weights that may be used by a deblocking unit to generate a deblocked image from a plurality of planes.
- weighting metrics may be used.
- the weight of a pixel may be determined to be proportional to the cosine of the distance from the edge of the block 704 rather than being linearly related to the distance from the center 708 .
- the weight may be calculated as follows:
- the weight of a pixel may be determined to be proportional to the cosine of the distance from the corner of a block as follows:
- each additional pixel may be ignored or assigned a weight of 0.0.
- using a block size of eight by eight only the four center pixels may be used from each plane when generating a deblocked image.
- each plane may be given an equal weight or set to a value of 1/K where K is the number of planes. It should be understood that additional weighting techniques may be utilized by the deblocking unit to merge or combine image data on a pixel by pixel level. In each case, the weight of corresponding pixels across the planes may be summed to a value of one.
- FIG. 8 illustrates an example graphical representation 800 of deblocking planes 802 - 830 generated with respect to an original image 832 and a deblocked image 834 according to some implementations.
- the planes 802 - 834 may include blocks having a five by five size having a starting positioned shifted by one pixel for each plane 802 - 830 .
- the highlighted pixel 836 is shown in the original image 832 , the deblocked image 834 , and planes 802 - 830 with respect to a shifted block within the planes.
- FIG. 9 illustrates an example a deblocking unit 900 for deblocking pre-processed planes 902 according to some implementations.
- a pre-processor (not shown) may have generated the plurality of planes 902 with respect to an original image to improve the overall compression rate with respect to the original image, as discussed above.
- Each of the planes may represent a portion of the original image data after the pre-processing operations are performed.
- each of the planes 902 may be a shifted representation of the original image with additional pixels added as padding to accommodate the shifting.
- the planes 902 may have less image data or be of a smaller size than the original image.
- a size of amount of data or position of the blocks within each plane 902 associated with the original image may vary.
- the deblocking unit 900 may include several modules or components, such as a pixel selection unit 904 , a pixel weighting unit 906 , and a merge unit 908 .
- the pixel selection unit 904 may be configured to select a current pixel to process and identify each plane 902 that has image data associated with the selected pixel.
- the pixel weighting unit 906 may be configured to determine a weight to assign to corresponding pixels within each plane 902 contributing image data to a particular pixel of the daglocked image 910 . For instance, as discussed above, weights are assigned to individual pixels of the planes 902 based on a distance between the pixel selected by the pixel selecting unit and a center of a corresponding block within a plane 902 . In one example, the weight of a pixel may be calculated as:
- weights may be calculated as:
- the weight of a pixel may be determined to be proportional to the cosine of the distance from the edge of the corresponding plane 902 rather than being linearly related to the distance from the center. For instance, the weight may be calculated as follows:
- the weight of a pixel may be determined to be proportional to the cosine of the distance from the corner of a block as follows:
- pixels of each plane 902 may be given an equal weight, such as set to a value of 1/K where K is the number of planes 902 .
- the merger unit 908 may be configured to merge the image data of the pixels from each plane 902 corresponding to the pixel selected by the pixel selecting unit 904 according to the weight applied. For example, if the planes 902 are represented as P 1 to P n , (i,j) are used to represent a position of each pixel within the original image, and w n (i,j) represents the weight of the pixel at the position (i,j), then the deblocked image 910 may be expressed as P′(i,j) and generated by the merge unit 908 from the pixels of the planes 902 as follows:
- FIG. 10 illustrates example components an electronic device 1000 that may be configured to perform deblocking on an image prior to encoding according to some implementations.
- electronic device 1000 may include processing resources, as represented by processors 1002 , and computer-readable storage media 1004 .
- the computer-readable storage media 1004 may include volatile and nonvolatile memory, removable and non-removable media implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, program modules, or other data.
- Such memory includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, RAID storage systems, or any other medium which can be used to store the desired information and which can be accessed by a computing device.
- the electronic device 1000 may also include one or more communication interfaces 1006 , which may support both wired and wireless connection to various networks, such as cellular networks, radio (e.g., radio-frequency identification RFID), WiFi networks, short-range or near-field networks (e.g., Bluetooth®), infrared signals, local area networks, wide area networks, the Internet, and so forth.
- the communication interfaces 1006 may allow the electronic device 1000 to send or stream compressed video or image data over one or more networks, such as the Internet®.
- modules, sets of instructions, data stores, and so forth may be stored within the computer-readable media 1004 and configured to execute on the processors 1002 .
- a pre-processing module 1008 a deblocking module 1010 , and an encoding module 1012 , as well as other modules 1014 .
- the computer-readable media 1004 may store data, such as input images or original image 1016 , image planes 1018 , deblocked images 1020 , compressed images 1022 , block sizes 1024 , and weighting functions 1026 .
- the pre-processing module 1008 may generate a plurality of image planes 1018 representative of an original image 1016 and perform the pre-processing on each of the individual planes 1018 . While the pre-processing of each plane 1018 by the pre-processing module 1008 may introduce imperceptible irregularities along the boundary of each block, the planes 1018 and/or blocks within each plane 1018 may be shifted with respect to the original image 1016 such that each pixel (or in other cases, each interior pixel) of the original image 1016 is an interior pixel of at least one block of at least one plane 1018 . Thus, mitigating the effect of the imperceptible irregularities introduced along the boundary of each block with respect to at least one copy of the pixel from the original image 1016 .
- the deblocking module 1010 may receive a plurality of planes 1018 representative of the original image 1016 and merge image data of one or more of the plurality of planes 1018 back into a single pre-processed and deblocked image 1024 .
- the deblocking module 1010 may select a pixel position corresponding to a pixel of the original image 1016 and utilize the pixels of at least one block of at least one plane 1018 to generate the pixel of the pre-processed and deblocked image 1020 at the pixel position.
- the deblocking module 1010 may identify each of the planes 1018 at which the pixel position exists and weight each pixel based on a distance from the center of the corresponding plane 1018 and a weighting function 1026 .
- the pre-processed and deblocked image 1020 generated by the deblocking module 1010 contains image data that is least likely to experience edge effects during pre-processing, thus, reducing the effect that the imperceptible irregularities has on compression by encoder module 1012 .
- the deblocking module 1010 may determine a weight associated with a pixel of a plane 1018 based on a weighting function 1026 associated with a distance from the center of a block. For example, the pixels that are near the center of a block may be assigned a higher weight than the pixel near the edge of the block.
- the deblocking module 1010 may sum of the weights across all planes 1018 corresponding to a single pixel position may be equal to 1.0. For instance, if N different planes 1018 are used, the final pixel weight of a pixel at a select position in the deblocked image 1020 is calculated by summing the weighted pixels at the select position for each plane 1018 . For example, if the planes 1018 are represented as P 1 to P n , (i,j) are used to represent a position of each pixel within the original image 102 , and w n represents the weight, then the deblocked image 1020 may be expressed as P′(i,j) and generated from the planes 1018 as follows:
- the encoder module 1012 receives the pre-processed and deblocked image 1020 , and compress the pre-processed and deblocked image 1020 and generate a compressed image or date 1022 .
- the compressed image 1022 may then be transmitted, for example, to a decoder for display to a user by the communication interface 1006 .
- FIG. 11 is a flow diagram illustrating example processes associated with deblocking pre-processed image data prior to compression by an encoder according to some implementations.
- the processes are illustrated as a collection of blocks in a logical flow diagram, which represent a sequence of operations, some or all of which can be implemented in hardware, software or a combination thereof.
- the blocks represent computer-executable instructions stored on one or more computer-readable media that, which when executed by one or more processors, perform the recited operations.
- computer-executable instructions include routines, programs, objects, components, encryption, deciphering, compressing, recording, data structures and the like that perform particular functions or implement particular abstract data types.
- FIG. 11 is an example flow diagram showing an illustrative process 1100 for deblocking an image prior to encoding according to some implementations.
- image data is often pre-processed prior to compressing by a video encoder to improve the compression rate of the image data when compared with a non-processed image.
- many conventional systems utilize block based pre-processing systems that introduce imperceptible irregularities (e.g., data not noticeable or detectable by the human eye) along the edge of the blocks.
- the pre-processed image data including the imperceptible irregularities are then encoded and transmitted to a receiver (such as a set-top-box).
- a pre-processing system may receive an input frame or image to be transmitted in a compressed format.
- the pre-processing system may receive a frame from a video sequence being streamed to a set-top-box or other electronic device as part of a video streaming service.
- the pre-processing system may pre-process the input frame or image to generate a plurality of planes.
- the pre-processing system may generate a plurality of image planes representative of the frame or image received and perform the pre-processing on each of the individual planes. While the pre-processing of each plane may introduce imperceptible irregularities along the boundary of each block, the planes and/or blocks may be shifted with respect to the original frame or image such that each pixel (or in other cases, each interior pixel) of the original frame or image is an interior pixel of at least one block of at least one plane. Thus, mitigating the effect of the imperceptible irregularities introduced along the boundary of each block with respect to at least one copy of the pixel from the original frame or image.
- the pre-processing system may generate a deblocked image based at least in part on image data associated with the plurality of planes. For example, the pre-processing system may receive a plurality of planes representative of the original frame or image. Each of the planes having been pre-processed for compression. The pre-processing system may merge image data of one or more of the plurality of planes back into a single pre-processed and deblocked image. In general, the pre-processing system may select a pixel position corresponding to a pixel of the original image or frame and utilize the pixels of at least one block of at least one plane to generate the pixel of the pre-processed and deblocked image at the pixel position.
- the pre-processing system may identify each of the planes at which the pixel position exists and weight each pixel based on a distance from the center of a corresponding block and a weighting function.
- the pre-processed and deblocked image generated by the pre-processing system may contains image data that is least likely to experience edge effects during pre-processing, thus, reducing the effect that the imperceptible irregularities has on compression rates associated with the image data.
- the pre-processing system may compress the deblocked image and, at 1110 , the pre-processing system may transmit the compressed frame or image.
- the deblocked image may be compressed and transmitted via one or more networks.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
- This application is a divisional of and claims priority to U.S. application Ser. No. 15/427,570, filed on Feb. 8, 2017 and entitled “METHOD AND SYSTEM FOR IMPROVED IMAGE COMPRESSION,” the entirety of which is incorporated herein by reference.
- Conventional compression systems, often perform pre-processing on visual data to remove undesirable noise from video or image sources prior to compressing the data. In some case, block based pre-processing may be performed to improve the overall compression associated with the visual data when compared to compression of non-processed visual data. However, the act of blocking in conventional systems often introduces edge effects or imperceptible irregularities that are not detectable by the human eye. Unfortunately, the imperceptible irregularities effectively introduce additional data prior to encoding that ultimately reduces a realized compression rate with respect to the visual data.
- The detailed description is described with reference to the accompanying figures. In the figures, the left-most digit(s) of a reference number identifies the figure in which the reference number first appears. The use of the same reference numbers in different figures indicates similar or identical components or features.
-
FIG. 1 illustrates an example block diagram of a system for performing deblocking on pre-compressed visual data according to some implementations. -
FIG. 2 illustrates an example graphical representation of deblocking planes generated with respect to an original image according to some implementations. -
FIG. 3 illustrates an example graphical representation of an original image with padding to accommodate shifting of the original image with respect to generation of one or more planes according to some implementations. -
FIG. 4 illustrates another example graphical representation of deblocking planes generated with respect to an original image according to some implementations. -
FIG. 5 illustrates an example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations. -
FIG. 6 another illustrates an example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations. -
FIG. 7 illustrates another example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations. -
FIG. 8 illustrates yet another example graphical representation of weighting a pixel within a block of a deblocking plane according to some implementations. -
FIG. 9 illustrates an example a deblocking unit for deblocking pre-processed planes according to some implementations. -
FIG. 10 illustrates example components an electronic device that may be configured to perform deblocking on an image prior to encoding according to some implementations -
FIG. 11 is an example flow diagram showing an illustrative process for deblocking an image prior to encoding according to some implementations. - This disclosure includes techniques and implementations for deblocking of image data, including spatiotemporal three-dimensional video sequences, to improve effective compressing rates realized by a video encoder. For example, image data is often pre-processed prior to compressing by a video encoder to improve the compression rate of the image data when compared with a non-processed image. However, many conventional systems utilize block based pre-processing systems that introduce imperceptible irregularities (e.g., data not noticeable or detectable by the human eye) along the edge of the blocks. The pre-processed image data including the imperceptible irregularities are then encoded and transmitted to a receiver (such as a set-top-box). Unfortunately, in the conventional system, the encoding of the imperceptible irregularities results in a reduction in the overall compression of the image data, increasing bandwidth usage, and overall costs associated with transmitting data. Thus, described herein, is a system and techniques to remove the imperceptible irregularities using deblocking on the pre-processed image data prior to compression by the encoder.
- For example, in one implementation, the image data may be provided to the pre-processor to improve the overall compression rate of the image data. The pre-processed data may then be deblocked either by the pre-processor or by the encoder prior to compressing. In some cases, the deblocking may include receiving a plurality of planes, each representative of at least a portion of the image data. In one example, each plane may be offset from a top left corner of the image, designated at (0,0), by X and Y coordinate combinations formed based on a predetermined offset value. For instance, one possible series of offsets for an 8×8 image with a preprocess that utilizes a block size of 4×4 may be based on a predetermined offset value of two (e.g., half the block size). In this example, the offset values for X and Y may be (0,0), (2,0), (0,2), and (2,2).
- In another specific example, the number of planes for an image may set to four. In this example, the offset may be based on a value other than half the block size as discussed above. For instance, the plane size may be based on the size of the predetermined offset value and the size of the plane. Thus, if the image has a size of ten by ten and the predetermined offset value of two, the plane size may be set to eight by eight and the coordinates of each plane offset from (0,0) at the top left corner may be (0,0), (0,2), (2,0), and (2,2). Using these offset coordinates for the top left corner of each plane having a size of eight by eight (e.g., the size of the image minus the offset value), results in four overlapping planes in which none of the interior pixels of the image are boundary pixels for all of the planes (e.g., each interior pixel of the image is also an interior pixel for at least one of the planes).
- For example, in one implementation, the image data may be provided to the pre-processor to improve the overall compression rate with respect to the original image data. The pre-processed data may then be deblocked either by the pre-processor or by the encoder prior to compressing. In one particular example, the deblocking unit may receive a single plane representing at least a portion of the image to be deblocked, and may or may not carry associated data indicating the size and alignment of a plurality of processed blocks comprising the plane. In this case, other deblocking methods may be employed to determine block boundaries and remove pre-processing edge artifacts, rendering the plane more compressible.
- In another specific example, the number of planes for an image may set to four and the offset set to half the block size. For instance, if the image has a size of eight by eight and the predetermined offset value of four, the planes may be offset by X, Y values of (0,0), (0,2), (2,0), and (2,2), as discussed above. However, in this example, the boundary pixels may be ignored as in some systems (e.g., video processing systems) boundary pixels may not affect the compressibility of the image, as some types of pre-processing, such as motion estimation, do not extend outside the boundary of the image. Thus, in this example, the blocks for the plane at (0,0) may be four by four, the blocks for the plane at (0,2) may be four by four or four by two, the blocks for the plane at (2,0) may be two by four or four by four, and the blocks for the plane at (2,2) may be two by two, two by four, four by four, or four by two. Again in this example, the blocks of each of the four planes overlap such that none of the interior pixels of the image are boundary pixels for all of the blocks over all of the planes (e.g., each interior pixel of the image is also an interior pixel for at least one block within at least one plane).
- In yet another specific example, the number of planes for an image may set to four. In this example, the plane size may be based on the size of the predetermined offset value and the size of a padded version of the original image. Unlike the example above, in this case, the padding may cause boundary pixels of the original image to be interior pixels of at least one block of at least one plane. For example, the pre-processor may extend or shift the image by mirroring the edges pixels or padding the original image. For instance, the padding may be equal to half a block size or in this case of a block size of 4×4, the padding may add two additional pixels around the exterior of the original image. Thus, if the original image has a size of eight by eight and the pre-process has a block size of 4×4, the top left corner of a block within each plane offset from (0,0) of the original image may be (−2,−2), (−2,0), (0,−2), and (0,0). Thus, by extending the planes beyond the boundary of the original image, the four planes overlap such that none of the interior pixels nor the exterior pixels of the original image are boundary pixels for all of the blocks over all of the planes (e.g., each pixel of the original image is also an interior pixel for at least one block of at least one plane).
-
FIG. 1 illustrates an example block diagram of asystem 100 for performing deblocking on pre-compressed visual data according to some implementations. For instance, animage 102, such as one image of a video sequence, may be received by a pre-processor 104, to process theimage 102 in a manner to improve compressibility by the encoder orvideo encoder 106. In some cases, the pre-processing of theoriginal image 102 by the pre-processor 104 may introduce imperceptible irregularities (e.g., data not noticeable or detectable by the human eye) along the edge of the block, particularly when block based pre-processing is performed. In conventional systems, these imperceptible irregularities may introduce additional data to be compressed, encoded, or otherwise processed by theencoder 106 reducing an overall achievable compression ratio and increasing the overall bandwidth usage of transmitting theimage 102 or the video sequence associated with theimage 102. - In this example, unlike conventional systems, the pre-processor 104 may generate a plurality of
planes 108 representative of shifted versions of theoriginal image 102 and perform the pre-processing on each of theindividual planes 108. While the pre-processing of eachplane 108 by the pre-processor 104 may introduce imperceptible irregularities along the boundary of blocks of eachplane 108, theplanes 108 may be shifted with respect to the original image such that each pixel (or in other cases, each interior pixel) of theoriginal image 102 is an interior pixel of at least one block of at least oneplane 108. Thus, mitigating the effect of the imperceptible irregularities introduced along the boundary of each block with respect to at least one copy of the pixel from theoriginal image 102. - The
planes 108 are then provided to adeblocking unit 110. Thedeblocking unit 110 combines the plurality ofplanes 108 back into a single pre-processed anddeblocked image 112. In general, thedeblocking unit 110 may select a pixel position corresponding to a pixel of theoriginal image 102 and utilize the pixels of at least one block of at least oneplane 108 to generate the pixel of the pre-processed anddeblocked image 112 at the pixel position. For example, thedeblocking unit 112 may identify each of the blocks within one of theplanes 108 at which the pixel position exists and weight each pixel based on a distance from the center of the corresponding block. Thus, the closer a pixel is to a boundary of the corresponding block the less weight the pixel is given by thedeblocking unit 110. As such, the pre-processed anddeblocked image 112 generated by thedeblocking unit 110 contains image data that is least likely to experience edge effects during pre-processing, thus, reducing the effect that the imperceptible irregularities has on compression byencoder 106. - For instance, in one implementation, the
deblocking unit 110 may determine a weight associated with a pixel of a block based on a pixel's position relative to the center of the block, as pixels near the center of a block contain more accurate estimates than pixels near the edges of the block. Thus, the pixels that are near the center of the block are assigned a higher weight. In some examples, a sum of the weights across all blocks having a pixel at a corresponding pixel position may be equal to 1.0. For instance, if Ndifferent planes 108 are used, the final pixel weight of a pixel at a selected position in thedeblocked image 112 is calculated by summing the weighted pixels at the selected position for eachplane 108. For example, if theplanes 108 are represented as P1 to Pn, (i,j) are used to represent a position of each pixel within theoriginal image 102, and wn represents the weight based on the distance from the center of a block, then thedeblocked image 112 may be expressed as P′(i,j) and generated from theplanes 108 as follows: -
- Once, the
encoder 106 receives the pre-processed anddeblocked image 112, theencoder 106 may compress the pre-processed anddeblocked image 112 and generate a compressed image ordate 114. Thecompressed image 114 may then be transmitted, for example, to a decoder for display to a user. -
FIG. 2 illustrates an examplegraphical representation 200 of deblocking planes generated with respect to anoriginal image 202 havingpixels 204 according to some implementations. In the current example, thepixels 204 may be referred to with respect to a position offset from apixel 206 at position (0,0). Thus, the pixel 208 may have a position of (1,4) (e.g., one pixel to the right of thepixel 206 and four pixels below the pixel 206). - In the current example, planes, such as
210, 212, 214, and 216, may be generated by a pre-processor. During the pre-processing the planes 210-216 may be processed according to blocks, such as the illustrated blocks 218-268. For instance, in the illustrated example, theplanes plane 210 may include blocks 218-224, theplane 212 may include the blocks 226-236, theplane 214 may include the blocks 238-254, and theplane 216 may include blocks 256-266. Thus, as illustrated, the pre-processor may generate the planes 210-216 such that eachpixel 204 of theoriginal image 202 are interior pixels of at least one block of at least one block of one of the planes 210-216, as shown via the shading of thepixels 204 within the planes 210-216. In the current example, the planes are not illustrated with padding. However, if padding was used to extend the boundary of theoriginal image 202 prior to processing by the pre-processor, each of the shaded pixels within a plane would correspond to an interior pixel of at least one block 218-266. In this example, a deblocked image (not shown) may be formed by using only interior pixels of the block 218-266 to further reduce the effect of the imperceptible irregularities on the compression rates associated with theoriginal image 202. In other examples, thepixels 204 may be weighted based on a distance from the edge of a block 218-266 containing the pixel being weighted. The pixels may then be used to form the deblocked image based on the weight assigned. -
FIG. 3 illustrates an examplegraphical representation 300 of anoriginal image 302 with padding, generally indicated by dashedline 304, to accommodate shifting of the original image with respect to generation of one or more planes according to some implementations. For example, the pre-processor or other image processing component may be configured to extend the border or edges of theimage 302 by mirroring or adding pixel, such asshaded pixels 306, around the exterior of the image pixels, such aspixels 308. In this manner, eachpixel 308 of the original image may be an interior pixel of at least one block of at least one plane as will be discussed in more detail below with respect toFIG. 4 . -
FIG. 4 illustrates another examplegraphical representation 400 of deblocking planes 402-408 generated with respect to padded version of theoriginal image 302 ofFIG. 3 according to some implementations. In this example, thegraphical representation 400 illustrates apixel 410 of an original image 412 within each of the deblocking planes 402-408 and adeblocked image 414. In the current illustration, thepixel 410 is within a different block of each of the planes 402-408. For example, withinplane 402, thepixel 410 is within theblock 416, withinplane 404, thepixel 410 is within theblock 418, withinplane 406, thepixel 410 is within theblock 420, and withinplane 402, thepixel 410 is within theblock 422. However, thepixel 410 is in the same position in each plane 402-408 with respect to the (0,0) pixel 424 of the original image 412 (e.g., thepixel 410 in each plane 402-408 is at a position (3,3) or three pixels to the right of pixel 424 and three pixels below the pixel 424). - In general, when the deblocking unit selects, weights, and combines the image data from each of the planes 402-408 to generate the
deblocked image 414, the deblocking unit identifies each plane 402-408 having a pixel at a select position of the original image 412 and weights each of the identified pixels based on a distance from the center of the corresponding block, such as blocks 416-422. For instance, in the current example, thepixel 410 is nearer the center of theblock 416 than thepixel 410 within the 418, 420, and 422. Thus the data from theblocks plane 402 associated with thepixel 410 may have a higher weight than thecorresponding pixel 410 within the planes 404-408, and thus will contribute more data to thepixel 410 within the deblocked image 412. Similarly, thepixel 410 within planes 404-408 will have a lower weight than thepixel 410 withinplane 402 and, thus, will contribute less image data to the deblocked image 412 with respect topixel 410. - Additionally, in this example, the pixels of the planes 402-408 extend beyond the edge of the original image 412 in each case. In this manner, the edge pixels, such as pixel 424 at (0,0), may be an interior pixel of at least one of the blocks (e.g., in this example block 426 of plane 408). In some cases, the pre-processor may generate the additional pixels of the planes 402-08 by padding, mirroring, or extending the original image 412 prior to performing additional pre-processing operations on each plane 402-408.
-
FIG. 5 illustrates an examplegraphical representation 500 of weighting apixel 502 within ablock 506 of adeblocking plane 504 according to some implementations In this example, weights may be assigned to individual pixels within a block, such asblock 506 based on a distance, such as adistance 508, between theindividual pixel 502 and a center of theblock 510. In this example, the weight of a pixel may be calculated as: -
w(i,j)=(B2−d)/B2 - where w(i,j) is the weight of a pixel at position (i,j), d is the distance, and B2 is half the preprocessor block size (or plane size). Thus, in the current example, the weight of the
pixel 502 is being determined using thedistance 508. -
FIG. 6 illustrates another examplegraphical representation 600 of weighting apixel 602 within ablock 604 of a deblocking plane according to some implementations. For instance, in the illustrated example, ablock 604 of the plane is shown. In this example, theblock 604 is a block having a size of an eight by eight. In this example, the weights may be assigned to individual pixels within theblock 604 based on a distance, such as adistance 606, between theindividual pixel 602 and acenter 608 of theblock 604. Similar to the weighting discussed above, the weight of a pixel may be calculated as: -
w(i,j)=(B2−d)/B2 - where w(i,j) is the weight of a pixel at position (i,j), d is the distance, and B2 is half the preprocessor block size (or plane size). Thus, in the current example, the weight of the
pixel 602 is being determined using thedistance 606. - In the current example, by using the equation above to weight the pixels of the
deblocking plane 604, the pixels along the edge, generally indicated byshaded pixels 610, may be less than zero. In the case, a pixel has a value of less than zero, the weight may be set to zero. In the current example, thepixels 610 having a weight of zero are the pixels that fall along or outside of theline 612. -
FIG. 7 illustrates an examplegraphical representation 700 of weighting apixel 702 within ablock 704 of a deblocking plane according to some implementations. For instance, in the illustrated example, ablock 704 of the plane is shown and weights are again assigned to individual pixels of theblock 704 based on a distance, such as adistance 706, between the individual pixel and acenter 708 of theblock 704. However, in this example, the weight of a pixel may be calculated as: -
w(i,j)=(R2−d)/R2 -
R2=B2*√{square root over (2.0)} - where w(i,j) is the weight of a pixel at position (i,j), d is the distance, and R2 is the distance from the corner to the center of the
block 704 times the square root of two.
Thus, the corner pixels of a plane will have the lowest weight. In the current example, the weight of thepixel 702 is being determined using thedistance 706. In this example, the weights are greater than zero as the zero condition is shown byline 710. -
FIGS. 6 and 7 illustrate two examples weights that may be used by a deblocking unit to generate a deblocked image from a plurality of planes. However, it should be understood that other weighting metrics may be used. For example, the weight of a pixel may be determined to be proportional to the cosine of the distance from the edge of theblock 704 rather than being linearly related to the distance from thecenter 708. For example, the weight may be calculated as follows: -
w(i,j)=cos((π/2.0)*(B2−d)/B2) - In another example, the weight of a pixel may be determined to be proportional to the cosine of the distance from the corner of a block as follows:
-
w(i,j)=cos((π·2.0)*(R2−d)/R2) - In another example, only the middle pixels in the center of each block may be used to generated the deblocked image. In this case, each additional pixel may be ignored or assigned a weight of 0.0. For instance, in one particular example, using a block size of eight by eight only the four center pixels may be used from each plane when generating a deblocked image.
- In yet another example, each plane may be given an equal weight or set to a value of 1/K where K is the number of planes. It should be understood that additional weighting techniques may be utilized by the deblocking unit to merge or combine image data on a pixel by pixel level. In each case, the weight of corresponding pixels across the planes may be summed to a value of one.
-
FIG. 8 illustrates an examplegraphical representation 800 of deblocking planes 802-830 generated with respect to anoriginal image 832 and adeblocked image 834 according to some implementations. In this example, the planes 802-834 may include blocks having a five by five size having a starting positioned shifted by one pixel for each plane 802-830. In the current illustration, the highlightedpixel 836 is shown in theoriginal image 832, thedeblocked image 834, and planes 802-830 with respect to a shifted block within the planes. -
FIG. 9 illustrates an example adeblocking unit 900 for deblocking pre-processed planes 902 according to some implementations. For instance, in the current example, a pre-processor (not shown) may have generated the plurality of planes 902 with respect to an original image to improve the overall compression rate with respect to the original image, as discussed above. Each of the planes may represent a portion of the original image data after the pre-processing operations are performed. For example, each of the planes 902 may be a shifted representation of the original image with additional pixels added as padding to accommodate the shifting. In other cases, the planes 902 may have less image data or be of a smaller size than the original image. Further, a size of amount of data or position of the blocks within each plane 902 associated with the original image may vary. - The
deblocking unit 900 may include several modules or components, such as apixel selection unit 904, apixel weighting unit 906, and amerge unit 908. For example, thepixel selection unit 904 may be configured to select a current pixel to process and identify each plane 902 that has image data associated with the selected pixel. - The
pixel weighting unit 906 may be configured to determine a weight to assign to corresponding pixels within each plane 902 contributing image data to a particular pixel of thedaglocked image 910. For instance, as discussed above, weights are assigned to individual pixels of the planes 902 based on a distance between the pixel selected by the pixel selecting unit and a center of a corresponding block within a plane 902. In one example, the weight of a pixel may be calculated as: -
w(i,j)=(B2−d)/B2 - where w(i,j) is the weight of a pixel at position (i,j), d is the distance, and B2 is half the preprocessor block size. In another example, weights may be calculated as:
-
w(i,j)=(R2−d)/R2 -
R2=B2*√{square root over (2.0)} - where w(i,j) is the weight of a pixel at position (i,j), d is the distance, and R2 is the distance from the corner to the center of the corresponding block times the square root of two. In other examples, the weight of a pixel may be determined to be proportional to the cosine of the distance from the edge of the corresponding plane 902 rather than being linearly related to the distance from the center. For instance, the weight may be calculated as follows:
-
w(i,j)=cos((π/2.0)*(B2−d)/B2) - In other cases, the weight of a pixel may be determined to be proportional to the cosine of the distance from the corner of a block as follows:
-
w(i,j)=cos((π·2.0)*(R2−d)/R2) - In yet other examples, only the middle pixels in the center of each block may be used to generated the deblocked image or pixels of each plane 902 may be given an equal weight, such as set to a value of 1/K where K is the number of planes 902.
- The
merger unit 908 may be configured to merge the image data of the pixels from each plane 902 corresponding to the pixel selected by thepixel selecting unit 904 according to the weight applied. For example, if the planes 902 are represented as P1 to Pn, (i,j) are used to represent a position of each pixel within the original image, and wn(i,j) represents the weight of the pixel at the position (i,j), then thedeblocked image 910 may be expressed as P′(i,j) and generated by themerge unit 908 from the pixels of the planes 902 as follows: -
-
FIG. 10 illustrates example components anelectronic device 1000 that may be configured to perform deblocking on an image prior to encoding according to some implementations. For example,electronic device 1000 may include processing resources, as represented byprocessors 1002, and computer-readable storage media 1004. The computer-readable storage media 1004 may include volatile and nonvolatile memory, removable and non-removable media implemented in any method or technology for storage of information, such as computer-readable instructions, data structures, program modules, or other data. Such memory includes, but is not limited to, RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, RAID storage systems, or any other medium which can be used to store the desired information and which can be accessed by a computing device. - The
electronic device 1000 may also include one ormore communication interfaces 1006, which may support both wired and wireless connection to various networks, such as cellular networks, radio (e.g., radio-frequency identification RFID), WiFi networks, short-range or near-field networks (e.g., Bluetooth®), infrared signals, local area networks, wide area networks, the Internet, and so forth. For example, thecommunication interfaces 1006 may allow theelectronic device 1000 to send or stream compressed video or image data over one or more networks, such as the Internet®. - Several modules, sets of instructions, data stores, and so forth may be stored within the computer-
readable media 1004 and configured to execute on theprocessors 1002. For example, apre-processing module 1008, adeblocking module 1010, and anencoding module 1012, as well asother modules 1014. In some implementations, the computer-readable media 1004 may store data, such as input images ororiginal image 1016,image planes 1018,deblocked images 1020, compressedimages 1022,block sizes 1024, and weighting functions 1026. - The
pre-processing module 1008 may generate a plurality ofimage planes 1018 representative of anoriginal image 1016 and perform the pre-processing on each of theindividual planes 1018. While the pre-processing of eachplane 1018 by thepre-processing module 1008 may introduce imperceptible irregularities along the boundary of each block, theplanes 1018 and/or blocks within eachplane 1018 may be shifted with respect to theoriginal image 1016 such that each pixel (or in other cases, each interior pixel) of theoriginal image 1016 is an interior pixel of at least one block of at least oneplane 1018. Thus, mitigating the effect of the imperceptible irregularities introduced along the boundary of each block with respect to at least one copy of the pixel from theoriginal image 1016. - The
deblocking module 1010 may receive a plurality ofplanes 1018 representative of theoriginal image 1016 and merge image data of one or more of the plurality ofplanes 1018 back into a single pre-processed anddeblocked image 1024. In general, thedeblocking module 1010 may select a pixel position corresponding to a pixel of theoriginal image 1016 and utilize the pixels of at least one block of at least oneplane 1018 to generate the pixel of the pre-processed anddeblocked image 1020 at the pixel position. For example, thedeblocking module 1010 may identify each of theplanes 1018 at which the pixel position exists and weight each pixel based on a distance from the center of the correspondingplane 1018 and aweighting function 1026. Thus, the closer a pixel is to a boundary of the correspondingplane 1018 the less weight the pixel is given by thedeblocking module 1010. As such, the pre-processed anddeblocked image 1020 generated by thedeblocking module 1010 contains image data that is least likely to experience edge effects during pre-processing, thus, reducing the effect that the imperceptible irregularities has on compression byencoder module 1012. - For instance, in one implementation, the
deblocking module 1010 may determine a weight associated with a pixel of aplane 1018 based on aweighting function 1026 associated with a distance from the center of a block. For example, the pixels that are near the center of a block may be assigned a higher weight than the pixel near the edge of the block. - In some implementations, the
deblocking module 1010 may sum of the weights across allplanes 1018 corresponding to a single pixel position may be equal to 1.0. For instance, if Ndifferent planes 1018 are used, the final pixel weight of a pixel at a select position in thedeblocked image 1020 is calculated by summing the weighted pixels at the select position for eachplane 1018. For example, if theplanes 1018 are represented as P1 to Pn, (i,j) are used to represent a position of each pixel within theoriginal image 102, and wn represents the weight, then thedeblocked image 1020 may be expressed as P′(i,j) and generated from theplanes 1018 as follows: -
- The
encoder module 1012 receives the pre-processed anddeblocked image 1020, and compress the pre-processed anddeblocked image 1020 and generate a compressed image ordate 1022. Thecompressed image 1022 may then be transmitted, for example, to a decoder for display to a user by thecommunication interface 1006. -
FIG. 11 is a flow diagram illustrating example processes associated with deblocking pre-processed image data prior to compression by an encoder according to some implementations. The processes are illustrated as a collection of blocks in a logical flow diagram, which represent a sequence of operations, some or all of which can be implemented in hardware, software or a combination thereof. In the context of software, the blocks represent computer-executable instructions stored on one or more computer-readable media that, which when executed by one or more processors, perform the recited operations. Generally, computer-executable instructions include routines, programs, objects, components, encryption, deciphering, compressing, recording, data structures and the like that perform particular functions or implement particular abstract data types. - The order in which the operations are described should not be construed as a limitation. Any number of the described blocks can be combined in any order and/or in parallel to implement the process, or alternative processes, and not all of the blocks need be executed. For discussion purposes, the processes herein are described with reference to the frameworks, architectures and environments described in the examples herein, although the processes may be implemented in a wide variety of other frameworks, architectures or environments.
-
FIG. 11 is an example flow diagram showing an illustrative process 1100 for deblocking an image prior to encoding according to some implementations. For instance, image data is often pre-processed prior to compressing by a video encoder to improve the compression rate of the image data when compared with a non-processed image. However, many conventional systems utilize block based pre-processing systems that introduce imperceptible irregularities (e.g., data not noticeable or detectable by the human eye) along the edge of the blocks. The pre-processed image data including the imperceptible irregularities are then encoded and transmitted to a receiver (such as a set-top-box). Unfortunately, in the conventional system, the encoding of the imperceptible irregularities results in a reduction in the overall compression of the image data, increasing bandwidth usage, and overall costs associated with transmitting data. Thus, described herein, is a process 1100 to remove the imperceptible irregularities from pre-processed image data by deblocking a plurality of copies or planes of the pre-processed data. - At 1102, a pre-processing system may receive an input frame or image to be transmitted in a compressed format. For example, the pre-processing system may receive a frame from a video sequence being streamed to a set-top-box or other electronic device as part of a video streaming service.
- At 1104, the pre-processing system may pre-process the input frame or image to generate a plurality of planes. For example, the pre-processing system may generate a plurality of image planes representative of the frame or image received and perform the pre-processing on each of the individual planes. While the pre-processing of each plane may introduce imperceptible irregularities along the boundary of each block, the planes and/or blocks may be shifted with respect to the original frame or image such that each pixel (or in other cases, each interior pixel) of the original frame or image is an interior pixel of at least one block of at least one plane. Thus, mitigating the effect of the imperceptible irregularities introduced along the boundary of each block with respect to at least one copy of the pixel from the original frame or image.
- At 1104, the pre-processing system may generate a deblocked image based at least in part on image data associated with the plurality of planes. For example, the pre-processing system may receive a plurality of planes representative of the original frame or image. Each of the planes having been pre-processed for compression. The pre-processing system may merge image data of one or more of the plurality of planes back into a single pre-processed and deblocked image. In general, the pre-processing system may select a pixel position corresponding to a pixel of the original image or frame and utilize the pixels of at least one block of at least one plane to generate the pixel of the pre-processed and deblocked image at the pixel position. For example, the pre-processing system may identify each of the planes at which the pixel position exists and weight each pixel based on a distance from the center of a corresponding block and a weighting function. Thus, the closer a pixel is to a boundary of the corresponding block the less weight the pixel is given by the pre-processing system during deblocking. As such, the pre-processed and deblocked image generated by the pre-processing system may contains image data that is least likely to experience edge effects during pre-processing, thus, reducing the effect that the imperceptible irregularities has on compression rates associated with the image data.
- At 1108, the pre-processing system may compress the deblocked image and, at 1110, the pre-processing system may transmit the compressed frame or image. For example, the deblocked image may be compressed and transmitted via one or more networks.
- Although the subject matter has been described in language specific to structural features, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features described. Rather, the specific features are disclosed as illustrative forms of implementing the claims.
Claims (20)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/533,992 US20190364291A1 (en) | 2017-02-08 | 2019-08-07 | Method and system for improved image compression |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US15/427,570 US10419771B2 (en) | 2017-02-08 | 2017-02-08 | Method and system for improved image compression |
| US16/533,992 US20190364291A1 (en) | 2017-02-08 | 2019-08-07 | Method and system for improved image compression |
Related Parent Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/427,570 Division US10419771B2 (en) | 2017-02-08 | 2017-02-08 | Method and system for improved image compression |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20190364291A1 true US20190364291A1 (en) | 2019-11-28 |
Family
ID=61257109
Family Applications (2)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/427,570 Expired - Fee Related US10419771B2 (en) | 2017-02-08 | 2017-02-08 | Method and system for improved image compression |
| US16/533,992 Abandoned US20190364291A1 (en) | 2017-02-08 | 2019-08-07 | Method and system for improved image compression |
Family Applications Before (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US15/427,570 Expired - Fee Related US10419771B2 (en) | 2017-02-08 | 2017-02-08 | Method and system for improved image compression |
Country Status (2)
| Country | Link |
|---|---|
| US (2) | US10419771B2 (en) |
| WO (1) | WO2018148179A1 (en) |
Families Citing this family (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112261408B (en) * | 2020-09-16 | 2023-04-25 | 青岛小鸟看看科技有限公司 | Image processing method and device for head-mounted display equipment and electronic equipment |
Family Cites Families (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3646853B2 (en) * | 1999-02-12 | 2005-05-11 | Kddi株式会社 | Multi-path image transmission device |
| SG140508A1 (en) | 2006-08-31 | 2008-03-28 | St Microelectronics Asia | Multimode filter for de-blocking and de-ringing |
| US8792564B2 (en) * | 2008-10-28 | 2014-07-29 | Sony Corporation | Adaptive preprocessing method using feature-extracted video maps |
| US8644389B2 (en) | 2009-05-15 | 2014-02-04 | Texas Instruments Incorporated | Real-time video image processing |
| US8644374B2 (en) * | 2009-08-31 | 2014-02-04 | Cisco Technology, Inc. | Multiple description coding with spatial shifting |
| US9438881B2 (en) | 2010-07-19 | 2016-09-06 | Dolby Laboratories Licensing Corporation | Enhancement methods for sampled and multiplexed image and video data |
| US10075737B2 (en) * | 2011-08-26 | 2018-09-11 | Qualcomm Incorporated | Method and apparatus for shift DCT-based sharpening of a video image |
| BR112014008270A2 (en) * | 2011-10-10 | 2017-04-18 | Koninklijke Philips Nv | video method and device for processing a three-dimensional video signal, computer program, and computer readable medium |
| WO2013141609A1 (en) * | 2012-03-20 | 2013-09-26 | 삼성전자 주식회사 | Method and device for encoding scalable video on basis of encoding unit of tree structure, and method and device for decoding scalable video |
| US8942467B2 (en) * | 2012-03-23 | 2015-01-27 | Mitsubishi Electric Research Laboratories, Inc. | Method for reducing blocking artifacts in images |
| WO2014163241A1 (en) * | 2013-04-02 | 2014-10-09 | 주식회사 칩스앤미디어 | Method and apparatus for processing video |
| WO2015058320A1 (en) * | 2013-10-25 | 2015-04-30 | Acamar Corporation | Denoising raw image data using content adaptive orthonormal transformation with cycle spinning |
| US9995160B2 (en) * | 2014-12-22 | 2018-06-12 | General Electric Company | Airfoil profile-shaped seals and turbine components employing same |
| US9955160B1 (en) * | 2015-04-27 | 2018-04-24 | Harmonic, Inc. | Video encoding using adaptive pre-filtering |
| CN117061736A (en) * | 2017-01-13 | 2023-11-14 | 谷歌有限责任公司 | Composite prediction for video coding |
-
2017
- 2017-02-08 US US15/427,570 patent/US10419771B2/en not_active Expired - Fee Related
-
2018
- 2018-02-06 WO PCT/US2018/016984 patent/WO2018148179A1/en not_active Ceased
-
2019
- 2019-08-07 US US16/533,992 patent/US20190364291A1/en not_active Abandoned
Also Published As
| Publication number | Publication date |
|---|---|
| WO2018148179A1 (en) | 2018-08-16 |
| US20180227587A1 (en) | 2018-08-09 |
| US10419771B2 (en) | 2019-09-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111047516B (en) | Image processing method, image processing device, computer equipment and storage medium | |
| US10003768B2 (en) | Apparatus and methods for frame interpolation based on spatial considerations | |
| TWI669939B (en) | Method and apparatus for selective filtering of cubic-face frames | |
| EP3648462B1 (en) | Image processing method and device | |
| EP2916543B1 (en) | Method for coding/decoding depth image and coding/decoding device | |
| US20170061592A1 (en) | Methods, systems and apparatus for over-exposure correction | |
| US8244054B2 (en) | Method, apparatus and integrated circuit capable of reducing image ringing noise | |
| US20100201870A1 (en) | System and method for frame interpolation for a compressed video bitstream | |
| US9294676B2 (en) | Choosing optimal correction in video stabilization | |
| US8885969B2 (en) | Method and apparatus for detecting coding artifacts in an image | |
| JP2008527932A (en) | Nonlinear In-Loop Denoising Filter for Quantization Noise Reduction in Hybrid Video Compression | |
| JP2012516637A5 (en) | ||
| CN111131837B (en) | Motion compensation correction method, encoding method, encoder, and storage medium | |
| US10706507B2 (en) | Hybrid denoising of images and videos based on interest metrics | |
| US20130051476A1 (en) | Video compression system and method using differencing and clustering | |
| CN115409716B (en) | Video processing method, device, storage medium and equipment | |
| JPH08154251A (en) | Image signal interpolation device | |
| JP5950605B2 (en) | Image processing system and image processing method | |
| US8111939B2 (en) | Image processing device and image processing method | |
| US20190364291A1 (en) | Method and system for improved image compression | |
| JP2024516550A (en) | Learning-Based Point Cloud Compression with Tearing Transform | |
| CN110677728B (en) | Method, device and equipment for playing video and storage medium | |
| CN114513662A (en) | QP (quantization parameter) adaptive in-loop filtering method and system, electronic equipment and storage medium | |
| JP5102810B2 (en) | Image correction apparatus and program thereof | |
| US9503756B2 (en) | Encoding and decoding using perceptual representations |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: APPLICATION DISPATCHED FROM PREEXAM, NOT YET DOCKETED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| AS | Assignment |
Owner name: ZPEG, INC., TEXAS Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WESTWATER, RAYMOND J.;PERRY, JEFFREY S.;REEL/FRAME:052452/0187 Effective date: 20170207 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: FINAL REJECTION MAILED |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |