RU2467499C2

RU2467499C2 - Method of compressing digital video stream in television communication channel

Info

Publication number: RU2467499C2
Application number: RU2010137076/07A
Authority: RU
Inventors: Владимир Григорьевич Балобанов (RU); Владимир Григорьевич Балобанов; Вадим Николаевич Безруков (RU); Вадим Николаевич Безруков; Андрей Владимирович Балобанов (RU); Андрей Владимирович Балобанов
Priority date: 2010-09-06
Filing date: 2010-09-06
Publication date: 2012-11-20
Also published as: RU2010137076A

Abstract

FIELD: physics.

SUBSTANCE: invention relates to digital television and particularly to compression of a digital video stream in a television communication channel. A field sequence is divided into groups of three types: O-fields, which act as the reference; N-even fields, which are encoded via prediction based on the previous field within one frame; M-odd fields, which are coded with prediction based on the previous odd field from another frame. To improve visual quality of the image according to the disclosed method, odd and even fields change places in the entire video sequence or group of frames. Frames with a higher image definition are formed as a result. To increase efficiency of compressing a digital stream, the disclosed method employs cutting of readings in N-fields horizontally and vertically with subsequent restoration thereof at the receiving side, wherein two neighbouring N-fields form a full half-frame (field).

EFFECT: communication channel bandwidth reduction, as well as high efficiency of digital conversion of a video signal, lying in additional cutting of the volume of digital information and increasing visual quality of the image, with sufficiently easy implementation.

13 dwg, 1 tbl

Description

Предлагаемое изобретение относится к области цифрового телевидения и может быть использовано для сжатия цифрового потока видеосигнала в телевизионном канале связи.The present invention relates to the field of digital television and can be used to compress the digital video stream in a television communication channel.

Проблема сокращения цифрового потока в телевидении сохраняет свою актуальность и в настоящее время. Широкая полоса частот, занимаемая цифровым сигналом, является главным препятствием при передаче его по каналам связи. В то же время цифровые методы позволяют создать новый тип ТВ аппаратуры, более надежный, стабильный, компактный и технологичный.The problem of reducing the digital stream in television remains relevant today. The wide frequency band occupied by the digital signal is the main obstacle when transmitting it through communication channels. At the same time, digital methods make it possible to create a new type of TV equipment, more reliable, stable, compact and technologically advanced.

В настоящее время известно более 100 способов преобразования аналогового сигнала изображения в цифровой. Все способы основаны на использовании одного из видов модуляции: импульсно-кодовой (ИКМ), дифференциальной импульсно-кодовой (ДИКМ) и дельта-модуляции (ДМ). Кроме того, способы различаются видом обработки аналогового сигнала перед аналого-цифровым преобразованием (АЦП) и видом последующей обработки первичного цифрового сигнала. На всех этапах преобразования сигнала учитываются свойства изображения и зрительного анализатора, что позволяет достигнуть максимальной эффективности процесса преобразования сигнала в целом.Currently, more than 100 methods for converting an analog image signal to digital are known. All methods are based on the use of one of the types of modulation: pulse-code (PCM), differential pulse-code (DPCM) and delta modulation (DM). In addition, the methods differ in the type of processing of the analog signal before analog-to-digital conversion (ADC) and the type of subsequent processing of the primary digital signal. At all stages of the signal conversion, the properties of the image and the visual analyzer are taken into account, which allows to achieve maximum efficiency of the signal conversion process as a whole.

Резервы для уменьшения цифрового потока без ущерба качеству воспроизводимого изображения заключены в специфике ТВ сигнала, обладающего значительной информационной избыточностью, которую условно делят на статистическую, физиологическую (психофизическую), пространственную (внутрикадровую) и временную. Следует также отметить, что резких границ здесь нет. Психофизическая и статическая избыточности связаны с временной и пространственной избыточностями.Reserves to reduce the digital stream without compromising the quality of the reproduced image are contained in the specifics of the TV signal, which has significant information redundancy, which is conditionally divided into statistical, physiological (psychophysical), spatial (intraframe) and temporal. It should also be noted that there are no sharp boundaries. Psychophysical and static redundancies are associated with temporal and spatial redundancies.

Примером сокращения психофизической избыточности может служить способ передачи градаций яркости от крупных и мелких деталей (внутрикадровая избыточность) и особенности восприятия движущихся объектов - временная (межкадровая избыточность).An example of a reduction in psychophysical redundancy can be a method of transmitting gradations of brightness from large and small parts (intraframe redundancy) and features of the perception of moving objects - temporary (interframe redundancy).

В большинстве случаев видеопоследовательности (ТВ изображение) содержат избыточность в двух направлениях - временном и пространственном. Главное статистическое свойство, на котором основана аппаратура сжатия, - межэлементная корреляция, включающая предположение о коррелированности последовательных кадров видеоданных. Таким образом, значения отдельных элементов изображения могут быть предсказаны либо по значениям ближайших элементов внутри одного кадра (внутрикадровое кодирование), либо по значениям элементов, расположенных в ближайших кадрах (межкадровое кодирование и компенсация перемещения).In most cases, video sequences (TV image) contain redundancy in two directions - temporal and spatial. The main statistical property on which the compression equipment is based is the inter-element correlation, which includes the assumption of the correlation of successive frames of video data. Thus, the values of individual elements of the image can be predicted either by the values of the nearest elements within the same frame (intraframe coding), or by the values of elements located in the nearest frames (interframe coding and motion compensation).

В некоторых случаях, например, при смене сюжета, временная корреляция между ближайшими кадрами очень низка. В таких случаях решающую роль играет внутрикадровая корреляция, то есть пространственная корреляция элементов изображения. Однако если корреляция между последовательными кадрами видеоданных высока, желательно устранить межкадровую избыточность. При этом применяемые методы преобразования изображения основаны на том, что его цифровой эквивалент (сигнал ИКМ) приводится к виду, удобному для сокращения избыточной информации. Наиболее эффективным методом является преобразование видеоинформации из временной области в спектральную. Результат преобразования представляет собой совокупность спектральных коэффициентов, которые характеризуют амплитуды пространственных частот изображения.In some cases, for example, when changing the plot, the temporal correlation between the closest frames is very low. In such cases, the intraframe correlation, that is, the spatial correlation of image elements, plays a decisive role. However, if the correlation between successive frames of video data is high, it is desirable to eliminate interframe redundancy. Moreover, the applied image conversion methods are based on the fact that its digital equivalent (PCM signal) is reduced to a form convenient for reducing redundant information. The most effective method is to convert video information from the time domain to the spectral one. The conversion result is a combination of spectral coefficients that characterize the amplitudes of the spatial frequencies of the image.

Из рассмотренных ортогональных преобразований стандартом MPEG рекомендовано использовать дискретно-косинусное преобразование (ДКП), являющееся частным случаем двумерного преобразования Фурье.Of the considered orthogonal transforms, the MPEG standard recommended the use of the discrete cosine transform (DCT), which is a special case of the two-dimensional Fourier transform.

ДКП осуществляет перевод отдельных блоков изображения, размер которых определен 8×8 элементов (или 16×16), из пространственной области значений временного сигнала в область пространственных частот спектральной плоскости коэффициентов (фиг.1). В результате массиву исходных значений сигнала (фиг.1, а) соответствует массив из такого же числа коэффициентов, представляющих собой амплитуды этих косинусных составляющих (фиг.1, б).DCT translates individual image blocks, the size of which is determined by 8 × 8 elements (or 16 × 16), from the spatial domain of the values of the temporary signal to the spatial frequency domain of the spectral plane of the coefficients (Fig. 1). As a result, an array of initial signal values (Fig. 1, a) corresponds to an array of the same number of coefficients representing the amplitudes of these cosine components (Fig. 1, b).

Двумерное ДКП ведется по формуле:Two-dimensional DCT is carried out according to the formula:

где

Where

f_(x,y) - отсчеты изображения с пространственными координатами x, y (от 0 до N-1);f _{(x, y)} - image samples with spatial coordinates x, y (from 0 to N-1);

N - размер блока изображения (N×N элементов);N - image block size (N × N elements);

F_(m,n) - коэффициенты, характеризующие изображение в спектральной плоскости m, n (от 0 до N-1).F _{(m, n)} - coefficients characterizing the image in the spectral plane m, n (from 0 to N-1).

ДКП является обратимым: по распределению F_(m,n) обратным преобразованием однозначно восстанавливается в декодирующем устройстве f_(x,y) из имеющихся трасформант:DCT is reversible: according to the distribution F _{(m, n), the} inverse transform uniquely recovers f _{(x, y)} from the existing transformants in the decoding device:

Квантование является важным звеном обработки сигнала, на котором происходят потери. Оно определяет точность хранения результатов ДКП и коэффициент сжатия. При квантовании каждое из значений ДКП делится на коэффициент квантования Q_(m,n) (табл.1), индивидуальный для каждой пространственной частоты, который берется из заранее определенной таблицы коэффициентов квантования размером 8×8 (естественно, она должна быть одинаковой для кодера и декодера). Эта таблица может быть взята по умолчанию или формируется кодером для конкретных изображений и передается декодеру вместе со сжатыми данными. Для сигнала цветного изображения таблицы могут различаться для разных компонент (Y, C_R, C_B). При декодировании исходные величины приближенно восстанавливаются путем умножения на фактор квантования. Сказанное поясняется табл.1.Quantization is an important part of signal processing at which losses occur. It determines the accuracy of storing the results of DCT and the compression ratio. During quantization, each of the DCT values is divided by the quantization coefficient Q _{(m, n)} (Table 1), individual for each spatial frequency, which is taken from a predefined 8 × 8 quantization coefficient table (naturally, it should be the same for the encoder and decoder). This table can be taken by default or generated by the encoder for specific images and transmitted to the decoder along with the compressed data. For a color image signal, tables may vary for different components (Y, C _R , C _B ). When decoding, the original values are approximately restored by multiplying by the quantization factor. The foregoing is explained in Table 1.

Таблица 1Table 1 139139 144144 149149 153153 155155 155155 155155 155155 12601260 -1-one -12-12 -6-6 22 -2-2 -2-2 22 144144 151151 153153 156156 159159 156156 156156 156156 -22-22 -17-17 -6-6 -3-3 -3-3 00 1one -1-one 150150 155155 160160 163163 158158 156156 156156 156156 -11-eleven -10-10 -1-one 22 1one -1-one -1-one -1-one 159159 161161 165165 160160 160160 159159 159159 159159 -7-7 -2-2 00 22 1one 00 00 00 159159 160160 161161 162162 162162 155155 155155 155155 00 -1-one 1one 1one -1-one -1-one 1one 22 161161 161161 161161 161161 160160 157157 157157 157157 22 00 1one -1-one -1-one 22 22 00 162162 162162 161161 163163 162162 157157 157157 157157 -1-one -1-one 00 -1-one 00 22 1one -1-one 162162 162162 161161 161161 163163 158158 158158 158158 -3-3 1one -3-3 -1-one 22 1one -1-one -1-one Выборки оригинала f_(x,y) Original Samples f _{(x, y)} Коэффициенты ДКП (округлены) f_(m,n) DCT coefficients (rounded) f _{(m, n)} 88 11eleven 1010 1616 2424 4040 5151 6161 158158 00 -1-one 00 00 00 00 00 1212 1212 14fourteen 1919 2626 5858 6060 5555 -2-2 -1-one 00 00 00 00 00 00 14fourteen 1313 1616 2424 4040 5757 6969 5656 -1-one -1-one 00 00 00 00 00 00 14fourteen 1717 2222 2929th 5151 8787 8080 6262 -1-one 00 00 00 00 00 00 00 18eighteen 2222 3737 5656 6868 109109 103103 7777 00 00 00 00 00 00 00 00 2424 3535 5555 6464 8181 104104 113113 9292 00 00 00 00 00 00 00 00 4949 6464 7878 8787 103103 121121 120120 101101 00 00 00 00 00 00 00 00 7272 9292 9595 9898 112112 100one hundred 103103 9999 00 00 00 00 00 00 00 00 Таблица квантовая Q_(m,n) Quantum table Q _{(m, n)} Коэффициент после квантования C_q(m,n) The coefficient after quantization C _{q (m, n)}

12641264 00 -10-10 00 00 00 00 00 142142 144144 147147 150150 152152 153153 154154 154154 -24-24 -12-12 00 00 00 00 00 00 149149 150150 153153 155155 156156 157157 156156 156156 -14-fourteen -13-13 00 00 00 00 00 00 157157 158158 159159 161161 161161 160160 159159 158158 -14-fourteen 00 00 00 00 00 00 00 162162 162162 163163 163163 162162 160160 158158 157157 00 00 00 00 00 00 00 00 162162 162162 162162 162162 161161 158158 156156 155155 00 00 00 00 00 00 00 00 160160 161161 161161 161161 160160 158158 156156 154154 00 00 00 00 00 00 00 00 160160 160160 161161 162162 161161 160160 158158 157157 00 00 00 00 00 00 00 00 160160 161161 163163 164164 164164 163163 161161 160160 Коэффициенты после обратного квантования

Coefficients after inverse quantization

Recovered Sample Values

Из таблицы видно, что восстановленное изображение отличается от оригинала незначительно.The table shows that the restored image differs slightly from the original.

Наиболее известными стандартами сжатия изображений являются MPEG-1, MPEG-2, MPEG-4, MPEG-7, JPEG и вейвлет-преобразование. В телевизионном вещании нашел применение стандарт MPEG-2. Другие методы сжатия изображений при экспериментах дали худшие результаты: неудовлетворительно качество, сложность технической реализации, недостаточная эффективность сжатия цифрового потока.The most famous image compression standards are MPEG-1, MPEG-2, MPEG-4, MPEG-7, JPEG, and wavelet transform. In television broadcasting, the MPEG-2 standard has found application. Other methods of image compression during experiments yielded worse results: poor quality, complexity of technical implementation, insufficient digital stream compression efficiency.

Устранение внутрикадровой и межкадровой избыточности в стандарте MPEG является наиболее эффективным. Изменить стандарт в ТВ вещании практически невозможно, так как это связано с большими экономическими затратами. Но в прикладном телевидении и специальных системах цифрового телевидения, не связанных с принятыми стандартами, применение других методов сжатия цифрового потока не вызовет существенных трудностей.The elimination of intraframe and interframe redundancy in the MPEG standard is the most effective. It is almost impossible to change the standard in TV broadcasting, as this is associated with high economic costs. But in applied television and special digital television systems that are not related to accepted standards, the application of other methods of compressing the digital stream will not cause significant difficulties.

Стандарт MPEG-2 не регламентирует методы сжатия видеосигнала, а только определяет, как должен выглядеть битовый поток кодированного сигнала, поэтому конкретные алгоритмы являются коммерческой тайной фирм - производителей оборудования. Однако существуют общие принципы и процесс сжатия цифрового потока видеосигнала может быть разбит на ряд последовательных операций: преобразование аналогового сигнала в цифровую форму (АЦП), предварительная обработка, дискретное косинусное преобразование (ДКП), квантование и кодирование.The MPEG-2 standard does not regulate video compression methods, but only determines what the bitstream of the encoded signal should look like, so specific algorithms are a trade secret of equipment manufacturers. However, there are general principles and the process of compressing a digital video signal stream can be divided into a number of sequential operations: converting an analog signal to digital form (ADC), preprocessing, discrete cosine transform (DCT), quantization and coding.

MPEG-2, в отличие от MPEG-1, дает возможность обработки чересстрочных изображений. В MPEG-1 для кодирования таких изображений приходилось предварительно объединить два поля в один кадр и только после этого подавать сигнал на вход кодера. Однако процедура приводила к заметным искажениям типа жалюзи и расческа. В MPEG-2 введена концепция полевого и кадрового кодирования. При полевом кодировании два поля одного кадра кодируются раздельно как самостоятельные изображения. Каждое поле разбивается на непересекающиеся макроблоки (8×8) или (16×16) пикселей (элементов) и к ним применяется ДКП. Кадровое кодирование предполагает построчное объединение двух полей в один кадр и обработку его как обычного изображения с прогрессивным разложением.MPEG-2, unlike MPEG-1, provides the ability to process interlaced images. In MPEG-1, for encoding such images, it was necessary to first combine two fields into one frame and only after that send a signal to the encoder input. However, the procedure led to noticeable distortions such as blinds and a comb. MPEG-2 introduces the concept of field and frame coding. In field coding, two fields of one frame are encoded separately as separate images. Each field is divided into disjoint macroblocks (8 × 8) or (16 × 16) pixels (elements) and DCT is applied to them. Frame coding involves line-by-line combining of two fields into one frame and processing it as a regular image with progressive decomposition.

MPEG-2 определяет два типа ДКП для макроблоков: кадровое и полевое. Кадровое ДКП действует аналогично MPEG-1: макроблок отсчетов яркости 16×16 пикселей разбивается на 4 блока 8×8 пикселей в соответствии с их расположением. Полевое ДКП берет 8 строк из нечетного поля для верхних двух блоков и 8 строк из четного поля для нижних блоков, образуя нечетные и четные поля макроблока, как показано на фиг.2.MPEG-2 defines two types of DCT for macroblocks: frame and field. Frame DCT acts similarly to MPEG-1: a macroblock of brightness samples of 16 × 16 pixels is divided into 4 blocks of 8 × 8 pixels in accordance with their location. Field DCT takes 8 lines from an odd field for the upper two blocks and 8 lines from an even field for the lower blocks, forming the odd and even fields of the macroblock, as shown in Fig.2.

Полевое ДКП более эффективно при существенном различии между полями, например, при наличии движения по вертикали. Полевое кодирование может использовать только полевое предсказание, у кадрового кодирования возможности шире - допускается применение кадрового предсказания или полевого. В последнем случае ищется сопряжение отдельно нечетного и четного полей макроблока в каждом из двух полей опорного кадра и выбирается лучший результат.Field DCT is more effective with a significant difference between the fields, for example, in the presence of vertical movement. Field coding can use only field prediction; with frame coding, the possibilities are wider — frame prediction or field prediction is allowed. In the latter case, the pairing of the separately odd and even fields of the macroblock in each of the two fields of the reference frame is searched for and the best result is selected.

Таким образом, в стандарте MPEG-2 кодер и декодер трактуют видеоданные как состоящие либо из 25 изображений в секунду с постоянным разрешением по вертикали, либо из 50 изображений в секунду с половинным разрешением по вертикали. Очевидно, что в обоих случаях объем информации на входе кодирующего устройства один и тот же. Полевая структура лучше подходит для компрессии изображений с быстрым движением, обеспечивая меньше артефактов. То есть она хороша для сюжетов с большим количеством движения, но хуже подходит при пространственной избыточности, обеспечивая худшее сжатие неподвижных изображений с точки зрения минимизации артефактов. Для кадровой или прогрессивной структуры справедливы, соответственно, обратные выводы. Выбор между кадровой и полевой структурами осуществляет специалист для достижения максимального уровня качества изображения.Thus, in the MPEG-2 standard, the encoder and decoder interpret the video data as consisting of either 25 images per second with a constant vertical resolution, or 50 images per second with a half vertical resolution. Obviously, in both cases, the amount of information at the input of the encoder is the same. The field structure is better suited for compressing images with fast motion, providing fewer artifacts. That is, it is good for scenes with a lot of movement, but is less suitable for spatial redundancy, providing worse compression of still images in terms of minimizing artifacts. For a personnel or progressive structure, the opposite conclusions are true, respectively. The choice between personnel and field structures is carried out by a specialist to achieve the maximum level of image quality.

После предварительной обработки разностные ошибки предсказания или сами макроблоки подвергаются ДКП, в результате которого исходная матрица блоков преобразуется в матрицу коэффициентов (фиг.3). Коэффициенты в матрице располагаются по возрастанию частот в вертикальном и горизонтальном направлениях. Другими словами, ДКП-матрицу можно рассматривать как двумерный фильтр в двух направлениях. При этом основная энергия концентрируется около нулевых частот. После квантования этих коэффициентов происходит преобразование их в одномерную последовательность. Алгоритм упорядочивания коэффициентов заключается в том, что в результате сканирования матрицы объединяются в серии с описанием их длины и местоположения (фиг.3). Одним из вариантов такого алгоритма является зигзагообразное сканирование, при котором преобразование начинается с левого верхнего угла и заканчивается в правом нижнем. Квантование полученной после ДКП-матрицы производится с учетом чувствительности глаза к различным пространственным частотам. Зрение человека наиболее чувствительно к градациям яркости крупных деталей, поэтому для коэффициентов, соответствующих нулевым частотам, шаг квантования должен быть минимальным. Для более высоких пространственных частот применяется квантование на меньшее число уровней.After pre-processing, difference prediction errors or macroblocks themselves are subjected to DCT, as a result of which the original matrix of blocks is converted into a matrix of coefficients (Fig. 3). The coefficients in the matrix are arranged in increasing frequencies in the vertical and horizontal directions. In other words, the DCT matrix can be considered as a two-dimensional filter in two directions. In this case, the main energy is concentrated near zero frequencies. After quantization of these coefficients, they are converted into a one-dimensional sequence. The algorithm for ordering the coefficients is that as a result of scanning the matrices are combined in a series with a description of their length and location (Fig. 3). One of the options for this algorithm is a zigzag scan, in which the conversion starts from the upper left corner and ends in the lower right. The quantization obtained after the DCT matrix is made taking into account the sensitivity of the eye to various spatial frequencies. Human vision is most sensitive to gradations of brightness of large parts, therefore, for coefficients corresponding to zero frequencies, the quantization step should be minimal. For higher spatial frequencies, quantization to a smaller number of levels is applied.

При построении системы сжатия цифрового потока в канале связи необходимо учитывать особенности зрения при восприятии мелких деталей разной протяженности и ориентации в плоскости xy (фиг.1). Известно, что передача тонких вертикальных черно-белых линий по каналу связи требует широкой полосы частот, а тонкие горизонтальные линии передаются низкими частотами, хотя разрешение глаза при восприятии вертикальных и горизонтальных линий является одинаковым.When constructing a digital stream compression system in a communication channel, it is necessary to take into account the peculiarities of vision when perceiving small parts of different lengths and orientations in the xy plane (Fig. 1). It is known that the transmission of thin vertical black and white lines through a communication channel requires a wide band of frequencies, and thin horizontal lines are transmitted at low frequencies, although the resolution of the eye when perceiving vertical and horizontal lines is the same.

Это надо учитывать при сокращении цифрового потока, чтобы в некоторых полях превалировали тонкие горизонтальные линии, которые передаются низкими частотами. Эти особенности позволяют дополнительно уменьшить избыточность цифрового сигнала путем прореживания отсчетов цифрового потока в кодирующем устройстве. В декодирующем устройстве проводится обратная операция - интерполяция, которая восстанавливает исходное количество отсчетов.This should be taken into account when reducing the digital stream, so that in some fields thin horizontal lines prevail, which are transmitted by low frequencies. These features allow you to further reduce the redundancy of the digital signal by thinning out the samples of the digital stream in the encoder. In the decoding device, the inverse operation is performed - interpolation, which restores the original number of samples.

Подобные способы сокращения цифрового потока часто используются в телевидении, например, в вейвлет-преобразовании.Similar methods of reducing the digital stream are often used in television, for example, in wavelet transform.

В стандарте MPEG-2 используется также и межкадровое кодирование, которое значительно повышает эффективность сжатия цифрового потока.MPEG-2 also uses inter-frame coding, which greatly improves the compression efficiency of the digital stream.

При межкадровом кодировании, основанном на временной избыточности, возможны различные способы предсказания. В зависимости от этого изображения (кадры) в своей временной последовательности подразделяются на следующие типы:With inter-frame coding based on temporal redundancy, various prediction methods are possible. Depending on this image (frames) in their time sequence are divided into the following types:

- I-кадры (intra), опорные, являются основными и кодируются без обращения к другим кадрам, то есть с использованием информации только этого кадра. Вид кодирования - внутрикадровый, обеспечивающий умеренное сжатие. Предсказание для них не формируется. Все остальные кадры анализируются процессором, который сравнивает их с опорным, а также между собой.- I-frames (intra), reference, are basic and are encoded without reference to other frames, that is, using information from this frame only. Type of coding - intraframe, providing moderate compression. Prediction for them is not formed. All other frames are analyzed by the processor, which compares them with the reference, as well as with each other.

- Р-кадры (predictive - предсказанные), при передаче которых используется межкадровое кодирование путем предсказания с компенсацией движения по ближайшему предшествующему 1-кадру или Р-кадру. Если макроблок в Р-кадре не может быть описан с использованием компенсации движения, что случается при появлении неизвестного объекта, то он кодируется тем же способом, что и макроблок в I-кадре. Р-кадры сжаты в 3 раза сильнее, чем I-кадры, и служат опорными для поступающих Р и В-кадров.- P-frames (predictive - predicted), the transmission of which uses inter-frame coding by prediction with motion compensation for the nearest previous 1-frame or P-frame. If the macroblock in the P-frame cannot be described using motion compensation, which happens when an unknown object appears, then it is encoded in the same way as the macroblock in the I-frame. P-frames are compressed 3 times stronger than I-frames, and serve as reference for incoming P and B-frames.

- В-кадры (bidirectional - двунаправленные), которые передаются с межкадровым кодированием путем предсказания с компенсацией движения по ближайшим к ним как спереди, так и сзади I-кадрам и Р-кадрам, а сами не могут использоваться для предсказания других кадров (некоторые фрагменты В-кадра могут кодироваться внутрикадровым методом).- B-frames (bidirectional - bidirectional), which are transmitted with inter-frame coding by prediction with motion compensation for I-frames and P-frames closest to them both in front and behind, and cannot be used to predict other frames (some fragments B-frames can be encoded using the intraframe method).

Таким образом, в стандарте MPEG-2 используются 3 вида кодирования: внутрикадровое, межкадровое вперед с компенсацией движения, межкадровое двунаправленное также с компенсацией движения.Thus, the MPEG-2 standard uses 3 types of coding: intraframe, interframe forward with motion compensation, bidirectional interframe also with motion compensation.

Полученные кадры объединяются в группы последовательных кадров (GOP). Каждая последовательность начинается с I-кадра и состоит из переменного числа Р- и В-кадров.Received frames are combined into sequential frame groups (GOPs). Each sequence begins with an I-frame and consists of a variable number of P- and B-frames.

В начале сюжета должен стоять I-кадр, в конце В-кадр. Увеличивать долю В-кадров можно только в рамках одного сюжета, иначе возникнут большие ошибки предсказания и компенсации движения. Поскольку типичная длительность группы кадров (во временном представлении 0,5 с - 12 кадров) значительно меньше характерного расстояния между границами сюжетов, то в большинстве случаев жесткое задание структуры группы кадров не приводит к существенным визуальным ошибкам из-за того, что смена сюжета попадает внутрь группы кадров.At the beginning of the plot there should be an I-frame, at the end of a B-frame. It is possible to increase the share of B-frames only within the framework of one plot, otherwise large errors of prediction and motion compensation will occur. Since the typical duration of a group of frames (in the temporary representation of 0.5 s - 12 frames) is significantly less than the characteristic distance between the boundaries of the plots, in most cases, rigidly setting the structure of the group of frames does not lead to significant visual errors due to the fact that the change of plot falls inside groups of frames.

Рассмотрим пример последовательности кадров:Consider an example frame sequence:

1one 22 33 4four 55 66 77 88 99 1010 11eleven 1212 1313 14fourteen 15fifteen 1616 1717 18eighteen 1919 20twenty II ВAT ВAT РR ВAT ВAT РR ВAT ВAT РR ВAT ВAT РR ВAT ВAT II ВAT ВAT РR ВAT

Здесь кадры с 1 по 15 образуют группу кадров. Число кадров в группе может быть и другим, но они всегда начинаются с I-кадра. Р-кадр 4 предсказывается по I-кадру 1, Р-кадр 7 - по Р-кадру 4, Р-кадр 10 - по Р-кадру 7 и т.д. I-кадр передается с внутрикадровым кодированием независимо от всех предшествующих ему кадров. В-кадры 2 и 3 предсказываются по I-кадру 1 и Р-кадру 4, В-кадры 5 и 6 - по Р-кадрам 4 и 7 и т.д. В-кадры 14 и 15 предсказываются по I-кадру 16 и Р-кадру 13. Перед кодированием порядок следования кадров изменяется, так как каждый В-кадр должен идти после обоих кадров по которым он предсказывается:Here frames 1 to 15 form a group of frames. The number of frames in a group may be different, but they always start with an I-frame. P-frame 4 is predicted by I-frame 1, P-frame 7 - by P-frame 4, P-frame 10 - by P-frame 7, etc. The I-frame is transmitted with intraframe coding independently of all the frames preceding it. B-frames 2 and 3 are predicted from I-frame 1 and P-frame 4, B-frames 5 and 6 are predicted from P-frames 4 and 7, etc. B-frames 14 and 15 are predicted from I-frame 16 and P-frame 13. Before encoding, the order of the frames changes, since each B-frame must go after both frames over which it is predicted:

1one 4four 22 33 77 55 66 1010 88 99 1313 11eleven 1212 1616 14fourteen 15fifteen 1919 1717 18eighteen 2222 II РR ВAT ВAT РR ВAT ВAT РR ВAT ВAT РR ВAT ВAT II ВAT ВAT РR ВAT ВAT РR

В таком порядке кадры кодируются и передаются, а в процессе декодирования восстанавливается исходный порядок кадров. Изменение порядка передачи кадров В, В, Р объясняется тем, что при их декодировании на приемном конце системы потребуются кадры I и Р.In this order, frames are encoded and transmitted, and in the decoding process, the original frame order is restored. The change in the transmission order of frames B, B, P is explained by the fact that when they are decoded at the receiving end of the system, frames I and P will be required.

Изображение типа В компрессированы наиболее глубоко. Если Р-кадры требуют для своей передачи в 3 раза меньше битов, чем I-кадры, то в В изображениях число битов для большинства сюжетов в 2-5 раз меньше, чем в Р. Как следствие, страдает помехоустойчивость В-кадров. Поэтому для защиты от возможных ошибок изображения В не используются для предсказания других кадров.Type B image is most deeply compressed. If P-frames require 3 times fewer bits for their transmission than I-frames, then in the images the number of bits for most scenes is 2-5 times less than in P. As a result, the noise immunity of B-frames suffers. Therefore, to protect against possible errors, images B are not used to predict other frames.

Из рассмотренных способов сокращения (уменьшения) цифрового потока в канале связи, являющиеся наиболее близким и предлагаемому техническому решению, заслуживает внимания метод передачи и приема цифровой информации по стандарту MPEG-2, который возьмем за прототип предлагаемого изобретения. Проведенный выше анализ работы системы MPEG-2 показывает, что она имеет и недостатки. Известно, что степень сжатия цифрового потока напрямую зависит от корреляции между элементами изображения. В MPEG-2 межкадровое кодирование между I- и Р-кадрами осуществляется через 3 кадра (поля), а не через 1 кадр (поле). Это в сильной степени ослабляет корреляционные связи между элементами и соответственно ухудшает эффективность сокращения избыточности изображения. Хотя этот недостаток и компенсируется двунаправленным предсказанием В-кадров, но при этом снижается помехоустойчивость и качество изображения В-кадров.Of the considered methods of reducing (decreasing) the digital stream in the communication channel, which are the closest and the proposed technical solution, noteworthy is the method of transmitting and receiving digital information according to the MPEG-2 standard, which we take as a prototype of the invention. The above analysis of the MPEG-2 system shows that it also has disadvantages. It is known that the compression rate of a digital stream directly depends on the correlation between image elements. In MPEG-2, inter-frame coding between I- and P-frames is performed through 3 frames (fields), and not through 1 frame (field). This greatly weakens the correlation between the elements and, accordingly, worsens the effectiveness of reducing image redundancy. Although this drawback is compensated by the bi-directional prediction of B-frames, the noise immunity and image quality of B-frames are reduced.

Естественно, перестановка Р-, В1- и В2-кадров существенно усложняет построение системы, делая ее громоздкой и дорогой. При этом упомянутые усложнения системы не всегда себя оправдывают в смысле получения высокого качества изображения.Naturally, the permutation of P-, B1- and B2-frames significantly complicates the construction of the system, making it cumbersome and expensive. Moreover, the mentioned complications of the system do not always justify themselves in the sense of obtaining high image quality.

Следует отметить, что двунаправленное предсказание позволяет прервать возникший трек ошибки в группе кадров, если ошибка возникла в В-кадрах. В I- и Р-кадрах трек ошибки будет продолжаться до начала следующего опорного кадра I.It should be noted that bidirectional prediction allows you to interrupt the error track that occurred in a group of frames if an error occurred in B-frames. In I- and P-frames, the error track will continue until the beginning of the next reference frame I.

Повышение эффективности сжатия В-кадров сопровождается потерей качества изображения в этих кадрах, но за счет боле высокого качества изображения I- и Р-кадров визуально наш глаз ухудшения качества всего изображения не замечает. Кроме того, имеются определенные трудности при определении вектора движения объекта в соседних кадрах в связи с изменением порядка следования В- и Р-кадров. В то же время любые изменения стандарта MPEG-2 могут привести к снижению эффективности кодирования.An increase in the compression efficiency of B-frames is accompanied by a loss of image quality in these frames, but due to the higher image quality of I- and P-frames, our eye does not visually notice a deterioration in the quality of the entire image. In addition, there are certain difficulties in determining the motion vector of an object in adjacent frames due to a change in the sequence of B- and P-frames. At the same time, any changes to the MPEG-2 standard may lead to a decrease in coding efficiency.

Устранение вышеназванных недостатков позволит создать более простую и не менее эффективную систему сжатия цифрового потока для прикладного и вещательного телевидения.Elimination of the aforementioned shortcomings will make it possible to create a simpler and no less effective digital stream compression system for applied and broadcast television.

Как было отмечено выше, наиболее близким техническим решением (прототипом предлагаемого изобретения) является способ сжатия цифрового потока видеосигнала по стандарту MPEG-2, работа которого подробно была рассмотрена выше и в работе (Локшин В.А. Цифровое вещание: от студии к телезрителю. / Под ред. Л.С.Виленчика. - М.: Сайрус системе, 2001. - 448 с.)As noted above, the closest technical solution (the prototype of the present invention) is a method of compressing a digital video stream according to the MPEG-2 standard, the work of which was discussed in detail above in the work (Lokshin V.A. Digital broadcasting: from studio to viewer. / Edited by L.S. Vilenchik. - M.: Cyrus system, 2001. - 448 p.)

Реализация прототипа предполагает следующую последовательность действий (относительно яркостного сигнала):The implementation of the prototype involves the following sequence of actions (relative to the brightness signal):

- изображение достраивается до кратного «16-ти» количества элементов (пикселов) по строкам и столбцам, чтобы обеспечить разбиение изображения на целое число макроблоков;- the image is completed to a multiple of “16” the number of elements (pixels) in rows and columns to ensure that the image is divided into an integer number of macroblocks;

- изображение разбивается на последовательность макроблоков, каждый из которых состоит из 4 блоков 8*8 элементов, несущих информацию о яркости объекта (фиг.2);- the image is divided into a sequence of macroblocks, each of which consists of 4 blocks of 8 * 8 elements that carry information about the brightness of the object (figure 2);

- образование опорных (I-кадров (полей)) и разностных (Р-, В-кадров (полей)) сигналов;- the formation of reference (I-frames (fields)) and differential (P-, B-frames (fields)) signals;

- определение вектора движения блоков;- determination of the motion vector of blocks;

- дискретно-косинусное преобразование (ДКП);- discrete cosine transform (DCT);

- квантование и кодирование коэффициентов ДКП;- quantization and coding of DCT coefficients;

- мультиплексирование данных цифрового потока;- multiplexing digital stream data;

- регулирование скорости потока данных, подлежащих передаче.- regulation of the flow rate of data to be transmitted.

Отмеченные ранее недостатки прототипа затрудняют его внедрение в прикладное телевидение.The previously noted disadvantages of the prototype impede its implementation in applied television.

Целью предлагаемого изобретения является сужение полосы частот канала связи, а также повышение эффективности цифрового преобразования видеосигнала, заключающееся в дополнительном сокращении объема цифровой информации при достаточно простой технической реализации.The aim of the invention is to narrow the frequency band of the communication channel, as well as to increase the efficiency of digital video signal conversion, which consists in further reducing the amount of digital information with a fairly simple technical implementation.

Поставленная цель достигается тем, что, как и в прототипе, последовательность полей (а не кадров) делится на группы. В группе есть поля 3-х типов (фиг.4):The goal is achieved in that, as in the prototype, the sequence of fields (rather than frames) is divided into groups. In the group there are fields of 3 types (figure 4):

- O-поля - изображения, играющие роль опорных при восстановлении других изображений. Предсказание для них не формируется, используют внутриполевое кодирование;- O-fields - images playing the role of reference when restoring other images. Prediction for them is not formed, use in-field coding;

- N-четные поля - изображения, кодируемые путем предсказания на основе предыдущего поля в пределах одного кадра, используют межполевое кодирование, в результате которого образуется межстрочная разность двух соседних строк нечетного и четного полей;- N-even fields - images encoded by prediction based on the previous field within the same frame use inter-field coding, resulting in an inter-line difference between two adjacent lines of odd and even fields;

- М-нечетные поля - кодируемые с предсказанием на основе предыдущего нечетного поля из другого кадра.- M-odd fields - encoded with prediction based on the previous odd field from another frame.

Буквами на фиг.4 обозначается их тип, а цифрами 1, 2, 3,… - порядок поступления их на вход компрессора. В группе 10 полей (5 кадров). Начинается группа с изображения типа О, подвергается только внутриполевой компрессии. В изображениях полей N- и М-типа устраняется как пространственная, так и временная избыточности. Число полей (кадров) в группе может быть иным, чем это представлено на фиг.4, и легко варьируется путем изменения частоты следования импульсов U₁ и соответственно U₄. Стрелки на фиг.4 определяют порядок предсказания: для N - в пределах одного кадра, и М - в пределах 2-х соседних кадров для нечетных полей.The letters in Fig. 4 indicate their type, and the numbers 1, 2, 3, ... indicate the order of their arrival at the compressor input. There are 10 fields in the group (5 frames). The group begins with an image of type O, is subjected only to field compression. In images of N- and M-type fields, both spatial and temporal redundancies are eliminated. The number of fields (frames) in the group may be different than that shown in figure 4, and can be easily varied by changing the pulse repetition rate U ₁ and, accordingly, U ₄ . The arrows in FIG. 4 determine the prediction order: for N, within one frame, and M, within 2 adjacent frames for odd fields.

Изображения N-полей содержат в основном тонкие горизонтальные линии разной протяженности (низкочастотные составляющие). При движении в изображениях N-полей появляются ВЧ-составляющие.Images of N-fields mainly contain thin horizontal lines of different lengths (low-frequency components). When moving in images of N-fields, high-frequency components appear.

Изображения М-полей содержат в основном высокочастотные составляющие, отражают межкадровую разность в пределах нечетных полей соседних кадров. Здесь НЧ-составляющие появляются при резкой смене сюжетов.Images of M-fields contain mainly high-frequency components, reflect the inter-frame difference within the odd fields of adjacent frames. Here, the low-frequency components appear during a sharp change of plot.

При передаче малоподвижных объектов в изображениях М-полей сигнал практически равен нулю, а сигнал от N-полей отличен от нуля. ВЧ-составляющие в N-полях появляются от передачи наклонных линий. После образования разностных сигналов N-полей перед матрицей ДКП для дополнительного сокращения цифрового потока осуществим прореживание (децимацию) сигналов от N-полей как в горизонтальном, так и вертикальном направлениях. Эта операция правомерна и не ухудшает качество изображения, так как частота дискретизации для N-полей слишком велика. В декодирующем устройстве проведем обратную операцию - интерполяцию, восстанавливая потерянную информацию. Подобные операции положены в основу сокращения цифрового потока в вейвлет-преобразованиях в НЧ- и ВЧ-каналах.When transferring inactive objects in images of M-fields, the signal is practically zero, and the signal from N-fields is non-zero. The RF components in N-fields appear from the transmission of oblique lines. After the formation of difference signals of N-fields in front of the DCT matrix, to further reduce the digital stream, we will perform decimation (decimation) of signals from N-fields in both horizontal and vertical directions. This operation is legitimate and does not impair image quality, since the sampling frequency for N-fields is too high. In the decoding device, we carry out the inverse operation - interpolation, restoring the lost information. Such operations are the basis for reducing the digital stream in wavelet transforms in the LF and HF channels.

Операции прореживания отсчетов и интерполяции можно провести также и для сигналов от М-полей. В декодирующем устройстве восстановление пропущенных отсчетов осуществляется путем использования одномерного цифрового фильтра с конечной импульсной характеристикой.The operations of thinning samples and interpolation can also be performed for signals from M-fields. In the decoding device, the restoration of the missed samples is carried out by using a one-dimensional digital filter with a finite impulse response.

Такие устройства являются интерполяторами. Безусловно, реализация интерполирования связана с определенными неточностями восстановления сигнала по его отсчетам, однако, она необходима для осуществления операций с целью получения исходных значений сигналов. При отсутствии интерполяции в декодирующем устройстве высокочастотные компоненты разностных сигналов могут быть потеряны, а в видеосигнале появятся мешающие НЧ-компоненты.Such devices are interpolators. Of course, the implementation of interpolation is associated with certain inaccuracies in the restoration of the signal from its samples, however, it is necessary for operations to obtain the initial values of the signals. If there is no interpolation in the decoding device, the high-frequency components of the difference signals may be lost, and interfering low-frequency components will appear in the video signal.

В результате проведенных преобразований цифровой поток в N-полях дополнительно уменьшается примерно в 4 раза.As a result of the transformations, the digital stream in N-fields additionally decreases by about 4 times.

Как и в стандарте MPEG-2 применим в системе сжатия энтропийное кодирование и определение вектора движений в М-полях. В N-полях компенсацию движения можно не учитывать, так как они расположены в тех же кадрах что и О- и М-поля, кроме того, она нарушит работу интерполятора в декодирующем устройстве или значительно усложнит его построение.As in the MPEG-2 standard, entropy coding and motion vector determination in M-fields are applicable in the compression system. In N-fields, motion compensation can be ignored, since they are located in the same frames as the O- and M-fields, in addition, it will disrupt the operation of the interpolator in the decoding device or significantly complicate its construction.

Подытоживая сказанное, видим, что эффективность сокращения цифрового потока O-полей является наиболее низкой по сравнению с М- и N-полями и примерно равна по эффективности 1-полям в стандарте MPEG-2. Зато качество изображения здесь выше, чем в М- и N-полях. Эффективность сжатия в М-полях примерно одинакова с Р-полями, а в N-полях она несколько выше, чем в В-полях, так как в N-полях осуществляется прореживание (передискретизация) значений сигналов матрицы ДКП (до матрицы коэффициентов) в горизонтальном и вертикальном направлениях (фиг.5 и 6).Summing up the above, we see that the efficiency of reducing the digital stream of O-fields is the lowest in comparison with the M- and N-fields and is approximately equal in efficiency to 1-fields in the MPEG-2 standard. But the image quality is higher than in the M and N fields. Compression efficiency in M-fields is approximately the same with P-fields, and in N-fields it is slightly higher than in B-fields, since in N-fields thinning (oversampling) of the values of the DCT matrix signals (to the coefficient matrix) in the horizontal and vertical directions (FIGS. 5 and 6).

Кроме того, для улучшения визуального качества изображения по предлагаемому способу нечетные и четные поля в каждой группе кадров меняются местами: в 1 группе - (O,N), (M,N), (M,N), …, а во 2-й группе - (N,O), (N,M), (N,M), … и т.д. (фиг.4, а'). Такая перестановка полей в силу особенностей зрительного восприятия зрения приводит к существенному улучшению визуального качества изображения в целом, то есть визуально происходит образование опорного кадра (O,N)+(N,O). В приемнике, естественно, должна быть восстановлена исходная очередность следования полей.In addition, to improve the visual quality of the image according to the proposed method, the odd and even fields in each group of frames are interchanged: in the 1st group - (O, N), (M, N), (M, N), ..., and in 2- group - (N, O), (N, M), (N, M), ... etc. (figure 4, a '). Such a permutation of the fields due to the peculiarities of visual perception of vision leads to a significant improvement in the visual quality of the image as a whole, that is, visually the formation of a reference frame (O, N) + (N, O) occurs. In the receiver, of course, the initial sequence of the fields should be restored.

Возможен еще один вариант перестановки полей в кадрах (фиг.4, а'') - перестановка нечетных и четных полей во всей видеопоследовательности кадров: (O,N), (N,M), (M,N), (N,M), (M,N), (N,O), … и т.д. При нечетном числе (2n+1) полей в группе кадров О- и N-поля, а также (М- и N-поля), периодически меняются местами, что более предпочтительно. Здесь визуально формируется виртуальный опорный кадр: (O,N) плюс (N,O). В нашем случае (фиг.4, а'') визуально кадры (O,N) и (N,O) накладываются друг на друга через 4 кадра, образуя опорный кадр.Another variant of the permutation of fields in frames (Fig. 4, a``) is the permutation of odd and even fields in the entire video sequence of frames: (O, N), (N, M), (M, N), (N, M ), (M, N), (N, O), ... etc. With an odd number (2n + 1) of fields in the group of frames, the O- and N-fields, as well as (M- and N-fields), are interchanged periodically, which is more preferable. Here a virtual reference frame is visually formed: (O, N) plus (N, O). In our case (Fig. 4, a), visually the frames (O, N) and (N, O) overlap each other through 4 frames, forming a reference frame.

Из рассмотренных вариантов последний является предпочтительнее, так как частота перестановки полей здесь более высокая, чем в первом варианте.Of the options considered, the latter is preferable, since the frequency of the field permutation is higher than in the first option.

То есть если зрителю поочередно предъявлять кадры (поля) изображения с высокой и низкой четкостью, то глаз в целом воспринимает изображение с высокой четностью. В прототипе такая возможность повышения визуального качества изображения отсутствует.That is, if the viewer is presented alternately with frames (fields) of images with high and low definition, then the eye as a whole perceives the image with high definition. In the prototype, this possibility of increasing the visual quality of the image is missing.

Реализация способа предполагает следующую последовательность действий (относительно яркостного сигнала):The implementation of the method involves the following sequence of actions (relative to the brightness signal):

- изображение достраивается до кратного «16» количества элементов (пикселов) по строкам и столбцам, чтобы обеспечить разбиение изображения на целое число макроблоков:- the image is extended to a multiple of “16” the number of elements (pixels) in rows and columns to ensure that the image is divided into an integer number of macroblocks:

- 1-й вариант: перестановка местами нечетных и четных полей в каждой группе кадров для улучшения визуального качества изображения: в 1 группе - (O,N), (M,N), (M,N),…, а во 2-й группе - (N,O), (N,M), (N,M), … и т.д. (фиг.4, а');- 1st option: rearrangement of odd and even fields in each group of frames to improve visual image quality: in 1 group - (O, N), (M, N), (M, N), ..., and in 2- group - (N, O), (N, M), (N, M), ... etc. (figure 4, a ');

- 2-й вариант: перестановка местами нечетных и четных полей во всей видеопоследовательности кадров: (O,N), (N,M), (M,N), (N,M)…(N,O), (M,N), (N,M), (M,N), … и т.д. (фиг.4, а'');- 2nd option: rearrangement of odd and even fields in the entire video sequence of frames: (O, N), (N, M), (M, N), (N, M) ... (N, O), (M, N), (N, M), (M, N), ... etc. (Fig. 4, a``);

- изображение разбивается на последовательность макроблоков, каждый из которых состоит из четырех блоков 8*8 элементов, несущих информацию о яркости объекта (фиг.5) для кадрового и полевого кодирований;- the image is divided into a sequence of macroblocks, each of which consists of four blocks of 8 * 8 elements that carry information about the brightness of the object (figure 5) for frame and field coding;

- образование опорных (O-полей) и разностных (М- и N-полей) сигналов;- the formation of reference (O-fields) and difference (M- and N-fields) signals;

- прореживание отсчетов в матрицах значений разностных сигналов яркости по строкам и столбцам в четных полях каждого кадра (фиг.6 и 8);- thinning of samples in the matrices of values of the differential brightness signals in rows and columns in the even fields of each frame (Fig.6 and 8);

- дискретно-косинусное преобразование значений сигнальной матрицы в матрицу коэффициентов;- discrete cosine transformation of the values of the signal matrix into a matrix of coefficients;

- квантование и энтропийное кодирование коэффициентов ДКП;- quantization and entropy coding of DCT coefficients;

- определение векторов движения блоков в подвижном изображении и энтропийное кодирование;- determination of motion vectors of blocks in a moving image and entropy coding;

- регулирование скорости потока данных, подлежащих передаче;- regulation of the flow rate of data to be transmitted;

- декодирование принятых сообщений происходит в обратном порядке процессам, происходящим в кодирующем устройстве.- decoding of received messages occurs in reverse order to the processes occurring in the encoding device.

Сравнение заявляемого решения с прототипом выявило признаки, отличающие заявляемое решение от прототипа, что позволяет сделать вывод о соответствии заявляемого способа критерию «существенные отличия».Comparison of the proposed solutions with the prototype revealed signs that distinguish the claimed solution from the prototype, which allows us to conclude that the proposed method meets the criterion of "significant differences".

Существенными отличиями предлагаемого способа являются:Significant differences of the proposed method are:

- применяются для каждой группы кадров одновременно полевое и кадровое кодирование. В MPEG-2 используется только кадровое или только полевое кодирование в зависимости от характера передаваемого изображения, что значительно усложняет аппаратуру;- apply for each group of frames simultaneously field and frame coding. MPEG-2 uses only frame or only field coding, depending on the nature of the transmitted image, which greatly complicates the equipment;

- значительно более сильная корреляция между сравниваемыми сигналами, а следовательно, и более эффективное устранение избыточности из телевизионных сообщений. В прототипе корреляционные связи между сравниваемыми сигналами значительно слабее, чем в предлагаемом способе;- a much stronger correlation between the compared signals, and therefore, a more effective elimination of redundancy from television messages. In the prototype, the correlation between the compared signals is much weaker than in the proposed method;

- прореживание отсчетов в матрицах значений одного из разностных сигналов яркости по строкам и столбцам в четных полях каждого кадра;- thinning of samples in the matrices of values of one of the difference brightness signals along rows and columns in even fields of each frame;

- перестановка местами нечетных и четных полей во всей видеопоследовательности кадров для улучшения визуального качества изображения.- rearrangement of odd and even fields in the entire video sequence of frames to improve visual image quality.

Возможны различные схемные решения, обеспечивающие такое сокращение цифрового потока в канале связи.Various circuit solutions are possible that provide such a reduction in the digital stream in the communication channel.

Рассмотрим реализацию способа на примере устройства, изображенного на фиг.7 (кодирующее устройство) и фиг.9 (декодирующее устройство).Consider the implementation of the method on the example of the device depicted in Fig.7 (encoding device) and Fig.9 (decoding device).

Кратко рассмотрим работу кодирующего устройства.Briefly consider the operation of the encoder.

Структурная схема кодера (фиг.7) отображает лишь основные операции, выполняемые при кодировании и обеспечивающие получение выходного потока данных с требуемыми параметрами.The block diagram of the encoder (Fig. 7) displays only the basic operations performed during encoding and providing the output data stream with the required parameters.

В кодере реализуются два режима кодирования: внутрикадровое и межкадровое с предсказанием и компенсацией движения.The encoder implements two encoding modes: intra-frame and inter-frame with prediction and motion compensation.

ПримечаниеNote

Для облегчения понимания физических процессов, протекающих в схеме, будем рассматривать видеопоследовательность кадров без перестановки полей (фиг.4, а).To facilitate understanding of the physical processes taking place in the circuit, we will consider the video sequence of frames without rearranging the fields (Fig. 4, a).

На схеме представлены только те узлы и блоки, которые характеризуют особенности данного способа сокращения цифрового потока информации.The diagram shows only those nodes and blocks that characterize the features of this method of reducing the digital flow of information.

Работа схемы также поясняется диаграммами напряжений, представленными на фиг.4.The operation of the circuit is also illustrated by the voltage diagrams presented in figure 4.

Входной видеосигнал с ИКМ поступает на блок предварительной обработки сигналов (БПО).The input video signal from the PCM is fed to the signal preprocessing unit (BPO).

В БПО сигналов осуществляются следующие преобразования:The following transformations are carried out in BPO signals:

- изображение достраивается до кратного «16» количества элементов (пикселов) по строкам и столбца;- the image is completed to a multiple of “16” the number of elements (pixels) in rows and columns;

- перестановка местами нечетных и четных полей в видеопоследовательности кадров;- rearrangement of odd and even fields in the video sequence of frames;

- изображение разбивается на последовательность макроблоков, каждый из которых состоит из четырех блоков 8×8 элементов, несущих информацию о яркости.- the image is divided into a sequence of macroblocks, each of which consists of four blocks of 8 × 8 elements that carry information about the brightness.

Аналогичные преобразования осуществляются и с сигналами цветности в формате 4:2:0. Далее в блоках 2, 3, 4, 5, 8 происходит образование опорных и разностных сигналов М и N. Управляющие сигналы U1, U2, U5 (фиг.4) координируют работу названых блоков. В блоке 9 осуществляется передискретизация (прореживание) сигналов N-полей в горизонтальном и вертикальном направлениях.Similar conversions are carried out with color signals in the 4: 2: 0 format. Next, in blocks 2, 3, 4, 5, 8, the formation of reference and difference signals M and N. The control signals U1, U2, U5 (Fig. 4) coordinate the operation of these blocks. In block 9, oversampling (decimation) of N-field signals in the horizontal and vertical directions is carried out.

Все макроблоки O-полей кодируются в режиме внутриполевого кодирования. С этой целью входной сигнал ИКМ поступает на блок ДКП по верхнему каналу. В ДКП 6 и 10 происходит преобразование матриц значений сигналов f_x,y в матрицы коэффициентов Фурье F_(m,n). В квантователе происходит кодирование коэффициентов ДКП в соответствии с формулой:All macroblocks of O-fields are encoded in intra-field coding mode. For this purpose, the PCM input signal is supplied to the DCT unit via the upper channel. In DCT 6 and 10, the matrices of the signal values f _{x, y} are transformed into the matrix of Fourier coefficients F _{(m, n)} . In the quantizer, the DCT coefficients are encoded in accordance with the formula:

где Q(m,n) - коэффициенты квантования, задаваемые в виде таблицы из 8×8 целых чисел (таблица 1, Q);where Q (m, n) are the quantization coefficients specified in the form of a table of 8 × 8 integers (table 1, Q);

ρ - параметр, определяющий степень сжатия изображения;ρ is a parameter that determines the degree of image compression;

Round - операция округления до ближайшего целого значения;Round - operation of rounding to the nearest integer value;

C_q(m,n) - полученные в результате данной операции квантованные коэффициенты ДКП, которые могут быть как положительными, так и отрицательными (табл. 1, C_q).C _q (m, n) —the quantized DCT coefficients obtained as a result of this operation, which can be both positive and negative (Table 1, C _q ).

Квантование полученной после ДКП матрицы коэффициентов производится с учетом чувствительности глаза к различным пространственным частотам, причем при передаче более ВЧ компонентов возможна большая погрешность, то есть их можно квантовать на меньшее число уровней. Для крупных деталей коэффициенты ДКП квантуются на большее число уровней.The quantization of the matrix of coefficients obtained after DCT is made taking into account the sensitivity of the eye to various spatial frequencies, and when transmitting more RF components, a large error is possible, that is, they can be quantized to a smaller number of levels. For large parts, DCT coefficients are quantized to a greater number of levels.

В результате выполнения операций деления и округления многие коэффициенты ДКП становятся равными нулю. Именно квантование создает возможность уменьшения числа двоичных символов, необходимых для представления информации о коэффициентах ДКП, то есть сжатия изображения. В то же время квантование является источником необратимых потерь информации при сжатии. Выбор конкретной таблицы квантования Q предоставляется на усмотрение пользователя.As a result of the operations of division and rounding, many DCT coefficients become equal to zero. It is quantization that makes it possible to reduce the number of binary symbols needed to represent information about the DCT coefficients, that is, image compression. At the same time, quantization is a source of irreversible loss of information during compression. The selection of a particular Q quantization table is at the discretion of the user.

В блоках «Энтропийное кодирование» (21-23) осуществляется кодирование с переменной длиной кодовых слов.In the blocks "Entropy coding" (21-23), coding is performed with a variable length of code words.

В петле обратной связи (деквантователь Кв^-1 (15), блок обратного дискретного косинусного преобразования - ДКП^-1 (14) и предсказатели (12 и 19), ЗУ которых могут содержать несколько предыдущих полей (τ=Т_п+1/2Т_с, τ=2 Т_п)) происходит формирование предсказанного поля.In the feedback loop (KV ^-1 dequantizer (15), inverse discrete cosine transform block - DCT ^-1 (14) and predictors (12 and 19), whose memory can contain several previous fields (τ = T _p + 1 / 2T _s , τ = 2 T _p )), the formation of the predicted field occurs.

Оценка движения ОД (18) осуществляется путем сравнения текущего изображения, поступающего на вход кодера, с изображением, находящимся в ЗУ (19) и используемым для предсказания. Через блок 16 на вход ОД (18) поступают сигналы О- и М-полей.The motion of OD (18) is estimated by comparing the current image fed to the input of the encoder with the image located in the memory (19) and used for prediction. Through block 16, the input of the OD (18) receives the signals of O- and M-fields.

Предсказатель в системе не просто память для хранения предшествующих полей (кадров), но и устройство, которое при формировании предсказания отыскивает из массива данных, находящихся в его памяти, блок, согласованный с блоками текущего кадра. Для этого в предсказатель заводятся данные о векторах движения. Сигнал предсказания также подвергается энтропийному кодированию (23) и мультиплексируется (24) в общий цифровой поток с коэффициентами ДКП. Основные цифровые потоки О, М, N с выходов блоков 21 и 22 мультиплексируются (24) в один поток. В общий цифровой поток мультиплексируются сигналы с выхода определителя векторов движения ОД (18). В блоках 21, 22 и 23 осуществляется дополнительное сокращение цифрового потока.The predictor in the system is not just a memory for storing previous fields (frames), but also a device that, when generating a prediction, searches from the array of data in its memory for a block that is consistent with the blocks of the current frame. For this, data on motion vectors are entered into the predictor. The prediction signal is also subjected to entropy encoding (23) and is multiplexed (24) into a common digital stream with DCT coefficients. The main digital streams O, M, N from the outputs of blocks 21 and 22 are multiplexed (24) into one stream. Signals from the output of the OD motion vector determinant are multiplexed into the common digital stream (18). In blocks 21, 22 and 23, an additional reduction of the digital stream is carried out.

Общий цифровой поток после мультиплексора Мп (24) поступает на вход буферного запоминающего устройства БЗУ (26), работающего по принципу: «первым вошел - первым вышел». Необходимость введения в систему БЗУ объясняется следующими обстоятельствами. В зависимости от детальности движения и характера передаваемого движения в существенной степени может меняться скорость цифрового потока на выходе блоков кодирования 21, 22, 23 с переменной длиной. При возрастании в изображении уровня ВЧ компонентов, при быстроменяющихся сюжетах скорость потока данных на выходе компрессора возрастает. Это возрастание может приводить к превышению возможностей канала передачи по его пропускной способности. Ограничение скорости кодированного цифрового потока осуществляется реализацией обратной связи, в которую включена буферная память 26 и квантователи 7, 11.The total digital stream after the multiplexer Mp (24) is fed to the input of the buffer memory BZU (26), which works according to the principle: "first in, first out." The need for introducing into the system BZU due to the following circumstances. Depending on the detail of the movement and the nature of the transmitted movement, the speed of the digital stream at the output of the variable-length coding units 21, 22, 23 can substantially change. With increasing level of RF components in the image, with rapidly changing scenes, the data flow rate at the compressor output increases. This increase can lead to exceeding the capacity of the transmission channel in its bandwidth. The rate limitation of the encoded digital stream is implemented by the implementation of feedback, which includes a buffer memory 26 and quantizers 7, 11.

Для оптимизации работы системы желательно поддерживать уровень заполнения БЗУ приблизительно постоянным. Если БЗУ переполняется, то будет происходить потеря данных, то есть ухудшение качества изображения. Если БЗУ освобождается, то по каналу связи передаются "пустые" блоки, что приводит к снижению эффективности канала связи. Обратная связь степень заполнения БЗУ поддерживает постоянной.To optimize the operation of the system, it is desirable to maintain the filling level of the CCD approximately constant. If the BZU is full, then data loss will occur, that is, image quality deterioration. If the BZU is freed, then "empty" blocks are transmitted through the communication channel, which leads to a decrease in the efficiency of the communication channel. Feedback the degree of filling of the CCD keeps constant.

Сущность действия обратной связи (ОС) заключается в следующем. Если передается мелкоструктурное изображение и заполнение БЗУ увеличивается (память переполняется), то под воздействием ОС увеличивается параметр квантования ρ коэффициентов ДКП (формула 3). При этом число бит на каждый коэффициент уменьшается, и уровень потока данных поддерживается примерно постоянным. Наоборот, при передаче «гладких» изображений квантование становится более точным (параметр ρ в формуле 3 уменьшается, а коэффициенты квантования C_q - табл.1 - увеличиваются). Такой метод соответствует свойствам человеческого зрения: на мелкоструктурных изображениях менее заметны неточности в пределах уровней яркости. Конечно, изменение масштаба квантования в зависимости от содержания изображения отражается на качестве воспроизводимого изображения, изменяется уровень шумов квантования. Включенное в цепь обратной связи устройство управления коэффициентом сжатия УКС 25 повышает ее быстродействие. В результате, благодаря действию обратной связи, степень заполнения буферной памяти (БЗУ) в среднем поддерживается постоянной.The essence of feedback (OS) is as follows. If a fine-grained image is transmitted and the filling of the CCD increases (memory is full), then under the influence of the OS, the quantization parameter ρ of the DCT coefficients increases (formula 3). In this case, the number of bits per coefficient decreases, and the level of data flow is maintained approximately constant. On the contrary, when transmitting “smooth” images, quantization becomes more accurate (the parameter ρ in formula 3 decreases, and the quantization coefficients C _q - Table 1 - increase). This method corresponds to the properties of human vision: in fine-grained images, inaccuracies within brightness levels are less noticeable. Of course, a change in the quantization scale depending on the content of the image affects the quality of the reproduced image, the level of quantization noise changes. The UKS 25 compression ratio control device included in the feedback circuit increases its speed. As a result, due to the feedback action, the degree of filling of the buffer memory (BZU) is kept constant on average.

Наиболее предпочтительными схемами кодирующего устройства системы ONM является схема, изображенная на фиг.7, где прореживание отсчетов осуществляется до ДКП. Более высокая эффективность сжатия здесь объясняется более сильной связью между элементами в блоке, чем между коэффициентами ДКП. Схема фиг.7 обеспечивает более высокое качество изображения, чем схема, где прореживание осуществляется после ДКП. Выбор той или иной схемы кодирования определяется техническими данными на проектирование системы.The most preferred circuits of the encoder of the ONM system is the circuit shown in Fig.7, where the thinning of the samples is carried out before DCT. The higher compression efficiency here is due to the stronger coupling between the elements in the block than between the DCT coefficients. The scheme of Fig. 7 provides higher image quality than the scheme where decimation is carried out after DCT. The choice of a particular coding scheme is determined by the technical data for the design of the system.

Прореживание отчетов по сравнению с общеизвестными имеет свои особенности, которые улучшают визуальную четкость изображения N-полей (фиг.8):Thinning reports in comparison with the well-known has its own characteristics, which improve the visual clarity of the image of N-fields (Fig):

- в 2, 6, 10, …, (4n-2) N-полях передаются только 2, 6, 10, …, (4z-2) строки с половинным количеством отсчетов (2, 4, 6, 8, …, - прореживание) (фиг.8, в). Информация в 4, 8, 12,… строках не передается;- in 2, 6, 10, ..., (4n-2) N-fields, only 2, 6, 10, ..., (4z-2) lines with half the number of samples (2, 4, 6, 8, ..., - thinning) (Fig. 8, c). Information in 4, 8, 12, ... lines is not transmitted;

- в 4, 8, 12, …, (4n) N-полях передаются строки с номерами (фиг.8, г) 4, 8, 12, …, (4z), а информация в 2, 6, 10, … строках не передается. (Здесь n и z-порядковые номера: 1, 2, 3, 4, 5, 6, …).- in 4, 8, 12, ..., (4n) N-fields, lines with numbers are transmitted (Fig. 8, d) 4, 8, 12, ..., (4z), and information in 2, 6, 10, ... lines not getting through. (Here n and z-serial numbers: 1, 2, 3, 4, 5, 6, ...).

Такой порядок прореживания отсчетов в N-полях повышает визуальную четкость N-полей до исходной, так как создается виртуальный полноценный N-полукадр (поле) из двух смежных полей при использовании 1-го варианта перестановки полей. Следует отметить, что в результате прореживания N-полей ортогональная структура отсчетов сохраняется, что весьма важно для ДКП.This order of thinning samples in N-fields increases the visual clarity of N-fields to the original, since a virtual full-fledged N-half-frame (field) is created from two adjacent fields when using the 1st option of field permutation. It should be noted that as a result of thinning N-fields, the orthogonal sample structure is preserved, which is very important for DCT.

Декодирующее устройство (фиг.9) осуществляет обратный процесс преобразования видеосигнала из цифровой формы в аналоговую. В декодирующем устройстве (фиг.9) выполняются операции декодирования кодов переменной длины (декодирование энтропийного кода) 3, 6, деквантование 4, 7, обратное ДКП (ДКП^-1) 5, 8 и восстанавливается исходная последовательность полей с импульсно-кодовой модуляцией (ИКМ).The decoding device (Fig. 9) performs the inverse process of converting a video signal from a digital form to an analog one. In the decoding device (Fig. 9), the operations of decoding codes of variable lengths (decoding the entropy code) 3, 6, dequantization 4, 7, inverse DCT (DCT ^-1 ) 5, 8 are performed, and the original sequence of fields with pulse-code modulation (PCM) is restored )

Декодер содержит буферное запоминающее устройство (БЗУ) 1, демультиплексор (ДМп) 2 для разделения цифровых потоков, декодеры кодов с переменной длиной кодовых слов (декодирование энтропийного кода) сигналов полей О, N, М, деквантователи (Кв^-1) 4, 7, блоки обратного дискретного косинусного преобразования (ДКП^-1) 5, 8 и восстанавливания отсчетов (↑2) 9, предсказатели и запоминающие устройства 12, 14 с задержкой сигналов на

и τ=2Т_П (Т_П, Т_с - длительность поля и строки), аналогичные соответствующим блокам кодера.The decoder contains a buffer memory (CCD) 1, a demultiplexer (DMP) 2 for separating digital streams, decoders for codes with a variable length of code words (decoding of the entropy code) of the signals of the O, N, M fields, dequantizers (Qu ^-1 ) 4, 7, blocks of the inverse discrete cosine transform (DCT ^-1 ) 5, 8 and restoration of samples (↑ 2) 9, predictors and

storage devices

12, 14 with a delay of signals on

and τ = 2Т _П (Т _П , Т _с - field and line duration), similar to the corresponding encoder blocks.

Тактовая частота восстанавливается с использованием данных из декодирующего потока.The clock frequency is restored using data from the decoding stream.

БЗУ на входе декодера выполняет функцию согласования постоянной скорости передачи двоичных символов во входном потоке данных с процессами в декодере, то есть восстанавливается исходная скорость цифрового потока кодера без учета регулировки УКС (фиг.7).BZU at the input of the decoder performs the function of matching a constant bit rate in the input data stream with the processes in the decoder, that is, the original speed of the digital encoder stream is restored without taking into account the UKS adjustment (Fig. 7).

С выхода буферного запоминающего устройства кодированные данные изображения и значения параметров квантования поступают на блоки декодирования энтропийного кода 3,6 и далее на деквантователи (Кв^-1) 4, 7, а сигналы векторов движения поступают на блок декодирования энтропийного кода 10 и предсказатель 12.From the output of the buffer memory, the encoded image data and the values of the quantization parameters are sent to the decoding units of the entropy code 3.6 and then to the dequantizers (Qu ^-1 ) 4, 7, and the signals of the motion vectors are sent to the decoding unit of the entropy code 10 and the predictor 12.

В результате выполненных преобразований на выходе электронного коммутатора ЭК 17 получаем исходную последовательность O-, N- и М-полей с импульсно-кодовой модуляцией (ИКМ).As a result of the transformations performed at the output of the electronic switch EC 17, we obtain the initial sequence of O-, N- and M-fields with pulse-code modulation (PCM).

Рассмотрим более подробно процесс образования исходных сигналов O-, М- и N-полей.Let us consider in more detail the process of the formation of the initial signals of O-, M- and N-fields.

Цифровой сигнал с выхода блока ДКП^-1 5 содержит опорный сигнал O-полей, передаваемый с внутрикадровым кодированием, и разностный сигнал М-полей (межкадровая разность двух нечетных полей соседних кадров). Во втором канале, кроме декодирования энтропийного кода, деквантования и обратного ДКП 8, в сигнале N происходит восстановление (интерполяция) пропущенных отсчетов в блоке 9. Восстановление ИКМ в сигналах N и М происходит следующим образом.The digital signal from the output of the DCT ^{-1 -1} block contains the O-field reference signal transmitted with intraframe coding, and the M-field difference signal (interframe difference of two odd fields of adjacent frames). In the second channel, in addition to decoding the entropy code, dequantization, and inverse DCT 8, in the signal N, the missing samples are restored (interpolated) in block 9. The PCM is restored in the N and M signals as follows.

При появлении сигнала О на выходе ДКП^-1 5 электронный коммутатор 17 пропускает сигнал «О» на выход схемы (положение 1 переключателя). Одновременно сигнал О через сумматор 11 поступает на предсказатель 14 (запоминающее устройство ЗУ на время

- одного поля и полстроки

) и через схему И 13 на предсказатель 12 (ЗУ с τ_З=2Т_П). При появлении сигнала N в сумматоре 15 происходит сложение его с задержанным опорным O-сигналом и образование ИКМ сигнала N, который через блок 16 поступает через ЭК 17 на выход схемы (переключатель в положение 2).When a signal O appears at the output of DCT ^-1 5, the electronic switch 17 passes the signal “O” to the output of the circuit (position 1 of the switch). At the same time, the signal O through the adder 11 is fed to the predictor 14 (memory device for a while

- one field and half a line

) and through the circuit And 13 to the predictor 12 (memory with τ _W = 2T _P ). When the signal N appears in the adder 15, it is added to the delayed reference O-signal and the PCM signal N is formed, which, through block 16, enters the circuit output through EC 17 (switch to position 2).

Сигнал М поступает на сумматор 11, где происходит его сложение с задержанным на τ_З=2Т_П опорным сигналом "О". С выхода сумматора 11 восстановленный М-сигнал с ИКМ через блок 16 поступает через ЭК 17 (переключатель в положение 2) на выход схемы.The signal M supplied to the adder 11 where it is delayed by the addition with τ _W = 2T _n reference signal "O". From the output of the adder 11, the restored M-signal from the PCM through the block 16 is fed through the EC 17 (switch to position 2) to the output of the circuit.

Таким образом, все три сигнала (О, N и М) с ИКМ, пройдя в блоке 18 обратную перестановку полей, приобретают исходную очередность следования полей О, N, М (фиг.4, а) и могут быть поданы на ЦАП (на схеме не показан).Thus, all three signals (O, N and M) with PCM, having passed the field rearrangement in block 18, acquire the initial sequence of following the fields O, N, M (Fig. 4, a) and can be applied to the DAC (in the diagram not shown).

Подводя итоги сказанному, можно сделать следующие выводы:Summing up the above, we can draw the following conclusions:

1. Предложенный способ выгодно отличается от существующих способов сжатия цифрового потока видеосигнала в канале связи. Не уступая в эффективности сжатия цифрового потока стандарту MPEG-2, предложенный способ значительно проще и легче реализуется на практике. Известные способы сжатия цифрового потока MPEG-4, MPEG-7, вейвлет-преобразование сложнее предложенного способа в смысле технической реализации, поэтому ограничены в своем применении. В прототипе невозможна перестановка полей для повышения визуальной четкости: в предлагаемом способе поля с высокой и пониженной четкостью меняются местами это создает виртуальный полноценный кадр. Этого в MPEG-2 нет, так как перестановка кадров (полей) осуществляется только для получения межкадровой (полевой) разностей, а кадры (поля) высокой и низкой четкости занимают при воспроизведении всегда свое определенное место в группе кадров.1. The proposed method compares favorably with existing methods for compressing a digital video signal stream in a communication channel. Not inferior in the efficiency of digital stream compression to the MPEG-2 standard, the proposed method is much simpler and easier to implement in practice. Known methods for compressing the digital stream MPEG-4, MPEG-7, wavelet transform are more complicated than the proposed method in the sense of technical implementation, therefore, they are limited in their application. In the prototype it is impossible to rearrange the fields to increase visual clarity: in the proposed method, the fields with high and low definition are swapped, this creates a virtual full frame. This is not the case in MPEG-2, since the permutation of frames (fields) is carried out only to obtain interframe (field) differences, while high and low definition frames (fields) always take their specific place in the group of frames during playback.

Качество изображения в предложенном способе выше, чем в прототипе благодаря перестановке соседних полей в видеопоследовательности кадров.The image quality in the proposed method is higher than in the prototype due to the permutation of adjacent fields in the video sequence of frames.

2. При необходимости система сжатия цифрового потока по предлагаемому способу легко переходит в режим работы «опорного кадра», когда передается последовательность полей, состоящая из кадров: (О,N), (N,О), (O,N), …, то есть поля О и N в каждом кадре меняются местами. В результате произведенной перестановки полей визуальная четкость изображения повышается. Все кадры визуально воспроизводятся с качеством изображения опорного кадра (виртуального).2. If necessary, the digital stream compression system according to the proposed method easily switches to the “reference frame” mode of operation when a sequence of fields consisting of frames is transmitted: (O, N), (N, O), (O, N), ..., that is, the fields O and N in each frame are swapped. As a result of the permutation of the fields, the visual clarity of the image increases. All frames are visually reproduced with the image quality of the reference frame (virtual).

Переход работы системы в режим работы «опорного поля (кадра)» в прикладном телевидении может осуществляться автоматически по сигналу тревоги.The transition of the system to the operating mode of the "reference field (frame)" in applied television can be carried out automatically by an alarm signal.

3. В системе, построенной по предлагаемому способу, имеются большие резервы для повышения эффективности сжатия цифрового потока.3. In the system built according to the proposed method, there are large reserves to improve the compression efficiency of the digital stream.

Рассмотрим пример повышения эффективности сжатия цифрового потока по предлагаемому способу.Consider an example of increasing the compression efficiency of a digital stream by the proposed method.

Пусть на вход кодирующего устройства поступает сигнал ИКМ с ортогональным расположением отсчетов в кадре (фиг.10, а). В нечетных М-полях для сохранения визуальной четкости структура отсчетов прореживается один раз в горизонтальном направлении в шахматном порядке (фиг.10, г): в 1, 5, 9, 13, …, (4n-3) строках передаются только нечетные элементы, а в 3, 7, 11, 15, …, (4n-1) строках - четные элементы. Такая передискретизация улучшает визуальную горизонтальную четкость. В четных N-полях сигнал по горизонтали прореживается дважды, то есть передается только каждый 4-й элемент по строке, но в шахматном порядке. По вертикали передаются все строки. Методика прореживания и восстановления отсчетов остается прежней (фиг.10, в, г) и визуальная четкость изображения в целом понижается незначительно.Let the PCM signal with the orthogonal arrangement of the samples in the frame be received at the input of the encoding device (Fig. 10, a). In odd M-fields, in order to maintain visual clarity, the sample structure is thinned once in the horizontal direction in a checkerboard pattern (Fig. 10, d): in 1, 5, 9, 13, ..., (4n-3) lines, only odd elements are transmitted, and in 3, 7, 11, 15, ..., (4n-1) lines - even elements. Such oversampling improves visual horizontal clarity. In even N-fields, the horizontal signal is thinned twice, that is, only every 4th element is transmitted along the line, but in a checkerboard pattern. All lines are transmitted vertically. The method of thinning and restoration of samples remains the same (Fig. 10, c, d) and the visual clarity of the image as a whole decreases slightly.

В нашем случае предполагается восстановление частоты отсчетов N-полей до первоначального значения, поэтому заметной потери качества изображения не будет. Естественно, перед прореживанием для уменьшения искажений стоит ФНЧ, который в схеме (фиг.7) для упрощения не показан.In our case, it is supposed to restore the sampling frequency of N-fields to the original value, so there will be no noticeable loss in image quality. Naturally, before thinning to reduce distortion, there is a low-pass filter, which is not shown in the circuit (Fig. 7) for simplicity.

Дополнительное прореживание отсчетов в рассмотренном случае повышает эффективность системы в сокращении цифрового потока более чем в 2 раза. В прототипе такая возможность отсутствует.The additional thinning of the samples in the considered case increases the efficiency of the system in reducing the digital stream by more than 2 times. In the prototype, this is not possible.

4. Значительно более сильные, чем в прототипе, корреляционные связи между сравниваемыми элементами обеспечивают эффективное устранение избыточности из телевизионных сообщений и делают систему перспективной при ее модернизации.4. Significantly stronger than in the prototype, the correlation between the elements being compared ensures the effective elimination of redundancy from television messages and makes the system promising for its modernization.

Приведем схемы кодирующего и декодирующего устройств, реализующие предлагаемый способ сжатия цифрового потока видеосигнала, к виду, требуемому при оформлении заявки (фиг.11 и 12).Let us cite the schemes of the encoding and decoding devices that implement the proposed method of compressing the digital video signal stream to the form required when processing the application (Figs. 11 and 12).

На чертежах приведены следующие обозначения:The drawings indicate the following notation:

Для фиг.11:For figure 11:

- блок предварительной обработки - 1;- pre-processing unit - 1;

- устройство вычитания - 2;- subtraction device - 2;

- ключевая схема И-3, 4, 8, 16;- the key scheme of I-3, 4, 8, 16;

- устройство сложения - 5, 13, 17;- addition device - 5, 13, 17;

- дискретно-косинусное преобразование - 6, 10;- discrete cosine transform - 6, 10;

- квантователи - 7, 11;- quantizers - 7, 11;

- блок прореживания сигналов N - 9;- block thinning signals N - 9;

- предсказатели с τ₃=1Т_П и τ₃=2Т_П - 12, 19;- predictors with τ ₃ = 1T _P and τ ₃ = 2T _P - 12, 19;

- блок обратного ДКП^-1 - 14;- block reverse DCT ^-1 - 14;

- деквантователь кв - 15;- dequantizer sq - 15;

- определитель движения - 18;- motion determinant - 18;

- мультиплексор - 24;- multiplexer - 24;

- устройство управления коэффициентом сжатия - 25;- device for controlling the compression ratio - 25;

- буферное запоминающее устройство - 26.- buffer storage device - 26.

Для фиг.12:For Fig:

- буферное запоминающее устройство - 1;- buffer storage device - 1;

- демультиплексор - 2;- demultiplexer - 2;

- декодер энтропийного кода - 3, 6, 10;- decoder of the entropy code - 3, 6, 10;

- деквантователь - 4, 7;- dequantizer - 4, 7;

- блок обратного дискретного косинусного преобразования - 5, 8;- block inverse discrete cosine transform - 5, 8;

- блок интерполирования отсчетов в канале N-полей - 9;- block interpolation of samples in the channel of N-fields - 9;

- сумматоры - 11, 15, 16;- adders - 11, 15, 16;

- предсказатели: τ₃=2Т_П - 12, τ₃=1Т_П - 14;- predictors: τ ₃ = 2T _P - 12, τ ₃ = 1T _P - 14;

- электронный коммутатор - 17;- electronic switch - 17;

- блок обратной перестановки полей - 18.- block reverse permutation of fields - 18.

На схемах представлены только те узлы и блоки, которые характеризуют особенности данного способа сжатия цифрового потока видеосигнала в канале связи.The diagrams show only those nodes and blocks that characterize the features of this method of compressing a digital video signal stream in a communication channel.

Рассмотрим кратко последовательность преобразований цифрового видеосигнала в представленных схемах (фиг.11 и 12).Consider briefly the sequence of transformations of the digital video signal in the presented schemes (Figs. 11 and 12).

Цифровой видеосигнал с ИКМ поступает на блок предварительной обработки сигнала 1, в котором происходит образование блоков изображения и перестановка нечетных и четных полей в последовательности кадров. С выхода блока 1 сигнал по верхнему каналу через ключевую схему 3 и сумматор 5 поступает на блок дискретного косинусного преобразования ДКП 6. В вычитателе 2 образуются два разностных сигнала М- и N-полей. Сигнал М-полей через ключ 4 и сумматор 5 поступает на блок ДКП 6. В блоках ДКП происходит преобразование матриц значений сигналов в матрицу коэффициентов ряда Фурье. Другой сигнал N-полей через ключ 8 и блок прореживания 9 поступает на ДКП 10. С блоков ДКП 6 и 10 сигналы О, М и N, пройдя квантование 7 и 11 и энтропийное кодирование 21 и 22, поступают на мультиплексор 24.The digital video signal from the PCM enters the signal preprocessing unit 1, in which the image blocks are formed and the odd and even fields in the sequence of frames are rearranged. From the output of block 1, the signal along the upper channel through the key circuit 3 and adder 5 is fed to the discrete cosine transform block of DCT 6. In subtracter 2, two difference signals of M- and N-fields are generated. The signal of the M-fields through the key 4 and the adder 5 is fed to the DCT block 6. In the DCT blocks, the matrix of signal values is converted to the matrix of coefficients of the Fourier series. Another N-field signal through key 8 and decimation unit 9 is fed to DCT 10. From DCT 6 and 10, signals O, M and N, having passed quantization 7 and 11 and entropy coding 21 and 22, are sent to multiplexer 24.

В петле обратной связи, охватывающей блоки: деквантователь кв^-1 15, блок обратного косинусного преобразования ДКП^-1 14, сумматоры 13, 17 и 20, предсказатели 12 с

и 19 с τ₃=2Т_П, полученный задержанный сигнал обратной связи в блоке 2 вместе с сигналом, поступающем с БПО, образует разностные сигналы М и N. Кроме того, с выхода БПО 1 входной сигнал через ключевую схему 16 поступает на определитель движения 18, на второй вход которого поступает задержанный сигнал полей О и М и определяется вектор движения, который, пройдя блок энтропийного кодирования 23, поступает на вход мультиплексора 24. Далее объединенный в мультиплексоре цифровой поток поступает на вход буферного запоминающего устройства БЗУ 26. С помощью БЗУ 26, блока управления коэффициентом сжатия УКС 25, квантователей 7 и 11, мультиплексора 24 осуществляется выравнивание скорости цифрового потока в канале связи независимо от содержания передаваемого изображения. Этим самым повышается эффективность использования канала связи, пропускная способность которого ограничена. Скомпрессированный цифровой поток с выхода БЗУ 26 поступает в канал связи.In the feedback loop covering the blocks: dequantizer kv ^-1 15, block of the inverse cosine transform DCT ^-1 14,

adders

13, 17 and 20, predictors 12 s

and 19 with τ ₃ = 2T _P , the received delayed feedback signal in block 2, together with the signal from the BPO, forms the difference signals M and N. In addition, from the output of the BPO 1, the input signal through the key circuit 16 is fed to the motion detector 18 , the second input of which receives the delayed signal of the fields O and M and determines the motion vector, which, having passed the entropy coding block 23, is fed to the input of the multiplexer 24. Next, the digital stream combined in the multiplexer is fed to the input of the buffer storage device BZU 26. Using BZU 26, the control unit for the compression ratio UKS 25,

quantizers

7 and 11, the multiplexer 24 is equalized speed digital stream in the communication channel, regardless of the content of the transmitted image. This increases the efficiency of using a communication channel, the throughput of which is limited. Compressed digital stream from the output of the BZU 26 enters the communication channel.

В приемном (декодирующем) устройстве скомпрессированный сигнал поступает на вход буферного запоминающего устройства БЗУ 1, в котором восстанавливается исходная плотность цифрового потока, то есть данные из БЗУ считываются неравномерно во времени.In the receiving (decoding) device, the compressed signal is fed to the input of the buffer storage device BZU 1, in which the original density of the digital stream is restored, that is, the data from the BZU are read unevenly in time.

С выходов демультиплексора 2 кодированные данные изображения и векторов движения поступают на блоки энтропийного кодирования 3, 6, 10. Разностные и опорные сигналы О, N и М поступают на деквантователи Кв^-1 4, 7 и блоки обратного косинусного преобразования ДКП^-1 5, 8. В блоке 9 происходит восстановление (интерполяция) пропущенных отсчетов в изображении N-полей. Далее из разностных сигналов N- и М-полей в блоках 11-15 происходит восстановление исходной структуры N- и М-полей с ИКМ. С помощью блоков 10, 11 и 12 из сигналов векторов движения образуются сигналы М-полей подвижных изображений. В предсказателе 14 с запоминающим устройством ЗУ

хранится предыдущее значение сигнала для образования в сумматоре 15 текущего значения сигнала N. Аналогичную функцию выполняет и предсказатель 12 для сигналов М-полей.From the outputs of demultiplexer 2, the encoded image data and motion vectors arrive at the entropy coding blocks 3, 6, 10. Difference and reference signals O, N, and M are fed to the

dequantizers Kv

^-1 4, 7 and the blocks of the inverse

cosine transform DCT

^-1 5, 8 In block 9, the restoration (interpolation) of the missing samples in the image of N-fields occurs. Then, from the difference signals of N- and M-fields in blocks 11-15, the initial structure of N- and M-fields with PCM is restored. Using

blocks

10, 11 and 12 of the signals of the motion vectors, the signals of the M-fields of moving images are formed. In the predictor 14 with a memory device

the previous signal value is stored for the current signal N to be formed in adder 15. A predictor 12 for M-field signals performs a similar function.

При приеме сигналов опорного O-поля переключатель электронного коммутатора находится в положении 1 - сигнал О проходит на выход схемы. На управляющий вход ЭК 17 поступают импульсы U₁ опорного поля О. При приеме макроблоков М- и N-полей переключатель ЭК находится в положении 2. В этом случае формирование выходного сигнала происходит путем поэлементного сложения поступающих с блоков обратного ДКП^-1 5, 8 значений разностей с предсказанным макроблоком, формируемым из элементов ранее декодированных изображений О- и М-полей с использованием декодированных векторов движения в М-полях (блоки 10, 11, 12). С выхода ЭК 17 декомпрессированный цифровой поток с ИКМ поступает на блок обратной перестановки полей ОПП 18, в котором восстанавливается исходная последовательность полей.When receiving signals of the reference O-field, the switch of the electronic switch is in position 1 - the signal O passes to the output of the circuit. Pulses U _{1 of the} reference field O are received at the control input of EC 17. When receiving macroblocks of M- and N-fields, the EC switch is in position 2. In this case, the output signal is formed by elementwise addition of the inverted DCT ^-1 5, 8 values differences with the predicted macroblock formed from elements of previously decoded images of O- and M-fields using decoded motion vectors in M-fields (blocks 10, 11, 12). From the output of EC 17, the decompressed digital stream from the PCM enters the reverse field permutation block OPP 18, in which the original sequence of fields is restored.

Преимуществом предлагаемого изобретения является улучшение качества изображения при относительно более простой схемной его реализации, что имеет не только экономический, но и социальный эффект, так как может найти широкое применение во многих областях цифрового телевидения.An advantage of the invention is the improvement of image quality with a relatively simpler circuit implementation, which has not only economic, but also social effect, as it can be widely used in many areas of digital television.

ЛитератураLiterature

1. Лебедев Д.С., Цуккерман И.И. Телевидение и теория информации. - Л.: Энергия, 1965. - 219 с.1. Lebedev D.S., Tsukkerman I.I. Television and information theory. - L.: Energy, 1965 .-- 219 p.

2. Цифровое телевидение. / Кривошеев М.И., Виленчик Л.С. и др. - М.: Связь, 1980. - 260 с.2. Digital television. / Krivosheev M.I., Vilenchik L.S. et al. - M .: Communication, 1980. - 260 p.

3. Цифровое кодирование телевизионных изображений. / Цуккерман И.И., Кац Б.М. и др. - М.: Радио и связь, 1983. - 239 с.3. Digital coding of television images. / Zuckerman I.I., Katz B.M. et al. - M.: Radio and Communications, 1983 .-- 239 p.

4. Методы передачи изображений. Сокращение избыточности. / Уильям Прэтт, Давид Д.Сакрисон и др. / Перевод с англ. - М.: Радио и связь, 1983. - 284 с.4. Image transfer methods. Reduction of redundancy. / William Pratt, David D. Sacrison et al. / Translation from English. - M .: Radio and communications, 1983. - 284 p.

5. Трофимов Б.Е., Куликовский О.В. Передача изображений в цифровой форме. - М.: Связь, 1980. - 120 с.5. Trofimov B.E., Kulikovsky OV Digital image transmission. - M.: Communication, 1980 .-- 120 p.

6. У.Прэтт. Цифровая обработка изображений. - М.: Издательство "Мир", 1982, Т.1-Т.2. - 790 с.6. W. Pratt. Digital image processing. - M.: Mir Publishing House, 1982, T.1-T.2. - 790 s.

7. Кретцмер (Kretzmer E.R.) Reduce-alphabet representation of television signals, "Conv Rec. IRE", 1956, 4, - c.140.7. Kretzmer (Kretzmer E.R.) Reduce-alphabet representation of television signals, "Conv Rec. IRE", 1956, 4, p. 140.

8. Грэхем (Graham R.E.). Communication theory applied to television coding "Acta Electronica", 1957/58, N1-2. - c.333.8. Graham (Graham R.E.). Communication theory applied to television coding "Acta Electronica", 1957/58, N1-2. - c. 333.

9. Ануфриев И.К. Использование цифровых методов обработки сигналов в телевидении. // Средства связи. - 1989, №4. - с.3-6.9. Anufriev I.K. The use of digital signal processing methods in television. // Means of communication. - 1989, No. 4. - p. 3-6.

10. Смирнов В.М., Сорин В.Я. Передача цифровых ТВ сигналов методами двумерного кодирования. - Техника средств связи. Сер. Техника телевидения, вып.6, 1987. - С.31-37.10. Smirnov V.M., Sorin V.Ya. Transmission of digital TV signals by two-dimensional coding methods. - Communication technology. Ser. Technique of Television, issue 6, 1987. - P. 31-37.

11. Ярославский. Введение в цифровую обработку изображений. - М.: «Советское радио», 1979. - 312 с.11. Yaroslavsky. Introduction to digital image processing. - M .: "Soviet Radio", 1979. - 312 p.

12. Быков Р.Е. Теоретические основы телевидения. - Санкт-Петербург: Изд. «Лань», 1990. - 287 с.12. Bykov R.E. The theoretical foundations of television. - St. Petersburg: Publishing. "Doe", 1990. - 287 p.

13. Левченко В.Н. Спутниковое телевидение. - «BHV - Санкт-Петербург»: «Издательство Арлит», 1998. - 288 с.13. Levchenko V.N. Satellite television. - “BHV - St. Petersburg”: “Arlit Publishing House”, 1998. - 288 p.

14. Н.Н.Красильников. Теория передачи и восприятия изображений. - М.: Радио и связь, 1986. - 246 с.14.N.N.Krasilnikov. Theory of transmission and perception of images. - M .: Radio and communications, 1986. - 246 p.

15. Птачек М. Цифровое телевидение. Теория и техника. / Пер. с чеш. / Под ред. Л.С.Виленчика. - М.: Радио и связь. 1990. - 528 с.15. Ptachek M. Digital television. Theory and technology. / Per. from Czech / Ed. L.S. Vilenchik. - M .: Radio and communication. 1990 .-- 528 s.

16. Безруков В.Н. Разработка и применение элементов теории преобразование сигналов изображений в системах прикладного телевидения. Авторская диссертация на соискание ученой степени доктора технических наук. - М.: 1996. - 45 с.16. Bezrukov V.N. Development and application of elements of the theory of the transformation of image signals in applied television systems. Author's dissertation for the degree of Doctor of Technical Sciences. - M .: 1996. - 45 p.

17. Зубарев Ю.Б., Кривошеев М.И., Красносельский И.Н. Цифровое телевизионное вещание. - М.: 2001. - 521 с.17. Zubarev Yu.B., Krivosheev M.I., Krasnoselsky I.N. Digital television broadcasting. - M .: 2001. - 521 p.

18. Телевидение: Учебник для ВУЗов. / Под ред. В.Е.Джаконии. - М.: Радио и связь, 2004. - 615 с.18. Television: Textbook for high schools. / Ed. V.E.Jaconia. - M .: Radio and communications, 2004 .-- 615 p.

19. Цифровая обработка телевизионных и компьютерных изображений. / Под ред. Ю.Б.Зубарева и В.П.Дворковича. - М.: МЦНТИ. 1997. - 255 с.19. Digital processing of television and computer images. / Ed. Yu.B. Zubarev and V.P. Dvorkovich. - M .: ICSTI. 1997 .-- 255 p.

20. Локшин Б.А. Цифровое вещание: - от студии к телезрителю. - М.: 2001. - 446 с.20. Lokshin B.A. Digital broadcasting: - from studio to viewer. - M .: 2001. - 446 p.

21. Смирнов А.В. Основы цифрового телевидения. - М.: 2001. - 224 с.21. Smirnov A.V. The basics of digital television. - M .: 2001. - 224 p.

22. Брайс Р. Справочник по цифровому телевидению. Пер. с англ. - г.Жуковский: Эра, 2001. - 230 с.22. Bryce R. Handbook of Digital Television. Per. from English - Zhukovsky: Era, 2001 .-- 230 p.

23. Цифровое преобразование изображений. / Под ред. Проф. Р.Е.Быкова. - М.: Горячая линия - Телеком, 2003. - 230 с.23. Digital image conversion. / Ed. Prof. R.E.Bykova. - M .: Hot line - Telecom, 2003 .-- 230 p.

24. Смирнов А.В., Пескин А.Е. Цифровое телевидение: - от теории к практике. - М.: Горячая линия - Телеком, 2005. - 352 с.24. Smirnov A.V., Peskin A.E. Digital television: - from theory to practice. - M .: Hot line - Telecom, 2005 .-- 352 p.

25. Владо Демьяновски. Библия охранного телевидения. / Пер. с англ. - М.: «Ай-Эс-Эс Пресс», 2003. - 344 с.25. Vlado Demjanowski. Bible CCTV. / Per. from English - M .: "IS-ES Press", 2003. - 344 p.

26. Петраков А.В., Лагутин B.C. Телеохрана. - М.: Энергоиздат. - 1998. 372 с.26. Petrakov A.V., Lagutin B.C. Teleguard. - M.: Energoizdat. - 1998.372 s.

27. Гласман. К. «MPEG - это просто». // Журнал «625», 2000, №3. - с.5-45.27. Glassman. K. "MPEG is easy." // Magazine "625", 2000, No. 3. - p. 5-45.

28. Преобразование стандартов. Применение технических решений. // Журнал «625». 2005. №7. - с.79.28. Converting standards. Application of technical solutions. // Magazine "625". 2005. No. 7. - p. 79.

29. Гласман К. Видеокомпенсация. // Журнал «625». 1997. №7. - с.60.29. Glassman K. Videocompensation. // Magazine "625". 1997. No. 7. - p. 60.

30. Балобанов В.Г. Способ передачи и приема цифрового телевизионного сигнала. Авторское свидетельство №1642594. Бюл. 14. - 1991.30. Balobanov V.G. A method of transmitting and receiving a digital television signal. Copyright certificate No. 1642594. Bull. 14. - 1991.

31. Балобанов В.Г. Способ сжатия видеосигнала в цифровой форме. Патент №2012157. Бюл. №8. - 1994.31. Balobanov V.G. A method of compressing a video signal in digital form. Patent No. 2012157. Bull. No. 8. - 1994.

32. Безруков В.Н., Балобанов В.Г. Способ передачи дополнительной информации по телевизионному каналу. Авторское свидетельство №586572. Бюл. №48. - 1977.32. Bezrukov V.N., Balobanov V.G. A method of transmitting additional information on a television channel. Copyright certificate No. 586572. Bull. No. 48. - 1977.

33. Артюшенко В.М., Шелухин О.И., Афонин М.Ю. Цифровое сжатие видеоинформации и звука: Учебное пособие. / Под ред. Артюшенко В.М. - М.: Изд. Торг. Корпорации «Дашков и Ко». 2003. - 426 с.34. Катаев С.И., Хромой Б.П., Безруков В.Н., Балобанов В.Г. Авторское Свидетельство №301675. Бюллетень изобретений №14, 1971.33. Artyushenko V.M., Shelukhin O.I., Afonin M.Yu. Digital compression of video information and sound: a Training manual. / Ed. Artyushenko V.M. - M.: Publishing. Bargain. Corporation "Dashkov & Co." 2003 .-- 426 p. 34. Kataev S.I., Khromoi B.P., Bezrukov V.N., Balobanov V.G. Copyright Certificate No. 301675. Bulletin of inventions No. 14, 1971.

Claims

A method of compressing a digital video signal stream, namely, when converting a video signal in order to reduce the digital stream speed in a communication channel in a preprocessing unit (BPO), the image is divided into blocks of 8 × 8 elements and the odd and even fields in each second frame are interchanged in of the entire video sequence, then form a group of fields consisting of signals of the reference O-fields and two difference signals of N- and M-fields, subject them to discrete cosine transform (DCT), quantization, and entropy coding and multiplexing, and for moving objects in the image of M-fields, instead of the values of the elements of the moving blocks, only their coordinates are transmitted - motion vectors, which are also entropy encoded and multiplexed into a common compressed digital stream, the speed of which, thanks to feedback, consisting of a buffer memory (BZU), the control unit of the compression ratio (UKS), quantizers and multiplexer is made constant regardless of the image structure, characterized in that , in order to eliminate redundancy from TV images and, therefore, reduce the speed of the digital stream in the communication channel, the sequence of fields of the digital video signal with pulse-code modulation are divided into groups consisting of:
O-fields, which are transmitted with intraframe coding and are reference for decoding the remaining fields of the group, providing the ability to decode and play the received TV signal;
M-fields transmitted with inter-frame coding by motion-compensated prediction over the nearest field of an adjacent frame;
N-fields, the transmission of which uses interfield coding with prediction from the previous O- and M-fields, in order to increase the clarity of the image, the odd and even fields in each group of frames are interchanged with the necessary restoration of their sequence in the decoding device, and for additional reduction of the digital stream in the communication channel in N-fields, the samples are thinned out by rows and columns in the matrix of signal values with their subsequent restoration on the receiving side.