JP6236731B1

JP6236731B1 - Super-resolution processing apparatus, super-resolution processing method, and computer program

Info

Publication number: JP6236731B1
Application number: JP2017046113A
Authority: JP
Inventors: 修二奥野
Original assignee: TSUBASA FACTORY CO., LTD.
Current assignee: TSUBASA FACTORY CO., LTD.
Priority date: 2017-03-10
Filing date: 2017-03-10
Publication date: 2017-11-29
Anticipated expiration: 2037-03-10
Also published as: JP2018151747A

Abstract

【課題】高精度な超解像度画像を高速に作成する。
【解決手段】第２解像度よりも高解像度の第３解像度の画像を含む学習用セットのサブセットである第１階層の複数の学習用サブセットの各々について、第２解像度の画像を第３解像度の画像に超解像度処理するための第１フィルタ、および第２解像度よりも低解像度の第１解像度の画像を第２解像度の画像に超解像度処理するための第２フィルタを取得する第１階層フィルタ取得部と、第２解像度の入力画像から第１解像度の縮小画像を作成する縮小画像作成部と、各第２フィルタを用いて、第１解像度の縮小画像を超解像度処理して第２解像度の候補画像を作成する候補画像作成部と、第２解像度の候補画像と第２解像度の入力画像との差分が最小となる第２フィルタに対応する第１フィルタを用いて、第２解像度の入力画像を超解像度処理する超解像度処理部とを備える。
【選択図】図１A high-precision super-resolution image is created at high speed.
A second resolution image is converted into a third resolution image for each of a plurality of learning subsets in a first hierarchy, which is a subset of a learning set including a third resolution image having a higher resolution than the second resolution. A first filter for super-resolution processing, and a first layer filter acquisition unit for acquiring a second filter for super-resolution processing of a first resolution image having a resolution lower than the second resolution into a second resolution image A reduced image creation unit that creates a reduced image of the first resolution from the input image of the second resolution, and a second resolution candidate image by performing super-resolution processing on the reduced image of the first resolution using each second filter The second resolution input image using the first filter corresponding to the second filter that minimizes the difference between the second resolution candidate image and the second resolution input image. resolution And a super-resolution processing section for management.
[Selection] Figure 1

Description

本発明は、入力画像に超解像度処理を施す超解像度処理装置、超解像度処理方法およびコンピュータプログラムに関する。 The present invention relates to a super resolution processing apparatus, a super resolution processing method, and a computer program for performing super resolution processing on an input image.

近年、入力画像に超解像度処理を施して、高解像度化する超解像度処理の技術が実用化されている（例えば、特許文献１参照）。 In recent years, a super-resolution processing technique for increasing the resolution by performing super-resolution processing on an input image has been put into practical use (see, for example, Patent Document 1).

超解像度処理の一例として、機械学習済みの畳み込みニューラルネットワーク（Convolutional Neural Network：以下、「ＣＮＮ」という）に入力画像を入力し、当該入力画像に対して超解像度処理を施して、高解像度化された出力画像を出力する方法が知られている。このＣＮＮは、オリジナル画像と、当該オリジナル画像の縮小画像とに基づいて、縮小画像を入力とし、オリジナル画像を出力として各層におけるニューラルネットの重みおよびバイアスを機械学習することにより構成される。 As an example of super-resolution processing, input images are input to a machine-learned convolutional neural network (hereinafter referred to as “CNN”), and the input images are subjected to super-resolution processing to increase the resolution. There is known a method for outputting the output image. The CNN is configured by performing machine learning on the weight and bias of the neural network in each layer, using the reduced image as an input and the original image as an output based on the original image and the reduced image of the original image.

特表２０１３−５３２８７８号公報Special table 2013-532878 gazette

画像には人物写真、風景写真、イラストなど様々な種類が含まれる。例えば、イラストはエッジ成分を多く含むため強い高周波成分を含むのに対し、風景画像はすべての周波数成分を満遍なく含むなど、画像の種類によって特性が異なる。 There are various types of images such as portraits, landscapes, and illustrations. For example, illustrations include many high frequency components because they include many edge components, while landscape images include all frequency components evenly.

しかしながら、これらの画像の特性を無視して、１つのＣＮＮを用いて入力画像を超解像度処理したのでは、満足の行く結果が得られない場合がある。例えば、低周波成分を多く含む画像を多く用いて機械学習したＣＮＮにより、高周波成分を多く含む画像を超解像度処理した場合には、高周波成分の再現性が損なわれることが考えられる。また、高周波成分を多く含む画像を多く用いて機械学習したＣＮＮにより低周波成分を多く含む画像を超解像度処理した場合には、高周波ノイズが発生したりすることが考えられる。 However, if the characteristics of these images are ignored and the input image is super-resolution processed using one CNN, a satisfactory result may not be obtained. For example, it is conceivable that the reproducibility of the high-frequency component is impaired when the super-resolution processing is performed on an image containing a large amount of high-frequency components by CNN that has been machine-learned using many images that include a large amount of low-frequency components. Further, when an image containing a large amount of low-frequency components is subjected to super-resolution processing by machine learning using a large number of images containing a large amount of high-frequency components, high-frequency noise may be generated.

そのうえ、複数のＣＮＮのプリセットの中からユーザが最適なものを選択するのは一般的に困難である。このため、従来は、上述のように多くの種類の学習用画像を利用してＣＮＮを作成せざるを得なかった。多くの種類の学習用画像を利用する場合には、学習用画像の枚数が膨大となるため、機械学習に時間を要し、ＣＮＮの構成も複雑なものとなる。 In addition, it is generally difficult for the user to select an optimal one from a plurality of CNN presets. For this reason, conventionally, it has been necessary to create a CNN using many types of learning images as described above. When many types of learning images are used, the number of learning images is enormous, so that time is required for machine learning and the configuration of the CNN is complicated.

本発明はこのような事情に鑑みてなされたものであり、高精度な超解像度画像を高速に作成することのできる超解像度処理装置、超解像度処理方法およびコンピュータプログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and an object of the present invention is to provide a super-resolution processing apparatus, a super-resolution processing method, and a computer program that can create a high-precision super-resolution image at high speed. .

上記目的を達成するために、本発明のある局面に係る超解像度処理装置は、第２解像度の画像を超解像度処理して、第２解像度よりも高解像度の第３解像度の画像を作成する超解像度処理装置であって、第３解像度の画像を含む学習用セットのサブセットである第１階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された、第２解像度の画像を第３解像度の画像に超解像度処理するための第１フィルタ、および第２解像度よりも低解像度の第１解像度の画像を第２解像度の画像に超解像度処理するための第２フィルタを取得する第１階層フィルタ取得部と、第２解像度の入力画像から第１解像度の縮小画像を作成する縮小画像作成部と、前記第１階層フィルタ取得部が取得した各前記第２フィルタを用いて、前記第１解像度の縮小画像を超解像度処理して第２解像度の候補画像を作成する候補画像作成部と、前記候補画像作成部が作成した前記第２解像度の候補画像と前記第２解像度の入力画像との差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記第２解像度の入力画像を超解像度処理することにより、第３解像度の超解像度画像を作成する超解像度処理部とを備える。 In order to achieve the above object, a super-resolution processing apparatus according to an aspect of the present invention performs super-resolution processing on a second resolution image to create a third resolution image having a higher resolution than the second resolution. A resolution processing apparatus, wherein each of a plurality of learning sub-sets of a first hierarchy, which is a sub-set of a learning set including an image of a third resolution, is machine-learned using the learning subset, Obtaining a first filter for super-resolution processing of an image into a third resolution image and a second filter for super-resolution processing of a first resolution image lower than the second resolution into a second resolution image A first layer filter acquisition unit, a reduced image generation unit that generates a reduced image of the first resolution from an input image of the second resolution, and each of the second filters acquired by the first layer filter acquisition unit. A candidate image creation unit that creates a second resolution candidate image by performing super-resolution processing on the reduced image of the first resolution, and the second resolution candidate image created by the candidate image creation unit and the second resolution Super-resolution processing for generating a super-resolution image of the third resolution by performing super-resolution processing of the input image of the second resolution using the first filter corresponding to the second filter that minimizes the difference from the input image. A part.

この構成によると、第２解像度の候補画像と第２解像度の入力画像との差分が最小となる第２フィルタを複数の第２フィルタの中から選択することができる。また、選択した第２フィルタに対応する第１フィルタを用いて、入力画像を超解像度処理することができる。第２フィルタは、第１フィルタと同じ学習用サブセットを用いて機械学習されているため、第１フィルタと同様の性質を有し、かつ、第２フィルタが対象とする入力画像の解像度は第１フィルタが対象とする入力画像の解像度よりも小さい。このため、入力画像に最適な性質を有する第２フィルタを高速に選択することができ、選択した第２フィルタに対応した第１フィルタを用いて入力画像に超解像度処理を実行することで、高精度な超解像度画像を高速に作成することができる。 According to this configuration, the second filter that minimizes the difference between the second resolution candidate image and the second resolution input image can be selected from the plurality of second filters. In addition, the input image can be super-resolution processed using the first filter corresponding to the selected second filter. Since the second filter is machine-learned using the same learning subset as the first filter, it has the same properties as the first filter, and the resolution of the input image targeted by the second filter is the first. The resolution is smaller than the resolution of the input image targeted by the filter. For this reason, the second filter having the optimal property for the input image can be selected at high speed, and by executing the super-resolution processing on the input image using the first filter corresponding to the selected second filter, Accurate super-resolution images can be created at high speed.

好ましくは、上述の超解像度処理装置は、さらに、前記差分が最小となる前記第２フィルタを機械学習するのに用いた前記第１階層の学習用サブセットのサブセットである第２階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された第１フィルタおよび第２フィルタを取得する第２階層フィルタ取得部を備え、前記候補画像作成部は、さらに、前記第２階層フィルタ取得部が取得した前記第２階層の各前記第２フィルタを用いて、前記第１解像度の縮小画像を超解像度処理して第２解像度の候補画像を作成し、前記超解像度処理部は、さらに、前記候補画像作成部が前記第２階層の前記第２フィルタを用いて作成した前記第２解像度の候補画像と前記第２解像度の入力画像との差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記第２解像度の入力画像を超解像度処理することにより、前記第３解像度の超解像度画像を作成する。 Preferably, the above-described super-resolution processing apparatus further includes a plurality of learnings in a second hierarchy that is a subset of the learning subset in the first hierarchy used for machine learning of the second filter having the smallest difference. A second hierarchical filter acquisition unit that acquires a first filter and a second filter machine-learned using the learning subset for each of the subsets, and the candidate image creation unit further includes the second hierarchical filter Using each second filter of the second hierarchy acquired by the acquisition unit, the reduced image of the first resolution is subjected to super resolution processing to create a second resolution candidate image, and the super resolution processing unit further includes: The second image in which the difference between the second resolution candidate image created by the candidate image creation unit using the second filter of the second hierarchy and the second resolution input image is minimized. Using a first filter corresponding to the data, the input image of the second resolution by super-resolution processing, to create a super-resolution image of the third resolution.

この構成によると、第１階層において第２フィルタを選択し、選択した第２フィルタに基づいて、さらに、第２階層における第２フィルタを選択することができる。また、選択した第２フィルタに対応する第１フィルタを用いて、入力画像を超解像度処理することができる。このように、第１フィルタおよび第２フィルタを階層構造化することができるため、効率的に第２フィルタおよび該第２フィルタに対応する第１フィルタを選択することができる。これにより、高精度な超解像度画像を高速に作成することができる。 According to this configuration, it is possible to select the second filter in the first hierarchy, and further select the second filter in the second hierarchy based on the selected second filter. In addition, the input image can be super-resolution processed using the first filter corresponding to the selected second filter. Thus, since the first filter and the second filter can be hierarchically structured, the second filter and the first filter corresponding to the second filter can be efficiently selected. Thereby, a highly accurate super-resolution image can be created at high speed.

本発明の他の局面に係る超解像度処理方法は、第２解像度の画像を超解像度処理して、第２解像度よりも高解像度の第３解像度の画像を作成する装置を機能させるための超解像度処理方法であって、第３解像度の画像を含む学習用セットのサブセットである第１階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された、第２解像度の画像を第３解像度の画像に超解像度処理するための第１フィルタ、および第２解像度よりも低解像度の第１解像度の画像を第２解像度の画像に超解像度処理するための第２フィルタを取得するステップと、第２解像度の入力画像から第１解像度の縮小画像を作成するステップと、取得された各前記第２フィルタを用いて、前記第１解像度の縮小画像を超解像度処理して第２解像度の候補画像を作成するステップと、作成された前記第２解像度の候補画像と前記第２解像度の入力画像との差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記第２解像度の入力画像を超解像度処理することにより、第３解像度の超解像度画像を作成するステップとを含む。 A super-resolution processing method according to another aspect of the present invention provides a super-resolution for causing a device that creates a third-resolution image having a higher resolution than the second resolution to function by super-resolution processing the second-resolution image. A second resolution image, which is a processing method, machine-learned using each of the plurality of learning subsets in the first hierarchy, which is a subset of the learning set including a third resolution image, using the learning subset. A first filter for super-resolution processing of a first resolution image into a third resolution image, and a second filter for super-resolution processing of a first resolution image lower than the second resolution into a second resolution image A step of creating a reduced image of the first resolution from the input image of the second resolution, and using the acquired second filter, the reduced image of the first resolution is subjected to super-resolution processing and second Using a first filter corresponding to a second filter that minimizes a difference between the created second-resolution candidate image and the second-resolution input image; Creating a third resolution super-resolution image by subjecting the second resolution input image to super-resolution processing.

この構成は、上述した超解像度処理装置が備える処理部に対応するステップを含む。このため、上述した超解像度処理装置と同様の作用および効果を奏することができる。 This configuration includes steps corresponding to the processing unit included in the super-resolution processing apparatus described above. For this reason, the same operation and effect as the above-described super-resolution processing apparatus can be achieved.

本発明の他の局面に係るコンピュータプログラムは、第２解像度の画像を超解像度処理して、第２解像度よりも高解像度の第３解像度の画像を作成するためのコンピュータプログラムであって、コンピュータを、第３解像度の画像を含む学習用セットのサブセットである第１階層の複数の学習用サブセットの各々について、該学習用サブセットを用いて機械学習された、第２解像度の画像を第３解像度の画像に超解像度処理するための第１フィルタ、および第２解像度よりも低解像度の第１解像度の画像を第２解像度の画像に超解像度処理するための第２フィルタを取得する第１階層フィルタ取得部と、第２解像度の入力画像から第１解像度の縮小画像を作成する縮小画像作成部と、前記第１階層フィルタ取得部が取得した各前記第２フィルタを用いて、前記第１解像度の縮小画像を超解像度処理して第２解像度の候補画像を作成する候補画像作成部と、前記候補画像作成部が作成した前記第２解像度の候補画像と前記第２解像度の入力画像との差分が最小となる第２フィルタに対応する第１フィルタを用いて、前記第２解像度の入力画像を超解像度処理することにより、第３解像度の超解像度画像を作成する超解像度処理部として機能させる。 A computer program according to another aspect of the present invention is a computer program for super-resolution processing a second resolution image to create a third resolution image having a higher resolution than the second resolution. , For each of the plurality of learning subsets of the first hierarchy, which is a subset of the learning set including the third resolution image, the second resolution image machine-learned using the learning subset is converted to the third resolution image. First-layer filter acquisition for acquiring a first filter for performing super-resolution processing on an image and a second filter for performing super-resolution processing on an image having a lower resolution than the second resolution to an image having a second resolution A reduced image creation unit that creates a reduced image of the first resolution from the input image of the second resolution, and each of the second filters acquired by the first hierarchical filter acquisition unit , A candidate image creation unit that creates a second resolution candidate image by performing super-resolution processing on the first resolution reduced image, the second resolution candidate image created by the candidate image creation unit, and the second A super-resolution image of the third resolution is created by performing super-resolution processing on the input image of the second resolution using the first filter corresponding to the second filter that minimizes the difference from the input image of the two resolutions. It functions as a super-resolution processor.

この構成によると、コンピュータを、上述した超解像度処理装置として機能させることができる。このため、高精度な超解像度画像を高速に作成することができる。 According to this configuration, the computer can function as the above-described super-resolution processing apparatus. For this reason, a highly accurate super-resolution image can be created at high speed.

なお、本発明に係るコンピュータプログラムを、ＣＤ−ＲＯＭ（Compact Disc-Read Only Memory）等のコンピュータ読取可能な非一時的な記録媒体やインターネット等の通信ネットワークを介して流通させることができるのは、言うまでもない。また、本発明は、超解像度処理装置の一部又は全部を実現する半導体集積回路として実現したり、超解像度処理装置を含むシステムとして実現したりすることもできる。 The computer program according to the present invention can be distributed via a computer-readable non-transitory recording medium such as a CD-ROM (Compact Disc-Read Only Memory) or a communication network such as the Internet. Needless to say. In addition, the present invention can be realized as a semiconductor integrated circuit that realizes part or all of the super-resolution processing apparatus, or can be realized as a system including the super-resolution processing apparatus.

本発明によると、高精度な超解像度画像を高速に作成することができる。 According to the present invention, a highly accurate super-resolution image can be created at high speed.

本発明の実施の形態に係る超解像度処理装置の構成を示すブロック図である。It is a block diagram which shows the structure of the super-resolution processing apparatus which concerns on embodiment of this invention. 本発明の実施の形態で取り扱う画像のサイズおよびフィルタを説明するための図である。It is a figure for demonstrating the size and filter of the image which are handled with embodiment of this invention. 階層構造化された学習用セットおよびフィルタの一例を示す図である。It is a figure which shows an example of the learning set and filter which were hierarchically structured. 本発明の実施の形態に係る超解像度処理装置が実行するフィルタ作成処理の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the filter creation process which the super-resolution processing apparatus which concerns on embodiment of this invention performs. 本発明の実施の形態に係る超解像度処理装置が実行する入力画像の超解像度処理の処理手順を示すフローチャートである。It is a flowchart which shows the process sequence of the super-resolution process of the input image which the super-resolution processing apparatus which concerns on embodiment of this invention performs.

以下、本発明の実施の形態について、図面を用いて詳細に説明する。なお、以下で説明する実施の形態は、いずれも本発明の好ましい一具体例を示すものである。以下の実施の形態で示される数値、形状、構成要素、構成要素の配置位置および接続形態、ステップ、ステップの順序などは、一例であり、本発明を限定する主旨ではない。本発明は、特許請求の範囲によって特定される。よって、以下の実施の形態における構成要素のうち、本発明の最上位概念を示す独立請求項に記載されていない構成要素については、本発明の課題を達成するのに必ずしも必要ではないが、より好ましい形態を構成するものとして説明される。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. Each of the embodiments described below shows a preferred specific example of the present invention. Numerical values, shapes, components, arrangement positions and connection forms of components, steps, order of steps, and the like shown in the following embodiments are merely examples, and are not intended to limit the present invention. The invention is specified by the claims. Therefore, among the constituent elements in the following embodiments, constituent elements that are not described in the independent claims indicating the highest concept of the present invention are not necessarily required to achieve the object of the present invention. It will be described as constituting a preferred form.

図１は、本発明の実施の形態に係る超解像度処理装置の構成を示すブロック図である。図２は、本発明の実施の形態で取り扱う画像のサイズおよびフィルタを説明するための図である。 FIG. 1 is a block diagram showing a configuration of a super-resolution processing apparatus according to an embodiment of the present invention. FIG. 2 is a diagram for explaining image sizes and filters handled in the embodiment of the present invention.

図１を参照して、超解像度処理装置１は、入力画像に対して超解像度処理を施し、入力画像よりも解像度の高い画像を作成する装置であり、入力画像取得部１０と、縮小画像作成部１１と、機械学習部１２と、第１階層フィルタ取得部１３と、候補画像作成部１４と、第２階層フィルタ取得部１５と、超解像度処理部１６と、記憶装置１７とを備える。 Referring to FIG. 1, a super-resolution processing apparatus 1 is an apparatus that performs super-resolution processing on an input image and creates an image having a higher resolution than the input image. Unit 11, machine learning unit 12, first layer filter acquisition unit 13, candidate image creation unit 14, second layer filter acquisition unit 15, super-resolution processing unit 16, and storage device 17.

本実施の形態では、２Ｋサイズ（１９２０×１０８０ピクセル）の入力画像に対して超解像度処理を施し、４Ｋサイズ（３８４０×２１６０ピクセル）の画像を作成する例について説明する。ただし、超解像度処理装置１が対象とする画像のサイズはこれらに限定されるものではない。 In this embodiment, an example will be described in which super-resolution processing is performed on an input image of 2K size (1920 × 1080 pixels) to create an image of 4K size (3840 × 2160 pixels). However, the image size targeted by the super-resolution processing apparatus 1 is not limited to these.

入力画像取得部１０は、超解像度処理の対象となる入力画像を取得する。例えば、入力画像取得部１０は、記憶装置１７に記憶されている画像の中からユーザがキーボード等を用いて選択した画像を、記憶装置１７から読み出すことにより、当該画像を入力画像として取得してもよい。また、入力画像取得部１０は、ネットワーク等を介して画像をダウンロードすることにより、当該画像を入力画像として取得してもよい。 The input image acquisition unit 10 acquires an input image to be subjected to super resolution processing. For example, the input image acquisition unit 10 reads out an image selected by the user using a keyboard or the like from the images stored in the storage device 17 from the storage device 17, thereby acquiring the image as an input image. Also good. The input image acquisition unit 10 may acquire the image as an input image by downloading the image via a network or the like.

図２に示すように、本実施の形態では、入力画像のサイズ（解像度）は、２Ｋサイズ（第２解像度）であるものとする。 As shown in FIG. 2, in the present embodiment, the size (resolution) of the input image is assumed to be 2K size (second resolution).

再度図１を参照して、縮小画像作成部１１は、入力画像取得部１０が取得した入力画像を縮小することにより縮小画像を作成する。 Referring to FIG. 1 again, the reduced image creation unit 11 creates a reduced image by reducing the input image acquired by the input image acquisition unit 10.

図２に示すように、本実施の形態では、縮小画像作成部１１は、入力画像を縦および横それぞれ１／２に縮小する。縮小画像のサイズ（解像度）は、ＱＨＤ（Quarter High Definition）サイズ（９６０×５４０ピクセル）（第１解像度）であるものとする。 As shown in FIG. 2, in the present embodiment, the reduced image creating unit 11 reduces the input image by 1/2 in both the vertical and horizontal directions. The size (resolution) of the reduced image is assumed to be a QHD (Quarter High Definition) size (960 × 540 pixels) (first resolution).

なお、縮小画像作成部１１は、入力画像以外にも、超解像度処理のためのフィルタを機械学習するために用意された学習用画像も縮小する。 In addition to the input image, the reduced image creation unit 11 reduces the learning image prepared for machine learning of a filter for super-resolution processing.

つまり、記憶装置１７には、予め４Ｋサイズの複数の学習用画像を含む学習用セットが記憶されているものとする。また、学習用セットには、人物写真、風景写真、イラストなどの様々な種類の学習用画像が含まれるものとする。縮小画像作成部１１は、記憶装置１７から学習用セットを読み出し、学習用セットに含まれる各学習用画像を、縦および横それぞれ１／２に縮小した２Ｋサイズの縮小画像と、縦および横それぞれ１／４に縮小したＱＨＤサイズの縮小画像とを作成する。 That is, it is assumed that a learning set including a plurality of 4K-size learning images is stored in the storage device 17 in advance. In addition, the learning set includes various types of learning images such as portrait photographs, landscape photographs, and illustrations. The reduced image creation unit 11 reads out the learning set from the storage device 17, reduces the learning images included in the learning set to 1/2 each of the vertical and horizontal sizes, and the vertical and horizontal sizes. A reduced image of QHD size reduced to ¼ is created.

以下では、学習用画像から作成した２Ｋサイズの縮小画像およびＱＨＤサイズの縮小画像のことも学習用画像と言う。また、学習用セットには、複数の４Ｋサイズの学習用画像のみならず、これらから作成した２Ｋサイズの学習用画像およびＱＨＤサイズの学習用画像が含まれるものとする。なお、本実施の形態では、入力画像のサイズは、学習用セットに含まれる２Ｋサイズの学習用画像と同じものとしているが、必ずしも両者のサイズは同じである必要はなく、学習用画像のサイズが入力画像のサイズと異なっていてもよい。 Hereinafter, the 2K size reduced image and the QHD size reduced image created from the learning image are also referred to as the learning image. In addition, the learning set includes not only a plurality of 4K-size learning images but also a 2K-size learning image and a QHD-size learning image created from them. In this embodiment, the size of the input image is the same as that of the 2K-size learning image included in the learning set, but the size of both is not necessarily the same, and the size of the learning image is not necessarily the same. May be different from the size of the input image.

機械学習部１２は、学習用セットに含まれる画像を用いて、超解像度処理を行うためのフィルタを機械学習により作成する。なお、機械学習部１２は、学習用セットに含まれる画像を階層構造化して、各階層において、フィルタを作成する。 The machine learning unit 12 creates a filter for performing super-resolution processing by machine learning using images included in the learning set. The machine learning unit 12 hierarchically structures images included in the learning set, and creates a filter in each layer.

図３は、階層構造化された学習用セットおよびフィルタの一例を示す図である。
つまり、機械学習部１２は、以下のような手順により、階層構造化された学習用セットおよびフィルタを作成する。 FIG. 3 is a diagram illustrating an example of a learning set and filter having a hierarchical structure.
In other words, the machine learning unit 12 creates learning sets and filters that are hierarchically structured in the following procedure.

まず、機械学習部１２は、学習用セットに含まれる２Ｋサイズの学習用画像と、４Ｋサイズの学習用画像とを用いて、２Ｋサイズの画像を４Ｋサイズの画像に超解像度処理するためのＣＮＮ（Convolutional Neural Network）を作成する。図２に示すように、２Ｋサイズの画像を４Ｋサイズの画像に超解像度処理するＣＮＮを、第１ＣＮＮまたは第１フィルタと言う。機械学習部１２は、２Ｋサイズの学習用画像をＣＮＮの入力とし、４Ｋサイズの学習用画像をＣＮＮの出力として、ＣＮＮの各層における重みおよびバイアスを機械学習することで、第１ＣＮＮを作成する。学習用セットを用いて作成した第１ＣＮＮを、図３に示すように第１ＣＮＮ０と言う。 First, the machine learning unit 12 uses a 2K-size learning image and a 4K-size learning image included in the learning set, and performs CNN for super-resolution processing of a 2K-size image into a 4K-size image. Create (Convolutional Neural Network). As shown in FIG. 2, CNN that performs super-resolution processing of a 2K size image into a 4K size image is referred to as a first CNN or a first filter. The machine learning unit 12 creates a first CNN by machine learning of weights and biases in each layer of the CNN using a 2K-size learning image as a CNN input and a 4K-size learning image as a CNN output. The first CNN created using the learning set is referred to as the first CNN0 as shown in FIG.

なお、本実施の形態では、機械学習部１２は、超解像度処理するためのフィルタとしてＣＮＮを例に説明するが、フィルタはＣＮＮに限定されるものではなく、複数の学習用画像から機械学習されるその他のフィルタに対しても本発明を適用することができる。 In the present embodiment, the machine learning unit 12 will be described using CNN as an example of a filter for super-resolution processing, but the filter is not limited to CNN, and machine learning is performed from a plurality of learning images. The present invention can also be applied to other filters.

次に、機械学習部１２は、第１ＣＮＮ０を用いて、学習用セットを３つのサブセット（学習用サブセット１〜３）に分類する。例えば、機械学習部１２は、第１ＣＮＮ０により超解像度処理された画像と、学習用画像との差分に基づいて、上記分類を行う。 Next, the machine learning unit 12 classifies the learning set into three subsets (learning subsets 1 to 3) using the first CNN0. For example, the machine learning unit 12 performs the above classification based on the difference between the image that has been super-resolution processed by the first CNN0 and the learning image.

具体的には、機械学習部１２は、学習用セットに含まれる２Ｋサイズの各学習用画像を第１ＣＮＮ０に入力することにより、４Ｋサイズの画像を作成する。 Specifically, the machine learning unit 12 creates a 4K size image by inputting each learning image of 2K size included in the learning set to the first CNN0.

次に、機械学習部１２は、第１ＣＮＮ０が作成した４Ｋサイズの画像と、上記２Ｋサイズの学習用画像を作成するのに用いた４Ｋサイズの学習用画像との差分を算出する。画像間の差分は、例えば、画素値の差分の二乗和または絶対値和により求めることができる。 Next, the machine learning unit 12 calculates a difference between the 4K size image created by the first CNN0 and the 4K size learning image used to create the 2K size learning image. The difference between images can be obtained by, for example, the sum of squares or the sum of absolute values of pixel value differences.

次に、機械学習部１２は、算出した差分に基づいて、４Ｋサイズの学習用画像を３つのサブセットのいずれかに分類する。 Next, the machine learning unit 12 classifies the 4K size learning image into one of three subsets based on the calculated difference.

例えば、機械学習部１２は、差分が第１閾値よりも大きい４Ｋサイズの学習用画像を学習用サブセット１に分類する。また、機械学習部１２は、差分が第２閾値（ただし、第２閾値は第１閾値よりも小さい値）よりも大きいく、かつ第１閾値以下の４Ｋサイズの学習用画像を学習用サブセット２に分類する。また、機械学習部１２は、差分が第２閾値以下の４Ｋサイズの学習用画像を学習用サブセット３に分類する。なお、機械学習部１２は、４Ｋサイズの学習用画像から縮小画像作成部１１が作成した２Ｋサイズの学習用画像およびＱＨＤサイズの学習用画像も、４Ｋサイズの学習用画像と同じ学習用サブセットに分類する。これにより、図３に示すように、学習用セットが学習用サブセット１〜３に分類される。 For example, the machine learning unit 12 classifies learning images having a 4K size whose difference is larger than the first threshold into the learning subset 1. In addition, the machine learning unit 12 sets a learning image having a 4K size less than the first threshold and having a difference larger than a second threshold (where the second threshold is smaller than the first threshold) to the learning subset 2. Classify into: In addition, the machine learning unit 12 classifies learning images having a 4K size whose difference is equal to or smaller than the second threshold into the learning subset 3. The machine learning unit 12 uses the same learning subset as the 4K size learning image for the 2K size learning image and the QHD size learning image created by the reduced image creation unit 11 from the 4K size learning image. Classify. Thereby, as shown in FIG. 3, the learning set is classified into learning subsets 1 to 3.

上記の差分は、超解像度処理された画像の誤差を示しているため、学習用セットが誤差に基づいて、複数の学習用サブセットに分類されることになる。 Since the difference indicates an error of the image subjected to the super-resolution processing, the learning set is classified into a plurality of learning subsets based on the error.

機械学習部１２は、学習用サブセット１〜３のそれぞれを用いて、第１ＣＮＮと、ＱＨＤサイズの画像を２Ｋサイズの画像に超解像度処理するＣＮＮ（第２ＣＮＮまたは第２フィルタと言う）を作成する。 The machine learning unit 12 uses each of the learning subsets 1 to 3 to create a first CNN and a CNN (referred to as a second CNN or a second filter) for super-resolution processing of a QHD size image into a 2K size image. .

例えば、機械学習部１２は、学習用サブセット１に含まれる２Ｋサイズの学習用画像をＣＮＮの入力とし、学習用サブセット１に含まれる４Ｋサイズの学習用画像をＣＮＮの出力として、ＣＮＮの各層における重みおよびバイアスを機械学習することで、第１ＣＮＮを作成する。 For example, the machine learning unit 12 uses a 2K-size learning image included in the learning subset 1 as an input of the CNN and a 4K-size learning image included in the learning subset 1 as an output of the CNN. A first CNN is created by machine learning of weights and biases.

また、機械学習部１２は、学習用サブセット１に含まれるＱＨＤサイズの学習用画像をＣＮＮの入力とし、学習用サブセット１に含まれる２Ｋサイズの学習用画像をＣＮＮの出力として、ＣＮＮの各層における重みおよびバイアスを機械学習することで、第２ＣＮＮを作成する。 Further, the machine learning unit 12 uses the QHD size learning image included in the learning subset 1 as an input of the CNN, and uses the 2K size learning image included in the learning subset 1 as an output of the CNN, in each layer of the CNN. A second CNN is created by machine learning of weights and biases.

機械学習部１２は、学習用サブセット２および３についても、同様に、第１ＣＮＮおよび第２ＣＮＮを作成する。 The machine learning unit 12 similarly creates the first CNN and the second CNN for the learning subsets 2 and 3 as well.

学習用サブセットｉから作成された第１ＣＮＮをおよび第２ＣＮＮを、それぞれ、第１ＣＮＮｉおよび第２ＣＮＮｉと記載する（ｉ＝１〜３）。 The first CNN and the second CNN created from the learning subset i are described as the first CNNi and the second CNNi, respectively (i = 1 to 3).

機械学習部１２は、学習用サブセット１〜３のそれぞれを、さらに、３つのサブセットに分類する。 The machine learning unit 12 further classifies each of the learning subsets 1 to 3 into three subsets.

例えば、機械学習部１２は、第１ＣＮＮ１を用いて、学習用サブセット１を３つのサブセット（学習用サブセット１−１〜１−３）に分類する。分類方法は、学習用セットの分類方法と分類の対象が異なる以外同様である。このため、その詳細な説明はここでは繰り返さない。 For example, the machine learning unit 12 classifies the learning subset 1 into three subsets (learning subsets 1-1 to 1-3) using the first CNN1. The classification method is the same as that of the learning set except that the classification target is different. Therefore, detailed description thereof will not be repeated here.

また、機械学習部１２は、学習用サブセット１−１〜１−３のそれぞれについて、第１ＣＮＮおよび第２ＣＮＮを作成する。ＣＮＮの作成方法は、学習用サブセット１を用いた第１ＣＮＮおよび第２ＣＮＮの作成方法と、利用する学習用サブセットが異なる以外同様である。このため、その詳細な説明はここでは繰り返さない。 In addition, the machine learning unit 12 creates a first CNN and a second CNN for each of the learning subsets 1-1 to 1-3. The creation method of the CNN is the same as the creation method of the first CNN and the second CNN using the learning subset 1, except that the learning subset to be used is different. Therefore, detailed description thereof will not be repeated here.

機械学習部１２は、学習用サブセット２および３についても、学習用サブセットの分類処理および分類された学習用サブセットを用いたＣＮＮの作成処理を行う。 The machine learning unit 12 also performs learning subset classification processing and CNN creation processing using the classified learning subsets for the learning subsets 2 and 3.

学習用サブセットｉを分類することにより得られる学習用サブセットを、学習用サブセットｉ−ｊと記載する（ｉ＝１〜３，ｊ＝１〜３）。また、学習用サブセットｉ−ｊから作成された第１ＣＮＮおよび第２ＣＮＮを、それぞれ、第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊと記載する。 A learning subset obtained by classifying the learning subset i is referred to as a learning subset i-j (i = 1 to 3, j = 1 to 3). Further, the first CNN and the second CNN created from the learning subset i-j are referred to as a first CNNi-j and a second CNNi-j, respectively.

機械学習部１２が実行する処理により、図３に示すような解像構造化された学習用セットおよびＣＮＮの分類木が作成される。学習用セットおよび第１ＣＮＮ０を第０階層とすると、第１階層には、学習用セットｉ、第１ＣＮＮｉおよび第２ＣＮＮｉが含まれる。また、第２階層には、学習用セットｉ−ｊ、第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊが含まれる。 By the processing executed by the machine learning unit 12, a learning set and a CNN classification tree having a resolution structure as shown in FIG. 3 are created. Assuming that the learning set and the first CNN0 are the zeroth hierarchy, the first hierarchy includes the learning set i, the first CNNi, and the second CNNi. The second hierarchy includes a learning set i-j, a first CNNi-j, and a second CNNi-j.

以下に説明する第１階層フィルタ取得部１３〜超解像度処理部１６は、入力画像を超解像度処理するための処理部である。 The first layer filter acquisition unit 13 to the super resolution processing unit 16 described below are processing units for performing super resolution processing on an input image.

第１階層フィルタ取得部１３は、第１階層の学習用サブセット１〜３のそれぞれについて、第１ＣＮＮおよび第２ＣＮＮを取得する。 The first hierarchy filter acquisition unit 13 acquires the first CNN and the second CNN for each of the learning subsets 1 to 3 in the first hierarchy.

候補画像作成部１４は、第１階層フィルタ取得部１３が取得した第２ＣＮＮｉを用いて、入力画像を縮小したＱＨＤサイズの縮小画像を超解像度処理し、２Ｋサイズの画像を作成する。作成した２Ｋサイズの画像を候補画像と呼ぶ。つまり、候補画像作成部１４は、ＱＨＤサイズの縮小画像を、第２ＣＮＮｉに入力することにより、３枚の２Ｋサイズの候補画像を作成する。 The candidate image creation unit 14 uses the second CNNi acquired by the first hierarchical filter acquisition unit 13 to perform super-resolution processing on the QHD size reduced image obtained by reducing the input image, and generates a 2K size image. The created 2K size image is called a candidate image. In other words, the candidate image creation unit 14 creates three 2K size candidate images by inputting the QHD size reduced image to the second CNNi.

第２階層フィルタ取得部１５は、第２階層の複数の学習用サブセットｉ−ｊの各々について、学習用サブセットｉ−ｊを用いて機械学習された第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊを取得する（ｉ＝１〜３，ｊ＝１〜３）。 The second hierarchy filter acquisition unit 15 acquires the first CNNi-j and the second CNNi-j machine-learned using the learning subset i-j for each of the plurality of learning subsets i-j in the second hierarchy. (I = 1-3, j = 1-3).

候補画像作成部１４は、さらに、第２階層フィルタ取得部１５が取得した第２ＣＮＮｉ−ｊを用いて、入力画像を縮小したＱＨＤサイズの縮小画像を超解像度処理し、２Ｋサイズの画像（以下、「候補画像」という）を作成する。つまり、候補画像作成部１４は、ＱＨＤサイズの縮小画像を、第２ＣＮＮｉ−ｊに入力することにより、２Ｋサイズの候補画像を作成する。 Further, the candidate image creation unit 14 uses the second CNNi-j acquired by the second layer filter acquisition unit 15 to perform super-resolution processing on the QHD size reduced image obtained by reducing the input image, thereby obtaining a 2K size image (hereinafter, referred to as “2K size”). "Candidate image"). That is, the candidate image creation unit 14 creates a 2K size candidate image by inputting a QHD size reduced image to the second CNNi-j.

超解像度処理部１６は、候補画像作成部１４が第２ＣＮＮｉから作成した３枚の２Ｋサイズの候補画像のそれぞれと、２Ｋサイズの入力画像との差分を算出する。画像間の差分の算出方法については、上述した通りである。 The super-resolution processing unit 16 calculates the difference between each of the three 2K-size candidate images created from the second CNNi by the candidate image creation unit 14 and the 2K-size input image. The method for calculating the difference between images is as described above.

超解像度処理部１６は、算出した差分が最小となる候補画像を作成するに用いた第２ＣＮＮｍを特定し、特定した第２ＣＮＮｍを機械学習するに用いた学習用サブセットｍを特定する。 The super-resolution processing unit 16 identifies the second CNNm used to create the candidate image with the smallest calculated difference, and identifies the learning subset m used for machine learning of the identified second CNNm.

超解像度処理部１６は、特定した学習用サブセットｍのサブセットである学習用サブセットｍ−ｊ（ｊ＝１〜３）を特定する。例えば、ｍ＝１の場合には、学習用サブセット１−１〜１−３が特定される。 The super-resolution processing unit 16 specifies a learning subset mj (j = 1 to 3) that is a subset of the specified learning subset m. For example, when m = 1, learning subsets 1-1 to 1-3 are specified.

超解像度処理部１６は、さらに、学習用サブセットｍ−ｊ（ｊ＝１〜３）の第２ＣＮＮｍ−ｊを用いて作成された３枚の候補画像の中から、２Ｋサイズの入力画像との差分が最小となる候補画像を特定する。 The super-resolution processing unit 16 further compares the difference from the 2K-size input image from among the three candidate images created using the second CNNm-j of the learning subset mj (j = 1 to 3). The candidate image that minimizes is identified.

超解像度処理部１６は、特定した候補画像を作成するのに用いた第２ＣＮＮｍ−ｎに対応する第１ＣＮＮｍ−ｎを用いて、２Ｋサイズの入力画像を超解像度処理することにより、４Ｋサイズの超解像度画像を作成する。 The super-resolution processing unit 16 performs super-resolution processing on the 2K-size input image using the first CNNm-n corresponding to the second CNNm-n used to create the identified candidate image. Create a resolution image.

記憶装置１７は、画像や各種データを記憶するための記憶装置であり、ＨＤＤ（Hard Disk Drive）、不揮発性メモリまたは揮発性メモリなどにより構成される。
次に、超解像度処理装置１が実行する処理の手順について説明する。 The storage device 17 is a storage device for storing images and various data, and is configured by an HDD (Hard Disk Drive), a nonvolatile memory, a volatile memory, or the like.
Next, a procedure of processing executed by the super resolution processing apparatus 1 will be described.

図４は、本発明の実施の形態に係る超解像度処理装置１が実行するフィルタ作成処理の処理手順を示すフローチャートである。 FIG. 4 is a flowchart showing a processing procedure of filter creation processing executed by the super-resolution processing apparatus 1 according to the embodiment of the present invention.

図４に示すように、縮小画像作成部１１は、記憶装置１７に記憶されている学習用セットに含まれる４Ｋサイズの各学習用画像から、当該画像を、縦および横それぞれ１／２に縮小した２Ｋサイズの学習用画像と、縦および横それぞれ１／４に縮小したＱＨＤサイズの学習用画像とを作成する（Ｓ１）。 As shown in FIG. 4, the reduced image creating unit 11 reduces the image from each 4K-size learning image included in the learning set stored in the storage device 17 to ½ each vertically and horizontally. The learning image of 2K size and the learning image of QHD size reduced to ¼ each in the vertical and horizontal directions are created (S1).

機械学習部１２は、学習用セットに含まれる４Ｋサイズの学習用画像と２Ｋサイズの縮小画像とに基づいて、機械学習により、２Ｋサイズの画像を４Ｋサイズの画像に変換するための第１フィルタ（第１ＣＮＮ０）を作成する（Ｓ２）。 The machine learning unit 12 uses a machine learning to convert a 2K size image into a 4K size image based on the 4K size learning image and the 2K size reduced image included in the learning set. (First CNN0) is created (S2).

機械学習部１２は、学習用セットに含まれる２Ｋサイズの各学習用画像を第１ＣＮＮ０に入力することにより、４Ｋサイズの画像を作成する。次に、機械学習部１２は、第１ＣＮＮ０が作成した４Ｋサイズの画像と、上記２Ｋサイズの学習用画像を作成するのに用いた４Ｋサイズの学習用画像との差分を算出する。機械学習部１２は、算出した差分に基づいて、４Ｋサイズの学習用画像を３つの学習用サブセット１〜３のいずれかに分類する（Ｓ３）。 The machine learning unit 12 creates a 4K size image by inputting each 2K size learning image included in the learning set to the first CNN0. Next, the machine learning unit 12 calculates a difference between the 4K size image created by the first CNN0 and the 4K size learning image used to create the 2K size learning image. The machine learning unit 12 classifies the 4K-size learning image into any one of the three learning subsets 1 to 3 based on the calculated difference (S3).

機械学習部１２は、学習用サブセットｉ（ｉ＝１〜３）を用いて、第１ＣＮＮｉと、第２ＣＮＮｉを作成する（Ｓ４）。機械学習部１２は、作成したＣＮＮを、記憶装置１７に記憶させる。なお、機械学習部１２は、学習用サブセットｉ（ｉ＝１〜３）を適宜交換しながら、機械学習を繰り返し実行することにより、各学習用サブセットとの誤差が小さくなるように第１ＣＮＮｉおよび第２ＣＮＮｉを作成してもよい。 The machine learning unit 12 creates the first CNNi and the second CNNi using the learning subset i (i = 1 to 3) (S4). The machine learning unit 12 stores the created CNN in the storage device 17. The machine learning unit 12 repeatedly executes machine learning while appropriately replacing the learning subset i (i = 1 to 3), so that the error with each learning subset is reduced. 2CNNi may be created.

機械学習部１２は、学習用サブセット１〜３のそれぞれについて、学習用サブセットｉ（ｉ＝１〜３）に含まれる２Ｋサイズの各学習用画像を第１ＣＮＮｉに入力することにより、４Ｋサイズの画像を作成する。次に、機械学習部１２は、第１ＣＮＮｉが作成した４Ｋサイズの画像と、上記２Ｋサイズの学習用画像を作成するのに用いた４Ｋサイズの学習用画像との差分を算出する。機械学習部１２は、算出した差分に基づいて、４Ｋサイズの学習用画像を３つの学習用サブセットｉ−ｊ（ｊ＝１〜３）のいずれかに分類する（Ｓ５）。これにより、学習用サブセット１〜３の各々が、さらに、３つの学習用サブセットに分類される。 For each of the learning subsets 1 to 3, the machine learning unit 12 inputs each learning image of 2K size included in the learning subset i (i = 1 to 3) to the first CNNi, thereby obtaining a 4K size image. Create Next, the machine learning unit 12 calculates a difference between the 4K size image created by the first CNNi and the 4K size learning image used to create the 2K size learning image. Based on the calculated difference, the machine learning unit 12 classifies the 4K-size learning image into any one of the three learning subsets ij (j = 1 to 3) (S5). Thereby, each of the learning subsets 1 to 3 is further classified into three learning subsets.

機械学習部１２は、学習用サブセットｉ−ｊ（ｉ＝１〜３，ｊ＝１〜３）を用いて、機械学習により、第１ＣＮＮｉ−ｊと、第２ＣＮＮｉ−ｊを作成する（Ｓ６）。機械学習部１２は、作成したＣＮＮを、記憶装置１７に記憶させる。なお、機械学習部１２は、学習用サブセットｉ−ｊ（ｉ＝１〜３，ｊ＝１〜３）を適宜交換しながら、機械学習を繰り返し実行することにより、各学習用サブセットとの誤差が小さくなるように第１ＣＮＮｉ−ｊおよび第２ＣＮＮｉ−ｊを作成してもよい。 The machine learning unit 12 creates the first CNNi-j and the second CNNi-j by machine learning using the learning subset ij (i = 1 to 3, j = 1 to 3) (S6). The machine learning unit 12 stores the created CNN in the storage device 17. Note that the machine learning unit 12 repeatedly performs machine learning while appropriately replacing the learning subsets ij (i = 1 to 3, j = 1 to 3), thereby causing an error from each learning subset. The first CNNi-j and the second CNNi-j may be created to be smaller.

図４に示したフィルタ作成処理により、図３に示したような階層構図化されたＣＮＮが作成される。 By the filter creation process shown in FIG. 4, a hierarchically structured CNN as shown in FIG. 3 is created.

図５は、本発明の実施の形態に係る超解像度処理装置１が実行する入力画像の超解像度処理の処理手順を示すフローチャートである。 FIG. 5 is a flowchart showing the processing procedure of the super-resolution processing of the input image executed by the super-resolution processing device 1 according to the embodiment of the present invention.

図５に示すように、入力画像取得部１０は、超解像度処理の対象となる２Ｋサイズの入力画像を取得する（Ｓ１１）。 As illustrated in FIG. 5, the input image acquisition unit 10 acquires a 2K-size input image to be subjected to super-resolution processing (S11).

縮小画像作成部１１は、入力画像取得部１０が取得した２Ｋサイズの入力画像を縮小することによりＱＨＤサイズの縮小画像を作成する（Ｓ１２）。 The reduced image creation unit 11 creates a QHD size reduced image by reducing the 2K size input image acquired by the input image acquisition unit 10 (S12).

第１階層フィルタ取得部１３は、図３に示した分類木の第１階層のフィルタを取得する（Ｓ１３）。つまり、第１階層フィルタ取得部１３は、第１ＣＮＮ１〜３と、第２ＣＮＮ１〜３とを、記憶装置１７から取得する。 The first hierarchy filter acquisition unit 13 acquires the first hierarchy filter of the classification tree shown in FIG. 3 (S13). That is, the first hierarchy filter acquisition unit 13 acquires the first CNN 1 to 3 and the second CNN 1 to 3 from the storage device 17.

候補画像作成部１４は、第２ＣＮＮ１〜３に、入力画像を縮小したＱＨＤサイズの縮小画像を入力することにより、３枚の２Ｋサイズの候補画像を作成する（Ｓ１４）。 The candidate image creation unit 14 creates three 2K size candidate images by inputting the QHD size reduced images obtained by reducing the input images to the second CNNs 1 to 3 (S14).

超解像度処理部１６は、ステップＳ１４で作成された３枚の２Ｋサイズの候補画像のそれぞれと、２Ｋサイズの入力画像との差分を算出する（Ｓ１５）。 The super-resolution processing unit 16 calculates a difference between each of the two 2K-size candidate images created in step S14 and the 2K-size input image (S15).

超解像度処理部１６は、ステップＳ１５で算出した差分が最小となる候補画像を作成するに用いた第２ＣＮＮｍを特定する（Ｓ１６）。例えば、差分が最小となる候補画像を作成するのに用いが第２ＣＮＮが第２ＣＮＮ１であると特定される。 The super-resolution processing unit 16 specifies the second CNNm used to create the candidate image that minimizes the difference calculated in step S15 (S16). For example, the second CNN used to create the candidate image with the smallest difference is specified as the second CNN1.

なお、第２ＣＮＮ１〜３への入力は、ＱＨＤサイズの縮小画像に限定されるものではない。例えば、入力画像から切り出した一部の画像を第２ＣＮＮ１〜３に入力してもよい。例えば、入力画像から、等間隔に小領域を切り出すことにより該一部の画像を作成してもよいし、入力画像の画素を等間隔に間引くことで該一部の画像を作成してもよい。また、入力画像と同様の性質を有するであろうと想定される複数の画像の中から、一部の画像の一部分を切り出すことにより、該一部の画像を作成しても良い。例えば、入力画像が動画像の一部である場合には、入力画像と同じシーンの動画像の中から、一部の画像を取り出し、取り出した一部の画像から一部分を切り出しても良い。 Note that the input to the second CNN 1 to 3 is not limited to a QHD size reduced image. For example, a part of the image cut out from the input image may be input to the second CNN 1 to 3. For example, the partial image may be created by cutting out a small region at regular intervals from the input image, or the partial image may be created by thinning pixels of the input image at regular intervals. . Further, the partial image may be created by cutting out a part of the partial image from a plurality of images assumed to have the same properties as the input image. For example, when the input image is a part of the moving image, a part of the image may be extracted from the moving image of the same scene as the input image, and a part of the extracted image may be cut out.

また、該一部の画像に基づいて第２ＣＮＮ１〜３が作成する候補画像と、該一部の画像との差分から、第２ＣＮＮｍを上記と同様に特定してもよい。このような処理により、第２ＣＮＮｍを特定するために必要な時間を短縮することができる。 Further, the second CNNm may be specified in the same manner as described above from the difference between the candidate image created by the second CNN 1 to 3 based on the partial image and the partial image. By such processing, the time required to specify the second CNNm can be shortened.

第２階層フィルタ取得部１５は、ステップＳ１６で特定された第１階層の第２ＣＮＮｍの下位の階層である第２階層のフィルタを取得する（Ｓ１７）。つまり、第２階層フィルタ取得部１５は、図３に示した分類木における第１ＣＮＮｍ−ｊと、第２ＣＮＮｍ−ｊとを、記憶装置１７から読み出す（ｊ＝１〜３）。例えば、第２階層フィルタ取得部１５は、第１ＣＮＮ１−１〜１−３と、第２ＣＮＮ１−１〜１−３とを、記憶装置１７から読み出す。 The second hierarchy filter acquisition unit 15 acquires a filter of the second hierarchy, which is a hierarchy lower than the second CNNm of the first hierarchy specified in step S16 (S17). That is, the second hierarchy filter acquisition unit 15 reads the first CNNm-j and the second CNNm-j in the classification tree shown in FIG. 3 from the storage device 17 (j = 1 to 3). For example, the second hierarchy filter acquisition unit 15 reads the first CNN 1-1 to 1-3 and the second CNN 1-1 to 1-3 from the storage device 17.

候補画像作成部１４は、第２ＣＮＮｍ−ｊ（ｊ＝１〜３）に、入力画像を縮小したＱＨＤサイズの縮小画像を入力することにより、３枚の２Ｋサイズの候補画像を作成する（Ｓ１８）。 The candidate image creation unit 14 creates three 2K size candidate images by inputting the QHD size reduced images obtained by reducing the input image to the second CNNm-j (j = 1 to 3) (S18). .

超解像度処理部１６は、ステップＳ１８で作成された３枚の２Ｋサイズの候補画像のそれぞれと、２Ｋサイズの入力画像との差分を算出する（Ｓ１９）。 The super-resolution processor 16 calculates a difference between each of the 3K candidate images created in step S18 and the 2K-size input image (S19).

超解像度処理部１６は、ステップＳ１９で算出した差分が最小となる候補画像を作成するに用いた第２ＣＮＮｍ−ｎを特定する（Ｓ２０）。例えば、差分が最小となる候補画像を作成するのに用いた第２ＣＮＮが第２ＣＮＮ１−２であると特定される。 The super-resolution processing unit 16 identifies the second CNNm-n used to create the candidate image that minimizes the difference calculated in step S19 (S20). For example, the second CNN used to create the candidate image with the smallest difference is specified as the second CNN 1-2.

なお、第２ＣＮＮｍ−ｊ（ｊ＝１〜３）への入力は、ＱＨＤサイズの縮小画像に限定されるものではない。例えば、入力画像から切り出した一部の画像を第２ＣＮＮｍ−ｊ（ｊ＝１〜３）に入力してもよい。例えば、入力画像から、等間隔に小領域を切り出すことにより該一部の画像を作成してもよいし、入力画像の画素を等間隔に間引くことで該一部の画像を作成してもよい。また、入力画像と同様の性質を有するであろうと想定される複数の画像の中から、一部の画像の一部分を切り出すことにより、該一部の画像を作成しても良い。例えば、入力画像が動画像の一部である場合には、入力画像と同じシーンの動画像の中から、一部の画像を取り出し、取り出した一部の画像から一部分を切り出しても良い。 Note that the input to the second CNNm-j (j = 1 to 3) is not limited to a QHD size reduced image. For example, a part of the image cut out from the input image may be input to the second CNNm-j (j = 1 to 3). For example, the partial image may be created by cutting out a small region at regular intervals from the input image, or the partial image may be created by thinning pixels of the input image at regular intervals. . Further, the partial image may be created by cutting out a part of the partial image from a plurality of images assumed to have the same properties as the input image. For example, when the input image is a part of the moving image, a part of the image may be extracted from the moving image of the same scene as the input image, and a part of the extracted image may be cut out.

また、該一部の画像に基づいて第２ＣＮＮｍ−ｊ（ｊ＝１〜３）が作成する候補画像と、該一部の画像との差分から、第２ＣＮＮｍ−ｎを上記と同様に特定してもよい。このような処理により、第２ＣＮＮｍ−ｎを特定するために必要な時間を短縮することができる。 Further, the second CNNm-n is identified in the same manner as described above from the difference between the candidate image created by the second CNNm-j (j = 1 to 3) based on the partial image and the partial image. Also good. By such processing, the time required for specifying the second CNNm-n can be shortened.

超解像度処理部１６は、ステップＳ２０で特定された第２階層の第２ＣＮＮｍ−ｎに対応する第１ＣＮＮｍ−ｎを用いて、２Ｋサイズの入力画像を超解像度処理することにより、４Ｋサイズの超解像度画像を作成する（Ｓ２１）。上記の例では第１ＣＮＮ１−２を用いて、超解像度処理が行われる。 The super-resolution processing unit 16 performs super-resolution processing on the input image of 2K size using the first CNNm-n corresponding to the second CNNm-n of the second hierarchy specified in step S20, thereby super-resolution of 4K size. An image is created (S21). In the above example, the super-resolution processing is performed using the first CNN 1-2.

以上説明したように、本発明の実施の形態によると、２Ｋサイズの入力画像を縮小したＱＨＤサイズの縮小画像を超解像度処理した時の誤差が最小となる第２ＣＮＮを、分類木を用いて探索することができる。第２ＣＮＮは、第１ＣＮＮと同様の性質を有し、かつ第２ＣＮＮが対象とする入力画像の解像度は第１ＣＮＮが対象とする入力画像の解像度よりも小さい。このため、第２ＣＮＮを効率的に探索することができる。また、探索された第２ＣＮＮに対応する第１ＣＮＮを用いて、入力画像を超解像度処理することにより、誤差の少ない高精度な超解像度画像を作成することができる。 As described above, according to the embodiment of the present invention, the second CNN that minimizes the error when the QHD size reduced image obtained by reducing the 2K size input image is subjected to the super-resolution processing is searched using the classification tree. can do. The second CNN has the same properties as the first CNN, and the resolution of the input image targeted by the second CNN is smaller than the resolution of the input image targeted by the first CNN. For this reason, the second CNN can be searched efficiently. Further, by performing super-resolution processing on the input image using the first CNN corresponding to the searched second CNN, it is possible to create a highly accurate super-resolution image with few errors.

つまり、第１ＣＮＮと、該第１ＣＮＮとペアをなす第２ＣＮＮとは、サイズの異なる同一の学習用画像を用いて機械学習が行われる。このため、第２ＣＮＮが精度よく超解像度処理することのできる画像は、該第２ＣＮＮとペアをなす第１ＣＮＮによっても、精度よく超解像度処理されることが期待できる。これにより、誤差が最小となる第２ＮＮとペアをなす第１ＣＮＮを選択し、選択した第１ＣＮＮを用いて入力画像を超解像度処理することにより、入力画像を超解像度処理した時の誤差が小さくなると期待できる。 That is, the first CNN and the second CNN paired with the first CNN are subjected to machine learning using the same learning images having different sizes. For this reason, it can be expected that an image that can be subjected to the super-resolution processing with high accuracy by the second CNN is also accurately processed with the first CNN paired with the second CNN. Accordingly, the first CNN paired with the second NN that minimizes the error is selected, and the input image is super-resolution processed using the selected first CNN, so that the error when the input image is super-resolution processed is reduced. I can expect.

また、第２ＣＮＮの構成を第１ＣＮＮの構成よりも小さくし、第２ＣＮＮを選択した後に、選択した第２ＣＮＮに対応する第１ＣＮＮを選択することとしているため、第１ＣＮＮの選択の処理速度を向上させることができる。ただし、第２ＣＮＮの構成は必ずしも第１ＣＮＮの構成よりも小さくなくてもよい。 In addition, since the configuration of the second CNN is made smaller than that of the first CNN and the second CNN is selected and then the first CNN corresponding to the selected second CNN is selected, the processing speed of the selection of the first CNN is improved. be able to. However, the configuration of the second CNN is not necessarily smaller than the configuration of the first CNN.

なお、上述の実施の形態では、学習用セットおよび学習用サブセットを、それぞれ、３つの学習用サブセットに分類したが、分類する学習用サブセットの個数は３つに限定されるものではなく、２つ以上であればよい。 In the above-described embodiment, the learning set and the learning subset are each classified into three learning subsets. However, the number of learning subsets to be classified is not limited to three, but two. That is all you need.

また、図３に示したように、学習用サブセットの階層は２階層としたが、１階層であってもよいし、３階層以上あってもよい。 Further, as shown in FIG. 3, the learning subset has two hierarchies, but may have one hierarchy or three or more hierarchies.

また、上述の実施の形態では、超解像度処理された画像と学習用画像との差分に基づいて、学習用セットおよび学習用サブセットを分類したが、分類の仕方はこれに限定されるものではない。例えば、画像の種類（イラスト、風景写真、人物写真）をさらに考慮して分類を行ってもよい。 In the above-described embodiment, the learning set and the learning subset are classified based on the difference between the super-resolution processed image and the learning image. However, the classification method is not limited to this. . For example, classification may be performed in consideration of the type of image (illustration, landscape photograph, portrait photograph).

また、上述の実施の形態では、第１ＣＮＮおよび第２ＣＮＮは、画像の全体領域を入力として受け、超解像度処理を行うものとして説明したが、各ＣＮＮは、画像を分割した複数のブロック領域をそれぞれ入力として受け、各ブロック領域を超解像度処理した後にマージする構成であってもよい。また、各ブロック領域は、隣接するブロック領域とオーバーラップしていてもよいし、ブロック領域間でサイズが異なっていてもよい。このように、ＣＮＮを構成することにより、学習用画像の全体領域を入力とせずとも、一部の領域だけを利用してＣＮＮの学習を行うことが可能となる。 In the above-described embodiment, the first CNN and the second CNN have been described as receiving the entire area of the image and performing super-resolution processing. However, each CNN has a plurality of block areas obtained by dividing the image, respectively. A configuration may be adopted in which each block area is received as input and merged after super-resolution processing. In addition, each block area may overlap with an adjacent block area, or the size may be different between the block areas. Thus, by configuring the CNN, it is possible to learn the CNN using only a part of the region without inputting the entire region of the learning image.

また、超解像度処理装置１が処理対象とする画像は、イラストや写真等の通常の画像に限定されるものではない。例えば、ＭＲＩ（Magnetic Resonance Imaging）画像やＣＴ（Computed Tomography）画像などの医療用のスキャンデータのような、測定データの分布図を処理対象としても良い。また、ベクトルの構造体や、ボクセル画像のような３次元構造体など、超解像度処理が提供可能な処理対象であれば、超解像度処理装置１の入力とすることができる。 Further, the image to be processed by the super-resolution processing apparatus 1 is not limited to a normal image such as an illustration or a photograph. For example, a distribution map of measurement data such as medical scan data such as an MRI (Magnetic Resonance Imaging) image or a CT (Computed Tomography) image may be processed. Further, any processing target that can provide super-resolution processing, such as a vector structure or a three-dimensional structure such as a voxel image, can be used as the input of the super-resolution processing apparatus 1.

また、上記の超解像度処理装置１は、具体的には、マイクロプロセッサ、ＲＯＭ、ＲＡＭ、ハードディスクドライブ、ディスプレイユニット、キーボード、マウスなどから構成されるコンピュータシステムとして構成されてもよい。ＲＡＭまたはハードディスクドライブには、コンピュータプログラムが記憶されている。マイクロプロセッサが、コンピュータプログラムに従って動作することにより、超解像度処理装置１は、その機能を達成する。ここでコンピュータプログラムは、所定の機能を達成するために、コンピュータに対する指令を示す命令コードが複数個組み合わされて構成されたものである。 The super-resolution processing apparatus 1 may be specifically configured as a computer system including a microprocessor, ROM, RAM, hard disk drive, display unit, keyboard, mouse, and the like. A computer program is stored in the RAM or hard disk drive. The super-resolution processing device 1 achieves its functions by the microprocessor operating according to the computer program. Here, the computer program is configured by combining a plurality of instruction codes indicating instructions for the computer in order to achieve a predetermined function.

さらに、上記の超解像度処理装置１を構成する構成要素の一部または全部は、１個のシステムＬＳＩ（Large Scale Integration：大規模集積回路）から構成されているとしてもよい。システムＬＳＩは、複数の構成部を１個のチップ上に集積して製造された超多機能ＬＳＩであり、具体的には、マイクロプロセッサ、ＲＯＭ、ＲＡＭなどを含んで構成されるコンピュータシステムである。ＲＡＭには、コンピュータプログラムが記憶されている。マイクロプロセッサが、コンピュータプログラムに従って動作することにより、システムＬＳＩは、その機能を達成する。 Further, some or all of the components constituting the super-resolution processing apparatus 1 may be configured by one system LSI (Large Scale Integration). The system LSI is an ultra-multifunctional LSI manufactured by integrating a plurality of components on a single chip, and specifically, a computer system including a microprocessor, ROM, RAM, and the like. . A computer program is stored in the RAM. The system LSI achieves its functions by the microprocessor operating according to the computer program.

さらにまた、上記の超解像度処理装置１を構成する構成要素の一部または全部は、超解像度処理装置１に脱着可能なＩＣカードまたは単体のモジュールから構成されているとしてもよい。ＩＣカードまたはモジュールは、マイクロプロセッサ、ＲＯＭ、ＲＡＭなどから構成されるコンピュータシステムである。ＩＣカードまたはモジュールは、上記の超多機能ＬＳＩを含むとしてもよい。マイクロプロセッサが、コンピュータプログラムに従って動作することにより、ＩＣカードまたはモジュールは、その機能を達成する。このＩＣカードまたはこのモジュールは、耐タンパ性を有するとしてもよい。 Furthermore, some or all of the constituent elements constituting the super-resolution processing apparatus 1 may be configured as an IC card that can be attached to and detached from the super-resolution processing apparatus 1 or a single module. The IC card or module is a computer system that includes a microprocessor, ROM, RAM, and the like. The IC card or the module may include the super multifunctional LSI described above. The IC card or the module achieves its function by the microprocessor operating according to the computer program. This IC card or this module may have tamper resistance.

また、本発明は、上記に示す方法であるとしてもよい。また、本発明は、これらの方法をコンピュータにより実現するコンピュータプログラムであるとしてもよいし、前記コンピュータプログラムからなるデジタル信号であるとしてもよい。 Further, the present invention may be the method described above. Further, the present invention may be a computer program that realizes these methods by a computer, or may be a digital signal composed of the computer program.

さらに、本発明は、上記コンピュータプログラムまたは上記デジタル信号をコンピュータ読取可能な非一時的な記録媒体、例えば、フレキシブルディスク、ハードディスクドライブ、ＣＤ−ＲＯＭ、ＭＯ、ＤＶＤ、ＤＶＤ−ＲＯＭ、ＤＶＤ−ＲＡＭ、ＢＤ（Ｂｌｕ−ｒａｙ（登録商標）Ｄｉｓｃ）、半導体メモリなどに記録したものとしてもよい。また、これらの非一時的な記録媒体に記録されている上記デジタル信号であるとしてもよい。 Furthermore, the present invention provides a non-transitory recording medium that can read the computer program or the digital signal, such as a flexible disk, a hard disk drive, a CD-ROM, an MO, a DVD, a DVD-ROM, a DVD-RAM, and a BD. (Blu-ray (registered trademark) Disc), or recorded in a semiconductor memory or the like. Further, the digital signal may be recorded on these non-temporary recording media.

また、本発明は、上記コンピュータプログラムまたは上記デジタル信号を、電気通信回線、無線または有線通信回線、インターネットを代表とするネットワーク、データ放送等を経由して伝送するものとしてもよい。 In the present invention, the computer program or the digital signal may be transmitted via an electric communication line, a wireless or wired communication line, a network represented by the Internet, a data broadcast, or the like.

また、本発明は、マイクロプロセッサとメモリを備えたコンピュータシステムであって、上記メモリは、上記コンピュータプログラムを記憶しており、上記マイクロプロセッサは、上記コンピュータプログラムに従って動作するとしてもよい。 The present invention may be a computer system including a microprocessor and a memory, wherein the memory stores the computer program, and the microprocessor operates according to the computer program.

また、上記プログラムまたは上記デジタル信号を上記非一時的な記録媒体に記録して移送することにより、または上記プログラムまたは上記デジタル信号を上記ネットワーク等を経由して移送することにより、独立した他のコンピュータシステムにより実施するとしてもよい。
さらに、上記実施の形態および上記変形例をそれぞれ組み合わせるとしてもよい。 Further, by recording the program or the digital signal on the non-temporary recording medium and transferring it, or transferring the program or the digital signal via the network or the like, another independent computer It may be implemented by the system.
Furthermore, the above embodiment and the above modification examples may be combined.

今回開示された実施の形態はすべての点で例示であって制限的なものではないと考えられるべきである。本発明の範囲は、上記した意味ではなく、特許請求の範囲によって示され、特許請求の範囲と均等の意味および範囲内でのすべての変更が含まれることが意図される。 The embodiment disclosed this time should be considered as illustrative in all points and not restrictive. The scope of the present invention is defined by the terms of the claims, rather than the meanings described above, and is intended to include any modifications within the scope and meaning equivalent to the terms of the claims.

本発明は、入力画像の解像度を高解像度化した画像を作成する超解像度処理装置等に用いると有益である。 The present invention is useful when used in a super-resolution processing apparatus or the like that creates an image in which the resolution of an input image is increased.

１超解像度処理装置
１０入力画像取得部
１１縮小画像作成部
１２機械学習部
１３第１階層フィルタ取得部
１４候補画像作成部
１５第２階層フィルタ取得部
１６超解像度処理部
１７記憶装置 DESCRIPTION OF SYMBOLS 1 Super-resolution processing apparatus 10 Input image acquisition part 11 Reduced image creation part 12 Machine learning part 13 1st hierarchy filter acquisition part 14 Candidate image creation part 15 2nd hierarchy filter acquisition part 16 Super-resolution process part 17 Storage device

Claims

A super-resolution processing device that performs super-resolution processing on a second-resolution image and creates a third-resolution image that is higher in resolution than the second resolution,
For each of the plurality of learning subsets in the first layer, which is a subset of the learning set including the third resolution image, the second resolution image machine-learned using the learning subset is the third resolution image. A first filter for super-resolution processing, and a first layer filter acquisition unit for acquiring a second filter for super-resolution processing of a first resolution image having a resolution lower than the second resolution into a second resolution image When,
A reduced image creation unit that creates a reduced image of the first resolution from an input image of the second resolution;
A candidate image creation unit that creates a second resolution candidate image by performing super-resolution processing on the reduced image of the first resolution using each of the second filters obtained by the first layer filter acquisition unit;
Using the first filter corresponding to the second filter that minimizes the difference between the second resolution candidate image created by the candidate image creation unit and the second resolution input image, the second resolution input image And a super-resolution processing unit that creates a super-resolution image of the third resolution by performing super-resolution processing.

further,
For each of a plurality of learning subsets in the second hierarchy, which is a subset of the learning subset in the first hierarchy used for machine learning of the second filter with the smallest difference, using the learning subset A second hierarchical filter acquisition unit that acquires the first and second machine-learned filters;
The candidate image creation unit further performs super-resolution processing on the reduced image of the first resolution using each of the second filters of the second layer acquired by the second layer filter acquisition unit. Create a candidate image,
The super-resolution processing unit further has a minimum difference between the second-resolution candidate image created by the candidate-image creating unit using the second filter in the second hierarchy and the second-resolution input image. The super-resolution processing apparatus according to claim 1, wherein a super-resolution image of the third resolution is generated by performing super-resolution processing on the input image of the second resolution using a first filter corresponding to the second filter. .

A super-resolution processing method for causing a device that performs super-resolution processing of a second resolution image to generate a third resolution image having a higher resolution than the second resolution,
For each of the plurality of learning subsets in the first layer, which is a subset of the learning set including the third resolution image, the second resolution image machine-learned using the learning subset is the third resolution image. Obtaining a first filter for super-resolution processing and a second filter for super-resolution processing of a first resolution image having a lower resolution than the second resolution into a second resolution image;
Creating a reduced image of the first resolution from the input image of the second resolution;
Using each acquired second filter to create a second resolution candidate image by super-resolution processing the reduced image of the first resolution;
Using the first filter corresponding to the second filter that minimizes the difference between the created candidate image of the second resolution and the input image of the second resolution, the input image of the second resolution is subjected to super-resolution processing. Thereby creating a super-resolution image of the third resolution.

A computer program for super-resolution processing a second resolution image to create a third resolution image having a higher resolution than the second resolution,
Computer
For each of the plurality of learning subsets in the first layer, which is a subset of the learning set including the third resolution image, the second resolution image machine-learned using the learning subset is the third resolution image. A first filter for super-resolution processing, and a first layer filter acquisition unit for acquiring a second filter for super-resolution processing of a first resolution image having a resolution lower than the second resolution into a second resolution image When,
A reduced image creation unit that creates a reduced image of the first resolution from an input image of the second resolution;
A candidate image creation unit that creates a second resolution candidate image by performing super-resolution processing on the reduced image of the first resolution using each of the second filters obtained by the first layer filter acquisition unit;
Using the first filter corresponding to the second filter that minimizes the difference between the second resolution candidate image created by the candidate image creation unit and the second resolution input image, the second resolution input image A computer program for functioning as a super-resolution processing unit that creates a super-resolution image of the third resolution by performing super-resolution processing on the.