[go: up one dir, main page]

WO2018235697A1 - Transmission apparatus, communication system, communication method, and recording medium - Google Patents

Transmission apparatus, communication system, communication method, and recording medium Download PDF

Info

Publication number
WO2018235697A1
WO2018235697A1 PCT/JP2018/022595 JP2018022595W WO2018235697A1 WO 2018235697 A1 WO2018235697 A1 WO 2018235697A1 JP 2018022595 W JP2018022595 W JP 2018022595W WO 2018235697 A1 WO2018235697 A1 WO 2018235697A1
Authority
WO
WIPO (PCT)
Prior art keywords
throughput
pattern
video data
division
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/JP2018/022595
Other languages
French (fr)
Japanese (ja)
Inventor
夏季 甲斐
裕志 吉田
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by NEC Corp filed Critical NEC Corp
Priority to JP2019525486A priority Critical patent/JP6825703B2/en
Publication of WO2018235697A1 publication Critical patent/WO2018235697A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F13/00Interconnection of, or transfer of information or other signals between, memories, input/output devices or central processing units
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details
    • H04N17/02Diagnosis, testing or measuring for television systems or their details for colour television signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/156Availability of hardware or computational resources, e.g. encoding based on power-saving criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/238Interfacing the downstream path of the transmission network, e.g. adapting the transmission rate of a video stream to network bandwidth; Processing of multiplex streams

Definitions

  • the present invention relates to a transmitting apparatus, a communication system, a communication method, and a recording medium, and, for example, to a transmitting apparatus that transmits compressed video data via a network.
  • LTE Long Term Evolution
  • Wi-Fi registered trademark
  • the encoding parameters include, for example, bit rate, frame rate, and GOP (Group of picture) size (or key frame interval).
  • the coding parameters can be changed by resetting the codec. The smaller the GOP size, the more granular the bit rate can be controlled.
  • the voice synthesizer (transmitting device) described in Patent Document 1 improves the image quality of video by increasing the GOP size of video data when the loss rate (loss rate of packets) is small, that is, when the throughput is high. Plan. On the other hand, when the loss rate is large, that is, when the throughput is low, the GOP size of video data is reduced. Thus, the speech synthesizer described in Patent Document 1 can control the bit rate with high granularity.
  • An object of the present invention is to improve the quality of experience of video transmitted through a network.
  • a transmitter in order to solve the above-mentioned problems, a throughput measuring means for measuring a throughput of communication with a receiving device, and a future based on the throughput measured by the throughput measuring means.
  • the pattern generation unit configured to generate a plurality of patterns each configured by a plurality of division times by dividing the predetermined time by dividing the predetermined time.
  • a bodily sensation quality calculating means for calculating a bodily sensation quality, and before the pattern generation means generates Among the plurality of patterns, pattern selection means for selecting one pattern that maximizes the sensory quality of the image calculated by the sensory quality calculation means, and the division according to the one pattern selected by the pattern selection means And parameter determining means for determining encoding parameters for the compression process of the video data for each time.
  • a communication method measures the throughput of communication with a receiving device to solve the above problem, predicts the future throughput based on the measured throughput, and determines a predetermined time. By dividing, a plurality of patterns respectively formed of a plurality of division times are generated, and at each maximum division rate which does not exceed the lower limit value of the future throughput for each division time according to the generated patterns
  • the sensory quality of the video to be reproduced is calculated from the video data, and one of the plurality of patterns is selected which makes the sensory quality of the video maximum.
  • An encoding parameter for the compression process of the video data is determined for each of the division times according to the selected one pattern.
  • a recording medium that measures throughput of communication with a receiving device and predicts future throughput based on the measured throughput. Generating a plurality of patterns respectively configured by a plurality of division times by dividing a predetermined time, and not exceeding the lower limit value of the future throughput for each of the division times according to the generated patterns
  • video data is transmitted to the receiving apparatus at such a maximum bit rate
  • the sensory quality of the video to be reproduced from the video data is calculated, and the sensory quality of the video among the plurality of patterns is maximized.
  • Selection of one pattern, and compression processing of the video data at each division time according to the selected one pattern Storing a program for executing the method comprising: determining a coding parameter, to the computer.
  • An object of the present invention is to provide a transmission apparatus that improves the quality of experience of video transmitted through a network.
  • FIG. 5 is a diagram illustrating an example of throughput of communication between a transmitting terminal and a receiving terminal according to the first embodiment and an example of predicted future throughput. It is a figure which shows the example of the encoding parameter for every division time determined according to the prediction value of the future throughput.
  • FIG. 5 is a sequence diagram showing a flow of communication between a transmitting terminal and a receiving terminal of the communication system according to Embodiment 1.
  • FIG. 7 is a block diagram of a transmission terminal according to Embodiment 2. It is a figure which shows the hardware constitutions of the transmission terminal concerning Embodiment 3.
  • FIG. 5 is a sequence diagram showing a flow of communication between a transmitting terminal and a receiving terminal of the communication system according to Embodiment 1.
  • FIG. 7 is a block diagram of a transmission terminal according to Embodiment 2. It is a figure which shows the hardware constitutions of the transmission terminal concerning Embodiment 3.
  • FIG. 5 is a sequence diagram showing a flow of communication between a transmitting terminal and a receiving terminal of the communication system according to Embodiment 1.
  • FIG. 7 is a block diagram of a transmission terminal according to Embodiment 2. It is a figure which shows the hardware constitutions of the transmission terminal concerning Embodiment 3.
  • FIG. 5 is a sequence diagram showing a
  • FIG. 1 is a block diagram of a communication system according to the first embodiment.
  • the communication system includes a transmitting terminal 1 and a receiving terminal 2.
  • the transmitting terminal 1 and the receiving terminal 2 are terminal devices such as a personal computer or a smartphone.
  • the transmitting terminal 1 and the receiving terminal 2 communicate via a network.
  • the network of the communication system according to the first embodiment is the Internet, a mobile network, or the like, and has a heterogeneous structure.
  • the transmitting terminal 1 includes an encoder 101, a transmitting unit 102, a data receiving unit 103, a throughput measuring unit 104, a division pattern generating unit 105, a QoE calculating unit 106, and a parameter determining unit 107. And have.
  • the encoder 101 compresses and encodes a video and outputs video data.
  • the transmitting unit 102 transmits the video data compressed and encoded by the encoder 101 to the receiving terminal 2.
  • the data receiving unit 103 receives data for measuring communication throughput from the receiving terminal 2.
  • the throughput measuring unit 104 measures the throughput based on the data received by the data receiving unit 103.
  • the division pattern generation unit 105 generates a plurality of patterns (described later) each configured by a plurality of division times by dividing a predetermined time.
  • the QoE calculation unit 106 predicts the future throughput based on the throughput measured by the throughput measurement unit 104. In addition, the QoE calculation unit 106 calculates QoE, that is, the quality of experience of the user who views the video, for each of the patterns generated by the divided pattern generation unit 105.
  • the parameter determination unit 107 selects one pattern having the largest QoE calculated by the QoE calculation unit 106 from the plurality of patterns generated by the divided pattern generation unit 105, and based on the selected one pattern, Determine the coding parameters of
  • the receiving terminal 2 includes a receiving unit 201 and a data transmitting unit 202.
  • the receiving unit 201 receives the video data transmitted from the transmitting unit 102 of the transmitting terminal 1.
  • the data transmission unit 202 acquires or generates data necessary for calculating the throughput from the reception unit 201, and transmits the acquired or generated data to the transmission terminal 1.
  • the division pattern generation unit 105 divides a predetermined time T from time 0 to a predetermined time T (> 0) at a maximum division number m (m is a natural number) or less.
  • the predetermined time T and the maximum division number m may be any value. Alternatively, the predetermined time T and the maximum division number m may be set in advance by the user.
  • the divided pattern generation unit 105 has four patterns: ⁇ 0, t1, t2, t3 ⁇ , ⁇ 0, t2, t3 ⁇ , ⁇ 0, t1, t3 ⁇ , Or generate ⁇ 0, t3 ⁇ .
  • 0 ⁇ t1 ⁇ t2 ⁇ t3 T.
  • the total number of patterns is 2 ⁇ (m-1).
  • corresponding elements of different sets R_j of the set R are not necessarily the same.
  • the element T_1 of the set R_1 is t2, but the element T_1 of the set R_2 is t1.
  • the “distance” (on the time axis) between elements adjacent to each other is not necessarily constant.
  • the “distance” between elements is not constant.
  • the “distance” between the element T_i and the element T_ ⁇ i-1 ⁇ of the set R_j will be referred to as a division time (T_i ⁇ T_ ⁇ i-1 ⁇ ).
  • the QoE calculation unit 106 calculates QoE, that is, the quality of sensation for each division time (T_i-T_ ⁇ i-1 ⁇ ).
  • QoE is represented by a real number from 1 to 5. The lower the QoE, the lower the quality of sensation. On the other hand, the higher the QoE, the higher the quality of sensation.
  • the calculation function of QoE follows the standard on streaming quality of audiovisual media (for example, ITU-T P. 1201 created by the International Telecommunication Union / ITU Telecommunication Sector).
  • the calculation function of QoE is represented as Q (b, g).
  • b is a bit rate
  • g is a GOP size. That is, Q (b, g) depends on the bit rate and the GOP size.
  • the QoE calculation unit 106 calculates a function x (t) (t is time) representing the future throughput using the communication throughput prediction method.
  • the communication throughput prediction method is a method of obtaining the expectation value and the variance of the probability density function representing the future throughput by analyzing the information on the past throughput along the time series. Techniques related to communication throughput prediction methods are described, for example, in Non-Patent Document 1 or Non-Patent Document 2. In the present embodiment, the description of the technology related to the communication throughput prediction method is omitted.
  • the QoE calculation unit 106 determines the maximum bit rate from the throughput in the future.
  • the maximum bit rate is the maximum bit rate that does not exceed the future throughput lower limit value represented by the function x (t). Therefore, the maximum bit rate matches the lower limit of the future throughput.
  • a a sensitivity coefficient set in advance by the user.
  • the maximum bit rate decreases with time.
  • FIG. 3 is a flowchart showing the operation of the QoE calculation unit 106.
  • represents the number of elements of the set R.
  • j
  • (YES in S2) the operation of the QoE calculation unit 106 ends.
  • the QoE calculation unit 106 calculates the division time (T_i ⁇ T_ ⁇ i) based on the function x (t) (t is time) representing the predicted throughput value calculated in advance. Calculate the maximum bit rate b_i in -1).
  • represents the number of elements of the set R_j.
  • the QoE calculation unit 106 calculates the GOP size g_i in the division time (T_i-T_ ⁇ i-1 ⁇ ) according to the following calculation formula (S5).
  • g_i f ⁇ (T_i-T_ ⁇ i-1 ⁇ )
  • f is a frame rate.
  • f is a constant.
  • the QoE calculation unit 106 calculates QoE in each division time of the set R_j (S6).
  • QoE in the division time (T_i-T_ ⁇ i-1 ⁇ ) of the set R_j is represented as Q_j_i.
  • the time represented by T_i differs depending on the set R_j. Therefore, it should be noted that Q_j_i takes different values depending on the set R_j.
  • the maximum bit rate and the GOP size are optimally controlled based on the predicted throughput value, it is possible to suppress the decrease in QoE.
  • the parameter determination unit 107 receives the set R generated by the divided pattern generation unit 105, and receives the QoE calculated by the QoE calculation unit 106, that is, Q_j_i.
  • the parameter determination unit 107 weights Q_j_i for each division time (T_i-T_ ⁇ i-1 ⁇ ) of the set R_j by the division time (T_i-T_ ⁇ i-1 ⁇ ).
  • the set R_j that maximizes the sum of values is selected.
  • the parameter determination unit 107 determines the maximum bit rate b_i and the GOP size g_i in each division time (T_i-T_ ⁇ i-1 ⁇ ) of the selected set R_j as a coding parameter for compression encoding of video data. .
  • the encoder 101 receives information on the maximum bit rate and the GOP size from the parameter determination unit 107. Then, the encoder 101 resets the codec, and updates the coding parameters based on the received information of the maximum bit rate and the GOP size.
  • FIG. 4 and FIG. 5 are graphs showing an example of time change of throughput.
  • the horizontal axis t of the graph represents time
  • the vertical axis b of the graph represents throughput.
  • the solid line graph indicates the measured throughput value.
  • all three broken line graphs indicate predicted throughput values.
  • the reproduction of the video may be interrupted at the receiving terminal 2 side. Therefore, the maximum bit rate is limited so as not to exceed the lower limit of throughput.
  • the frame rate f is a constant.
  • , 0 ⁇ j ⁇
  • the sum represented by equation (2) is Q_0_0 + Q_0_1. Also, in the case of R_1, the sum is 2 ⁇ Q_1_0. It can be said that QoE at a predetermined time T is larger as the sum is larger.
  • the parameter determination unit 107 selects the set R_0.
  • FIG. 6 is a graph showing another example of the time change of throughput.
  • the graph shown in FIG. 6 is smaller in fluctuation of the measured throughput than the graph shown in FIG.
  • the variance of predicted throughput values is small.
  • the graph x ′ (t) shown in FIG. 6, that is, the lower limit value of the predicted throughput is gradually reduced as compared with the graph x (t) shown in FIG.
  • the configuration of the first embodiment it is possible to select an optimal combination of GOP size and bit rate so as to maximize the quality of experience (QoE) of the user. This is because QoE is calculated for each of a plurality of patterns that divide the predetermined time, and a pattern that maximizes QoE is selected. Further, according to the configuration of the first embodiment, even if the fluctuation of the throughput is large, it is possible to suppress the decrease in QoE. The reason is that since the maximum bit rate does not exceed the lower limit of the predicted throughput, there is a low possibility that the reproduction of the video will be interrupted.
  • FIG. 7 is a sequence diagram showing a flow of each operation of the transmitting terminal 1 and the receiving terminal 2 when the transmitting terminal 1 and the receiving terminal 2 that make up the communication system communicate.
  • the data transmitting unit 202 of the receiving terminal 2 transmits data related to throughput, in other words, data necessary for calculating the throughput, to the transmitting terminal 1 (S201).
  • the data transmission unit 202 transmits information on the amount of video data received by the reception unit 201 of the reception terminal 2 per unit time and information on loss rate to the transmission terminal 1 as data on throughput.
  • the data reception unit 103 of the transmission terminal 1 receives data on throughput from the reception terminal 2.
  • the throughput measuring unit 104 measures the throughput of communication between the transmitting terminal 1 and the receiving terminal 2 using the data received by the data receiving unit 103 (S101).
  • the QoE calculation unit 106 of the transmission terminal 1 predicts the future throughput based on the throughput measured by the throughput measurement unit 104 (S102).
  • the division pattern generation unit 105 of the transmission terminal 1 generates a pattern composed of a plurality of division times by dividing a predetermined time (for example, from 0 to t3 in FIG. 2) (S103).
  • the QoE calculation unit 106 transmits an image to the receiving terminal 2 at a maximum bit rate that does not exceed the lower limit value of the future throughput predicted above for each division time according to the pattern generated by the division pattern generation unit 105.
  • the sensory quality of the video to be reproduced is calculated from the video data (S104). The details of S104 have been described above with reference to FIG.
  • the parameter determination unit 107 selects one pattern that maximizes the sensation quality calculated by the QoE calculation unit 106 among the plurality of patterns generated by the divided pattern generation unit 105 (S105).
  • the parameter determination unit 107 of the transmission terminal 1 determines a coding parameter for each division time according to one selected pattern (S106).
  • the coding parameter (bit rate b_1) in the division time (T_1-T_0) is selected.
  • the encoder 101 of the transmission terminal 1 compresses and encodes video data using the encoding parameter determined by the parameter determination unit 107 (S107).
  • the transmitting unit 102 of the transmitting terminal 1 transmits the video data compressed and encoded by the encoder 101 to the receiving terminal 2 (S108).
  • the receiving unit 201 of the receiving terminal 2 receives the compression-coded video data from the transmitting terminal 1 (S202). Thereafter, the flow returns to S201.
  • FIG. 8 is a schematic block diagram showing the configuration of the transmission terminal 3 according to the second embodiment.
  • the transmission terminal 3 according to the second embodiment includes a throughput measurement unit 301, a throughput prediction unit 302, a pattern generation unit 303, a sensation quality calculation unit 304, a pattern selection unit 305, and parameters. And a determination unit 306.
  • the throughput measurement unit 301 of the second embodiment corresponds to, for example, the throughput measurement unit 104 of the first embodiment.
  • the throughput prediction unit 302 and the sensation quality calculation unit 304 of the second embodiment correspond to, for example, the QoE calculation unit 106 of the first embodiment.
  • the pattern generation unit 303 of the second embodiment corresponds to, for example, the divided pattern generation unit 105 of the first embodiment.
  • the pattern selection unit 305 and the parameter determination unit 306 of the second embodiment correspond to, for example, the parameter determination unit 107 of the first embodiment.
  • the transmitting terminal 3 according to the second embodiment may further include components corresponding to the encoder 101 and the transmitting unit 102 provided in the transmitting terminal 1 of the first embodiment.
  • the throughput measuring unit 301 measures the throughput of communication with a receiving terminal (not shown).
  • the throughput prediction unit 302 predicts the future throughput based on the throughput measured by the throughput measurement unit 301.
  • the pattern generation unit 303 generates a plurality of patterns each configured of a plurality of division times by dividing a predetermined time.
  • the perceptual quality calculation unit 304 transmits video data to the receiving terminal at the maximum bit rate that does not exceed the lower limit value of the future throughput predicted by the throughput prediction unit 302 every division time according to the pattern generated by the pattern generation unit 303. Is transmitted, the sensory quality of the video reproduced from the video data is calculated.
  • the pattern selection unit 305 selects, from among the plurality of patterns generated by the pattern generation unit 303, one pattern that maximizes the sensation quality of the image calculated by the sensation quality calculation unit 304.
  • the parameter determination unit 306 determines, for each division time corresponding to one pattern selected by the pattern selection unit 305, a coding parameter for compression processing of video data.
  • the quality of experience of the video can be improved.
  • FIG. 9 is a block diagram showing the hardware configuration of the transmission terminal 4 according to the third embodiment.
  • the transmission terminal according to the third embodiment includes a central processing unit (CPU) 401, an encoder 402, and an interface 403.
  • the transmission terminal 4 according to the present embodiment corresponds to, for example, the transmission terminal 1 according to the first embodiment or the transmission terminal 3 according to the second embodiment.
  • the CPU 401 implements the control function of the transmission terminal 4 according to the present embodiment.
  • the encoder 402 corresponds to, for example, the encoder 101 of the first embodiment.
  • the interface 403 corresponds to, for example, the transmission unit 102 and the data reception unit 103 of the first embodiment.
  • the transmission terminal 4 has a control function of the transmission terminal 1 according to the first embodiment (throughput measurement unit 104, division pattern generation unit 105, QoE calculation unit 106, parameter determination unit 107, and so on shown in FIG.
  • a program for realizing all or part of the encoder 101) is recorded in a computer readable recording medium, and the program recorded in the recording medium is read by the CPU 401 and executed to perform processing of each part. It is also good.
  • the “computer system” includes an OS and hardware such as peripheral devices.
  • the transmitting terminal 4 has the control function of the transmitting terminal 3 according to the second embodiment (throughput measuring unit 301, throughput predicting unit 302, pattern generating unit 303, quality of experience calculating unit 304 shown in FIG. , A program for realizing all or part of the pattern selection unit 305 and the parameter determination unit 306) in a computer readable recording medium, and causing the CPU 401 to read the program recorded in the recording medium, and executing the program The processing of each part may be performed by doing this.
  • “computer readable recording medium” means portable media such as a flexible disk, a magneto-optical disk, a ROM (Read Only Memory), a CD (Compact Disc) -ROM, and a hard disk incorporated in a computer system. It refers to a storage device.
  • the program read and executed by the CPU 401 may be for realizing a part of the above-described functions, and the above-described functions may be realized in combination with the program already recorded in the computer system. It may be

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Information Transfer Between Computers (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

スループット計測部(104)は、受信端末(2)との間の通信のスループットを計測し、分割パターン生成部(105)は、所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成し、QoE算出部(106)は、分割パターン生成部(105)が生成したパターンごとに、未来のスループットの予測値から、映像の体感品質を算出し、パラメータ決定部(107)は、映像の体感品質が最大になる1つのパターンの分割時間ごとに、符号化パラメータを決定する。The throughput measurement unit (104) measures the throughput of communication with the reception terminal (2), and the division pattern generation unit (105) is configured with a plurality of division times by dividing a predetermined time. A plurality of patterns are generated, and the QoE calculation unit (106) calculates the perceived quality of the image from the predicted value of the future throughput for each pattern generated by the divided pattern generation unit (105), and the parameter determination unit (107 ) Determines the coding parameter for each division time of one pattern that maximizes the quality of perceived video.

Description

送信装置、通信システム、通信方法、及び記録媒体Transmission apparatus, communication system, communication method, and recording medium

 本発明は、送信装置、通信システム、通信方法、及び記録媒体に関し、例えば、ネットワークを介して、圧縮した映像データを送信する送信装置に関する。 The present invention relates to a transmitting apparatus, a communication system, a communication method, and a recording medium, and, for example, to a transmitting apparatus that transmits compressed video data via a network.

 LTE(Long Term Evolution)やWi-Fi(登録商標)などのベストエフォート型ネットワークを介した通信のスループットは、大きく変動しうる。そのため、通信のスループットの変動に応じて、映像データを圧縮するための符号化パラメータを動的に変更する技術が必要とされている。 The throughput of communication via a best effort network such as Long Term Evolution (LTE) or Wi-Fi (registered trademark) may vary greatly. Therefore, there is a need for a technique for dynamically changing coding parameters for compressing video data in accordance with fluctuations in communication throughput.

 符号化パラメータには、例えば、ビットレート、フレームレート、およびGOP(Group of picture)サイズ(あるいはキーフレーム間隔)が含まれる。コーデックをリセットすることによって、符号化パラメータを変更することができる。GOPサイズが小さいほど、ビットレートをより高い粒度で制御することができる。 The encoding parameters include, for example, bit rate, frame rate, and GOP (Group of picture) size (or key frame interval). The coding parameters can be changed by resetting the codec. The smaller the GOP size, the more granular the bit rate can be controlled.

 特許文献1に記載の音声合成装置(送信装置)は、ロス率(パケットの損失率)が小さい場合、すなわちスループットが高い場合、映像データのGOPサイズを大きくすることによって、映像の画質の向上を図る。一方、ロス率が大きい場合、すなわちスループットが低い場合、映像データのGOPサイズを小さくする。これにより、特許文献1に記載の音声合成装置は、ビットレートを高い粒度で制御可能である。 The voice synthesizer (transmitting device) described in Patent Document 1 improves the image quality of video by increasing the GOP size of video data when the loss rate (loss rate of packets) is small, that is, when the throughput is high. Plan. On the other hand, when the loss rate is large, that is, when the throughput is low, the GOP size of video data is reduced. Thus, the speech synthesizer described in Patent Document 1 can control the bit rate with high granularity.

特開2005-151600号公報JP 2005-151600 A

「TCPスループットの確率的拡散予測に基づく映像配信制御(Video Streaming Control by Predicting Stochastic Diffusion of TCP Throughput)」 吉田裕志ら インターネットコンファレンス2011(IC2011) (2011年10月27日)"Video Streaming Control by Predictive Stochastic Diffusion of TCP Throughput" Yoshida Yoshida et al. Internet Conference 2011 (IC2011) (Oct. 27, 2011) Constructing Stochastic Model of TCP Throughput on Basis of Stationarity Analysis, Hiroshi Yoshida et al. Global Communications Conference (GLOBECOM), 2013 IEEE, date of conference: 9-13 Dec. 2013Constructing Stochastic Model of TCP Throughput on Basis of Stationarity Analysis, Hiroshi Yoshida et al. Global Communications Conference (GLOBECOM), 2013 IEEE, date of conference: 9-13 Dec. 2013

 しかしながら、特許文献1に記載の音声合成装置(送信装置)による通信方法では、GOPサイズとビットレートの組み合わせがQoE(Quality of Experience)に与える影響を考慮していない。そのため、GOPサイズとビットレートとの組み合わせによって、QoEが著しく低下する可能性がある。 However, in the communication method by the speech synthesizer (transmitting device) described in Patent Document 1, the influence of the combination of the GOP size and the bit rate on QoE (Quality of Experience) is not taken into consideration. Therefore, the QoE may be significantly reduced by the combination of the GOP size and the bit rate.

 例えば、特許文献1に記載の送信装置が、ビットレートを高い粒度で制御するために、GOPサイズを小さくした場合、符号量の制御が破綻して、ブロックに歪み(ブロックノイズ)が生じる可能性がある。ブロックノイズは、QoEに大きく影響する。 For example, if the transmitting apparatus described in Patent Document 1 reduces the GOP size to control the bit rate with high granularity, the control of the code amount may be broken and distortion (block noise) may occur in the block. There is. Block noise greatly affects QoE.

 本発明の目的は、ネットワークを介して伝送される映像の体感品質を向上させることにある。 An object of the present invention is to improve the quality of experience of video transmitted through a network.

 本発明の一様態に係わる送信装置は、上記課題を解決するために、受信装置との間の通信のスループットを計測するスループット計測手段と、前記スループット計測手段が計測した前記スループットに基づいて、未来のスループットを予測するスループット予測手段と、所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成するパターン生成手段と、前記パターン生成手段が生成した前記パターンに応じた前記分割時間ごとに、前記スループット予測手段が予測した前記未来のスループットの下限値を超えないような最大ビットレートで前記受信端末へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出する体感品質算出手段と、前記パターン生成手段が生成した前記複数のパターンのうち、前記体感品質算出手段が算出した前記映像の体感品質が最大になる1つのパターンを選択するパターン選択手段と、前記パターン選択手段が選択した前記1つのパターンに応じた前記分割時間ごとに、前記映像データの圧縮処理のための符号化パラメータを決定するパラメータ決定手段と、を有する。 A transmitter according to an aspect of the present invention, in order to solve the above-mentioned problems, a throughput measuring means for measuring a throughput of communication with a receiving device, and a future based on the throughput measured by the throughput measuring means. According to the pattern generated by the pattern generation unit, the pattern generation unit configured to generate a plurality of patterns each configured by a plurality of division times by dividing the predetermined time by dividing the predetermined time. When video data is transmitted to the receiving terminal at a maximum bit rate that does not exceed the lower limit value of the future throughput predicted by the throughput prediction unit for each of the division times, the video data reproduced from the video data A bodily sensation quality calculating means for calculating a bodily sensation quality, and before the pattern generation means generates Among the plurality of patterns, pattern selection means for selecting one pattern that maximizes the sensory quality of the image calculated by the sensory quality calculation means, and the division according to the one pattern selected by the pattern selection means And parameter determining means for determining encoding parameters for the compression process of the video data for each time.

 本発明の一様態に係わる通信方法は、上記課題を解決するために、受信装置との間の通信のスループットを計測し、計測した前記スループットに基づいて、未来のスループットを予測し、所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成し、生成した前記パターンに応じた前記分割時間ごとに、前記未来のスループットの下限値を超えないような最大ビットレートで前記受信装置へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出し、前記複数のパターンのうち、前記映像の体感品質が最大になる1つのパターンを選択し、 選択した前記1つのパターンに応じた前記分割時間ごとに、前記映像データの圧縮処理のための符号化パラメータを決定する。 A communication method according to an aspect of the present invention measures the throughput of communication with a receiving device to solve the above problem, predicts the future throughput based on the measured throughput, and determines a predetermined time. By dividing, a plurality of patterns respectively formed of a plurality of division times are generated, and at each maximum division rate which does not exceed the lower limit value of the future throughput for each division time according to the generated patterns When video data is transmitted to the receiving apparatus, the sensory quality of the video to be reproduced is calculated from the video data, and one of the plurality of patterns is selected which makes the sensory quality of the video maximum. An encoding parameter for the compression process of the video data is determined for each of the division times according to the selected one pattern.

 本発明の一様態に係わる記録媒体は、上記課題を解決するために、受信装置との間の通信のスループットを計測することと、計測した前記スループットに基づいて、未来のスループットを予測することと、所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成することと、生成した前記パターンに応じた前記分割時間ごとに、前記未来のスループットの下限値を超えないような最大ビットレートで前記受信装置へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出することと、前記複数のパターンのうち、前記映像の体感品質が最大になる1つのパターンを選択することと、選択した前記1つのパターンに応じた前記分割時間ごとに、前記映像データの圧縮処理のための符号化パラメータを決定することと、をコンピュータに実行させるためのプログラムを格納している。 According to an aspect of the present invention, there is provided a recording medium that measures throughput of communication with a receiving device and predicts future throughput based on the measured throughput. Generating a plurality of patterns respectively configured by a plurality of division times by dividing a predetermined time, and not exceeding the lower limit value of the future throughput for each of the division times according to the generated patterns When video data is transmitted to the receiving apparatus at such a maximum bit rate, the sensory quality of the video to be reproduced from the video data is calculated, and the sensory quality of the video among the plurality of patterns is maximized. Selection of one pattern, and compression processing of the video data at each division time according to the selected one pattern. Storing a program for executing the method comprising: determining a coding parameter, to the computer.

 本発明の目的は、ネットワークを介して伝送される映像の体感品質を向上させる送信装置を提供することにある。 An object of the present invention is to provide a transmission apparatus that improves the quality of experience of video transmitted through a network.

実施形態1に係わる通信システムのブロック図であり、通信システムを構成する送信端末および受信端末の各構成を示す図である。It is a block diagram of the communication system concerning Embodiment 1, and is a figure showing each composition of a transmitting terminal and a receiving terminal which constitute a communication system. 所定時間を構成する複数の分割時間の例を示す図である。It is a figure showing an example of a plurality of division time which constitutes predetermined time. 実施形態1に係わる送信端末が実行するQoE(Quality of Experience)算出処理の流れを示すフローチャートである。It is a flowchart which shows the flow of QoE (Quality of Experience) calculation processing which the transmission terminal concerning Embodiment 1 performs. 実施形態1に係わる送信端末と受信端末との間の通信のスループットの例、および、予測される未来のスループットの例を示す図である。FIG. 5 is a diagram illustrating an example of throughput of communication between a transmitting terminal and a receiving terminal according to the first embodiment and an example of predicted future throughput. 未来のスループットの予測値に応じて決定される、分割時間ごとの符号化パラメータの例を示す図である。It is a figure which shows the example of the encoding parameter for every division time determined according to the prediction value of the future throughput. 未来のスループットの予測値に応じて決定される、分割時間ごとの符号化パラメータの他の例を示す図である。It is a figure which shows the other example of the encoding parameter for every division | segmentation time determined according to the prediction value of the future throughput. 実施形態1に係わる通信システムの送信端末と受信端末との間の通信の流れを示すシーケンス図である。FIG. 5 is a sequence diagram showing a flow of communication between a transmitting terminal and a receiving terminal of the communication system according to Embodiment 1. 実施形態2に係わる送信端末のブロック図である。FIG. 7 is a block diagram of a transmission terminal according to Embodiment 2. 実施形態3に係わる送信端末のハードウェア構成を示す図である。It is a figure which shows the hardware constitutions of the transmission terminal concerning Embodiment 3. FIG.

 本発明の実施の形態について、図面を参照して詳細に説明する。
〔実施形態1〕
 (通信システム)
 図1は、本実施形態1に係わる通信システムのブロック図である。図1に示すように、通信システムは、送信端末1および受信端末2を備えている。送信端末1および受信端末2は、それぞれ、パーソナルコンピュータまたはスマートフォンなどの端末装置である。送信端末1および受信端末2は、ネットワークを介して通信する。本実施形態1に係わる通信システムのネットワークは、インターネットやモバイルネットワークなどであり、ヘテロジニアス(Heterogeneous)な構造を有する。
Embodiments of the present invention will be described in detail with reference to the drawings.
Embodiment 1
(Communications system)
FIG. 1 is a block diagram of a communication system according to the first embodiment. As shown in FIG. 1, the communication system includes a transmitting terminal 1 and a receiving terminal 2. The transmitting terminal 1 and the receiving terminal 2 are terminal devices such as a personal computer or a smartphone. The transmitting terminal 1 and the receiving terminal 2 communicate via a network. The network of the communication system according to the first embodiment is the Internet, a mobile network, or the like, and has a heterogeneous structure.

 (送信端末1)
 図1に示すように、送信端末1は、エンコーダ101と、送信部102と、データ受信部103と、スループット計測部104と、分割パターン生成部105と、QoE算出部106と、パラメータ決定部107とを備えている。
(Transmission terminal 1)
As shown in FIG. 1, the transmitting terminal 1 includes an encoder 101, a transmitting unit 102, a data receiving unit 103, a throughput measuring unit 104, a division pattern generating unit 105, a QoE calculating unit 106, and a parameter determining unit 107. And have.

 エンコーダ101は、映像を圧縮符号化して、映像データを出力する。送信部102は、エンコーダ101によって圧縮符号化された映像データを受信端末2に送信する。データ受信部103は、通信のスループットを計測するためのデータを受信端末2から受信する。スループット計測部104は、データ受信部103が受信したデータに基づいて、スループットを計測する。 The encoder 101 compresses and encodes a video and outputs video data. The transmitting unit 102 transmits the video data compressed and encoded by the encoder 101 to the receiving terminal 2. The data receiving unit 103 receives data for measuring communication throughput from the receiving terminal 2. The throughput measuring unit 104 measures the throughput based on the data received by the data receiving unit 103.

 分割パターン生成部105は、所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターン(後述)を生成する。 The division pattern generation unit 105 generates a plurality of patterns (described later) each configured by a plurality of division times by dividing a predetermined time.

 QoE算出部106は、スループット計測部104が計測したスループットに基づいて、未来のスループットを予測する。加えて、QoE算出部106は、分割パターン生成部105が生成したパターンごとに、QoE、すなわち映像を視聴するユーザの体感品質を算出する。 The QoE calculation unit 106 predicts the future throughput based on the throughput measured by the throughput measurement unit 104. In addition, the QoE calculation unit 106 calculates QoE, that is, the quality of experience of the user who views the video, for each of the patterns generated by the divided pattern generation unit 105.

 パラメータ決定部107は、分割パターン生成部105が生成した複数のパターンの中から、QoE算出部106が算出したQoEが最大となる1つのパターンを選択し、選択した1つのパターンに基づいて、映像の符号化パラメータを決定する。 The parameter determination unit 107 selects one pattern having the largest QoE calculated by the QoE calculation unit 106 from the plurality of patterns generated by the divided pattern generation unit 105, and based on the selected one pattern, Determine the coding parameters of

 分割パターン生成部105、QoE算出部106、およびパラメータ決定部107がそれぞれ実行する処理の詳細を後で順番に説明する。 The details of processing executed by the divided pattern generation unit 105, the QoE calculation unit 106, and the parameter determination unit 107 will be described later in order.

 (受信端末2)
 図1に示すように、受信端末2は、受信部201と、データ送信部202とを備えている。受信部201は、送信端末1の送信部102から送信された映像データを受信する。
(Receiver 2)
As shown in FIG. 1, the receiving terminal 2 includes a receiving unit 201 and a data transmitting unit 202. The receiving unit 201 receives the video data transmitted from the transmitting unit 102 of the transmitting terminal 1.

 データ送信部202は、受信部201から、スループットを算出するために必要なデータを取得あるいは生成して、取得あるいは生成したデータを送信端末1に送信する。
(分割パターン生成部105)
 送信端末1の分割パターン生成部105の動作を説明する。分割パターン生成部105は、時刻0から所定の時刻T(>0)までの所定時間Tを、最大分割数m(mは自然数)以下で分割する。所定時間Tおよび最大分割数mは、任意の値であってよい。あるいは、所定時間Tおよび最大分割数mは、ユーザが予め設定することが可能であってもよい。
The data transmission unit 202 acquires or generates data necessary for calculating the throughput from the reception unit 201, and transmits the acquired or generated data to the transmission terminal 1.
(Division pattern generation unit 105)
The operation of the division pattern generation unit 105 of the transmission terminal 1 will be described. The division pattern generation unit 105 divides a predetermined time T from time 0 to a predetermined time T (> 0) at a maximum division number m (m is a natural number) or less. The predetermined time T and the maximum division number m may be any value. Alternatively, the predetermined time T and the maximum division number m may be set in advance by the user.

 図2は、最大分割数m=3である場合に、所定時間T=t3から生成される複数のパターンの例を示す。図2に示すように、この場合、分割パターン生成部105は、4通りのパターン、すなわち、{0,t1,t2,t3}、{0,t2,t3}、{0,t1,t3}、または{0,t3}を生成する。ここで、0<t1<t2<t3=Tである。一般的に、最大分割数がmである場合、パターンの総数は2^(m-1)である。 FIG. 2 shows an example of a plurality of patterns generated from the predetermined time T = t3 when the maximum division number m = 3. As shown in FIG. 2, in this case, the divided pattern generation unit 105 has four patterns: {0, t1, t2, t3}, {0, t2, t3}, {0, t1, t3}, Or generate {0, t3}. Here, 0 <t1 <t2 <t3 = T. In general, if the maximum division number is m, the total number of patterns is 2 ^ (m-1).

 所定時間Tのパターンを、以下では、集合R_j(ただし0<=j<2^(m-1))と呼ぶ。また、集合R_jの集合を、集合Rと呼ぶ。各集合R_jの要素は、分割時刻T_i(i=0、1、2、・・・)である。集合R_jごとに、要素すなわち分割時刻T_iの数は異なる。 The pattern of the predetermined time T is hereinafter referred to as a set R_j (where 0 <= j <2 ^ (m-1)). Also, a set of the set R_j is called a set R. The elements of each set R_j are division times T_i (i = 0, 1, 2,...). For each set R_j, the number of elements, ie, division times T_i is different.

 図2に示すように、最大分割数m=3である場合、集合Rは、分割時刻T_i(0<=i<=3)を用いて、R={R_0={T_0,T_1,T_2,T_3},R_1={T_0,T_1,T_2},R_2={T_0,T_1,T_2},R_3={T_0,T_1}}と表わされる。 As shown in FIG. 2, when the maximum division number m = 3, the set R uses R = {R_0 = {T_0, T_1, T_2, T_3, using division time T_i (0 <= i <= 3). }, R_1 = {T_0, T_1, T_2}, R_2 = {T_0, T_1, T_2}, R_3 = {T_0, T_1}}.

 ここで、集合Rの異なる集合R_jの対応する要素同士は、必ずしも同一ではない。例えば、図2を参照すれば、集合R_1の要素T_1はt2であるが、集合R_2の要素T_1はt1である。また、互いに隣接する要素間の(時間軸上における)“距離”は、必ずしも一定ではない。 Here, corresponding elements of different sets R_j of the set R are not necessarily the same. For example, referring to FIG. 2, the element T_1 of the set R_1 is t2, but the element T_1 of the set R_2 is t1. Also, the “distance” (on the time axis) between elements adjacent to each other is not necessarily constant.

 例えば、図2を参照すれば、集合R_0では、要素間の“距離”は一定(ただし、t1=t2-t1=t3-t2である場合)である。一方、集合R_2では、要素間の“距離”は一定ではない。以下では、集合R_jの要素T_iと要素T_{i-1}との間の“距離”を、分割時間(T_i-T_{i-1})と呼ぶ。 For example, referring to FIG. 2, in the set R_0, the "distance" between elements is constant (where t1 = t2-t1 = t3-t2). On the other hand, in the set R_2, the "distance" between elements is not constant. Hereinafter, the “distance” between the element T_i and the element T_ {i-1} of the set R_j will be referred to as a division time (T_i−T_ {i-1}).

 (QoE算出部106)
 QoE算出部106は、分割時間(T_i-T_{i-1})ごとに、QoEすなわち体感品質を算出する。通常、QoEは、1から5までの実数で表される。QoEが低いほど、体感品質が低い。一方、QoEが高いほど、体感品質が高い。
(QoE calculation unit 106)
The QoE calculation unit 106 calculates QoE, that is, the quality of sensation for each division time (T_i-T_ {i-1}). Usually, QoE is represented by a real number from 1 to 5. The lower the QoE, the lower the quality of sensation. On the other hand, the higher the QoE, the higher the quality of sensation.

 QoEの算出関数は、オーディオビジュアルメディアのストリーミング品質に関する規格(例えばInternational Telecommunication Union/ITU Telecommunication Sectorが作成したITU-T P.1201)にしたがう。本実施形態1では、QoEの算出関数を、Q(b,g)と表す。ここで、bはビットレートであり、gはGOPサイズである。すなわち、Q(b,g)は、ビットレートおよびGOPサイズに依存する。 The calculation function of QoE follows the standard on streaming quality of audiovisual media (for example, ITU-T P. 1201 created by the International Telecommunication Union / ITU Telecommunication Sector). In the first embodiment, the calculation function of QoE is represented as Q (b, g). Here, b is a bit rate, and g is a GOP size. That is, Q (b, g) depends on the bit rate and the GOP size.

 本実施形態では、QoE算出部106は、通信スループット予測法を用いて、未来のスループットを表す関数x(t)(tは時刻)を算出する。通信スループット予測法とは、過去のスループットに関する情報を、時系列に沿って解析することによって、未来のスループットを表す確率密度関数の期待値および分散を得る方法である。通信スループット予測法に関する技術は、例えば、非特許文献1または非特許文献2に記載されている。本実施形態では、通信スループット予測法に関する技術の説明を省略する。 In the present embodiment, the QoE calculation unit 106 calculates a function x (t) (t is time) representing the future throughput using the communication throughput prediction method. The communication throughput prediction method is a method of obtaining the expectation value and the variance of the probability density function representing the future throughput by analyzing the information on the past throughput along the time series. Techniques related to communication throughput prediction methods are described, for example, in Non-Patent Document 1 or Non-Patent Document 2. In the present embodiment, the description of the technology related to the communication throughput prediction method is omitted.

 また、QoE算出部106は、未来のスループットから、最大ビットレートを決定する。最大ビットレートとは、関数x(t)で表される未来のスループットの下限値を超えないビットレートの最大値である。したがって、最大ビットレートは、未来のスループットの下限値と一致する。 Also, the QoE calculation unit 106 determines the maximum bit rate from the throughput in the future. The maximum bit rate is the maximum bit rate that does not exceed the future throughput lower limit value represented by the function x (t). Therefore, the maximum bit rate matches the lower limit of the future throughput.

 あるいは、QoE算出部106は、通信スループット予測法を用いずに、未来のスループットの下限値を算出してもよい。例えば、QoE算出部106は、スループットの計測値に基づいて、未来のスループットを予測することができる。この場合、QoE算出部106は、分割時間(T_i-T_{i-1})における最大ビットレートb_iを、以下の計算式にしたがって算出してもよい。
b_i=a^i×x_0
 ここで、aは、ユーザが予め設定した感度係数である。また、x_0は、現在時刻T_0=0におけるスループットの計測値である。一般的に、最大ビットレートは、時間が経過するとともに低下する。
Alternatively, the QoE calculation unit 106 may calculate the lower limit value of the future throughput without using the communication throughput prediction method. For example, the QoE calculation unit 106 can predict the future throughput based on the measurement value of the throughput. In this case, the QoE calculation unit 106 may calculate the maximum bit rate b_i in the division time (T_i-T_ {i-1}) according to the following calculation formula.
b_i = a ^ i × x_0
Here, a is a sensitivity coefficient set in advance by the user. Further, x_0 is a measurement value of throughput at current time T_0 = 0. In general, the maximum bit rate decreases with time.

 送信端末1から受信端末2へ伝送される映像データのビットレートが未来のスループットの下限値を超えない限り、受信端末2側で映像データの再生が途絶する可能性は十分に低い。
(QoEの算出方法)
 図3を参照して、本実施形態1に係わるQoE算出部106が、QoEを算出する方法の流れを説明する。図3は、QoE算出部106の動作を示すフローチャートである。
As long as the bit rate of the video data transmitted from the transmitting terminal 1 to the receiving terminal 2 does not exceed the lower limit value of the throughput in the future, the possibility that the reproduction of the video data on the receiving terminal 2 side is low is sufficiently low.
(How to calculate QoE)
A flow of a method of calculating QoE by the QoE calculating unit 106 according to the first embodiment will be described with reference to FIG. FIG. 3 is a flowchart showing the operation of the QoE calculation unit 106.

 図3に示すように、QoE算出部106は、まず、分割パターン生成部105から、集合Rを受け取る。そして、QoE算出部106は、集合Rから、一つの集合R_jを取り出す(S1)。最初、j=0である。 As shown in FIG. 3, the QoE calculation unit 106 first receives the set R from the divided pattern generation unit 105. Then, the QoE calculation unit 106 extracts one set R_j from the set R (S1). Initially j = 0.

 j<|R|の場合(S2でNO)、QoE算出部106は、R_jの要素(分割時刻)T_iを取り出し、iを初期化(すなわちi=0)する(S3)。ここで、|R|は、集合Rの要素数を表す。j=|R|の場合(S2でYES)、QoE算出部106の動作は終了する。 If j <| R | (NO at S2), the QoE calculation unit 106 takes out the element (division time) T_i of R_j and initializes i (that is, i = 0) (S3). Here, | R | represents the number of elements of the set R. When j = | R | (YES in S2), the operation of the QoE calculation unit 106 ends.

 i<|R_j|の場合(S4でNO)、QoE算出部106は、予め算出したスループットの予測値を表す関数x(t)(tは時刻)に基づいて、分割時間(T_i-T_{i-1})における最大ビットレートb_iを算出する。ここで、|R_j|は、集合R_jの要素数を表す。 In the case of i <| R_j | (NO in S4), the QoE calculation unit 106 calculates the division time (T_i−T_ {i) based on the function x (t) (t is time) representing the predicted throughput value calculated in advance. Calculate the maximum bit rate b_i in -1). Here, | R_j | represents the number of elements of the set R_j.

 QoE算出部106は、以下の計算式にしたがって、分割時間(T_i-T_{i-1})におけるGOPサイズg_iを算出する(S5)。
g_i=f×(T_i-T_{i-1})
ここで、fはフレームレートである。本実施形態1では、フレームレートは一定であるので、fは定数である。
The QoE calculation unit 106 calculates the GOP size g_i in the division time (T_i-T_ {i-1}) according to the following calculation formula (S5).
g_i = f × (T_i-T_ {i-1})
Here, f is a frame rate. In the first embodiment, since the frame rate is constant, f is a constant.

 次に、QoE算出部106は、集合R_jの各分割時間におけるQoEを算出する(S6)。以下では、集合R_jの分割時間(T_i-T_{i-1})におけるQoEを、Q_j_iと表す。Q_j_iは、以下のように表される。
Q_j_i=Q(b_i,g_i)=Q(b_i,f×(T_i-T_{i-1}))
 なお、前述したように、集合R_jによって、T_iが表わす時刻は異なる。したがって、集合R_jによって、Q_j_iは異なる値を取ることを注意する。
Next, the QoE calculation unit 106 calculates QoE in each division time of the set R_j (S6). Hereinafter, QoE in the division time (T_i-T_ {i-1}) of the set R_j is represented as Q_j_i. Q_j_i is expressed as follows.
Q_j_i = Q (b_i, g_i) = Q (b_i, f × (T_i-T_ {i-1}))
As described above, the time represented by T_i differs depending on the set R_j. Therefore, it should be noted that Q_j_i takes different values depending on the set R_j.

 S6の後、iに1が加算されて(S7)、S4に戻る。i=|R_j|の場合(S4でYES)、jに1が加算されて(S8)、S2に戻る。 After S6, 1 is added to i (S7), and the process returns to S4. If i = | R_j | (YES at S4), 1 is added to j (S8), and the process returns to S2.

 本実施形態1の構成によれば、スループットの予測値に基づいて、最大ビットレートおよびGOPサイズを最適に制御するので、QoEが低下することを抑制することができる。 According to the configuration of the first embodiment, since the maximum bit rate and the GOP size are optimally controlled based on the predicted throughput value, it is possible to suppress the decrease in QoE.

 (パラメータ決定部107)
 パラメータ決定部107は、分割パターン生成部105が生成した集合Rを受け取るとともに、QoE算出部106が算出したQoE、すなわちQ_j_iを受け取る。
(Parameter determination unit 107)
The parameter determination unit 107 receives the set R generated by the divided pattern generation unit 105, and receives the QoE calculated by the QoE calculation unit 106, that is, Q_j_i.

 パラメータ決定部107は、集合Rの全ての集合R_j(0<=j<|R|)の中から、以下の計算式(1)にしたがって、いずれか1つの集合R_jを選択する。 The parameter determination unit 107 selects any one set R_j from all the sets R_j (0 <= j <| R |) of the set R according to the following calculation formula (1).

Figure JPOXMLDOC01-appb-I000002
Figure JPOXMLDOC01-appb-I000002

 式(1)によれば、パラメータ決定部107は、集合R_jの分割時間(T_i-T_{i-1})ごとのQ_j_iを該分割時間(T_i-T_{i-1})で重みづけした値の総和が最大になる集合R_jを選択する。 According to Equation (1), the parameter determination unit 107 weights Q_j_i for each division time (T_i-T_ {i-1}) of the set R_j by the division time (T_i-T_ {i-1}). The set R_j that maximizes the sum of values is selected.

 パラメータ決定部107は、選択した集合R_jの各分割時間(T_i-T_{i-1})における最大ビットレートb_iおよびGOPサイズg_iを、映像データの圧縮符号化のための符号化パラメータとして決定する。 The parameter determination unit 107 determines the maximum bit rate b_i and the GOP size g_i in each division time (T_i-T_ {i-1}) of the selected set R_j as a coding parameter for compression encoding of video data. .

 (エンコーダ101)
 エンコーダ101は、パラメータ決定部107から、最大ビットレートおよびGOPサイズの情報を受け取る。そして、エンコーダ101は、コーデックをリセットして、受け取った最大ビットレートおよびGOPサイズの情報に基づいて、符号化パラメータを更新する。
(Encoder 101)
The encoder 101 receives information on the maximum bit rate and the GOP size from the parameter determination unit 107. Then, the encoder 101 resets the codec, and updates the coding parameters based on the received information of the maximum bit rate and the GOP size.

 (符号化パラメータを決定する方法)
 図4および図5を参照して、本実施形態1に係わる送信端末1が符号化パラメータを決定する方法の流れを説明する。図4および図5は、スループットの時間変化の一例を示すグラフである。グラフの横軸tは時刻を表し、グラフの縦軸bはスループットを表す。図4において、実線のグラフは、スループットの計測値を示している。また、3つの破線のグラフは、いずれも、スループットの予測値を示している。
(Method of determining coding parameters)
The flow of a method in which the transmitting terminal 1 according to the first embodiment determines the coding parameters will be described with reference to FIGS. 4 and 5. FIG. 4 and FIG. 5 are graphs showing an example of time change of throughput. The horizontal axis t of the graph represents time, and the vertical axis b of the graph represents throughput. In FIG. 4, the solid line graph indicates the measured throughput value. Also, all three broken line graphs indicate predicted throughput values.

 図4において、3つの破線のグラフは、それぞれ、スループットの“期待値”(図4ではy(t))、“期待値+(プラス)分散”(図4ではz(t))、および“期待値-(マイナス)分散”(図4ではx(t))に対応している。これらの破線のグラフのうち、“期待値-分散” x(t)は、予測されるスループットの下限値を表す。 In FIG. 4, three broken line graphs respectively indicate “expected value” (y (t) in FIG. 4), “expected value + (plus) dispersion” (z (t)) in FIG. It corresponds to the expected value-(minus) variance "(x (t) in FIG. 4). Of these broken line graphs, "expected value-variance" x (t) represents the lower limit value of the predicted throughput.

 最大ビットレートがスループットの下限値を上回った場合、受信端末2側において、映像の再生が途絶する可能性がある。したがって、最大ビットレートは、スループットの下限値を超えないように制限される。 When the maximum bit rate exceeds the lower limit value of the throughput, the reproduction of the video may be interrupted at the receiving terminal 2 side. Therefore, the maximum bit rate is limited so as not to exceed the lower limit of throughput.

 所定時間T=2[sec]であり、最大分割数m=2であるとする。図5では、T_0=0[sec]~T_2=2[sec]における最大ビットレートb_0~b_2は、それぞれ、b_0=2[Mbps]、b_1=1.7[Mbps]、b_2=1.3[Mbps]である。 It is assumed that the predetermined time T = 2 [sec] and the maximum division number m = 2. In FIG. 5, the maximum bit rates b_0 to b_2 for T_0 = 0 [sec] to T_2 = 2 [sec] are b_0 = 2 [Mbps], b_1 = 1.7 [Mbps], and b_2 = 1.3 [b], respectively. Mbps].

 この場合、分割パターン生成部105は、2つのパターンR_0={T_0,T_1,T_2}およびR_1={T_0,T_2}}を生成する。図5では、T_0=0、T_1=1、T_2=2である。 In this case, the division pattern generation unit 105 generates two patterns R_0 = {T_0, T_1, T_2} and R_1 = {T_0, T_2}}. In FIG. 5, T_0 = 0, T_1 = 1, and T_2 = 2.

 QoE算出部106は、集合R_0の各分割時間についてのQoE、すなわちQ_0_0=Q(b_1,f×T_1)およびQ_0_1=Q(b_2,f×(T_2-T_1))を算出する。また、集合R_1についてのQoE、すなわちQ_1_0=Q(b_2,f×T_2)を算出する。ここで、フレームレートfは定数である。 The QoE calculation unit 106 calculates QoE for each division time of the set R_0, that is, Q_0_0 = Q (b_1, f × T_1) and Q_0_1 = Q (b_2, f × (T_2−T_1)). Also, QoE for the set R_1, that is, Q_1_0 = Q (b_2, f × T_2) is calculated. Here, the frame rate f is a constant.

 その後、パラメータ決定部107は、下記計算式(2)に基づいて、分割時間(T_i-T_{i-1})で重み付けしたQ_j_i(0<=i<|R_j|,0<=j<|R|)の総和を算出する。 After that, the parameter determination unit 107 calculates Q_j_i (0 <= i <| R_j |, 0 <= j <|) weighted by the division time (T_i−T_ {i−1}) based on the following formula (2). The sum of R |) is calculated.

Figure JPOXMLDOC01-appb-I000003
Figure JPOXMLDOC01-appb-I000003

 R_0の場合、式(2)で表される総和は、Q_0_0+Q_0_1である。また、R_1の場合、総和は、2×Q_1_0である。総和が大きいほど、所定時間TにおけるQoEは大きいと言える。 In the case of R_0, the sum represented by equation (2) is Q_0_0 + Q_0_1. Also, in the case of R_1, the sum is 2 × Q_1_0. It can be said that QoE at a predetermined time T is larger as the sum is larger.

 例えば、Q_0_0=3.5、Q_0_1=3.2、Q_1_0=3.3であるとする。この場合、R_0についての総和は、Q_0_0+Q_0_1=3.5+3.2=6.7である。また、R_1についての総和は、2×Q_1_0=2×3.3=6.6である。 For example, it is assumed that Q_0_0 = 3.5, Q_0_1 = 3.2, and Q_1_0 = 3.3. In this case, the sum for R_0 is Q_0_0 + Q_0_1 = 3.5 + 3.2 = 6.7. Also, the sum for R_1 is 2 × Q_1_0 = 2 × 3.3 = 6.6.

 したがって、R_0についての総和のほうが、R_1についての総和よりも大きい。そのため、パラメータ決定部107は、集合R_0を選択する。 Thus, the sum for R_0 is greater than the sum for R_1. Therefore, the parameter determination unit 107 selects the set R_0.

 また、パラメータ決定部107は、選択した集合R_0の分割時間(T_1-T_0)における最大ビットレートb_1=1.7[Mbps]、および、分割時間(T_2-T_1)における最大ビットレートb_2=1.3[Mbps]を、エンコーダ101に通知する。 Also, the parameter determination unit 107 determines that the maximum bit rate b_1 = 1.7 [Mbps] in the division time (T_1-T_0) of the selected set R_0 and the maximum bit rate b_2 = 1 in the division time (T_2-T_1). The encoder 101 is notified of 3 [Mbps].

 また、パラメータ決定部107は、符号化パラメータとして、分割時間(T_1-T_0)におけるGOPサイズg=f×(T_1-T_0)、および、分割時間(T_2-T_1)におけるGOPサイズg=f×(T_2-T_1)も、エンコーダ101に通知する。 Further, parameter determination section 107 sets, as coding parameters, GOP size g = f × (T_1−T_0) in division time (T_1−T_0) and GOP size g = f × (division time (T_2−T_1)). T_2-T_1) also notifies the encoder 101.

 他の例では、Q_0_0=3.83、Q_0_1=3.72、Q_1_0=3.81であるとする。この場合、R_0についての総和(式(2)参照)は、Q_0_0+Q_0_1=3.83+3.72=7.55である。一方、R_1についての総和は、2×Q_1_0=7.62である。 In another example, it is assumed that Q_0_0 = 3.83, Q_0_1 = 3.72, and Q_1_0 = 3.81. In this case, the sum (see equation (2)) for R_0 is Q_0_0 + Q_0_1 = 3.83 + 3.72 = 7.55. On the other hand, the sum of R_1 is 2 × Q_1_0 = 7.62.

 したがって、R_1についての総和のほうが、R_0についての総和よりも大きい。そのため、パラメータ決定部107は、集合R_1を選択する。また、パラメータ決定部107は、集合R_1の分割時間(T_2-T_0)における最大ビットレートb_2=1.3[Mbps]を、エンコーダ101に通知する。 Thus, the sum for R_1 is greater than the sum for R_0. Therefore, the parameter determination unit 107 selects the set R_1. Also, the parameter determination unit 107 notifies the encoder 101 of the maximum bit rate b_2 = 1.3 [Mbps] in the division time (T_2−T_0) of the set R_1.

 また、パラメータ決定部107は、符号化パラメータとして、分割時間(T_2-T_0)におけるGOPサイズg=f×(T_2-T_0)も、エンコーダ101に通知する。 The parameter determination unit 107 also notifies the encoder 101 of the GOP size g = f × (T_2−T_0) in the division time (T_2−T_0) as the coding parameter.

 一般的に、スループットの計測値の揺らぎが大きい(小さい)ほど、未来のスループットの予測値の分散も大きく(小さく)なる。図6は、スループットの時間変化の他の一例を示すグラフである。図6に示すグラフは、図5に示すグラフと比較して、スループットの計測値の揺らぎが小さい。その結果、スループットの予測値の分散が小さい。図6に示すグラフx´(t)、すなわち予測されるスループットの下限値は、図5に示すグラフx(t)と比較して、緩やかに減少している。実際、図5に示すグラフx(t)では、x(t=1)=1.7、x(t=2)=1.3である一方、図6に示すグラフx´(t)では、x´(t=1)=1.8、x´(t=2)=1.7である。 Generally, the larger (smaller) the fluctuation of the throughput measurement value, the larger (smaller) the variance of the future throughput prediction value. FIG. 6 is a graph showing another example of the time change of throughput. The graph shown in FIG. 6 is smaller in fluctuation of the measured throughput than the graph shown in FIG. As a result, the variance of predicted throughput values is small. The graph x ′ (t) shown in FIG. 6, that is, the lower limit value of the predicted throughput, is gradually reduced as compared with the graph x (t) shown in FIG. In fact, in the graph x (t) shown in FIG. 5, x (t = 1) = 1.7 and x (t = 2) = 1.3, while in the graph x ′ (t) shown in FIG. It is x '(t = 1) = 1.8 and x' (t = 2) = 1.7.

 本実施形態1の構成によれば、ユーザの体感品質(QoE)を最大化するように、GOPサイズおよびビットレートの最適な組み合わせを選択することができる。なぜならば、所定時間を分割する複数のパターンについて、それぞれQoEを算出して、QoEが最大になるパターンを選択するからである。また、本実施形態1の構成によれば、スループットの変動が大きい場合であっても、QoEが低下することを抑制することができる。なぜならば、最大ビットレートは、予測されるスループットの下限値を超えないので、映像の再生が途絶する可能性が低いからである。 According to the configuration of the first embodiment, it is possible to select an optimal combination of GOP size and bit rate so as to maximize the quality of experience (QoE) of the user. This is because QoE is calculated for each of a plurality of patterns that divide the predetermined time, and a pattern that maximizes QoE is selected. Further, according to the configuration of the first embodiment, even if the fluctuation of the throughput is large, it is possible to suppress the decrease in QoE. The reason is that since the maximum bit rate does not exceed the lower limit of the predicted throughput, there is a low possibility that the reproduction of the video will be interrupted.

 (通信方法)
 図7を参照して、本実施形態1に係わる通信システムによる通信方法を説明する。図7は、通信システムを構成する送信端末1と受信端末2とが通信を行う際の送信端末1および受信端末2の各動作の流れを示すシーケンス図である。
(Communication method)
A communication method by the communication system according to the first embodiment will be described with reference to FIG. FIG. 7 is a sequence diagram showing a flow of each operation of the transmitting terminal 1 and the receiving terminal 2 when the transmitting terminal 1 and the receiving terminal 2 that make up the communication system communicate.

 図7に示すように、受信端末2のデータ送信部202が、スループットに関するデータ、言い換えれば、スループットを算出するために必要なデータを、送信端末1へ送信する(S201)。 As shown in FIG. 7, the data transmitting unit 202 of the receiving terminal 2 transmits data related to throughput, in other words, data necessary for calculating the throughput, to the transmitting terminal 1 (S201).

 例えば、データ送信部202は、受信端末2の受信部201が単位時間に受信する映像データの量の情報、および、ロス率の情報を、スループットに関するデータとして、送信端末1へ送信する。 For example, the data transmission unit 202 transmits information on the amount of video data received by the reception unit 201 of the reception terminal 2 per unit time and information on loss rate to the transmission terminal 1 as data on throughput.

 送信端末1のデータ受信部103は、受信端末2から、スループットに関するデータを受信する。スループット計測部104は、データ受信部103が受信したデータを用いて、送信端末1と受信端末2との間の通信のスループットを計測する(S101)。 The data reception unit 103 of the transmission terminal 1 receives data on throughput from the reception terminal 2. The throughput measuring unit 104 measures the throughput of communication between the transmitting terminal 1 and the receiving terminal 2 using the data received by the data receiving unit 103 (S101).

 次に、送信端末1のQoE算出部106は、スループット計測部104が計測したスループットに基づいて、未来のスループットを予測する(S102)。 Next, the QoE calculation unit 106 of the transmission terminal 1 predicts the future throughput based on the throughput measured by the throughput measurement unit 104 (S102).

 また、送信端末1の分割パターン生成部105は、所定時間(例えば、図2では0からt3まで)を分割することによって、複数の分割時間で構成されるパターンを生成する(S103)。 Also, the division pattern generation unit 105 of the transmission terminal 1 generates a pattern composed of a plurality of division times by dividing a predetermined time (for example, from 0 to t3 in FIG. 2) (S103).

 次に、QoE算出部106は、分割パターン生成部105が生成したパターンに応じた分割時間ごとに、先に予測した未来のスループットの下限値を超えないような最大ビットレートで受信端末2へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出する(S104)。なお、S104の詳細については、先に、図3に基づいて説明した。 Next, the QoE calculation unit 106 transmits an image to the receiving terminal 2 at a maximum bit rate that does not exceed the lower limit value of the future throughput predicted above for each division time according to the pattern generated by the division pattern generation unit 105. When data is transmitted, the sensory quality of the video to be reproduced is calculated from the video data (S104). The details of S104 have been described above with reference to FIG.

 その後、パラメータ決定部107は、分割パターン生成部105が生成した複数のパターンのうち、QoE算出部106が算出する体感品質が最大になるような1つのパターンを選択する(S105)。 After that, the parameter determination unit 107 selects one pattern that maximizes the sensation quality calculated by the QoE calculation unit 106 among the plurality of patterns generated by the divided pattern generation unit 105 (S105).

 続いて、送信端末1のパラメータ決定部107は、選択した1つのパターンに応じた分割時間ごとに、符号化パラメータを決定する(S106)。 Subsequently, the parameter determination unit 107 of the transmission terminal 1 determines a coding parameter for each division time according to one selected pattern (S106).

 例えば、図4に示す例において、パラメータ決定部107は、前記1つのパターンとして集合R_0={T_0,T_1,T_2}を選択した場合、分割時間(T_1-T_0)における符号化パラメータ(ビットレートb_1およびGOPサイズg=f×T_1)、および、分割時間(T_2-T_1)における符号化パラメータ(ビットレートb_2およびGOPサイズg=f×T_2)をそれぞれ決定する。 For example, in the example illustrated in FIG. 4, when the parameter determination unit 107 selects the set R_0 = {T_0, T_1, T_2} as the one pattern, the coding parameter (bit rate b_1) in the division time (T_1-T_0) is selected. And GOP size g = f × T_1) and coding parameters (bit rate b_2 and GOP size g = f × T_2) in division time (T_2−T_1) are determined, respectively.

 送信端末1のエンコーダ101は、パラメータ決定部107が決定した符号化パラメータを用いて、映像データを圧縮符号化する(S107)。 The encoder 101 of the transmission terminal 1 compresses and encodes video data using the encoding parameter determined by the parameter determination unit 107 (S107).

 送信端末1の送信部102は、エンコーダ101が圧縮符号化した映像データを、受信端末2へ送信する(S108)。 The transmitting unit 102 of the transmitting terminal 1 transmits the video data compressed and encoded by the encoder 101 to the receiving terminal 2 (S108).

 受信端末2の受信部201は、送信端末1から、圧縮符号化された映像データを受信する(S202)。その後、フローはS201へ戻る。 The receiving unit 201 of the receiving terminal 2 receives the compression-coded video data from the transmitting terminal 1 (S202). Thereafter, the flow returns to S201.

 〔実施形態2〕
 本実施形態2として、本発明の最小構成について説明する。
Second Embodiment
The minimum configuration of the present invention will be described as the second embodiment.

 図8は、本実施形態2に係わる送信端末3の構成を示す概略構成図である。図8に示すように、本実施形態2に係わる送信端末3は、スループット計測部301と、スループット予測部302と、パターン生成部303と、体感品質算出部304と、パターン選択部305と、パラメータ決定部306とを備える。 FIG. 8 is a schematic block diagram showing the configuration of the transmission terminal 3 according to the second embodiment. As shown in FIG. 8, the transmission terminal 3 according to the second embodiment includes a throughput measurement unit 301, a throughput prediction unit 302, a pattern generation unit 303, a sensation quality calculation unit 304, a pattern selection unit 305, and parameters. And a determination unit 306.

 本実施形態2のスループット計測部301は、例えば、前記実施形態1のスループット計測部104と対応する。本実施形態2のスループット予測部302および体感品質算出部304は、例えば、前記実施形態1のQoE算出部106に対応する。 The throughput measurement unit 301 of the second embodiment corresponds to, for example, the throughput measurement unit 104 of the first embodiment. The throughput prediction unit 302 and the sensation quality calculation unit 304 of the second embodiment correspond to, for example, the QoE calculation unit 106 of the first embodiment.

 また、本実施形態2のパターン生成部303は、例えば、前記実施形態1の分割パターン生成部105と対応する。本実施形態2のパターン選択部305およびパラメータ決定部306は、例えば、前記実施形態1のパラメータ決定部107と対応する。 The pattern generation unit 303 of the second embodiment corresponds to, for example, the divided pattern generation unit 105 of the first embodiment. The pattern selection unit 305 and the parameter determination unit 306 of the second embodiment correspond to, for example, the parameter determination unit 107 of the first embodiment.

 なお、本実施形態2に係わる送信端末3は、前記実施形態1の送信端末1が備えたエンコーダ101および送信部102と対応する構成要素を更に備えていてもよい。 The transmitting terminal 3 according to the second embodiment may further include components corresponding to the encoder 101 and the transmitting unit 102 provided in the transmitting terminal 1 of the first embodiment.

 スループット計測部301は、受信端末(図示せず)との間の通信のスループットを計測する。スループット予測部302は、スループット計測部301が計測したスループットに基づいて、未来のスループットを予測する。 The throughput measuring unit 301 measures the throughput of communication with a receiving terminal (not shown). The throughput prediction unit 302 predicts the future throughput based on the throughput measured by the throughput measurement unit 301.

 パターン生成部303は、所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成する。 The pattern generation unit 303 generates a plurality of patterns each configured of a plurality of division times by dividing a predetermined time.

 体感品質算出部304は、パターン生成部303が生成したパターンに応じた分割時間ごとに、スループット予測部302が予測した未来のスループットの下限値を超えないような最大ビットレートで受信端末へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出する。 The perceptual quality calculation unit 304 transmits video data to the receiving terminal at the maximum bit rate that does not exceed the lower limit value of the future throughput predicted by the throughput prediction unit 302 every division time according to the pattern generated by the pattern generation unit 303. Is transmitted, the sensory quality of the video reproduced from the video data is calculated.

 パターン選択部305は、パターン生成部303が生成した複数のパターンのうち、体感品質算出部304が算出した映像の体感品質が最大になる1つのパターンを選択する。 The pattern selection unit 305 selects, from among the plurality of patterns generated by the pattern generation unit 303, one pattern that maximizes the sensation quality of the image calculated by the sensation quality calculation unit 304.

 パラメータ決定部306は、パターン選択部305が選択した1つのパターンに応じた分割時間ごとに、映像データの圧縮処理のための符号化パラメータを決定する。 The parameter determination unit 306 determines, for each division time corresponding to one pattern selected by the pattern selection unit 305, a coding parameter for compression processing of video data.

 本実施形態2の構成によれば、所定時間をどのように分割するかを規定する複数のパターンの中から、映像の体感品質が最大になる1つのパターンを特定し、特定したパターンに基づいて、符号化パラメータを決定する。したがって、ネットワークを介した映像伝送において、映像の体感品質を向上させることができる。 According to the configuration of the second embodiment, from among a plurality of patterns that define how to divide the predetermined time, one pattern that maximizes the quality of sensation of the video is specified, and based on the specified pattern , Determine the coding parameters. Therefore, in the video transmission via the network, the quality of experience of the video can be improved.

 〔実施形態3〕
 図9は、本実施形態3に係わる送信端末4のハードウェア構成を示すブロック図である。図9に示すように、本実施形態3に係わる送信端末は、CPU(Central Processing Unit)401、エンコーダ402、およびインターフェース403を備えている。本実施形態に係わる送信端末4は、例えば、前記実施形態1に係わる送信端末1、または、前記実施形態2に係わる送信端末3に対応する。
Third Embodiment
FIG. 9 is a block diagram showing the hardware configuration of the transmission terminal 4 according to the third embodiment. As shown in FIG. 9, the transmission terminal according to the third embodiment includes a central processing unit (CPU) 401, an encoder 402, and an interface 403. The transmission terminal 4 according to the present embodiment corresponds to, for example, the transmission terminal 1 according to the first embodiment or the transmission terminal 3 according to the second embodiment.

 CPU401は、本実施形態に係わる送信端末4の制御機能を実現する。エンコーダ402は、例えば、前記実施形態1のエンコーダ101に対応する。インターフェース403は、例えば、前記実施形態1の送信部102およびデータ受信部103に対応する。 The CPU 401 implements the control function of the transmission terminal 4 according to the present embodiment. The encoder 402 corresponds to, for example, the encoder 101 of the first embodiment. The interface 403 corresponds to, for example, the transmission unit 102 and the data reception unit 103 of the first embodiment.

 本実施形態に係わる送信端末4は、前記実施形態1に係わる送信端末1の制御機能(図1に示す、スループット計測部104、分割パターン生成部105、QoE算出部106、パラメータ決定部107、およびエンコーダ101)の全部または一部を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをCPU401に読み込ませ、実行することにより各部の処理を行ってもよい。なお、ここでいう「コンピュータシステム」とは、OSや周辺機器等のハードウェアを含むものとする。 The transmission terminal 4 according to the present embodiment has a control function of the transmission terminal 1 according to the first embodiment (throughput measurement unit 104, division pattern generation unit 105, QoE calculation unit 106, parameter determination unit 107, and so on shown in FIG. A program for realizing all or part of the encoder 101) is recorded in a computer readable recording medium, and the program recorded in the recording medium is read by the CPU 401 and executed to perform processing of each part. It is also good. Here, the “computer system” includes an OS and hardware such as peripheral devices.

 あるいは、本実施形態に係わる送信端末4は、前記実施形態2に係わる送信端末3の制御機能(図8に示す、スループット計測部301、スループット予測部302、パターン生成部303、体感品質算出部304、パターン選択部305、およびパラメータ決定部306)の全部または一部を実現するためのプログラムをコンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをCPU401に読み込ませ、実行することにより各部の処理を行ってもよい。 Alternatively, the transmitting terminal 4 according to the present embodiment has the control function of the transmitting terminal 3 according to the second embodiment (throughput measuring unit 301, throughput predicting unit 302, pattern generating unit 303, quality of experience calculating unit 304 shown in FIG. , A program for realizing all or part of the pattern selection unit 305 and the parameter determination unit 306) in a computer readable recording medium, and causing the CPU 401 to read the program recorded in the recording medium, and executing the program The processing of each part may be performed by doing this.

 ここで、「コンピュータ読み取り可能な記録媒体」とは、フレキシブルディスク、光磁気ディスク、ROM(Read Only Memory)、CD(Compact Disc)-ROM等の可搬媒体、コンピュータシステムに内蔵されるハードディスク等の記憶装置のことをいう。 Here, “computer readable recording medium” means portable media such as a flexible disk, a magneto-optical disk, a ROM (Read Only Memory), a CD (Compact Disc) -ROM, and a hard disk incorporated in a computer system. It refers to a storage device.

 また、CPU401が読み込んで実行するプログラムは、前述した機能の一部を実現するためのものであっても良く、さらに前述した機能をコンピュータシステムにすでに記録されているプログラムとの組み合わせで実現できるものであっても良い。 Further, the program read and executed by the CPU 401 may be for realizing a part of the above-described functions, and the above-described functions may be realized in combination with the program already recorded in the computer system. It may be

 本実施形態3の構成によれば、前記実施形態1または前記実施形態2と同様の効果を奏する。 According to the configuration of the third embodiment, the same effect as that of the first embodiment or the second embodiment can be obtained.

 以上、図面を参照して、本発明の実施形態を詳述してきたが、本発明を実施するための具体的な構成はこれらの実施形態に限られるものではなく、本発明の要旨を逸脱しない範囲の設計等も含まれる。 As mentioned above, although the embodiment of the present invention has been described in detail with reference to the drawings, the specific configuration for carrying out the present invention is not limited to these embodiments, and does not deviate from the gist of the present invention The design of the range is also included.

 この出願は、2017年6月19日に出願された日本出願特願2017-119257を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims priority based on Japanese Patent Application No. 2017-119257 filed on Jun. 19, 2017, the entire disclosure of which is incorporated herein.

   1 送信端末(送信装置)
   2 受信端末(受信装置)
   3 送信端末(送信装置)
   4 送信端末(送信装置)
 101 エンコーダ
 102 送信部(送信手段)
 104 スループット計測部(スループット計測手段)
 105 分割パターン生成部(パターン生成手段、パターン選択手段)
 106 QoE算出部(スループット予測手段、体感品質算出手段)
 107 パラメータ決定部(パラメータ決定手段)
 201 受信部(受信手段)
 202 データ送信部
 301 スループット計測部(スループット計測手段)
 302 スループット予測部(スループット予測手段)
 303 パターン生成部(パターン生成手段)
 304 体感品質算出部(体感品質算出手段)
 305 パターン選択部(パターン選択手段)
 306 パターン決定部(パターン決定手段)
 402 エンコーダ
1 Transmission terminal (transmission device)
2 Receiving terminal (receiving device)
3 Sending terminal (sending device)
4 Sending terminal (sending device)
101 encoder 102 transmission unit (transmission means)
104 Throughput Measurement Unit (Throughput Measurement Means)
105 Division pattern generation unit (pattern generation unit, pattern selection unit)
106 QoE calculation unit (throughput prediction means, quality of experience calculation means)
107 Parameter determination unit (parameter determination means)
201 Receiver (Receiver)
202 Data Transmission Unit 301 Throughput Measurement Unit (Throughput Measurement Means)
302 Throughput prediction unit (throughput prediction means)
303 Pattern Generator (Pattern Generator)
304 Feeling Quality Calculator (Sensing Quality Calculator)
305 Pattern Selection Unit (Pattern Selection Means)
306 Pattern Determination Unit (Pattern Determination Unit)
402 encoder

Claims (10)

 受信装置との間の通信のスループットを計測するスループット計測手段と、
前記スループット計測手段が計測した前記スループットに基づいて、未来のスループットを予測するスループット予測手段と、
 所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成するパターン生成手段と、
 前記パターン生成手段が生成した前記パターンに応じた前記分割時間ごとに、前記スループット予測手段が予測した前記未来のスループットの下限値を超えないような最大ビットレートで前記受信装置へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出する体感品質算出手段と、
 前記パターン生成手段が生成した前記複数のパターンのうち、前記体感品質算出手段が算出した前記映像の体感品質が最大になる1つのパターンを選択するパターン選択手段と、
 前記パターン選択手段が選択した前記1つのパターンに応じた前記分割時間ごとに、前記映像データの圧縮処理のための符号化パラメータを決定するパラメータ決定手段と、
を有する送信装置。
Throughput measuring means for measuring the throughput of communication with the receiving device;
A throughput prediction unit that predicts a future throughput based on the throughput measured by the throughput measurement unit;
Pattern generation means for generating a plurality of patterns each constituted by a plurality of division times by dividing a predetermined time;
The video data is transmitted to the receiving apparatus at a maximum bit rate which does not exceed the lower limit value of the future throughput predicted by the throughput prediction unit at each division time according to the pattern generated by the pattern generation unit. A sensory quality calculation means for calculating the sensory quality of the video to be reproduced from the video data;
Pattern selection means for selecting one of the plurality of patterns generated by the pattern generation means, the pattern having the highest perceived quality of the image calculated by the sensory quality calculation means;
Parameter determining means for determining encoding parameters for the compression process of the video data for each of the division times according to the one pattern selected by the pattern selecting means;
A transmitter having
 前記パラメータ決定手段が決定した前記符号化パラメータを用いて、前記映像データの圧縮処理を行うエンコーダと、
 前記エンコーダが圧縮した前記映像データを前記受信装置へ送信する送信手段と、
をさらに有することを特徴とする請求項1に記載の送信装置。
An encoder for compressing the video data using the coding parameter determined by the parameter determination unit;
Transmitting means for transmitting the video data compressed by the encoder to the receiving device;
The transmitter according to claim 1, further comprising:
 前記符号化パラメータは、GOP(Group of picture)サイズおよびビットレートを少なくとも含むことを特徴とする請求項1または2に記載の送信装置。 The transmission apparatus according to claim 1, wherein the encoding parameter includes at least a group of picture (GOP) size and a bit rate.  前記パターン生成手段が生成する前記パターンは、複数の分割時刻T_i(i=1、2、・・・)を含み、前記複数の分割時刻T_iは、前記複数の分割時間(T_i-T_{i-1})を規定し、
 前記体感品質算出手段は、前記複数の分割時間(T_i-T_{i-1})の各々における前記最大ビットレートb_iおよびGOPサイズg_iを変数として含む算出関数Q(b_i,g_i)に基づいて、前記体感品質を算出し、
 算出される前記体感品質は以下の数式で表されることを特徴とする請求項1から3のいずれか1項に記載の送信装置。
Figure JPOXMLDOC01-appb-I000001
(ここで、g_i=f*(T_i-T_{i-1})であり、fは前記映像データのフレームレートである)
The pattern generated by the pattern generation unit includes a plurality of division times T_i (i = 1, 2,...), And the plurality of division times T_i are the plurality of division times (T_i-T_ {i− 1)),
The sensory quality calculation means is based on a calculation function Q (b_i, g_i) including the maximum bit rate b_i and the GOP size g_i in each of the plurality of divided times (T_i-T_ {i-1}) as variables. Calculate the quality of experience,
The transmitting apparatus according to any one of claims 1 to 3, wherein the perceived quality calculated is represented by the following formula.
Figure JPOXMLDOC01-appb-I000001
(Here, g_i = f * (T_i-T_ {i-1}), and f is a frame rate of the video data)
 前記パラメータ決定手段は、前記パターン選択手段が選択した前記パターンの分割時間(T_i-T_{i-1})における前記最大ビットレートb_iを、前記分割時間(T_i-T_{i-1})における前記符号化パラメータとして決定することを特徴とする請求項4に記載の送信装置。 The parameter determining means determines the maximum bit rate b_i in the division time (T_i-T_ {i-1}) selected by the pattern selection means in the division time (T_i-T_ {i-1}). The transmitter according to claim 4, characterized in that it is determined as the coding parameter.  前記スループット予測手段は、前記スループット計測手段が計測した前記スループットに基づいて、前記未来のスループットの期待値および分散を算出し、
 前記体感品質算出手段は、前記スループット予測手段が算出した前記期待値マイナス前記分散を、前記下限値として設定することを特徴とする請求項1から5のいずれか1項に記載の送信装置。
The throughput predicting means calculates an expected value and a variance of the future throughput based on the throughput measured by the throughput measuring means;
The transmitting apparatus according to any one of claims 1 to 5, wherein the feeling quality calculation means sets the expected value minus the variance calculated by the throughput prediction means as the lower limit value.
 請求項1から6のいずれか1項に記載の送信装置と、
 前記受信装置と、を含む通信システム。
A transmitter according to any one of claims 1 to 6,
A communication system including the receiving device.
 前記受信装置は、
 前記送信装置から前記映像データを受信する受信手段と、
 前記スループット計測手段が前記スループットを計測するための情報を前記送信装置へ送信するスループット計測用データ送信手段と、
を有することを特徴とする請求項7に記載の通信システム。
The receiving device is
Receiving means for receiving the video data from the transmitting device;
A throughput measurement data transmission unit for transmitting information for measuring the throughput to the transmission device by the throughput measurement unit;
The communication system according to claim 7, characterized in that:
受信装置との間の通信のスループットを計測し、
計測した前記スループットに基づいて、未来のスループットを予測し、
所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成し、
生成した前記パターンに応じた前記分割時間ごとに、前記未来のスループットの下限値を超えないような最大ビットレートで前記受信装置へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出し、
前記複数のパターンのうち、前記映像の体感品質が最大になる1つのパターンを選択し、
 選択した前記1つのパターンに応じた前記分割時間ごとに、前記映像データの圧縮処理のための符号化パラメータを決定する、
 通信方法。
Measure the throughput of communication with the receiver,
Predict future throughput based on the measured throughput,
By dividing a predetermined time, a plurality of patterns each formed of a plurality of divided times are generated,
When video data is transmitted to the receiving apparatus at a maximum bit rate that does not exceed the lower limit value of the future throughput for each of the divided times according to the generated pattern, the video data reproduced from the video data Calculate quality of experience,
Among the plurality of patterns, one pattern which maximizes the quality of sensation of the image is selected,
The encoding parameter for the compression process of the video data is determined for each of the division times according to the selected one pattern.
Communication method.
 受信装置との間の通信のスループットを計測することと、
 計測した前記スループットに基づいて、未来のスループットを予測することと、
 所定時間を分割することによって、複数の分割時間でそれぞれ構成される複数のパターンを生成することと、
 生成した前記パターンに応じた前記分割時間ごとに、前記未来のスループットの下限値を超えないような最大ビットレートで前記受信装置へ映像データを送信した場合に、該映像データから再生される映像の体感品質を算出することと、
 前記複数のパターンのうち、前記映像の体感品質が最大になる1つのパターンを選択することと、
 選択した前記1つのパターンに応じた前記分割時間ごとに、前記映像データの圧縮処理のための符号化パラメータを決定することと、
 をコンピュータに実行させるプログラムを格納した、一時的でない記録媒体。
Measuring the throughput of communication with the receiving device;
Predicting future throughput based on the measured throughput;
Generating a plurality of patterns each configured of a plurality of division times by dividing the predetermined time;
When video data is transmitted to the receiving apparatus at a maximum bit rate that does not exceed the lower limit value of the future throughput for each of the divided times according to the generated pattern, the video data reproduced from the video data Calculating quality of experience,
Among the plurality of patterns, selecting one pattern that maximizes the perceived quality of the image;
Determining an encoding parameter for the compression process of the video data for each of the division times according to the selected one pattern;
A non-transitory recording medium storing a program that causes a computer to execute.
PCT/JP2018/022595 2017-06-19 2018-06-13 Transmission apparatus, communication system, communication method, and recording medium Ceased WO2018235697A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
JP2019525486A JP6825703B2 (en) 2017-06-19 2018-06-13 Transmitters, communication systems, communication methods, and programs

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2017119257 2017-06-19
JP2017-119257 2017-06-19

Publications (1)

Publication Number Publication Date
WO2018235697A1 true WO2018235697A1 (en) 2018-12-27

Family

ID=64737091

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2018/022595 Ceased WO2018235697A1 (en) 2017-06-19 2018-06-13 Transmission apparatus, communication system, communication method, and recording medium

Country Status (2)

Country Link
JP (1) JP6825703B2 (en)
WO (1) WO2018235697A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2022119167A (en) * 2021-02-03 2022-08-16 日本放送協会 Receiving device and program
JPWO2022185534A1 (en) * 2021-03-05 2022-09-09

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011125295A1 (en) * 2010-04-02 2011-10-13 パナソニック株式会社 Wireless communication device and wireless communication method
WO2012101709A1 (en) * 2011-01-28 2012-08-02 日本電気株式会社 Throughput estimation device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011125295A1 (en) * 2010-04-02 2011-10-13 パナソニック株式会社 Wireless communication device and wireless communication method
WO2012101709A1 (en) * 2011-01-28 2012-08-02 日本電気株式会社 Throughput estimation device

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
HIROSHI YOSHIDA ET AL.: "Constructing stochastic model of TCP throughput on basis of stationarity analysis", GLOBECOM 2013 - COMMUNICATIONS QOS, RELIABILITY AND MODELLING SYMPOSIUM, 13 December 2013 (2013-12-13), pages 1544 - 1550, XP032604658, ISBN: 978-1-4799-1353-4, DOI: 10.1109/GLOCOM.2013.6831293 *
KOICHI NIHEI ; HIROSHI YOSHIDA ; NATSUKI KAI ; DAI KANETOMO ; KOZO SATODA: "QoE maximizing bitrate control for live video streaming on a mobile uplink", 2017 14TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS (CONTEL), 28 June 2017 (2017-06-28) - 30 June 2017 (2017-06-30), XP033135963, DOI: 10.23919/ConTEL.2017.8000044 *
NIHEI, K. ET AL.: "A QoE-driven video bitrate control method for live-streaming on mobile uplink", IEICE TECHNICAL REPORT, vol. 116, no. 171, 25 July 2016 (2016-07-25), pages 1 - 6, ISSN: 0913-5685 *
YOSHIDA, H. ET AL.: "Video streaming control by predicting stochastic diffusion of TCP troughput", 27 October 2011 (2011-10-27), pages 57 - 66, Retrieved from the Internet <URL:https://www.internetconference.org/ic2011/PDF/ic2011-paper5.pdf> [retrieved on 20180703] *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2022119167A (en) * 2021-02-03 2022-08-16 日本放送協会 Receiving device and program
JP7759210B2 (en) 2021-02-03 2025-10-23 日本放送協会 Receiving device and program
JPWO2022185534A1 (en) * 2021-03-05 2022-09-09
WO2022185534A1 (en) * 2021-03-05 2022-09-09 日本電信電話株式会社 Coding parameter derivation device, coding parameter derivation method, and program
JP7525045B2 (en) 2021-03-05 2024-07-30 日本電信電話株式会社 Coding parameter derivation device, coding parameter derivation method, and program

Also Published As

Publication number Publication date
JPWO2018235697A1 (en) 2020-04-02
JP6825703B2 (en) 2021-02-03

Similar Documents

Publication Publication Date Title
CN110620892B (en) Video Annotation Technology in Video Communication
JP7229261B2 (en) VIDEO ENCODER BITRATE CONTROL METHOD, APPARATUS, DEVICE, STORAGE MEDIUM AND PROGRAM
JP6851678B2 (en) Intraframe bit rate allocation method, computer equipment, and storage media
CN117793373A (en) Machine learning video processing systems and methods
CN114845106A (en) Video coding method, video coding device, storage medium and electronic equipment
US20140119456A1 (en) Encoding video into lower resolution streams
CN101656887B (en) Method and device for selecting rate control algorithm
TW202002632A (en) Method of adaptive filtering for multiple reference line of intra prediction in video coding, video encoding apparatus and video decoding apparatus therewith
Reznik et al. Optimal multi-codec adaptive bitrate streaming
Hu et al. TVG-streaming: Learning user behaviors for QoE-optimized 360-degree video streaming
JP5333128B2 (en) Moving picture encoding apparatus and program, and moving picture distribution system
CN106713901A (en) Video quality evaluation method and apparatus thereof
JP6825703B2 (en) Transmitters, communication systems, communication methods, and programs
JP7136084B2 (en) Communication device, media delivery system, media delivery method and program
JP6867275B2 (en) Video coding parameter adjustment device, video coding parameter adjustment method and program
US9319680B2 (en) Image encoding apparatus and its control method
JP6145069B2 (en) Subjective image quality estimation device and subjective image quality estimation program
US11699212B2 (en) Method of controlling encoding of display data
JP2017076892A (en) Buffer size-deciding device, system, buffer size-deciding method, video data reproducing method and program
CN103959783A (en) Video quality measurement considering multiple artifacts
JP2022119167A (en) Receiving device and program
KR20150114959A (en) Method and apparatus for context-based video quality assessment
JP6373681B2 (en) Objective image quality evaluation apparatus, objective image quality evaluation method, and program
JP6954289B2 (en) Bit rate indicator, bit rate indicator method, and bit rate indicator program
JP7280535B2 (en) Encoded data generation method, encoded data generation device, and program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18820297

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2019525486

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18820297

Country of ref document: EP

Kind code of ref document: A1