[go: up one dir, main page]

WO2010027142A2 - Transmitting/receiving system and transmitting/receiving method for multi-view video - Google Patents

Transmitting/receiving system and transmitting/receiving method for multi-view video Download PDF

Info

Publication number
WO2010027142A2
WO2010027142A2 PCT/KR2009/003190 KR2009003190W WO2010027142A2 WO 2010027142 A2 WO2010027142 A2 WO 2010027142A2 KR 2009003190 W KR2009003190 W KR 2009003190W WO 2010027142 A2 WO2010027142 A2 WO 2010027142A2
Authority
WO
WIPO (PCT)
Prior art keywords
video
bitstream
view
enhancement layer
communication terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/KR2009/003190
Other languages
French (fr)
Korean (ko)
Other versions
WO2010027142A3 (en
Inventor
λ°°νƒœλ©΄
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SK Telecom Co Ltd
Original Assignee
SK Telecom Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SK Telecom Co Ltd filed Critical SK Telecom Co Ltd
Publication of WO2010027142A2 publication Critical patent/WO2010027142A2/en
Publication of WO2010027142A3 publication Critical patent/WO2010027142A3/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/266Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
    • H04N21/2662Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and clientΒ 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/218Source of audio or video content, e.g. local disk arrays
    • H04N21/21805Source of audio or video content, e.g. local disk arrays enabling multiple viewpoints, e.g. using a plurality of cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/12Systems in which the television signal is transmitted via one channel or a plurality of parallel channels, the bandwidth of each channel being less than the bandwidth of the television signal

Definitions

  • the present invention relates to a system and method for transmitting and receiving a multiview video, and more particularly, a basic video is encoded into a base layer from a video photographed by a plurality of photographing means, and provided to a communication terminal through a broadcasting network.
  • the present invention relates to a multi-view video transmission / reception system and method for encoding an enhancement layer bitstream using encoding information of the base layer and providing the same to a communication terminal through a communication network.
  • the basic principle of compressing data is to eliminate spatial and temporal redundancy of the data.
  • Spatial redundancy is the repetition of the same color or object in a video.
  • temporal duplication means that the adjacent picture is almost unchanged in the moving picture or the same sound is repeatedly repeated in the audio.
  • temporal redundancy is eliminated by temporal filtering based on motion compensation and spatial redundancy is removed by spatial transform.
  • a transmission medium is required to transmit multimedia data generated after data duplication is removed.
  • the performance varies depending on the transmission medium.
  • scalable video coding technology is available.
  • Scalable video coding technology cuts a portion of a bit stream according to surrounding conditions such as transmission bit rate, transmission error rate, and system resources, and adjusts the resolution, frame rate, and signal to noise ratio (SNR) of the already compressed bitstream.
  • SNR signal to noise ratio
  • a temporal scalability is realized by constructing a motion prediction image with a hierarchical structure of a frame temporally, and spatially scalable coding is performed through layered coding in which layers are separately encoded according to each resolution.
  • image quality image quality scalability is implemented by using layered coding or fine grain scalability.
  • Multi-view video compression is for simultaneously compressing, storing, and transmitting video input from multiple cameras.
  • the size of the data is very large. Therefore, a large transmission bandwidth is required to transmit data to a user through a broadcasting network or a wired / wireless internet in real time.
  • multi-view video coding and decoding may be performed by independently coding and transmitting each video sequence and then decoding the video sequence. This can be easily implemented through existing MPEG-1 / 2/4 or H.261 / 263/264.
  • the conventional technology as described above has a disadvantage in that it is not possible to remove redundancy between videos caused by photographing the same object with a plurality of cameras.
  • the conventional technology as described above has a disadvantage in that it is not possible to remove redundancy between videos caused by photographing the same object with a plurality of cameras.
  • the broadcast network there was a disadvantage that you can not watch the video at different views.
  • a method of multiplexing a bitstream that can be independently decoded and a bitstream that is independently decoded and transmitted through a broadcasting network or a network is generally known.
  • An object of the present invention for solving the above-mentioned problems of the prior art is to transmit the additional video information through the communication network to watch the video at the desired time when the user wants to watch the video at a different time while watching the video through the broadcasting network.
  • Another object of the present invention is to provide a multi-view video transmission / reception system and method for compressing a base video into a base layer and compressing a video of another view into an enhancement layer.
  • Another object of the present invention is to provide a multi-view video transmission / reception system and method for providing a bitstream encoded in a base layer through a broadcasting network and providing a bitstream encoded in an enhancement layer through a communication network.
  • Another object of the present invention is to provide a multi-view video transmission / reception system and method for removing and encoding redundancy as an enhancement layer for a video at another point in time except for a basic video.
  • a communication terminal which transmits a message to a multi-view video transmission system, receives an enhancement layer bitstream at a corresponding time point, and decodes the enhancement layer bitstream using decoding information of the base layer;
  • a base video is encoded into a base layer among videos taken by a plurality of photographing means and provided to the communication terminal through a broadcasting network, and a video is encoded into an enhancement layer bitstream using encoding information of the base layer for a video of another viewpoint.
  • a multi-view video transmission / reception system including a multi-view video transmission system provided to the communication terminal is provided.
  • the communication terminal converts the video through the broadcasting network into the video through the communication network when the video is received through the communication network.
  • the communication terminal outputs a video through a broadcast network and a video through a communication network as a PIP or POP.
  • a base video is encoded into a base layer among multiview videos captured by a plurality of photographing means, and encoding information of the base layer is encoded for a video of another viewpoint.
  • a multiview video encoding apparatus encoding the enhancement layer bitstream using an encoding layer;
  • a broadcast server providing a basic video coded by the multi-view video encoding apparatus to a communication terminal through a broadcast network;
  • a multiview including a streaming server that selects a bitstream of a corresponding view from among bitstreams of an enhancement layer encoded by the multiview video encoding apparatus and provides the bitstream at a corresponding time to the communication terminal.
  • the video request message includes viewpoint information selected by the communication terminal user.
  • the bitstream of each enhancement layer encoded by the multiview video encoding apparatus includes view information.
  • the multi-view video encoding apparatus compresses each view in a two-layer structure of MPEG-2 MVP in the case of MPEG-2, and compresses each view in a two-layer structure of SVC in the case of H.264.
  • a multiview video encoding apparatus comprising: a base layer encoder configured to encode a base video into a base layer from a multiview video photographed by a plurality of photographing means; A multi-view video encoding apparatus including a plurality of enhancement layer encoders encoding a bitstream of an enhancement layer by using encoding information of a base layer encoded by the base layer encoder for a video of another viewpoint is provided.
  • the enhancement layer encoder encodes the video of the base layer by removing redundancy of the video of the base layer.
  • the enhancement layer encoder sets view information in the bitstream of each encoded enhancement layer.
  • a video request message including viewpoint information is received from a communication unit for communication with a communication terminal through a communication network, and the communication terminal through the communication unit, the video request message is analyzed and the corresponding time point is analyzed.
  • a streaming server including a time determination unit for determining a switching unit for selecting the enhancement layer bitstream corresponding to the time determined by the time determination unit to be provided to the communication terminal.
  • the streaming server further includes a storage unit for receiving and storing a plurality of enhancement layer bitstreams from a multiview video encoding apparatus.
  • the switching unit extracts a corresponding enhancement layer bitstream from a storage unit or a multi-view video encoding apparatus and provides the same to the communication terminal.
  • a communication terminal including a communication unit, an input unit, and an output unit for outputting a multi-view video using a broadcasting network and a communication network, which decodes and outputs a bitstream of a base layer received through the broadcasting network.
  • a decoder which decodes the bitstream of the enhancement layer by using the decoding information of the base layer when the bitstream of the enhancement layer is received through the communication network;
  • a communication terminal including a service request unit generating a video request message including corresponding point information and transmitting the same to the multi-view video transmission system through the communication unit is provided.
  • the decoding unit may include: a base layer decoding unit decoding a bitstream of a base layer received through the broadcasting network, and an enhancement of decoding a bitstream of an enhancement layer received through the communication network using information decoded by the base layer decoding unit. It includes a layer decoding unit.
  • a multi-view video encoding apparatus encodes a basic video as a basic layer among videos captured by a plurality of photographing means, and encodes a video of another viewpoint using encoding information of the base layer.
  • B providing a bitstream of a base layer encoded by the multi-view video encoding apparatus to a communication terminal through a broadcasting network, and (c) in the communication terminal, encoding the enhancement layer bisstream.
  • a video request message including corresponding view information to a streaming server when a video output of another view through a communication network is requested while decoding and outputting a base layer bitstream from a server,
  • Enhancement layer at that time in response to video requests from communication terminals Selecting and providing a bitstream to the communication terminal through a communication network;
  • step (a) the multi-view video encoding apparatus provides a bitstream encoded with a base layer to the broadcast server, and provides a bisstream encoded with an enhancement layer to the streaming server.
  • the enhancement layer bitstream encoded in step (a) includes view information.
  • a method of encoding a multiview video by a multiview video encoding apparatus comprising: encoding a basic video from a multiview video photographed by a plurality of photographing means into a base layer bitstream;
  • a multi-view video encoding method is provided, which comprises encoding a video stream of another view by using the encoding information of the base layer bitstream into a bitstream of an enhancement layer.
  • the method comprises: receiving and storing a plurality of enhancement layer bisstreams from a multiview video encoding apparatus; When the video request message including the view information is received from the communication terminal, determining the corresponding view by analyzing the video request message; A method for transmitting a multi-view video through a communication network is provided, the method including selecting an enhancement layer bitstream corresponding to the determined time point from the stored enhancement layer bitstream and transmitting the selected enhancement layer bitstream to the communication terminal.
  • a video output of another point in time through the communication network is requested while decoding and outputting the bitstream of the base layer received through the broadcast network, a video request message including the view information is provided.
  • a video request message including the view information is provided.
  • the bitstream of the enhancement layer corresponding to the request time point is received from the multiview video transmission system, decoding information of the base layer bitstream from the bitstream of the enhancement layer.
  • Step (a) may include: decoding and outputting a bitstream of a base layer received through a broadcasting network; Determining a corresponding point in time when a video output of another point in time is requested by the user; Generating a video request message including the determined time point and transmitting the multi-view video transmission system.
  • a multi-view video encoding method can be recorded as a program on a computer-readable medium such as a CD or a USB memory.
  • a multi-view video of which a user can watch a video at a desired point of view by transmitting additional video information through a communication network. It is possible to provide a transmission and reception system and a method.
  • a multi-view video transmission / reception system and method may be provided for removing and encoding redundancy as an enhancement layer for a video at a view point other than the basic video.
  • FIG. 1 is a diagram illustrating a system for transmitting and receiving multiview video according to the present invention.
  • FIG. 2 is a block diagram schematically illustrating a configuration of a multi-view video encoding apparatus according to the present invention.
  • Figure 3 is a block diagram schematically showing the configuration of a streaming server according to the present invention.
  • FIG. 4 is a block diagram schematically illustrating a configuration of a communication terminal for outputting a multiview video using a broadcasting network and a communication network according to the present invention.
  • FIG. 5 is a flowchart illustrating a method of encoding a multiview video by a multiview video encoding apparatus according to the present invention.
  • FIG. 6 is a flowchart illustrating a method for providing a multi-view video to a communication terminal according to the present invention.
  • FIG. 7 is a flowchart illustrating a method for providing an enhancement layer bitstream to a communication terminal by a streaming server according to the present invention.
  • FIG. 1 is a diagram illustrating a system for transmitting and receiving multiview video according to the present invention.
  • a multi-view video transmission / reception system receives a video through a broadcasting network and a communication network, and outputs a video to a request of the communication terminal 200 by encoding a video photographed by a plurality of photographing means. Accordingly, the multi-view video transmission system 100 provides a corresponding video to the communication terminal 200 through a broadcasting network or a communication network.
  • the communication terminal 200 re-views the video request message including the corresponding view information when the video output of another view through the communication network is requested while decoding and outputting the video of the base layer received through the broadcasting network.
  • the video signal is transmitted to the video transmission system 100 to receive an enhancement layer bitstream at a corresponding time point, and to decode and output the enhancement layer bitstream using decoding information of the base layer.
  • the communication terminal 200 includes a mobile communication terminal, a wired communication terminal, and the like, capable of receiving a video through a broadcasting network and a communication network.
  • the communication terminal 200 converts the currently output video into a video through the communication network and outputs the video.
  • the communication terminal 200 may output the video through the broadcasting network and the video through the communication network as PIP or POP.
  • the multi-view video transmission system 100 encodes a basic video into a base layer from a video photographed by a plurality of photographing means and provides it to the communication terminal 200 through a broadcasting network.
  • the multi-view video transmission system 100 encodes a video of another view into an enhancement layer bitstream using encoding information of the base layer, and when a video request message is received from the communication terminal 200, The bitstream is provided to the communication terminal 200 through a communication network.
  • the multiview video transmission system 100 includes a multiview video encoding apparatus 120, a broadcast server 150, and a streaming server 170.
  • the multi-view video encoding apparatus 120 encodes the base video from the multi-view video captured by the plurality of photographing means into a base layer, and encodes the base video into an enhancement layer bitstream using encoding information of the base layer with respect to the video from another viewpoint. do.
  • the broadcast server 150 provides a basic video coded by the multi-view video encoding apparatus 120 to the communication terminal 200 through a broadcast network.
  • the streaming server 170 selects a bitstream of the corresponding view from among bitstreams of the enhancement layer encoded by the multiview video encoding apparatus 120 to communicate with the communication terminal. Provided at 200.
  • the video request message includes viewpoint information selected by the user of the communication terminal.
  • the multi-view video transmission system 100 receives the view request information when the video request message is received from the communication terminal 200.
  • a bitstream of an enhancement layer may be selected and provided.
  • the multi-view video encoding apparatus 120 compresses each viewpoint by using a two-layer structure of MPEG-2 MVP in the case of MPEG-2, and compresses each viewpoint by using a two-layer structure of SVC in the case of H.264. do.
  • the broadcast server 150 corresponds to a current digital terrestrial broadcast, satellite digital broadcast, T-DMB, and S-DMB transmission system, and transmits a base layer of a video encoded by the multiview video encoding apparatus 120.
  • the streaming server 170 receives information on a time point desired by a user from the communication terminal 200 through a communication network and performs streaming of an enhancement layer bitstream corresponding to the IP layer.
  • a multi-view video is compressed using a block-based codec of a hierarchical encoder structure.
  • the base layer can be decoded by the codec of the broadcast network, and the video of the other time is compressed by using a layer-based multi-view video compression method that compresses by reducing the redundancy between the images using the information of the base layer.
  • a user having only a broadcast receiving device watches a video of a base layer, and a user connected to an additional communication network selects a desired time point and requests the streaming server 170 from the streaming server 170 to meet the request time. Additional bitstreams can be received and viewed.
  • FIG. 2 is a block diagram schematically illustrating a configuration of a multiview video encoding apparatus according to the present invention.
  • the multiview video encoding apparatus 120 may include a base layer encoder 122 that encodes a base video as a base layer among multiview videos photographed by a plurality of photographing means, and at different viewpoints.
  • a plurality of enhancement layer encoders 124a, 124b,..., 124n, 124, which are encoded into a beast stream of an enhancement layer by using encoding information of the base layer encoded by the base layer encoder 122 for a video, are hereinafter referred to as 124. ).
  • the base layer encoder 122 encodes the basic video using a codec compatible with an existing broadcasting system such as H.264 or MPEG-2.
  • the enhancement layer encoder 124 encodes the view point 1 to output the first enhancement layer bitstream, and the second enhancement layer encodes the view point 2 to output the second enhancement layer bitstream. And an enhancement layer encoder 124b.
  • the enhancement layer encoder 124 sets view information in the bitstream of each encoded enhancement layer.
  • the viewpoint information set as described above may select an enhancement layer bitstream of the viewpoint information when a video request message is received from the communication terminal.
  • the enhancement layer encoder 124 Since the enhancement layer encoder 124 performs encoding by using the encoding information of the base layer encoded by the base layer encoder 122, the enhancement layer encoder 124 removes redundancy of the video of the base layer from the video of another viewpoint.
  • the enhancement layer encoder 124 compresses each view in a two-layer structure of MPEG-2 MVP in the case of MPEG-2, and compresses each view in a two-layer structure of SVC in the case of H.264. .
  • FIG. 3 is a block diagram schematically showing the configuration of a streaming server according to the present invention.
  • the streaming server 170 requests a video when a video request message including viewpoint information is received from a communication terminal 172 and a communication terminal 172 for communication with a communication terminal through a communication network.
  • a switching unit 176 for selecting a enhancement layer bitstream corresponding to the viewpoint determined by the viewpoint determination unit 174 by analyzing the message and determining the corresponding viewpoint. .
  • the streaming server 170 may further include a storage unit 178 that receives and stores the plurality of enhancement layer bitstreams from the multiview video encoding apparatus.
  • the switching unit 176 extracts an enhancement layer bitstream corresponding to the viewpoint determined by the viewpoint determination unit 174 from the storage unit 178 or the multiview video encoding apparatus 120 and provides the extracted communication layer 200 to the communication terminal 200. Play a role.
  • FIG. 4 is a block diagram schematically illustrating a configuration of a communication terminal for outputting a multi-view video using a broadcasting network and a communication network according to the present invention.
  • a communication terminal 200 for outputting a multiview video using a broadcasting network and a communication network includes a communication unit 202, an input unit 204, a storage unit 206, an output unit 208, and a decoding unit 210. ), The service request unit 214.
  • the decoding unit 210 decodes the encoded bitstream.
  • the decoding unit 210 bases the bitstream of the enhancement layer. Decode using the decoding information of the layer.
  • the decoding unit 210 may decode the base layer decoding unit 211 for decoding the bitstream of the base layer received through the broadcasting network, and decode the bitstream of the enhancement layer received through the communication network in the base layer decoding unit 211.
  • an enhancement layer decoding unit 212 that decodes the received information.
  • the service requesting unit 214 When a video of another point in time is requested through the input unit 204, the service requesting unit 214 generates a video request message including corresponding point information and transmits it to the multi-view video transmission system through the communication unit 202.
  • the video requested by the service requester 214 is received through the communication unit 202, decoded by the decoding unit 210, and output to the output unit 208.
  • FIG. 5 is a flowchart illustrating a method of encoding a multiview video by a multiview video encoding apparatus according to the present invention.
  • the multiview video encoding apparatus 120 when the multiview video encoding apparatus 120 receives a multiview video photographed by a plurality of photographing means (S500), the multiview video encoding apparatus 120 encodes the base video into a base layer bitstream from the received multiview video ( S502).
  • the multi-view video encoding apparatus 120 encodes the video of the other view into the bitstream of the enhancement layer by using encoding information of the base layer bitstream (S504).
  • the base layer bitstream is provided to the communication terminal through the broadcast network, and the enhancement layer bitstream is provided to the communication terminal through the communication network.
  • FIG. 6 is a flowchart illustrating a method of providing a multi-view video to a communication terminal according to the present invention.
  • the broadcast server 150 provides a bitstream of a base layer encoded by the multiview video encoding apparatus 120 to the communication terminal 200 through a broadcast network (S600).
  • the multi-view video encoding apparatus 120 provides a bitstream encoded with a base layer to the broadcast server 150 and provides a bisstream encoded with an enhancement layer to the streaming server 170.
  • the communication terminal 200 decodes and outputs a base layer bitstream from the broadcast server 150 (S602).
  • the communication terminal 200 When outputting a video at another point in time through the communication network while outputting a video through the broadcasting network (S604), the communication terminal 200 transmits a video request message including the corresponding view information to the streaming server 170 (S606). .
  • the communication terminal 200 receives a video request message including view information selected by the user. It generates and transmits to the streaming server 170.
  • the streaming server 170 When the streaming server 170 receives the video request message from the communication terminal 200 (S608), the streaming server 170 selects an enhancement layer bitstream at the corresponding point in time according to the video request and provides it to the communication terminal 200 through the communication network (S610). .
  • the streaming server 170 analyzes the video request message, extracts view information, and selects an enhancement layer bitstream corresponding to the extracted view information through a communication network. Provided to the communication terminal 200.
  • the communication terminal 200 decodes and outputs the enhancement layer bitstream transmitted from the streaming server 170 using the decoding information of the base layer (S612).
  • FIG. 7 is a flowchart illustrating a method for providing an enhancement layer bitstream to a communication terminal by a streaming server according to the present invention.
  • the streaming server 170 receives and stores a plurality of enhancement layer bisstreams from the multiview video encoding apparatus 120 (S700).
  • the streaming server 170 receives the video request message including the viewpoint information from the communication terminal 200 (S702), the streaming server 170 analyzes the video request message to determine the corresponding viewpoint (S704).
  • the streaming server 170 selects the enhancement layer bitstream corresponding to the determined time point from the stored enhancement layer bitstream as described above and transmits it to the communication terminal 200 (S706).
  • the system and method for transmitting and receiving a multi-view video transmits additional video information through a communication network so that a user can watch a video at a desired time point while watching a video through a broadcasting network. It can be used for the technique.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention relates to a transmitting/receiving system and a transmitting/receiving method for multi-view video. The transmitting/receiving system for multi-view video according to the present invention includes a communication terminal and a multi-view video transmission system. The communication terminal transmits a motion picture request message containing relevant time information to the multi-view video transmission system and receives an improved layer bitstream of the relevant time, when the output of a motion picture for another time is requested through a communication network while the motion picture of the basic layer received through a broadcast network is being decoded and output. The multi-view video transmission system encodes the basic motion picture among the motion pictures taken by a plurality of photographing means to a basic layer, provides the basic motion picture to the communication terminal through the broadcast network, encodes the motion picture of another time to the improved layer bitstream using encoding information of the basic layer, and selects the improved layer bitstream of the relevant time and provides the selected bitstream to the communication terminal through the communication network, when the motion picture request message is received from the communication terminal. Thereby, the present invention transmits additional motion picture information through a communication network to enable a user to view the motion picture of the time desired by the user, in the event that the user wants to view the motion picture of another time while viewing a motion picture via a broadcast network.

Description

λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법Transmitting and Receiving System and Method of Multiview Video

λ³Έ 발λͺ…은 λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법에 κ΄€ν•œ κ²ƒμœΌλ‘œμ„œ, λ”μš± μƒμ„Έν•˜κ²ŒλŠ” 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜μ—¬ 방솑망을 톡해 톡신 λ‹¨λ§κΈ°λ‘œ μ œκ³΅ν•˜λ©°, λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•œ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜μ—¬ 톡신망을 톡해 톡신 단말기에 μ œκ³΅ν•˜λŠ”, λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법에 κ΄€ν•œ 것이닀.The present invention relates to a system and method for transmitting and receiving a multiview video, and more particularly, a basic video is encoded into a base layer from a video photographed by a plurality of photographing means, and provided to a communication terminal through a broadcasting network. The present invention relates to a multi-view video transmission / reception system and method for encoding an enhancement layer bitstream using encoding information of the base layer and providing the same to a communication terminal through a communication network.

데이터λ₯Ό μ••μΆ•ν•˜λŠ” 기본적인 μ›λ¦¬λŠ” λ°μ΄ν„°μ˜ 곡간적 쀑볡 및 μ‹œκ°„μ  쀑볡 μš”μ†Œλ₯Ό μ œκ±°ν•˜λŠ” 것이닀. 곡간적 쀑볡은 λΉ„λ””μ˜€μ—μ„œ λ™μΌν•œ μƒ‰μ΄λ‚˜ 객체가 λ°˜λ³΅λ˜λŠ” 것을 λ§ν•œλ‹€. 그리고 μ‹œκ°„μ  쀑볡은 λ™μ˜μƒ ν”½μ²˜μ—μ„œ 인접 ν”½μ²˜κ°€ 거의 λ³€ν™”μ—†λŠ” 것 λ˜λŠ” μ˜€λ””μ˜€μ—μ„œ 같은 음이 계속 λ°˜λ³΅λ˜λŠ” 것을 λ§ν•œλ‹€. 일반적인 λΉ„λ””μ˜€ μ½”λ”© λ°©λ²•μ—μ„œ μ‹œκ°„μ  쀑볡은 μ›€μ§μž„ 보상에 κ·Όκ±°ν•œ μ‹œκ°„μ  필터링에 μ˜ν•΄ 제거되고 곡간적 쀑볡은 곡간적 λ³€ν™˜μ— μ˜ν•΄ μ œκ±°λœλ‹€. The basic principle of compressing data is to eliminate spatial and temporal redundancy of the data. Spatial redundancy is the repetition of the same color or object in a video. And temporal duplication means that the adjacent picture is almost unchanged in the moving picture or the same sound is repeatedly repeated in the audio. In the general video coding method, temporal redundancy is eliminated by temporal filtering based on motion compensation and spatial redundancy is removed by spatial transform.

λ°μ΄ν„°μ˜ 쀑볡이 제거된 ν›„ μƒμ„±λ˜λŠ” λ©€ν‹°λ―Έλ””μ–΄ 데이터λ₯Ό μ „μ†‘ν•˜κΈ° μœ„ν•΄μ„œλŠ” 전솑맀체가 ν•„μš”ν•œλ°, κ·Έ μ„±λŠ₯은 전솑 λ§€μ²΄λ³„λ‘œ 차이가 μžˆλ‹€. λ‹€μ–‘ν•œ μ†λ„μ˜ 전솑맀체λ₯Ό μ§€μ›ν•˜κΈ° μœ„ν•˜μ—¬ λ˜λŠ” μ „μ†‘ν™˜κ²½μ— 따라 이에 μ ν•©ν•œ 전솑λ₯ λ‘œ λ©€ν‹°λ―Έλ””μ–΄ 데이터λ₯Ό μ „μ†‘ν•˜κΈ° μœ„ν•΄ μŠ€μΌ€μΌλŸ¬λΈ” λΉ„λ””μ˜€ μ½”λ”©(Scalable Video Coding) 기술이 μžˆλ‹€.A transmission medium is required to transmit multimedia data generated after data duplication is removed. The performance varies depending on the transmission medium. In order to support various speed transmission media or to transmit multimedia data at a data rate suitable for a transmission environment, scalable video coding technology is available.

μŠ€μΌ€μΌλŸ¬λΈ” λΉ„λ””μ˜€ μ½”λ”© κΈ°μˆ μ΄λž€ 이미 μ••μΆ•λœ λΉ„νŠΈμŠ€νŠΈλ¦Όμ— λŒ€ν•˜μ—¬ 전솑 λΉ„νŠΈμœ¨, 전솑 μ—λŸ¬μœ¨, μ‹œμŠ€ν…œ μžμ› λ“±μ˜ μ£Όλ³€ 쑰건에 따라 λΉ„νŠΈ 슀트림의 일뢀λ₯Ό μž˜λΌλ‚΄μ–΄ λΉ„λ””μ˜€μ˜ 해상도, ν”„λ ˆμž„μœ¨ 및 SNR(Signal to Noise Ratio) 등을 μ‘°μ ˆν•  수 있게 ν•΄μ£ΌλŠ” μ½”λ”© κΈ°μˆ μ„ μ˜λ―Έν•œλ‹€.Scalable video coding technology cuts a portion of a bit stream according to surrounding conditions such as transmission bit rate, transmission error rate, and system resources, and adjusts the resolution, frame rate, and signal to noise ratio (SNR) of the already compressed bitstream. Means a coding technology that allows

일반적인 μŠ€μΌ€μΌλŸ¬λΈ” λΉ„λ””μ˜€ 코딩은 μ‹œκ°„μ μœΌλ‘œ ν”„λ ˆμž„μ„ hierarchical structure둜 μ›€μ§μž„ 예츑 μ˜μƒμ„ κ΅¬μ„±ν•˜μ—¬ μ‹œκ°„μ  μŠ€μΌ€μΌλŸ¬λΉŒλŸ¬ν‹°λ₯Ό κ΅¬ν˜„ν•˜κ³ , 각 해상도에 따라 λ ˆμ΄μ–΄λ₯Ό λ³„λ„λ‘œ 두어 λΆ€ν˜Έν™”ν•˜λŠ” λ ˆμ΄μ–΄λ“œ 코딩을 톡해 κ³΅κ°„μ μœΌλ‘œ μŠ€μΌ€μΌλŸ¬λΈ”ν•˜κ²Œ λΆ€ν˜Έν™”ν•œλ‹€. ν™”μ§ˆμ˜ 경우, λ ˆμ΄μ–΄λ“œ 코딩을 ν†΅ν•œ λ°©λ²•μ΄λ‚˜, Fine grain scalabilityλ“±μ˜ 방법을 μ΄μš©ν•˜μ—¬ ν™”μ§ˆ μŠ€μΌ€μΌλŸ¬λΉŒλŸ¬ν‹°λ₯Ό κ΅¬ν˜„ν•œλ‹€.In general scalable video coding, a temporal scalability is realized by constructing a motion prediction image with a hierarchical structure of a frame temporally, and spatially scalable coding is performed through layered coding in which layers are separately encoded according to each resolution. In the case of image quality, image quality scalability is implemented by using layered coding or fine grain scalability.

μ΅œκ·Όμ—λŠ” 기술적 λ°œμ „κ³Ό μ‚¬μš©μžμ˜ μš”κ΅¬μ— μ˜ν•΄ λ‹€μ–‘ν•œ μ‹œμ μ—μ„œμ˜ 광경에 λŒ€ν•œ λΉ„λ””μ˜€ 정보λ₯Ό μ‹œμ²­μžμ—κ²Œ μ œκ³΅ν•˜κ³ , ν•΄λ‹Ή λΉ„λ””μ˜€ 정보 μ€‘μ—μ„œ 보고 싢은 광경에 λŒ€ν•œ μ‘°μž‘μ„ 톡해 각각의 μ‹œμ²­μžλ§ˆλ‹€ λ°©μ†‘κ΅­μ—μ„œ λ³΄λ‚΄μ§€λŠ” 정보λ₯Ό νŽΈμ§‘ν•˜μ—¬ μ‹œμ²­ν•  수 μžˆλŠ” μ„œλΉ„μŠ€κ°€ 개발되고 μžˆλ‹€. 이와 같은 μ„œλΉ„μŠ€λ₯Ό μ œκ³΅ν•˜κΈ° μœ„ν•΄μ„œλŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ— λŒ€ν•œ λΉ„λ””μ˜€ μ••μΆ• 기술과 λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ 전솑 기술 및 μ‹œμŠ€ν…œμ΄ ν•„μš”ν•˜λ‹€.Recently, due to technological developments and user demands, video information about a scene at various points of time is provided to viewers, and the information sent from a broadcasting station for each viewer can be edited and watched by manipulating the scene that the viewer wants to see. Services are being developed. In order to provide such a service, a video compression technique for multiview video and a transmission technique and system for multiview video are required.

λ‹€μ‹œμ  λΉ„λ””μ˜€ 압좕은 볡수 μΉ΄λ©”λΌλ‘œλΆ€ν„° μž…λ ₯된 λΉ„λ””μ˜€λ₯Ό λ™μ‹œμ— μ½”λ”©ν•˜μ—¬ μ••μΆ• μ €μž₯ 및 μ „μ†‘ν•˜κΈ° μœ„ν•œ 것이닀. λ‹€μ‹œμ  λΉ„λ””μ˜€λ₯Ό μ••μΆ•ν•˜μ§€ μ•Šκ³  μ €μž₯ 및 전솑할 경우 λ°μ΄ν„°μ˜ 크기가 맀우 크기 λ•Œλ¬Έμ— 데이터λ₯Ό μ‹€μ‹œκ°„μœΌλ‘œ 방솑망 λ˜λŠ” 유/무선 인터넷을 ν†΅ν•˜μ—¬ μ‚¬μš©μžμ—κ²Œ μ „λ‹¬ν•˜κΈ° μœ„ν•΄μ„œλŠ” 큰 크기의 전솑 λŒ€μ—­ν­μ΄ μš”κ΅¬λœλ‹€. Multi-view video compression is for simultaneously compressing, storing, and transmitting video input from multiple cameras. When storing and transmitting a multiview video without compressing the data, the size of the data is very large. Therefore, a large transmission bandwidth is required to transmit data to a user through a broadcasting network or a wired / wireless internet in real time.

톡상 λ‹€μ‹œμ  λΉ„λ””μ˜€ μ½”λ”© 및 디코딩은 각각의 λΉ„λ””μ˜€ μ‹œν€€μŠ€λ₯Ό λ…λ¦½μ μœΌλ‘œ μ½”λ”©ν•˜μ—¬ μ „μ†‘ν•œ ν›„ λ””μ½”λ”©ν•˜λŠ” λ°©λ²•μœΌλ‘œ μ΄λ£¨μ–΄μ§ˆ 수 μžˆλ‹€. μ΄λŠ” 기쑴의 MPEG-1/2/4λ‚˜ H.261/263/264λ₯Ό ν†΅ν•˜μ—¬ μ‰½κ²Œ κ΅¬ν˜„λ  수 μžˆλ‹€. In general, multi-view video coding and decoding may be performed by independently coding and transmitting each video sequence and then decoding the video sequence. This can be easily implemented through existing MPEG-1 / 2/4 or H.261 / 263/264.

κ·ΈλŸ¬λ‚˜ μ „μˆ ν•œ 바와 같은 μ’…λž˜μ˜ κΈ°μˆ μ—λŠ” λ™μΌν•œ 물체λ₯Ό 볡수의 μΉ΄λ©”λΌλ‘œ μ΄¬μ˜ν•˜κΈ° λ•Œλ¬Έμ— μƒκΈ°λŠ” λΉ„λ””μ˜€ κ°„μ˜ 쀑볡성을 μ œκ±°ν•  수 μ—†λŠ” 단점이 μžˆλ‹€. λ˜ν•œ, 방솑망을 톡해 λ™μ˜μƒμ„ μ‹œμ²­ν•˜κ³  μžˆλŠ” μ€‘μ—λŠ” λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ„ μ‹œμ²­ν•  수 μ—†λŠ” 단점이 μžˆμ—ˆλ‹€.However, the conventional technology as described above has a disadvantage in that it is not possible to remove redundancy between videos caused by photographing the same object with a plurality of cameras. In addition, while watching a video through the broadcast network there was a disadvantage that you can not watch the video at different views.

또 λ‹€λ₯Έ λ°©λ²•μœΌλ‘œ Scalable Video Coding λ°©λ²•μ˜ λ ˆμ΄μ–΄λ“œ μ½”λ”© 방법을 μ΄μš©ν•˜μ—¬ λΆ€ν˜Έν™”ν•˜λŠ” 방법이 μžˆλ‹€. 이 경우, μ‹œμ κ°„μ˜ 쀑볡성을 λ ˆμ΄μ–΄λ“œ μ½”λ”©λ°©λ²•μœΌλ‘œ μ œκ±°ν•  수 μžˆμœΌλ―€λ‘œ 높은 μ••μΆ•λ₯ μ„ 얻을 수 μžˆμ§€λ§Œ, μΈμ ‘ν•œ μ‹œμ κ°„μ˜ 쀑볡성을 μ œκ±°ν•˜λŠ” νŠΉμ„±μœΌλ‘œ 인해, μ‹œμ μ „ν™˜μ΄ 쉽지 μ•Šμ€ λ¬Έμ œκ°€ λ°œμƒν•œλ‹€. As another method, there is a method of encoding using a layered coding method of the scalable video coding method. In this case, since the redundancy between viewpoints can be removed by a layered coding method, a high compression ratio can be obtained. However, due to the property of removing redundancy between adjacent viewpoints, a problem arises in that the viewpoint switching is not easy.

λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ 전솑 μ‹œμŠ€ν…œμ˜ 경우, independentν•˜κ²Œ λ³΅ν˜Έν™”ν•  수 μžˆλŠ” λΉ„νŠΈμŠ€νŠΈλ¦Όκ³Ό dependentν•˜κ²Œ λ³΅ν˜Έν™”λ˜λŠ” λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ multiplexing ν•˜μ—¬ λ°©μ†‘λ§μ΄λ‚˜ λ„€νŠΈμ›Œν¬λ§μ„ 톡해 μ „μ†‘ν•˜λŠ” 방법이 일반적으둜 μ•Œλ €μ Έ μžˆλ‹€. κ·ΈλŸ¬λ‚˜ λ‹€μ‹œμ  λΉ„λ””μ˜€λ₯Ό 방솑망을 톡해 μ „μ†‘ν•˜κΈ° μœ„ν•΄μ„œλŠ” μ‹œμ λ§ŒνΌμ˜ 채널이 μΆ”κ°€μ μœΌλ‘œ ν•„μš”ν•œ λ¬Έμ œκ°€ λ°œμƒν•œλ‹€.In the multi-view video transmission system, a method of multiplexing a bitstream that can be independently decoded and a bitstream that is independently decoded and transmitted through a broadcasting network or a network is generally known. However, in order to transmit a multiview video through a broadcasting network, a problem arises in that an additional channel as much as a viewpoint is required.

μ „μˆ ν•œ μ’…λž˜ 기술의 λ¬Έμ œμ μ„ ν•΄κ²°ν•˜κΈ° μœ„ν•œ λ³Έ 발λͺ…μ˜ λͺ©μ μ€ 방솑망을 톡해 λ™μ˜μƒμ„ μ‹œμ²­ν•˜λŠ” 쀑에 μ‚¬μš©μžκ°€ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ μ‹œμ²­μ„ μ›ν•˜λŠ” 경우 톡신망을 톡해 μΆ”κ°€ λ™μ˜μƒ 정보λ₯Ό μ „μ†‘ν•˜μ—¬ μ‚¬μš©μžκ°€ μ›ν•˜λŠ” μ‹œμ μ˜ λ™μ˜μƒμ„ μ‹œμ²­ν•  수 μžˆλŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법을 μ œκ³΅ν•˜λŠ”λ° μžˆλ‹€. An object of the present invention for solving the above-mentioned problems of the prior art is to transmit the additional video information through the communication network to watch the video at the desired time when the user wants to watch the video at a different time while watching the video through the broadcasting network. To provide a multi-view video transmission and reception system and method.

λ˜ν•œ, λ³Έ 발λͺ…μ˜ λ‹€λ₯Έ λͺ©μ μ€ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄(base layer)둜 μ••μΆ•ν•˜κ³ , λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ„ ν–₯상 λ ˆμ΄μ–΄(enhancement layer)둜 μ••μΆ•ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법을 μ œκ³΅ν•˜λŠ”λ° μžˆλ‹€. Another object of the present invention is to provide a multi-view video transmission / reception system and method for compressing a base video into a base layer and compressing a video of another view into an enhancement layer.

λ˜ν•œ, λ³Έ 발λͺ…μ˜ λ˜λ‹€λ₯Έ λͺ©μ μ€ 방솑망을 ν†΅ν•΄μ„œ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”λœ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ œκ³΅ν•˜κ³ , 톡신망을 ν†΅ν•΄μ„œ ν–₯상 λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”λœ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ œκ³΅ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법을 μ œκ³΅ν•˜λŠ”λ° μžˆλ‹€. Another object of the present invention is to provide a multi-view video transmission / reception system and method for providing a bitstream encoded in a base layer through a broadcasting network and providing a bitstream encoded in an enhancement layer through a communication network.

그리고, λ³Έ 발λͺ…μ˜ λ˜λ‹€λ₯Έ λͺ©μ μ€ κΈ°λ³Έ λ™μ˜μƒμ„ μ œμ™Έν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ ν–₯상 λ ˆμ΄μ–΄λ‘œμ„œ 쀑볡성을 μ œκ±°ν•˜μ—¬ λΆ€ν˜Έν™”ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법을 μ œκ³΅ν•˜λŠ”λ° μžˆλ‹€.Another object of the present invention is to provide a multi-view video transmission / reception system and method for removing and encoding redundancy as an enhancement layer for a video at another point in time except for a basic video.

μ „μˆ ν•œ λͺ©μ λ“€μ„ λ‹¬μ„±ν•˜κΈ° μœ„ν•œ λ³Έ 발λͺ…μ˜ 일 츑면에 λ”°λ₯΄λ©΄, 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ™μ˜μƒμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우 ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ˜ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜κ³ , κ·Έ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 톡신 단말기; 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ™μ˜μƒμ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜μ—¬ 방솑망을 톡해 상기 톡신 λ‹¨λ§κΈ°λ‘œ μ œκ³΅ν•˜κ³ , λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•œ ν–₯μƒλ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜μ—¬ 톡신망을 톡해 상기 톡신 λ‹¨λ§κΈ°λ‘œ μ œκ³΅ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμ„ ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œμ΄ μ œκ³΅λœλ‹€. According to an aspect of the present invention for achieving the above object, when decoding the video of the base layer received through the broadcast network and outputs a video of a different point in time through the communication network is requested video request including the corresponding point information A communication terminal which transmits a message to a multi-view video transmission system, receives an enhancement layer bitstream at a corresponding time point, and decodes the enhancement layer bitstream using decoding information of the base layer; A base video is encoded into a base layer among videos taken by a plurality of photographing means and provided to the communication terminal through a broadcasting network, and a video is encoded into an enhancement layer bitstream using encoding information of the base layer for a video of another viewpoint. A multi-view video transmission / reception system including a multi-view video transmission system provided to the communication terminal is provided.

상기 톡신 λ‹¨λ§κΈ°λŠ” 톡신망을 톡해 λ™μ˜μƒμ΄ μˆ˜μ‹ λœ 경우 방솑망을 ν†΅ν•œ λ™μ˜μƒμ„ 톡신망을 ν†΅ν•œ λ™μ˜μƒμœΌλ‘œ μ „ν™˜ν•œλ‹€. The communication terminal converts the video through the broadcasting network into the video through the communication network when the video is received through the communication network.

λ˜ν•œ, 상기 톡신 λ‹¨λ§κΈ°λŠ” 방솑망을 ν†΅ν•œ λ™μ˜μƒκ³Ό 톡신망을 ν†΅ν•œ λ™μ˜μƒμ„ PIP λ˜λŠ” POP둜 좜λ ₯ν•œλ‹€. In addition, the communication terminal outputs a video through a broadcast network and a video through a communication network as a PIP or POP.

λ³Έ 발λͺ…μ˜ λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμ— μžˆμ–΄μ„œ, 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜κ³ , λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치; 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ™μ˜μƒμ„ 방솑망을 톡해 톡신 단말기에 μ œκ³΅ν•˜λŠ” 방솑 μ„œλ²„; 톡신망을 톡해 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우, 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ—μ„œ λΆ€ν˜Έν™”λœ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€‘μ—μ„œ ν•΄λ‹Ή μ‹œμ μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 상기 톡신 λ‹¨λ§κΈ°λ‘œ μ œκ³΅ν•˜λŠ” 슀트리밍 μ„œλ²„λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμ΄ μ œκ³΅λœλ‹€. According to another aspect of the present invention, in a multiview video transmission system, a base video is encoded into a base layer among multiview videos captured by a plurality of photographing means, and encoding information of the base layer is encoded for a video of another viewpoint. A multiview video encoding apparatus encoding the enhancement layer bitstream using an encoding layer; A broadcast server providing a basic video coded by the multi-view video encoding apparatus to a communication terminal through a broadcast network; When a video request message is received from a communication terminal through a communication network, a multiview including a streaming server that selects a bitstream of a corresponding view from among bitstreams of an enhancement layer encoded by the multiview video encoding apparatus and provides the bitstream at a corresponding time to the communication terminal. A video transmission system is provided.

상기 λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λŠ” 톡신 단말기 μ‚¬μš©μžμ— μ˜ν•΄ μ„ νƒλœ μ‹œμ  정보λ₯Ό ν¬ν•¨ν•œλ‹€. The video request message includes viewpoint information selected by the communication terminal user.

상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ—μ„œ λΆ€ν˜Έν™”λœ 각 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€ μ‹œμ  정보λ₯Ό ν¬ν•¨ν•œλ‹€. The bitstream of each enhancement layer encoded by the multiview video encoding apparatus includes view information.

상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λŠ” MPEG-2의 경우 MPEG-2 MVP의 2λ ˆμ΄μ–΄ ꡬ쑰둜 각 μ‹œμ μ— λŒ€ν•œ 압좕을 μˆ˜ν–‰ν•˜κ³ , H.264의 경우 SVC의 2 λ ˆμ΄μ–΄ ꡬ쑰둜 각 μ‹œμ μ— λŒ€ν•œ 압좕을 μˆ˜ν–‰ν•œλ‹€. The multi-view video encoding apparatus compresses each view in a two-layer structure of MPEG-2 MVP in the case of MPEG-2, and compresses each view in a two-layer structure of SVC in the case of H.264.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ— μžˆμ–΄μ„œ, 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€; λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 볡수의 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜κ°€ μ œκ³΅λœλ‹€. According to still another aspect of the present invention, there is provided a multiview video encoding apparatus, comprising: a base layer encoder configured to encode a base video into a base layer from a multiview video photographed by a plurality of photographing means; A multi-view video encoding apparatus including a plurality of enhancement layer encoders encoding a bitstream of an enhancement layer by using encoding information of a base layer encoded by the base layer encoder for a video of another viewpoint is provided.

λ˜ν•œ, 상기 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€λŠ” λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 쀑볡성을 μ œκ±°ν•˜μ—¬ λΆ€ν˜Έν™”ν•œλ‹€. In addition, the enhancement layer encoder encodes the video of the base layer by removing redundancy of the video of the base layer.

λ˜ν•œ, 상기 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€λŠ” λΆ€ν˜Έν™”λœ 각 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ— μ‹œμ  정보λ₯Ό μ„€μ •ν•œλ‹€. The enhancement layer encoder sets view information in the bitstream of each encoded enhancement layer.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, 톡신망을 톡해 톡신 λ‹¨λ§κΈ°μ™€μ˜ 톡신을 μœ„ν•œ 톡신뢀, 상기 톡신뢀λ₯Ό 톡해 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„° μ‹œμ  정보가 ν¬ν•¨λœ λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우, 상기 λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λΆ„μ„ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•˜λŠ” μ‹œμ  νŒλ‹¨λΆ€, 상기 μ‹œμ  νŒλ‹¨λΆ€μ—μ„œ νŒλ‹¨λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 상기 톡신 단말기에 μ œκ³΅λ˜λ„λ‘ ν•˜λŠ” μŠ€μœ„μΉ­λΆ€λ₯Ό ν¬ν•¨ν•˜λŠ” 슀트리밍 μ„œλ²„κ°€ μ œκ³΅λœλ‹€.According to another aspect of the present invention, when a video request message including viewpoint information is received from a communication unit for communication with a communication terminal through a communication network, and the communication terminal through the communication unit, the video request message is analyzed and the corresponding time point is analyzed. There is provided a streaming server including a time determination unit for determining a switching unit for selecting the enhancement layer bitstream corresponding to the time determined by the time determination unit to be provided to the communication terminal.

상기 슀트리밍 μ„œλ²„λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λ‘œλΆ€ν„° 볡수의 ν–₯μƒΉ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜μ—¬ μ €μž₯ν•˜λŠ” μ €μž₯λΆ€λ₯Ό 더 ν¬ν•¨ν•œλ‹€. The streaming server further includes a storage unit for receiving and storing a plurality of enhancement layer bitstreams from a multiview video encoding apparatus.

상기 μŠ€μœ„μΉ­λΆ€λŠ” μ €μž₯λΆ€ λ˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λ‘œλΆ€ν„° ν•΄λ‹Ή ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μΆ”μΆœν•˜μ—¬ 상기 톡신 단말기에 μ œκ³΅ν•œλ‹€. The switching unit extracts a corresponding enhancement layer bitstream from a storage unit or a multi-view video encoding apparatus and provides the same to the communication terminal.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, 톡신뢀, μž…λ ₯λΆ€ 및 좜λ ₯λΆ€κ°€ κ΅¬λΉ„λ˜μ–΄ 방솑망 및 톡신망을 μ΄μš©ν•˜μ—¬ λ‹€μ‹œμ  λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 톡신 단말기에 μžˆμ–΄μ„œ, 상기 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 톡해 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ΄ μˆ˜μ‹ λœ 경우 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜λŠ” λ””μ½”λ”©λΆ€; 상기 μž…λ ₯λΆ€λ₯Ό 톡해 λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ΄ μš”μ²­λ˜λŠ” 경우, ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό μƒμ„±ν•˜μ—¬ 상기 톡신뢀λ₯Ό 톡해 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•˜λŠ” μ„œλΉ„μŠ€ μš”μ²­λΆ€λ₯Ό ν¬ν•¨ν•˜λŠ” 톡신 단말기가 μ œκ³΅λœλ‹€. According to still another aspect of the present invention, a communication terminal including a communication unit, an input unit, and an output unit for outputting a multi-view video using a broadcasting network and a communication network, which decodes and outputs a bitstream of a base layer received through the broadcasting network. A decoder which decodes the bitstream of the enhancement layer by using the decoding information of the base layer when the bitstream of the enhancement layer is received through the communication network; When a video of another point in time is requested through the input unit, a communication terminal including a service request unit generating a video request message including corresponding point information and transmitting the same to the multi-view video transmission system through the communication unit is provided.

상기 λ””μ½”λ”©λΆ€λŠ”, 상기 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜λŠ” κΈ°λ³Έλ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€, 상기 톡신망을 톡해 μˆ˜μ‹ λœ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€μ—μ„œ λ””μ½”λ”©λœ 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€λ₯Ό ν¬ν•¨ν•œλ‹€. The decoding unit may include: a base layer decoding unit decoding a bitstream of a base layer received through the broadcasting network, and an enhancement of decoding a bitstream of an enhancement layer received through the communication network using information decoded by the base layer decoding unit. It includes a layer decoding unit.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, (a)λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ—μ„œ 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έλ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜κ³ , λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•œ ν–₯μƒλ ˆμ΄μ–΄ λΉ„μŠ€μŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 단계, (b)방솑 μ„œλ²„κ°€ 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 방솑망을 톡해 톡신 λ‹¨λ§κΈ°λ‘œ μ œκ³΅ν•˜λŠ” 단계, (c)상기 톡신 λ‹¨λ§κΈ°μ—μ„œ, 상기 방솑 μ„œλ²„λ‘œλΆ€ν„°μ˜ κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우 ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό 슀트리밍 μ„œλ²„λ‘œ μ „μ†‘ν•˜λŠ” 단계, (d)상기 슀트리밍 μ„œλ²„κ°€ 상기 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„°μ˜ λ™μ˜μƒ μš”μ²­μ— 따라 ν•΄λ‹Ή μ‹œμ μ˜ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 톡신망을 톡해 상기 톡신 단말기에 μ œκ³΅ν•˜λŠ” 단계, (e)상기 톡신 단말기가 상기 슀트리밍 μ„œλ²„λ‘œλΆ€ν„° μ „μ†‘λœ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 단계λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  방법이 μ œκ³΅λœλ‹€. According to still another aspect of the present invention, (a) a multi-view video encoding apparatus encodes a basic video as a basic layer among videos captured by a plurality of photographing means, and encodes a video of another viewpoint using encoding information of the base layer. (B) providing a bitstream of a base layer encoded by the multi-view video encoding apparatus to a communication terminal through a broadcasting network, and (c) in the communication terminal, encoding the enhancement layer bisstream. Transmitting a video request message including corresponding view information to a streaming server when a video output of another view through a communication network is requested while decoding and outputting a base layer bitstream from a server, (d) Enhancement layer at that time in response to video requests from communication terminals Selecting and providing a bitstream to the communication terminal through a communication network; (e) decoding and outputting, by the communication terminal, an enhancement layer bitstream transmitted from the streaming server using decoding information of the base layer. A method of transmitting and receiving a multiview video is provided.

상기 (a)λ‹¨κ³„μ—μ„œ, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”λœ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 방솑 μ„œλ²„λ‘œ μ œκ³΅ν•˜κ³ , ν–₯μƒλ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”λœ λΉ„μŠ€μŠ€νŠΈλ¦Όμ„ 상기 슀트리밍 μ„œλ²„λ‘œ μ œκ³΅ν•œλ‹€. In step (a), the multi-view video encoding apparatus provides a bitstream encoded with a base layer to the broadcast server, and provides a bisstream encoded with an enhancement layer to the streaming server.

상기 (a)λ‹¨κ³„μ—μ„œ λΆ€ν˜Έν™”λœ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€ μ‹œμ  정보λ₯Ό ν¬ν•¨ν•œλ‹€. The enhancement layer bitstream encoded in step (a) includes view information.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜κ°€ λ‹€μ‹œμ  λΉ„λ””μ˜€λ₯Ό λΆ€ν˜Έν™”ν•˜λŠ” 방법에 μžˆμ–΄μ„œ, 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έλ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 단계; λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„μŠ€νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 단계λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ λΆ€ν˜Έν™” 방법이 μ œκ³΅λœλ‹€. According to still another aspect of the present invention, there is provided a method of encoding a multiview video by a multiview video encoding apparatus, the method comprising: encoding a basic video from a multiview video photographed by a plurality of photographing means into a base layer bitstream; A multi-view video encoding method is provided, which comprises encoding a video stream of another view by using the encoding information of the base layer bitstream into a bitstream of an enhancement layer.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λ‘œλΆ€ν„° 볡수의 ν–₯상 λ ˆμ΄μ–΄ λΉ„μŠ€μŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜μ—¬ μ €μž₯ν•˜λŠ” 단계; 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„° μ‹œμ  정보가 ν¬ν•¨λœ λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우, 상기 λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λΆ„μ„ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•˜λŠ” 단계; 상기 μ €μž₯된 ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€‘μ—μ„œ 상기 νŒλ‹¨λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 상기 톡신 λ‹¨λ§κΈ°λ‘œ μ „μ†‘ν•˜λŠ” 단계λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ 톡신망을 ν†΅ν•œ 솑신 방법이 μ œκ³΅λœλ‹€. According to another aspect of the present invention, the method comprises: receiving and storing a plurality of enhancement layer bisstreams from a multiview video encoding apparatus; When the video request message including the view information is received from the communication terminal, determining the corresponding view by analyzing the video request message; A method for transmitting a multi-view video through a communication network is provided, the method including selecting an enhancement layer bitstream corresponding to the determined time point from the stored enhancement layer bitstream and transmitting the selected enhancement layer bitstream to the communication terminal.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, (a)방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우 ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•˜λŠ” 단계, (b)상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œλΆ€ν„° μš”μ²­ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯μƒΉ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ΄ μˆ˜μ‹ λ˜λŠ” 경우, κ·Έ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 단계λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μˆ˜μ‹  방법이 μ œκ³΅λœλ‹€. According to another aspect of the present invention, (a) when a video output of another point in time through the communication network is requested while decoding and outputting the bitstream of the base layer received through the broadcast network, a video request message including the view information is provided. Transmitting to the multiview video transmission system, (b) when the bitstream of the enhancement layer corresponding to the request time point is received from the multiview video transmission system, decoding information of the base layer bitstream from the bitstream of the enhancement layer Provided is a method of receiving a multiview video, comprising: decoding and outputting the signal by using a.

상기 (a)λ‹¨κ³„λŠ”, 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 단계; μ‚¬μš©μžμ— μ˜ν•΄ 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우 ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•˜λŠ” 단계; 상기 νŒλ‹¨λœ μ‹œμ μ„ ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό μƒμ„±ν•˜μ—¬ 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμ„ μ „μ†‘ν•˜λŠ” 단계λ₯Ό ν¬ν•¨ν•œλ‹€. Step (a) may include: decoding and outputting a bitstream of a base layer received through a broadcasting network; Determining a corresponding point in time when a video output of another point in time is requested by the user; Generating a video request message including the determined time point and transmitting the multi-view video transmission system.

λ³Έ 발λͺ…μ˜ 또 λ‹€λ₯Έ 츑면에 λ”°λ₯΄λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ λΆ€ν˜Έν™” 방법을 ν”„λ‘œκ·Έλž¨μœΌλ‘œμ„œ CDλ‚˜ USB λ©”λͺ¨λ¦¬ λ“± μ»΄ν“¨ν„°λ‘œ 읽을 수 μžˆλŠ” 맀체에 기둝할 수 μžˆλ‹€.According to another aspect of the present invention, a multi-view video encoding method can be recorded as a program on a computer-readable medium such as a CD or a USB memory.

μ „μˆ ν•œ 바와 같이 λ³Έ 발λͺ…에 λ”°λ₯΄λ©΄, 방솑망을 톡해 λ™μ˜μƒμ„ μ‹œμ²­ν•˜λŠ” 쀑에 μ‚¬μš©μžκ°€ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ μ‹œμ²­μ„ μ›ν•˜λŠ” 경우 톡신망을 톡해 μΆ”κ°€ λ™μ˜μƒ 정보λ₯Ό μ „μ†‘ν•˜μ—¬ μ‚¬μš©μžκ°€ μ›ν•˜λŠ” μ‹œμ μ˜ λ™μ˜μƒμ„ μ‹œμ²­ν•  수 μžˆλŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법을 μ œκ³΅ν•  수 μžˆλ‹€. As described above, according to the present invention, when a user wants to watch a video at a different point in time while watching a video through a broadcasting network, a multi-view video of which a user can watch a video at a desired point of view by transmitting additional video information through a communication network. It is possible to provide a transmission and reception system and a method.

λ˜ν•œ, κΈ°λ³Έ λ™μ˜μƒμ„ μ œμ™Έν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ ν–₯상 λ ˆμ΄μ–΄λ‘œμ„œ 쀑볡성을 μ œκ±°ν•˜μ—¬ λΆ€ν˜Έν™”ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법을 μ œκ³΅ν•  수 μžˆλ‹€.A multi-view video transmission / reception system and method may be provided for removing and encoding redundancy as an enhancement layer for a video at a view point other than the basic video.

도 1은 λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œμ„ λ‚˜νƒ€λ‚Έ 도면. 1 is a diagram illustrating a system for transmitting and receiving multiview video according to the present invention.

*도 2λŠ” λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치의 ꡬ성을 개랡적으둜 λ‚˜νƒ€λ‚Έ λΈ”λŸ­λ„. 2 is a block diagram schematically illustrating a configuration of a multi-view video encoding apparatus according to the present invention.

도 3은 λ³Έ 발λͺ…에 λ”°λ₯Έ 슀트리밍 μ„œλ²„μ˜ ꡬ성을 개랡적으둜 λ‚˜νƒ€λ‚Έ λΈ”λŸ­λ„. Figure 3 is a block diagram schematically showing the configuration of a streaming server according to the present invention.

도 4λŠ” λ³Έ 발λͺ…에 λ”°λ₯Έ 방솑망 및 톡신망을 μ΄μš©ν•˜μ—¬ λ‹€μ‹œμ  λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 톡신 λ‹¨λ§κΈ°μ˜ ꡬ성을 개랡적으둜 λ‚˜νƒ€λ‚Έ λΈ”λŸ­λ„. 4 is a block diagram schematically illustrating a configuration of a communication terminal for outputting a multiview video using a broadcasting network and a communication network according to the present invention.

도 5λŠ” λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜κ°€ λ‹€μ‹œμ  λΉ„λ””μ˜€λ₯Ό λΆ€ν˜Έν™”ν•˜λŠ” 방법을 λ‚˜νƒ€λ‚Έ 흐름도. 5 is a flowchart illustrating a method of encoding a multiview video by a multiview video encoding apparatus according to the present invention.

도 6은 λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λ™μ˜μƒμ„ 톡신 단말기에 μ œκ³΅ν•˜λŠ” 방법을 λ‚˜νƒ€λ‚Έ 흐름도. 6 is a flowchart illustrating a method for providing a multi-view video to a communication terminal according to the present invention.

도 7은 λ³Έ 발λͺ…에 λ”°λ₯Έ 슀트리밍 μ„œλ²„κ°€ 톡신 단말기에 ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ œκ³΅ν•˜λŠ” 방법을 λ‚˜νƒ€λ‚Έ 흐름도.7 is a flowchart illustrating a method for providing an enhancement layer bitstream to a communication terminal by a streaming server according to the present invention.

도 1은 λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œμ„ λ‚˜νƒ€λ‚Έ 도면이닀. 1 is a diagram illustrating a system for transmitting and receiving multiview video according to the present invention.

도 1을 μ°Έμ‘°ν•˜λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œμ€ 방솑망 및 톡신망을 톡해 λ™μ˜μƒμ„ μˆ˜μ‹ ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 톡신 단말기(200), 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ™μ˜μƒμ„ λΆ€ν˜Έν™”ν•˜μ—¬ 톡신 단말기(200)의 μš”μ²­μ— 따라 방솑망 λ˜λŠ” 톡신망을 톡해 ν•΄λ‹Ή λ™μ˜μƒμ„ 톡신 단말기(200)둜 μ œκ³΅ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œ(100)을 ν¬ν•¨ν•œλ‹€. Referring to FIG. 1, a multi-view video transmission / reception system receives a video through a broadcasting network and a communication network, and outputs a video to a request of the communication terminal 200 by encoding a video photographed by a plurality of photographing means. Accordingly, the multi-view video transmission system 100 provides a corresponding video to the communication terminal 200 through a broadcasting network or a communication network.

톡신 단말기(200)λŠ” 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄(base layer)의 λ™μ˜μƒμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우, ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œ(100)으둜 μ „μ†‘ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ˜ ν–₯상 λ ˆμ΄μ–΄(enhancement layer) λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜κ³ , κ·Έ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•œλ‹€.The communication terminal 200 re-views the video request message including the corresponding view information when the video output of another view through the communication network is requested while decoding and outputting the video of the base layer received through the broadcasting network. The video signal is transmitted to the video transmission system 100 to receive an enhancement layer bitstream at a corresponding time point, and to decode and output the enhancement layer bitstream using decoding information of the base layer.

톡신 단말기(200)λŠ” 방솑망 및 톡신망을 톡해 λ™μ˜μƒμ˜ μˆ˜μ‹ μ΄ κ°€λŠ₯ν•œ 이동톡신 단말기, μœ μ„ ν†΅μ‹  단말기 등을 ν¬ν•¨ν•œλ‹€. The communication terminal 200 includes a mobile communication terminal, a wired communication terminal, and the like, capable of receiving a video through a broadcasting network and a communication network.

톡신 단말기(200)λŠ” 톡신망을 톡해 λ™μ˜μƒμ΄ μˆ˜μ‹ λœ 경우, ν˜„μž¬ 좜λ ₯되고 μžˆλŠ” λ™μ˜μƒμ„ 톡신망을 ν†΅ν•œ λ™μ˜μƒμœΌλ‘œ μ „ν™˜ν•˜μ—¬ 좜λ ₯ν•œλ‹€. When a video is received through the communication network, the communication terminal 200 converts the currently output video into a video through the communication network and outputs the video.

λ˜ν•œ, 톡신 단말기(200)λŠ” 방솑망을 톡해 μˆ˜μ‹ λœ λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 톡해 λ™μ˜μƒμ΄ μˆ˜μ‹ λœ 경우, 방솑망을 ν†΅ν•œ λ™μ˜μƒκ³Ό 톡신망을 ν†΅ν•œ λ™μ˜μƒμ„ PIP λ˜λŠ” POP둜 좜λ ₯ν•  수 μžˆλ‹€.In addition, when a video is received through the communication network while outputting the video received through the broadcasting network, the communication terminal 200 may output the video through the broadcasting network and the video through the communication network as PIP or POP.

λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œ(100)은 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜μ—¬ 방솑망을 톡해 톡신 단말기(200)둜 μ œκ³΅ν•œλ‹€. The multi-view video transmission system 100 encodes a basic video into a base layer from a video photographed by a plurality of photographing means and provides it to the communication terminal 200 through a broadcasting network.

λ˜ν•œ, λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œ(100)은 λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•œ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜κ³ , 톡신 단말기(200)λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우 ν•΄λ‹Ή μ‹œμ μ˜ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 톡신망을 톡해 톡신 단말기(200)둜 μ œκ³΅ν•œλ‹€.Also, the multi-view video transmission system 100 encodes a video of another view into an enhancement layer bitstream using encoding information of the base layer, and when a video request message is received from the communication terminal 200, The bitstream is provided to the communication terminal 200 through a communication network.

λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œ(100)은 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120), 방솑 μ„œλ²„(150), 슀트리밍 μ„œλ²„(170)λ₯Ό ν¬ν•¨ν•œλ‹€. The multiview video transmission system 100 includes a multiview video encoding apparatus 120, a broadcast server 150, and a streaming server 170.

λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λŠ” 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜κ³ , λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•œλ‹€.The multi-view video encoding apparatus 120 encodes the base video from the multi-view video captured by the plurality of photographing means into a base layer, and encodes the base video into an enhancement layer bitstream using encoding information of the base layer with respect to the video from another viewpoint. do.

방솑 μ„œλ²„(150)λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ™μ˜μƒμ„ 방솑망을 톡해 톡신 단말기(200)둜 μ œκ³΅ν•œλ‹€.The broadcast server 150 provides a basic video coded by the multi-view video encoding apparatus 120 to the communication terminal 200 through a broadcast network.

슀트리밍 μ„œλ²„(170)λŠ” 톡신망을 톡해 톡신 단말기(200)λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)μ—μ„œ λΆ€ν˜Έν™”λœ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Ό μ€‘μ—μ„œ ν•΄λ‹Ή μ‹œμ μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 톡신 단말기(200)둜 μ œκ³΅ν•œλ‹€. When the video server request message is received from the communication terminal 200 through the communication network, the streaming server 170 selects a bitstream of the corresponding view from among bitstreams of the enhancement layer encoded by the multiview video encoding apparatus 120 to communicate with the communication terminal. Provided at 200.

μ—¬κΈ°μ„œ, λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λŠ” 톡신 λ‹¨λ§κΈ°μ˜ μ‚¬μš©μžμ— μ˜ν•΄ μ„ νƒλœ μ‹œμ  정보λ₯Ό ν¬ν•¨ν•œλ‹€. Here, the video request message includes viewpoint information selected by the user of the communication terminal.

λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)μ—μ„œ λΆ€ν˜Έν™”λœ 각 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ΄ μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜κ³  μžˆμœΌλ―€λ‘œ, λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œ(100)은 톡신 단말기(200)λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λœ 경우 ν•΄λ‹Ή μ‹œμ  μ •λ³΄μ˜ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ μ œκ³΅ν•  수 μžˆλ‹€. Since the bitstream of each enhancement layer coded by the multi-view video encoding apparatus 120 includes view information, the multi-view video transmission system 100 receives the view request information when the video request message is received from the communication terminal 200. A bitstream of an enhancement layer may be selected and provided.

λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λŠ” MPEG-2의 경우 MPEG-2 MVP의 2λ ˆμ΄μ–΄ ꡬ쑰둜 각 μ‹œμ μ— λŒ€ν•œ 압좕을 μˆ˜ν–‰ν•˜κ³ , H.264의 경우 SVC의 2 λ ˆμ΄μ–΄ ꡬ쑰둜 각 μ‹œμ μ— λŒ€ν•œ 압좕을 μˆ˜ν–‰ν•œλ‹€.The multi-view video encoding apparatus 120 compresses each viewpoint by using a two-layer structure of MPEG-2 MVP in the case of MPEG-2, and compresses each viewpoint by using a two-layer structure of SVC in the case of H.264. do.

방솑 μ„œλ²„(150)λŠ” ν˜„μž¬μ˜ λ””μ§€ν„Έ μ§€μƒνŒŒ 방솑, μœ„μ„± λ””μ§€ν„Έ 방솑, T-DMB, S-DMB μ†‘μΆœ μ‹œμŠ€ν…œμ— λŒ€μ‘ν•˜λŠ” μ„œλ²„μ΄λ©°, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)μ—μ„œ λΆ€ν˜Έν™”λœ λ™μ˜μƒμ˜ κΈ°λ³Έ λ ˆμ΄μ–΄λ₯Ό μ†‘μΆœν•œλ‹€. The broadcast server 150 corresponds to a current digital terrestrial broadcast, satellite digital broadcast, T-DMB, and S-DMB transmission system, and transmits a base layer of a video encoded by the multiview video encoding apparatus 120.

슀트리밍 μ„œλ²„(170)λŠ” 톡신망을 톡해 톡신 단말기(200)λ‘œλΆ€ν„° μ‚¬μš©μžκ°€ μ›ν•˜λŠ” μ‹œμ μ— λŒ€ν•œ 정보λ₯Ό μˆ˜μ‹ ν•˜μ—¬ 이에 ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ IP망 기반으둜 μŠ€νŠΈλ¦¬λ°μ„ μˆ˜ν–‰ν•œλ‹€. The streaming server 170 receives information on a time point desired by a user from the communication terminal 200 through a communication network and performs streaming of an enhancement layer bitstream corresponding to the IP layer.

μ „μˆ ν•œ 바와 같이 κ΅¬μ„±λœ μ‹œμŠ€ν…œμ—μ„œλŠ” 계측적 λΆ€ν˜Έν™”κΈ° ꡬ쑰의 λΈ”λ‘κΈ°λ°˜ 코덱을 μ΄μš©ν•˜μ—¬ λ‹€μ‹œμ  λ™μ˜μƒμ„ μ••μΆ•ν•œλ‹€. μ΄λ•Œ, κΈ°λ³Έ λ ˆμ΄μ–΄λŠ” λ°©μ†‘λ§μ˜ μ½”λ±μœΌλ‘œ 디코딩이 κ°€λŠ₯ν•˜λ„λ‘ ν•˜κ³ , λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ€ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ 정보λ₯Ό μ΄μš©ν•˜μ—¬ μ˜μƒ κ°„ 쀑볡성을 쀄여 μ••μΆ•ν•˜λŠ” λ ˆμ΄μ–΄ 기반 λ‹€μ‹œμ  λ™μ˜μƒ μ••μΆ• 방법을 톡해 λ™μ˜μƒμ„ μ••μΆ•ν•œλ‹€. 이후, 방솑 μˆ˜μ‹  μž₯λΉ„λ§Œμ„ κ°€μ§„ μ‚¬μš©μžλŠ” κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ™μ˜μƒμ„ μ‹œμ²­ν•˜λ„λ‘ ν•˜κ³ , 좔가적인 톡신망과 μ—°κ²°λœ μ‚¬μš©μžλŠ” μžμ‹ μ΄ μ›ν•˜λŠ” μ‹œμ μ„ μ„ νƒν•˜μ—¬ 슀트리밍 μ„œλ²„(170)에 μš”μ²­ν•˜λ©΄, 슀트리밍 μ„œλ²„(170)λ‘œλΆ€ν„° μš”μ²­ μ‹œμ μ— λ§žλŠ” 좔가적인 λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 전솑받아 μ‹œμ²­ν•  수 μžˆλ‹€.In the system configured as described above, a multi-view video is compressed using a block-based codec of a hierarchical encoder structure. At this time, the base layer can be decoded by the codec of the broadcast network, and the video of the other time is compressed by using a layer-based multi-view video compression method that compresses by reducing the redundancy between the images using the information of the base layer. Subsequently, a user having only a broadcast receiving device watches a video of a base layer, and a user connected to an additional communication network selects a desired time point and requests the streaming server 170 from the streaming server 170 to meet the request time. Additional bitstreams can be received and viewed.

도 2λŠ” λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치의 ꡬ성을 개랡적으둜 λ‚˜νƒ€λ‚Έ λΈ”λŸ­λ„μ΄λ‹€. 2 is a block diagram schematically illustrating a configuration of a multiview video encoding apparatus according to the present invention.

도 2λ₯Ό μ°Έμ‘°ν•˜λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λŠ” 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄(base layer) λΆ€ν˜Έν™”λΆ€(122), λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(122)μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„μŠ€νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 볡수의 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124a, 124b,..., 124n, μ΄ν•˜ 124둜 칭함)λ₯Ό ν¬ν•¨ν•œλ‹€.Referring to FIG. 2, the multiview video encoding apparatus 120 may include a base layer encoder 122 that encodes a base video as a base layer among multiview videos photographed by a plurality of photographing means, and at different viewpoints. A plurality of enhancement layer encoders 124a, 124b,..., 124n, 124, which are encoded into a beast stream of an enhancement layer by using encoding information of the base layer encoded by the base layer encoder 122 for a video, are hereinafter referred to as 124. ).

κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(122)λŠ” κΈ°λ³Έ λ™μ˜μƒμ— λŒ€ν•΄ H.264 λ˜λŠ” MPEG-2와 같은 κΈ°μ‘΄ 방솑 μ‹œμŠ€ν…œκ³Ό ν˜Έν™˜λ˜λŠ” 코덱을 μ‚¬μš©ν•˜μ—¬ λΆ€ν˜Έν™”ν•œλ‹€.The base layer encoder 122 encodes the basic video using a codec compatible with an existing broadcasting system such as H.264 or MPEG-2.

ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124)λŠ” view point 1을 λΆ€ν˜Έν™”ν•˜μ—¬ 제1 ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 좜λ ₯ν•˜λŠ” 제1 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124a), view point 2λ₯Ό λΆ€ν˜Έν™”ν•˜μ—¬ 제2 ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 좜λ ₯ν•˜λŠ” 제2 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124b) λ“±μœΌλ‘œ κ΅¬μ„±λœλ‹€. The enhancement layer encoder 124 encodes the view point 1 to output the first enhancement layer bitstream, and the second enhancement layer encodes the view point 2 to output the second enhancement layer bitstream. And an enhancement layer encoder 124b.

λ˜ν•œ, ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124)λŠ” λΆ€ν˜Έν™”λœ 각 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ— μ‹œμ  정보λ₯Ό μ„€μ •ν•œλ‹€. μ΄λ ‡κ²Œ μ„€μ •λœ μ‹œμ  μ •λ³΄λŠ” 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λœ 경우 ν•΄λ‹Ή μ‹œμ  μ •λ³΄μ˜ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 선택할 수 μžˆλ„λ‘ ν•œλ‹€. In addition, the enhancement layer encoder 124 sets view information in the bitstream of each encoded enhancement layer. The viewpoint information set as described above may select an enhancement layer bitstream of the viewpoint information when a video request message is received from the communication terminal.

ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124)λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(122)μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ λΆ€ν˜Έν™”λ₯Ό μˆ˜ν–‰ν•˜λ―€λ‘œ, λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 쀑볡성을 μ œκ±°ν•œλ‹€. Since the enhancement layer encoder 124 performs encoding by using the encoding information of the base layer encoded by the base layer encoder 122, the enhancement layer encoder 124 removes redundancy of the video of the base layer from the video of another viewpoint.

ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€(124)λŠ” MPEG-2의 경우 MPEG-2 MVP의 2λ ˆμ΄μ–΄ ꡬ쑰둜 각 μ‹œμ μ— λŒ€ν•œ 압좕을 μˆ˜ν–‰ν•˜κ³ , H.264의 경우 SVC의 2 λ ˆμ΄μ–΄ ꡬ쑰둜 각 μ‹œμ μ— λŒ€ν•œ 압좕을 μˆ˜ν–‰ν•œλ‹€.The enhancement layer encoder 124 compresses each view in a two-layer structure of MPEG-2 MVP in the case of MPEG-2, and compresses each view in a two-layer structure of SVC in the case of H.264. .

도 3은 λ³Έ 발λͺ…에 λ”°λ₯Έ 슀트리밍 μ„œλ²„μ˜ ꡬ성을 개랡적으둜 λ‚˜νƒ€λ‚Έ λΈ”λŸ­λ„μ΄λ‹€. 3 is a block diagram schematically showing the configuration of a streaming server according to the present invention.

도 3을 μ°Έμ‘°ν•˜λ©΄, 슀트리밍 μ„œλ²„(170)λŠ” 톡신망을 톡해 톡신 λ‹¨λ§κΈ°μ™€μ˜ 톡신을 μœ„ν•œ 톡신뢀(172), 톡신뢀(172)λ₯Ό 톡해 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„° μ‹œμ  정보가 ν¬ν•¨λœ λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우 λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λΆ„μ„ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•˜λŠ” μ‹œμ  νŒλ‹¨λΆ€(174), μ‹œμ  νŒλ‹¨λΆ€(174)μ—μ„œ νŒλ‹¨λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 톡신 단말기에 μ œκ³΅λ˜λ„λ‘ ν•˜λŠ” μŠ€μœ„μΉ­λΆ€(176)λ₯Ό ν¬ν•¨ν•œλ‹€. Referring to FIG. 3, the streaming server 170 requests a video when a video request message including viewpoint information is received from a communication terminal 172 and a communication terminal 172 for communication with a communication terminal through a communication network. And a switching unit 176 for selecting a enhancement layer bitstream corresponding to the viewpoint determined by the viewpoint determination unit 174 by analyzing the message and determining the corresponding viewpoint. .

λ˜ν•œ, 슀트리밍 μ„œλ²„(170)λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λ‘œλΆ€ν„° 볡수의 ν–₯μƒΉ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜μ—¬ μ €μž₯ν•˜λŠ” μ €μž₯λΆ€(178)λ₯Ό 더 포함할 수 μžˆλ‹€. In addition, the streaming server 170 may further include a storage unit 178 that receives and stores the plurality of enhancement layer bitstreams from the multiview video encoding apparatus.

상기 μŠ€μœ„μΉ­λΆ€(176)λŠ” μ‹œμ  νŒλ‹¨λΆ€(174)μ—μ„œ νŒλ‹¨λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ €μž₯λΆ€(178) λ˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λ‘œλΆ€ν„° μΆ”μΆœν•˜μ—¬ 톡신 단말기(200)에 μ œκ³΅ν•˜λŠ” 역할을 μˆ˜ν–‰ν•œλ‹€. The switching unit 176 extracts an enhancement layer bitstream corresponding to the viewpoint determined by the viewpoint determination unit 174 from the storage unit 178 or the multiview video encoding apparatus 120 and provides the extracted communication layer 200 to the communication terminal 200. Play a role.

도 4λŠ” λ³Έ 발λͺ…에 λ”°λ₯Έ 방솑망 및 톡신망을 μ΄μš©ν•˜μ—¬ λ‹€μ‹œμ  λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 톡신 λ‹¨λ§κΈ°μ˜ ꡬ성을 개랡적으둜 λ‚˜νƒ€λ‚Έ λΈ”λŸ­λ„μ΄λ‹€. 4 is a block diagram schematically illustrating a configuration of a communication terminal for outputting a multi-view video using a broadcasting network and a communication network according to the present invention.

도 4λ₯Ό μ°Έμ‘°ν•˜λ©΄, 방솑망 및 톡신망을 μ΄μš©ν•˜μ—¬ λ‹€μ‹œμ  λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 톡신 단말기(200)λŠ” 톡신뢀(202), μž…λ ₯λΆ€(204), μ €μž₯λΆ€(206), 좜λ ₯λΆ€(208), λ””μ½”λ”©λΆ€(210), μ„œλΉ„μŠ€ μš”μ²­λΆ€(214)λ₯Ό ν¬ν•¨ν•œλ‹€. Referring to FIG. 4, a communication terminal 200 for outputting a multiview video using a broadcasting network and a communication network includes a communication unit 202, an input unit 204, a storage unit 206, an output unit 208, and a decoding unit 210. ), The service request unit 214.

λ””μ½”λ”©λΆ€(210)λŠ” μΈμ½”λ”©λœ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜λŠ” κ²ƒμœΌλ‘œ, 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 톡해 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ΄ μˆ˜μ‹ λœ 경우 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•œλ‹€. The decoding unit 210 decodes the encoded bitstream. When the bitstream of the enhancement layer is received through the communication network while decoding and outputting the bitstream of the base layer received through the broadcasting network, the decoding unit 210 bases the bitstream of the enhancement layer. Decode using the decoding information of the layer.

*λ˜ν•œ, λ””μ½”λ”©λΆ€(210)λŠ” 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€(211), 톡신망을 톡해 μˆ˜μ‹ λœ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ κΈ°λ³Έ λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€(211)μ—μ„œ λ””μ½”λ”©λœ 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€(212)둜 κ΅¬μ„±λœλ‹€. In addition, the decoding unit 210 may decode the base layer decoding unit 211 for decoding the bitstream of the base layer received through the broadcasting network, and decode the bitstream of the enhancement layer received through the communication network in the base layer decoding unit 211. And an enhancement layer decoding unit 212 that decodes the received information.

μ„œλΉ„μŠ€ μš”μ²­λΆ€(214)λŠ” μž…λ ₯λΆ€(204)λ₯Ό 톡해 λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ΄ μš”μ²­λ˜λŠ” 경우, ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό μƒμ„±ν•˜μ—¬ 톡신뢀(202)λ₯Ό 톡해 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•œλ‹€. When a video of another point in time is requested through the input unit 204, the service requesting unit 214 generates a video request message including corresponding point information and transmits it to the multi-view video transmission system through the communication unit 202.

μ„œλΉ„μŠ€ μš”μ²­λΆ€(214)에 μ˜ν•΄ μš”μ²­λœ λ™μ˜μƒμ€ 톡신뢀(202)λ₯Ό 톡해 μˆ˜μ‹ ν•˜μ—¬ λ””μ½”λ”©λΆ€(210)에 μ˜ν•΄ λ””μ½”λ”©λ˜μ–΄ 좜λ ₯λΆ€(208)에 좜λ ₯λœλ‹€. The video requested by the service requester 214 is received through the communication unit 202, decoded by the decoding unit 210, and output to the output unit 208.

도 5λŠ” λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜κ°€ λ‹€μ‹œμ  λΉ„λ””μ˜€λ₯Ό λΆ€ν˜Έν™”ν•˜λŠ” 방법을 λ‚˜νƒ€λ‚Έ 흐름도이닀. 5 is a flowchart illustrating a method of encoding a multiview video by a multiview video encoding apparatus according to the present invention.

도 5λ₯Ό μ°Έμ‘°ν•˜λ©΄, λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λŠ” 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒμ΄ μˆ˜μ‹ λ˜λ©΄(S500), μˆ˜μ‹ λœ λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•œλ‹€(S502). Referring to FIG. 5, when the multiview video encoding apparatus 120 receives a multiview video photographed by a plurality of photographing means (S500), the multiview video encoding apparatus 120 encodes the base video into a base layer bitstream from the received multiview video ( S502).

그런 λ‹€μŒ λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λŠ” λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„μŠ€νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•œλ‹€(S504).Then, the multi-view video encoding apparatus 120 encodes the video of the other view into the bitstream of the enhancement layer by using encoding information of the base layer bitstream (S504).

그리고, κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€ 방솑망을 톡해 톡신 단말기에 제곡되고, ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€ 톡신망을 톡해 톡신 단말기에 μ œκ³΅λœλ‹€. The base layer bitstream is provided to the communication terminal through the broadcast network, and the enhancement layer bitstream is provided to the communication terminal through the communication network.

도 6은 λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λ™μ˜μƒμ„ 톡신 단말기에 μ œκ³΅ν•˜λŠ” 방법을 λ‚˜νƒ€λ‚Έ 흐름도이닀. 6 is a flowchart illustrating a method of providing a multi-view video to a communication terminal according to the present invention.

도 6을 μ°Έμ‘°ν•˜λ©΄, 방솑 μ„œλ²„(150)λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 방솑망을 톡해 톡신 단말기(200)둜 μ œκ³΅ν•œλ‹€(S600).Referring to FIG. 6, the broadcast server 150 provides a bitstream of a base layer encoded by the multiview video encoding apparatus 120 to the communication terminal 200 through a broadcast network (S600).

λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”λœ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 방솑 μ„œλ²„(150)에 μ œκ³΅ν•˜κ³ , ν–₯상 λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”λœ λΉ„μŠ€μŠ€νŠΈλ¦Όμ„ 슀트리밍 μ„œλ²„(170)에 μ œκ³΅ν•œλ‹€. The multi-view video encoding apparatus 120 provides a bitstream encoded with a base layer to the broadcast server 150 and provides a bisstream encoded with an enhancement layer to the streaming server 170.

이어, 톡신 단말기(200)λŠ” 방솑 μ„œλ²„(150)λ‘œλΆ€ν„°μ˜ κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•œλ‹€(S602).Subsequently, the communication terminal 200 decodes and outputs a base layer bitstream from the broadcast server 150 (S602).

톡신 단말기(200)λŠ” 방솑망을 톡해 λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λ©΄(S604), ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό 슀트리밍 μ„œλ²„(170)둜 μ „μ†‘ν•œλ‹€(S606).When outputting a video at another point in time through the communication network while outputting a video through the broadcasting network (S604), the communication terminal 200 transmits a video request message including the corresponding view information to the streaming server 170 (S606). .

즉, 톡신 λ‹¨λ§κΈ°μ˜ μ‚¬μš©μžκ°€ 방솑망을 ν†΅ν•œ λ™μ˜μƒμ„ μ‹œμ²­ν•˜λŠ” 쀑에 λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ„ μ›ν•˜μ—¬ 톡신망을 톡해 λ‹€λ₯Έ μ‹œμ μ„ μ„ νƒν•˜λ©΄, 톡신 단말기(200)λŠ” μ‚¬μš©μžμ— μ˜ν•΄ μ„ νƒλœ μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό μƒμ„±ν•˜μ—¬ 슀트리밍 μ„œλ²„(170)둜 μ „μ†‘ν•œλ‹€.That is, when a user of the communication terminal selects a different time point through the communication network while watching a video through the broadcasting network and selects a different time point through the communication network, the communication terminal 200 receives a video request message including view information selected by the user. It generates and transmits to the streaming server 170.

슀트리밍 μ„œλ²„(170)λŠ” 톡신 단말기(200)λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λ©΄(S608), λ™μ˜μƒ μš”μ²­μ— 따라 ν•΄λ‹Ή μ‹œμ μ˜ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 톡신망을 톡해 톡신 단말기(200)에 μ œκ³΅ν•œλ‹€(S610).When the streaming server 170 receives the video request message from the communication terminal 200 (S608), the streaming server 170 selects an enhancement layer bitstream at the corresponding point in time according to the video request and provides it to the communication terminal 200 through the communication network (S610). .

즉, 슀트리밍 μ„œλ²„(170)λŠ” 톡신 단말기(200)λ‘œλΆ€ν„° λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λ©΄, λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λΆ„μ„ν•˜μ—¬ μ‹œμ  정보λ₯Ό μΆ”μΆœν•˜κ³ , μΆ”μΆœλœ μ‹œμ  정보에 ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 톡신망을 톡해 톡신 단말기(200)에 μ œκ³΅ν•œλ‹€. That is, when the streaming server 170 receives the video request message from the communication terminal 200, the streaming server 170 analyzes the video request message, extracts view information, and selects an enhancement layer bitstream corresponding to the extracted view information through a communication network. Provided to the communication terminal 200.

그러면, 톡신 단말기(200)λŠ” 슀트리밍 μ„œλ²„(170)λ‘œλΆ€ν„° μ „μ†‘λœ ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•œλ‹€(S612).Then, the communication terminal 200 decodes and outputs the enhancement layer bitstream transmitted from the streaming server 170 using the decoding information of the base layer (S612).

도 7은 λ³Έ 발λͺ…에 λ”°λ₯Έ 슀트리밍 μ„œλ²„κ°€ 톡신 단말기에 ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ œκ³΅ν•˜λŠ” 방법을 λ‚˜νƒ€λ‚Έ 흐름도이닀. 7 is a flowchart illustrating a method for providing an enhancement layer bitstream to a communication terminal by a streaming server according to the present invention.

도 7을 μ°Έμ‘°ν•˜λ©΄, 슀트리밍 μ„œλ²„(170)λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치(120)λ‘œλΆ€ν„° 볡수의 ν–₯상 λ ˆμ΄μ–΄ λΉ„μŠ€μŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜μ—¬ μ €μž₯ν•œλ‹€(S700).Referring to FIG. 7, the streaming server 170 receives and stores a plurality of enhancement layer bisstreams from the multiview video encoding apparatus 120 (S700).

κ·ΈλŸ°λ‹€μŒ 슀트리밍 μ„œλ²„(170)λŠ” 톡신 단말기(200)λ‘œλΆ€ν„° μ‹œμ  정보가 ν¬ν•¨λœ λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λ©΄(S702), λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λΆ„μ„ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•œλ‹€(S704).Then, when the streaming server 170 receives the video request message including the viewpoint information from the communication terminal 200 (S702), the streaming server 170 analyzes the video request message to determine the corresponding viewpoint (S704).

그런 λ‹€μŒ 슀트리밍 μ„œλ²„(170)λŠ” μ „μˆ ν•œ 바와 같이 μ €μž₯된 ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Ό μ€‘μ—μ„œ νŒλ‹¨λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 톡신 단말기(200)둜 μ „μ†‘ν•œλ‹€(S706).Then, the streaming server 170 selects the enhancement layer bitstream corresponding to the determined time point from the stored enhancement layer bitstream as described above and transmits it to the communication terminal 200 (S706).

μ „μˆ ν•œ 바와 같이, λ³Έ 발λͺ…이 μ†ν•˜λŠ” κΈ°μˆ λΆ„μ•Όμ˜ λ‹Ήμ—…μžλŠ” λ³Έ 발λͺ…이 κ·Έ 기술적 μ‚¬μƒμ΄λ‚˜ ν•„μˆ˜μ  νŠΉμ§•μ„ λ³€κ²½ν•˜μ§€ μ•Šκ³ μ„œ λ‹€λ₯Έ ꡬ체적인 ν˜•νƒœλ‘œ μ‹€μ‹œλ  수 μžˆλ‹€λŠ” 것을 이해할 수 μžˆμ„ 것이닀. κ·ΈλŸ¬λ―€λ‘œ μ΄μƒμ—μ„œ κΈ°μˆ ν•œ μ‹€μ‹œμ˜ˆλ“€μ€ λͺ¨λ“  λ©΄μ—μ„œ μ˜ˆμ‹œμ μΈ 것이며 ν•œμ •μ μΈ 것이 μ•„λ‹Œ κ²ƒμœΌλ‘œμ„œ μ΄ν•΄ν•΄μ•Όλ§Œ ν•œλ‹€. λ³Έ 발λͺ…μ˜ λ²”μœ„λŠ” 상기 μƒμ„Έν•œ μ„€λͺ…λ³΄λ‹€λŠ” ν›„μˆ ν•˜λŠ” νŠΉν—ˆμ²­κ΅¬λ²”μœ„μ— μ˜ν•˜μ—¬ λ‚˜νƒ€λ‚΄μ–΄μ§€λ©°, νŠΉν—ˆμ²­κ΅¬λ²”μœ„μ˜ 의미 및 λ²”μœ„ 그리고 κ·Έ λ“±κ°€κ°œλ…μœΌλ‘œλΆ€ν„° λ„μΆœλ˜λŠ” λͺ¨λ“  λ³€κ²½ λ˜λŠ” λ³€ν˜•λœ ν˜•νƒœκ°€ λ³Έ 발λͺ…μ˜ λ²”μœ„μ— ν¬ν•¨λ˜λŠ” κ²ƒμœΌλ‘œ ν•΄μ„λ˜μ–΄μ•Ό ν•œλ‹€.As described above, those skilled in the art will appreciate that the present invention can be implemented in other specific forms without changing the technical spirit or essential features. Therefore, the above-described embodiments are to be understood as illustrative in all respects and not as restrictive. The scope of the present invention is shown by the following claims rather than the detailed description, and all changes or modifications derived from the meaning and scope of the claims and their equivalents should be construed as being included in the scope of the present invention. do.

λ³Έ 발λͺ…에 λ”°λ₯Έ λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μ†‘μˆ˜μ‹  μ‹œμŠ€ν…œ 및 방법은 방솑망을 톡해 λ™μ˜μƒμ„ μ‹œμ²­ν•˜λŠ” 쀑에 μ‚¬μš©μžκ°€ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ μ‹œμ²­μ„ μ›ν•˜λŠ” 경우 톡신망을 톡해 μΆ”κ°€ λ™μ˜μƒ 정보λ₯Ό μ „μ†‘ν•˜μ—¬ μ‚¬μš©μžκ°€ μ›ν•˜λŠ” μ‹œμ μ˜ λ™μ˜μƒμ„ μ‹œμ²­ν•  수 μžˆλ„λ‘ ν•˜λŠ” κΈ°μˆ μ— μ΄μš©ν•  수 μžˆλ‹€. The system and method for transmitting and receiving a multi-view video according to the present invention transmits additional video information through a communication network so that a user can watch a video at a desired time point while watching a video through a broadcasting network. It can be used for the technique.

Claims (12)

λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜μ— μžˆμ–΄μ„œ,In the multi-view video encoding apparatus, 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄λ‘œ λΆ€ν˜Έν™”ν•˜λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€; 및A base layer encoder configured to encode a base video as a base layer among the multi-view videos captured by the plurality of photographing means; And λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€μ—μ„œ λΆ€ν˜Έν™”λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 볡수의 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€;A plurality of enhancement layer encoders encoding the bitstream of the enhancement layer by using encoding information of the base layer encoded by the base layer encoder for a video of another viewpoint; λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치.Multiview video encoding apparatus comprising a. 제1항에 μžˆμ–΄μ„œ, The method of claim 1, 상기 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€λŠ” λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 쀑볡성을 μ œκ±°ν•˜μ—¬ λΆ€ν˜Έν™”ν•˜λŠ” 것을 νŠΉμ§•μœΌλ‘œ ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치.The enhancement layer encoder is a multi-view video encoding apparatus, characterized in that the video is removed from the redundancy of the video of the base layer to remove the video. 제1항에 μžˆμ–΄μ„œ, The method of claim 1, 상기 ν–₯상 λ ˆμ΄μ–΄ λΆ€ν˜Έν™”λΆ€λŠ” λΆ€ν˜Έν™”λœ 각 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ— μ‹œμ  정보λ₯Ό μ„€μ •ν•˜λŠ” 것을 νŠΉμ§•μœΌλ‘œ ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치.And the enhancement layer encoder sets view information in a bitstream of each encoded enhancement layer. 톡신망을 톡해 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯치 및 톡신 단말기와 ν†΅μ‹ ν•˜κΈ° μœ„ν•œ 톡신뢀;A communication unit for communicating with a multi-view video encoding apparatus and a communication terminal through a communication network; 상기 톡신뢀λ₯Ό 톡해 상기 톡신 λ‹¨λ§κΈ°λ‘œλΆ€ν„° μ‹œμ  정보가 ν¬ν•¨λœ λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€κ°€ μˆ˜μ‹ λ˜λŠ” 경우, 상기 λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λΆ„μ„ν•˜μ—¬ ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•˜λŠ” μ‹œμ  νŒλ‹¨λΆ€; 및A time determination unit determining a corresponding time point by analyzing the video request message when a video request message including view information is received from the communication terminal through the communication unit; And 상기 μ‹œμ  νŒλ‹¨λΆ€μ—μ„œ νŒλ‹¨λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μ„ νƒν•˜μ—¬ 상기 톡신 단말기에 μ œκ³΅λ˜λ„λ‘ ν•˜λŠ” μŠ€μœ„μΉ­λΆ€;A switching unit for selecting an enhancement layer bitstream corresponding to the time determined by the time determining unit and providing the enhanced layer bitstream to the communication terminal; λ₯Ό ν¬ν•¨ν•˜λŠ” 슀트리밍 μ„œλ²„.Streaming server comprising a. 제4항에 μžˆμ–΄μ„œ, The method of claim 4, wherein 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λ‘œλΆ€ν„° 볡수의 ν–₯μƒΉ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μˆ˜μ‹ ν•˜μ—¬ μ €μž₯ν•˜λŠ” μ €μž₯λΆ€λ₯Ό 더 ν¬ν•¨ν•˜λŠ” 슀트리밍 μ„œλ²„.And a storage unit configured to receive and store a plurality of enhancement layer bitstreams from the multiview video encoding apparatus. 제4항에 μžˆμ–΄μ„œ, The method of claim 4, wherein 상기 μŠ€μœ„μΉ­λΆ€λŠ” μ €μž₯λΆ€ λ˜λŠ” 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜λ‘œλΆ€ν„° ν•΄λ‹Ή ν–₯상 λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ μΆ”μΆœν•˜μ—¬ 상기 톡신 단말기에 μ œκ³΅ν•˜λŠ” 것을 νŠΉμ§•μœΌλ‘œ ν•˜λŠ” 슀트리밍 μ„œλ²„.And the switching unit extracts a corresponding enhancement layer bitstream from a storage unit or the multi-view video encoding apparatus and provides the extracted enhancement layer bitstream to the communication terminal. 톡신뢀, μž…λ ₯λΆ€ 및 좜λ ₯λΆ€κ°€ κ΅¬λΉ„λ˜μ–΄ 방솑망 및 톡신망을 μ΄μš©ν•˜μ—¬ λ‹€μ‹œμ  λ™μ˜μƒμ„ 좜λ ₯ν•˜λŠ” 톡신 단말기에 μžˆμ–΄μ„œ,In the communication terminal, the input unit and the output unit is provided, a communication terminal for outputting a multi-view video using a broadcasting network and a communication network, 상기 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 톡해 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ΄ μˆ˜μ‹ λœ 경우, ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜λŠ” λ””μ½”λ”©λΆ€; 및A decoding unit for decoding the bitstream of the enhancement layer using decoding information of the base layer when the bitstream of the enhancement layer is received through the communication network while decoding and outputting the bitstream of the base layer received through the broadcasting network; And 상기 μž…λ ₯λΆ€λ₯Ό 톡해 λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ΄ μš”μ²­λ˜λŠ” 경우, ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό μƒμ„±ν•˜μ—¬ 상기 톡신뢀λ₯Ό 톡해 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•˜λŠ” μ„œλΉ„μŠ€ μš”μ²­λΆ€;A service request unit for generating a video request message including corresponding view information and transmitting the video request message including corresponding view information to the multi-view video transmission system through the input unit; λ₯Ό ν¬ν•¨ν•˜λŠ” 톡신 단말기.Communication terminal comprising a. 제7항에 μžˆμ–΄μ„œ, The method of claim 7, wherein 상기 λ””μ½”λ”©λΆ€λŠ”, The decoding unit, 상기 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜λŠ” κΈ°λ³Έ λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€; 및A base layer decoder for decoding a bitstream of a base layer received through the broadcasting network; And 상기 톡신망을 톡해 μˆ˜μ‹ λœ μˆ˜μ‹ λœ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€μ—μ„œ λ””μ½”λ”©λœ 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄ λ””μ½”λ”©λΆ€;An enhancement layer decoding unit decoding the bitstream of the received enhancement layer received through the communication network using the information decoded by the base layer decoding unit; λ₯Ό ν¬ν•¨ν•˜λŠ” 것을 νŠΉμ§•μœΌλ‘œ ν•˜λŠ” 톡신 단말기.Communication terminal comprising a. λ‹€μ‹œμ  λΉ„λ””μ˜€ λΆ€ν˜Έν™” μž₯μΉ˜κ°€ λ‹€μ‹œμ  λΉ„λ””μ˜€λ₯Ό λΆ€ν˜Έν™”ν•˜λŠ” 방법에 μžˆμ–΄μ„œ,In the multi-view video encoding apparatus encodes a multi-view video, 볡수의 μ΄¬μ˜μˆ˜λ‹¨μ— μ˜ν•΄ 촬영된 λ‹€μ‹œμ  λ™μ˜μƒ μ€‘μ—μ„œ κΈ°λ³Έ λ™μ˜μƒμ„ κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 단계; 및Encoding the base video into a base layer bitstream among the multi-view videos captured by the plurality of photographing means; And λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒμ— λŒ€ν•΄ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ˜ λΆ€ν˜Έν™” 정보λ₯Ό μ΄μš©ν•˜μ—¬ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„μŠ€νŠΈμŠ€νŠΈλ¦ΌμœΌλ‘œ λΆ€ν˜Έν™”ν•˜λŠ” 단계;Encoding a video stream of another view using a encoding information of the base layer bitstream into a bitstream of an enhancement layer; λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ λΆ€ν˜Έν™” 방법.The multi-view video encoding method comprising a. (a) 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 쀑에 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우, ν•΄λ‹Ή μ‹œμ  정보λ₯Ό ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•˜λŠ” 단계;(a) When a video output of another view through the communication network is requested while decoding and outputting a bitstream of the base layer received through the broadcasting network, transmitting a video request message including the view information to the multi-view video transmission system. step; (b) 상기 μš”μ²­λœ μ‹œμ μ— ν•΄λ‹Ήν•˜λŠ” ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ΄ 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œλΆ€ν„° μˆ˜μ‹ λ˜λŠ” 경우, 상기 ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ 상기 κΈ°λ³Έ λ ˆμ΄μ–΄ λΉ„νŠΈμŠ€νŠΈλ¦Όμ˜ λ””μ½”λ”© 정보λ₯Ό μ΄μš©ν•˜μ—¬ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 단계;(b) when the bitstream of the enhancement layer corresponding to the requested time point is received from the multi-view video transmission system, decoding and outputting the bitstream of the enhancement layer using decoding information of the base layer bitstream. ; λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μˆ˜μ‹  방법.Receiving a multi-view video comprising a. 제10항에 μžˆμ–΄μ„œ, The method of claim 10, 상기 (a)λ‹¨κ³„λŠ”, In step (a), 방솑망을 톡해 μˆ˜μ‹ λœ κΈ°λ³Έ λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ„ λ””μ½”λ”©ν•˜μ—¬ 좜λ ₯ν•˜λŠ” 단계;Decoding and outputting a bitstream of a base layer received through a broadcasting network; μ‚¬μš©μžμ— μ˜ν•΄ 톡신망을 ν†΅ν•œ λ‹€λ₯Έ μ‹œμ μ˜ λ™μ˜μƒ 좜λ ₯이 μš”μ²­λ˜λŠ” 경우 ν•΄λ‹Ή μ‹œμ μ„ νŒλ‹¨ν•˜λŠ” 단계; 및Determining a corresponding point in time when a video output of another point in time is requested by the user; And 상기 νŒλ‹¨λœ μ‹œμ μ„ ν¬ν•¨ν•˜λŠ” λ™μ˜μƒ μš”μ²­ λ©”μ‹œμ§€λ₯Ό μƒμ„±ν•˜μ—¬ 상기 λ‹€μ‹œμ  λΉ„λ””μ˜€ 전솑 μ‹œμŠ€ν…œμœΌλ‘œ μ „μ†‘ν•˜λŠ” 단계;Generating a video request message including the determined time point and transmitting the generated video request message to the multi-view video transmission system; λ₯Ό ν¬ν•¨ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μˆ˜μ‹  방법.Receiving a multi-view video comprising a. 제10항에 μžˆμ–΄μ„œ, The method of claim 10, 상기 (b)λ‹¨κ³„μ—μ„œ, In step (b), 상기 λ””μ½”λ”©λœ ν–₯상 λ ˆμ΄μ–΄μ˜ λΉ„νŠΈμŠ€νŠΈλ¦Όμ€ PIP, POP, 전체 ν™”λ©΄ 쀑 적어도 ν•˜λ‚˜μ˜ 방법에 μ˜ν•΄ 좜λ ₯λ˜λŠ” 것을 νŠΉμ§•μœΌλ‘œ ν•˜λŠ” λ‹€μ‹œμ  λΉ„λ””μ˜€μ˜ μˆ˜μ‹  방법.The bitstream of the decoded enhancement layer is output by at least one method of PIP, POP, full screen.
PCT/KR2009/003190 2008-09-05 2009-06-15 Transmitting/receiving system and transmitting/receiving method for multi-view video Ceased WO2010027142A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020080087620A KR101012760B1 (en) 2008-09-05 2008-09-05 Transmitting and Receiving System and Method of Multiview Video
KR10-2008-0087620 2008-09-05

Publications (2)

Publication Number Publication Date
WO2010027142A2 true WO2010027142A2 (en) 2010-03-11
WO2010027142A3 WO2010027142A3 (en) 2010-04-29

Family

ID=41797607

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2009/003190 Ceased WO2010027142A2 (en) 2008-09-05 2009-06-15 Transmitting/receiving system and transmitting/receiving method for multi-view video

Country Status (2)

Country Link
KR (1) KR101012760B1 (en)
WO (1) WO2010027142A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024077616A1 (en) * 2022-10-14 2024-04-18 OppoεΉΏδΈœη§»εŠ¨ι€šδΏ‘ζœ‰ι™ε…¬εΈ Coding and decoding method and coding and decoding apparatus, device, and storage medium

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2013157817A1 (en) * 2012-04-16 2013-10-24 μ‚Όμ„±μ „μž μ£Όμ‹νšŒμ‚¬ Method for multi-view video encoding based on tree structure encoding unit and apparatus for same, and method for multi-view video decoding based on tree structure encoding unit and apparatus for same

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100732961B1 (en) * 2005-04-01 2007-06-27 κ²½ν¬λŒ€ν•™κ΅ μ‚°ν•™ν˜‘λ ₯단 Multiview scalable image encoding, decoding method and its apparatus
MY159176A (en) * 2005-10-19 2016-12-30 Thomson Licensing Multi-view video coding using scalable video coding
US8699583B2 (en) * 2006-07-11 2014-04-15 Nokia Corporation Scalable video coding and decoding
JP5170786B2 (en) * 2006-10-25 2013-03-27 ιŸ“εœ‹ι›»ε­ι€šδΏ‘η ”η©Άι™’ Multi-view video scalable coding and decoding method, and coding and decoding apparatus

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024077616A1 (en) * 2022-10-14 2024-04-18 OppoεΉΏδΈœη§»εŠ¨ι€šδΏ‘ζœ‰ι™ε…¬εΈ Coding and decoding method and coding and decoding apparatus, device, and storage medium

Also Published As

Publication number Publication date
KR20100028749A (en) 2010-03-15
WO2010027142A3 (en) 2010-04-29
KR101012760B1 (en) 2011-02-08

Similar Documents

Publication Publication Date Title
RU2530740C2 (en) Signalling characteristics of multiview video coding (mvc) operation point
CN109068136B (en) Image processing apparatus, image processing method, and computer-readable storage medium
WO2010053246A2 (en) Apparatus and method for synchronizing stereoscopic image, and apparatus and method for providing stereoscopic image based on the same
WO2010068020A2 (en) Multi- view video coding/decoding method and apparatus
KR20010020147A (en) Dual Compressed Video Bitstream Camera for Universal Serial Bus Connection
WO2011108903A2 (en) Method and apparatus for transmission and reception in the provision of a plurality of transport interactive 3dtv broadcasting services
WO2012064123A2 (en) Method and apparatus for determining a video compression standard in a 3dtv service
WO2012070875A2 (en) Method and apparatus for creating a media file for multilayer images in a multimedia system, and media-file-reproducing apparatus using same
WO2020189817A1 (en) Method and system for distributed decoding of split image for tile-based streaming
WO2013150943A1 (en) Decoder and decoding method, as well as encoder and encoding method
KR100606785B1 (en) Synchronization method of video and image data in remote multimedia service providing system through network
WO2016129981A1 (en) Method and device for transmitting/receiving media data
WO2015009098A1 (en) Method and apparatus for processing video signal
WO2014073877A1 (en) Method and apparatus for processing multiview video signal
WO2012144795A2 (en) Apparatus for outputting broadcast recorded by schedule recording and control method thereof
WO2015009091A1 (en) Method and apparatus for processing video signal
WO2010027142A2 (en) Transmitting/receiving system and transmitting/receiving method for multi-view video
WO2014054896A1 (en) Method and device for processing video signal
WO2014054897A1 (en) Method and device for processing video signal
RU2649758C2 (en) Image processing device and method
KR101096802B1 (en) Apparatus for mixing video stream and method thereof
WO2012141551A2 (en) High speed scalable video coding device and method using multi-track video
WO2014109563A1 (en) Method and apparatus for processing video signals
WO2018016879A1 (en) Method and device for providing 360-degree virtual reality broadcasting service
WO2014042459A1 (en) Method and apparatus for processing video signal

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09811641

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC

122 Ep: pct application non-entry in european phase

Ref document number: 09811641

Country of ref document: EP

Kind code of ref document: A2