[go: up one dir, main page]

TW202536802A - A tile selection method for tile-based 360-degree video streaming and computer-readable storage media - Google Patents

A tile selection method for tile-based 360-degree video streaming and computer-readable storage media

Info

Publication number
TW202536802A
TW202536802A TW113109308A TW113109308A TW202536802A TW 202536802 A TW202536802 A TW 202536802A TW 113109308 A TW113109308 A TW 113109308A TW 113109308 A TW113109308 A TW 113109308A TW 202536802 A TW202536802 A TW 202536802A
Authority
TW
Taiwan
Prior art keywords
quality
tile
combination
tiles
bandwidth
Prior art date
Application number
TW113109308A
Other languages
Chinese (zh)
Inventor
林喬文
陳怡孜
王永祥
周毓修
廖婉君
楊鈺清
林念珍
白莊
Original Assignee
中華電信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中華電信股份有限公司 filed Critical 中華電信股份有限公司
Priority to TW113109308A priority Critical patent/TW202536802A/en
Publication of TW202536802A publication Critical patent/TW202536802A/en

Links

Landscapes

  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

A tile selection method for tile-based 360-degree video streaming and computer- readable storage media is provided, the method includes cutting a 360-Degree video into a plurality of tiles, and each tile encoded to a plurality of qualities; obtaining and storing importance of each tile; estimating a current bandwidth used for tile transmission according to past bandwidth record, bandwidth variation range and player buffer length; and obtaining a final tile quality combination according to a tile quality selection algorithm, outputting and storing the tiles in the final tile quality combination to a player buffer.

Description

基於圖塊全景影片串流之圖塊選擇方法及電腦可讀儲存介質A method for selecting tiles in a tile-based panoramic video streaming system and a computer-readable storage medium.

本發明是有關於基於圖塊全景影片串流之圖塊選擇,且特別是有關於一種基於圖塊全景影片串流之圖塊選擇方法及電腦可讀儲存介質。This invention relates to tile selection based on tile panoramic video streaming, and more particularly to a method for tile selection based on tile panoramic video streaming and a computer-readable storage medium.

隨著5G網路及AI人工智慧的發展,沉浸式(Immersive)科技結合VR虛擬實境的應用也日益蓬勃,顛覆了以往人們觀看影片的傳統思維。在沉浸式的應用裡,人們可搭配頭戴式裝置(Head Mounted Device, HMD),自由選擇觀看視角,感受到360度環場互動的虛擬體驗。With the development of 5G networks and AI artificial intelligence, the application of immersive technology combined with VR virtual reality is booming, subverting people's traditional thinking about watching videos. In immersive applications, people can use head-mounted devices (HMDs) to freely choose the viewing angle and experience a 360-degree surround interactive virtual experience.

目前業界在傳輸360度全景影片串流內容時,多採用全視角傳輸方法。該方法是將全部360度全景影像傳送到使用者終端,終端將全部視角的內容解碼後,依據當前視角,呈現相對應的影像內容。此方法的優點是延遲低,當使用者轉換視角,可立即切換影片的畫面,但缺點是易造成網路頻寬及終端運算資源的浪費,因需花費大量頻寬及運算量在沒有被播放的影像。Currently, the industry mostly uses the full-view transmission method when transmitting 360-degree panoramic video streaming content. This method transmits the entire 360-degree panoramic image to the user's terminal. The terminal decodes the content from all views and presents the corresponding image content according to the current view. The advantage of this method is low latency; when the user changes viewpoints, the video frame can be switched immediately. However, the disadvantage is that it easily leads to a waste of network bandwidth and terminal computing resources, as a large amount of bandwidth and computing power is required for images that are not being played.

為了解決頻寬浪費的議題,MPEG组織(Moving Picture Experts Group)在2015年上提出OMAF(Omnidirectional Media Application Format,全景影像的應用格式)標準,提出了FOV (Field-of-View)視角傳輸方式,來降低資料的遞送量。所謂的FOV視角傳輸方式,就是影音伺服器僅傳輸使用者正在觀看的視角內容。因考量到使用者透過終端設備觀看全景影像時,往往視線範圍只會落在全景影像中的一小部份區域,基於這個特性OMAF定義了基於圖塊(tile-based)編碼的規範:將全景影像切割成多個可獨立編碼成多種解析度的模塊,依據使用者觀看行為,動態調整欲傳送的合適模塊(在視角範圍內的影像以高解析度方式傳送,視角外的影像以低解析度方式傳輸),終端收到混合畫質影像後再進行解碼呈現畫面。此方法的優點是可大量減少頻寬,提升傳輸效能,缺點則是當視角變換時,會需要額外時間重新下載視角變換後所對應的高畫質影片,進而導致影像畫質會由差轉好、或畫面停頓的現象發生,進而對用戶產生暈眩及不適感。To address the issue of bandwidth waste, the Moving Picture Experts Group (MPEG) proposed the OMAF (Omnidirectional Media Application Format) standard in 2015, which introduced the FOV (Field-of-View) transmission method to reduce data transmission volume. The so-called FOV transmission method means that the video server only transmits the content from the perspective of the user's current viewing angle. Considering that when users view panoramic images through terminal devices, their field of vision often only falls on a small part of the panoramic image, OMAF defines a tile-based encoding specification based on this characteristic: the panoramic image is cut into multiple modules that can be independently encoded into multiple resolutions. According to the user's viewing behavior, the appropriate module to be transmitted is dynamically adjusted (the image within the viewpoint is transmitted in high resolution, and the image outside the viewpoint is transmitted in low resolution). The terminal receives the mixed image quality and then decodes and displays the image. The advantage of this method is that it can significantly reduce bandwidth and improve transmission performance. The disadvantage is that when the viewpoint changes, it will take extra time to re-download the corresponding high-definition video after the viewpoint change, which will cause the image quality to change from bad to good or the image to freeze, thus causing dizziness and discomfort to the user.

為確保能提供良好的360度全景影片之使用者觀看體驗,通常會以MTP(Motion To Photon)延遲做為評斷的標準;MTP指的是當使用者移動視角到顯示器呈現影像的時間延遲。MTP延遲愈短,使用者在VR設備的沉浸感體驗愈好,MTP延遲愈長,使用者在VR設備的不適感/暈眩感可能將會更為強烈。因此,在觀看全景影像時,一旦使用者轉移視角,使用者會在短時間內先看到低品質畫面,隨後再切換成高品質畫面,但倘若切換時間過久,使用者則可能會移轉到其他視角、甚至放棄收看,將嚴重影響使用者的收視體驗。To ensure a good viewing experience for users of 360-degree panoramic videos, MTP (Motion To Photon) latency is usually used as the evaluation standard. MTP refers to the time delay between when the user moves their viewpoint and when the image is displayed on the screen. The shorter the MTP latency, the better the user's immersive experience in the VR device; the longer the MTP latency, the more intense the user's discomfort/dizziness may be. Therefore, when watching panoramic images, once the user changes their viewpoint, they will see a low-quality image for a short time before switching to a high-quality image. However, if the switching time is too long, the user may switch to another viewpoint or even give up watching, which will seriously affect the user's viewing experience.

為了減少MTP延遲,視角預測是一個有效的做法。因使用者在觀看全景影像並移動視角的同時,終端需先取得使用者當前視角後,再由伺服器端下載對應的影音模塊播放。然而,每次下載皆需要經過請求、下載、解碼、播放等程序,這些程序皆需要額外的處理時間,進而增加MTP延遲。若能結合視角預測,高機率預測出使用者視角的移動,預先下載可能會播放的影音模塊並存入終端緩衝區內,後續當使用者切換到該視角時,終端設備就可直接存取緩衝內容,以減少MTP延遲、確保觀看品質。To reduce MTP latency, viewpoint prediction is an effective approach. When a user views a panoramic image and moves their viewpoint, the terminal needs to obtain the user's current viewpoint before the server downloads the corresponding audio/video module for playback. However, each download involves requesting, downloading, decoding, and playback, all of which require additional processing time, increasing MTP latency. By combining viewpoint prediction with the ability to predict user viewpoint movement with a high probability, and pre-downloading potentially playable audio/video modules and storing them in the terminal's buffer, the terminal can directly access the buffered content when the user switches to that viewpoint, reducing MTP latency and ensuring viewing quality.

雖然全景影片FOV視角傳輸方式搭配視角預測可以預先下載所預測之高、低品質圖塊組合,嘗試減少MTP延遲及維持觀看品質,但是此作法仍面臨實務上挑戰,如:依照視角預測結果預先下載對應圖塊品質組合時,若當時網路品質不穩定、無法完整下載既定圖塊組合時,將可能出現因部分圖塊未完整下載,而導致使用者觀看到的頭戴顯示器視野中出現部分黑屏畫面狀況而影響使用者體驗。Although the FOV (Field of View) transmission method for panoramic video, combined with view prediction, can pre-download the predicted combination of high and low quality tiles to try to reduce MTP latency and maintain viewing quality, this approach still faces practical challenges. For example, if the network quality is unstable and the predetermined combination of tiles cannot be downloaded completely when pre-downloading the corresponding tile combination based on the view prediction results, some tiles may not be downloaded completely, resulting in a black screen in the user's head-mounted display field of view, thus affecting the user experience.

本發明提供一種基於圖塊全景影片串流之圖塊選擇方法及電腦可讀儲存介質,可減少MTP延遲,確保不會因使用者轉頭之視角預測錯誤而有黑屏畫面,保證了使用者觀看全景影片的品質。This invention provides a method for selecting tiles based on tile-based panoramic video streaming and a computer-readable storage medium, which can reduce MTP latency and ensure that there will be no black screen due to incorrect prediction of the user's viewing angle, thus guaranteeing the quality of panoramic video viewing.

本發明的一種基於圖塊全景影片串流之圖塊選擇方法,包括:將全景影片切割成多個圖塊,且每個圖塊可編碼成複數品質;獲取並儲存各圖塊的重要度;依據過去頻寬記錄、頻寬變動幅度及終端影片暫存長度來預估當前用於圖塊傳輸的頻寬;以及依據圖塊品質選擇演算法來獲取最終圖塊品質組合,輸出並且儲存最終圖塊品質組合中的圖塊至終端影片暫存區。The present invention provides a tile selection method based on tile-based panoramic video streaming, comprising: cutting the panoramic video into multiple tiles, each tile being coded into multiple qualities; acquiring and storing the importance of each tile; estimating the current bandwidth used for tile transmission based on past bandwidth records, bandwidth variation range, and terminal video storage length; and acquiring the final tile quality combination according to a tile quality selection algorithm, outputting and storing the tiles in the final tile quality combination in the terminal video storage area.

本發明的一種電腦可讀儲存介質,其中,電腦可讀儲存介質用於儲存電腦程式,電腦程式被執行時,如基於圖塊全景影片串流之圖塊選擇方法所述的方法被執行。The present invention provides a computer-readable storage medium, wherein the computer-readable storage medium is used to store computer programs, and when the computer programs are executed, methods such as the tile selection method based on tile panoramic video streaming are executed.

基於上述,本發明提供一種基於圖塊全景影片串流之圖塊選擇方法及電腦可讀儲存介質,可依據預估的當前可用於圖塊傳輸的頻寬及頭戴式顯示器視野的視角預測結果,選擇最佳使用者體驗指標的圖塊品質組合,供終端設備預先下載至終端影片暫存區內,後續當使用者切換到該視角時,終端設備可直接存取終端影片暫存區內的緩衝內容進行播放,減少MTP延遲,確保不會因使用者轉頭之視角預測錯誤而有黑屏畫面,保證了使用者觀看全景影片的品質。Based on the above, this invention provides a method for selecting tiles based on tile-based panoramic video streaming and a computer-readable storage medium. It can select the optimal combination of tile quality indicators for user experience based on the estimated bandwidth currently available for tile transmission and the predicted view angle of the head-mounted display. This combination is pre-downloaded to the terminal device's video buffer. Subsequently, when the user switches to that view angle, the terminal device can directly access and play the buffered content in the terminal video buffer, reducing MTP latency and ensuring that there are no black screens due to incorrect prediction of the user's head-turning view angle, thus guaranteeing the quality of the panoramic video viewing experience.

為讓本發明的上述特徵和優點能更明顯易懂,下文特舉實施例,並配合所附圖式作詳細說明如下。To make the above features and advantages of this invention more apparent and understandable, specific examples are given below, and detailed explanations are provided in conjunction with the accompanying drawings.

本發明的部份實施例接下來將會配合附圖來詳細描述,以下的描述所引用的元件符號,當不同附圖出現相同的元件符號將視為相同或相似的元件。這些實施例只是本發明的一部份,並未揭示所有本發明的可實施方式。Some embodiments of the present invention will be described in detail below with reference to the accompanying drawings. Component symbols used in the following description are considered to be the same or similar components when the same component symbol appears in different accompanying drawings. These embodiments are only a part of the present invention and do not disclose all possible embodiments of the present invention.

說明書與發明申請專利範圍中所使用的序數例如「第一」、「第二」等之用詞用以修飾元件,其本身並不意含及代表該,或該些,組件有任何之前的序數,也不代表某一元件與另一元件的順序、或是製造方法上的順序,該些序數的使用僅用來使具有某命名的元件得以和另一具有相同命名的元件能作出清楚區分。發明申請專利範圍與說明書中可不使用相同用詞,據此,說明書中的第一構件在發明申請專利範圍中可能為第二構件。須知悉的是,以下所舉實施例可以在不脫離本揭露的精神下,將數個不同實施例中的技術特徵進行替換、重組、混合以完成其他實施例。The use of ordinal numbers such as "first" and "second" in the specification and scope of the invention application to modify components does not imply or represent any prior ordinal number for that component, nor does it represent the order of one component with another, or the order of manufacturing methods. The use of these ordinal numbers is solely for the purpose of clearly distinguishing one component with a given name from another component with the same name. The invention application and the specification may not use the same terminology; therefore, a first component in the specification may be a second component in the scope of the invention application. It should be understood that the following embodiments can be modified by substituting, recombining, or mixing technical features in several different embodiments to complete other embodiments without departing from the spirit of this disclosure.

圖1是依照本發明的一實施例的一種基於圖塊全景影片串流之圖塊選擇方法的流程圖。Figure 1 is a flowchart of a tile selection method based on a tile panoramic video stream according to an embodiment of the present invention.

本揭露的基於圖塊全景影片串流之圖塊選擇方法依據預估的當前可用於圖塊傳輸的頻寬及頭戴式顯示器視野的視角預測結果,選擇最佳使用者體驗指標的圖塊品質組合,供終端設備預先下載至終端影片暫存區內,後續當使用者切換到該視角時,終端設備就可直接存取終端影片暫存區內的緩衝內容進行播放。The tile selection method disclosed herein for tile-based panoramic video streaming selects the optimal combination of tile quality indicators based on the estimated bandwidth currently available for tile transmission and the predicted field of view of the head-mounted display. This combination is pre-downloaded to the terminal device's video buffer. Subsequently, when the user switches to that viewpoint, the terminal device can directly access and play the cached content in the terminal video buffer.

圖2是依據圖塊品質選擇演算法來獲取最終圖塊品質組合的流程圖。Figure 2 is a flowchart of obtaining the final combination of image quality based on the image quality selection algorithm.

請結合圖1及圖2,於步驟S101中,將全景影片切割成多個圖塊,且各圖塊編碼成複數品質,可例如編碼成高品質、低品質,本發明並不以此為限。Referring to Figures 1 and 2, in step S101, the panoramic video is cut into multiple tiles, and each tile is encoded into multiple qualities, such as high quality and low quality. This invention is not limited to this.

於步驟S102中,依據重要度公式獲取並儲存各圖塊的重要度(即被使用者看見之可能性或機率)。In step S102, the importance of each tile (i.e. the probability or likelihood of it being seen by the user) is obtained and stored according to the importance formula.

於步驟S103中,依據過去頻寬記錄、頻寬變動幅度及終端影片暫存長度來預估當前用於圖塊傳輸的頻寬。In step S103, the bandwidth currently used for tile transmission is estimated based on past bandwidth records, bandwidth variation range, and terminal video storage length.

於步驟S104中,依據圖塊品質選擇演算法來獲取最終圖塊品質組合,輸出並且儲存最終圖塊品質組合中的圖塊至終端影片暫存區,以確保不會因使用者轉頭之視角預測錯誤而有黑屏畫面,同時也根據三個使用者體驗指標(即平均圖塊品質、空間品質差異與時間品質差異)來提升高重要度圖塊的品質,最後達成節省頻寬並維持使用者體驗的目的。In step S104, the final tile quality combination is obtained based on the tile quality selection algorithm. The tiles in the final tile quality combination are output and stored in the terminal video storage area to ensure that there will be no black screen due to incorrect prediction of the user's head turning angle. At the same time, the quality of high-importance tiles is improved based on three user experience indicators (i.e. average tile quality, spatial quality difference, and temporal quality difference), ultimately achieving the goal of saving bandwidth and maintaining user experience.

再請結合圖2,為步驟S104的具體流程圖。步驟S104包括步驟S1041至S1045。Please refer to Figure 2 for a detailed flowchart of step S104. Step S104 includes steps S1041 to S1045.

具體而言,於步驟S1041中,選取圖塊中的多個低品質圖塊形成初始品質組合。Specifically, in step S1041, multiple low-quality tiles are selected to form an initial quality combination.

於步驟S1042中,建構與初始品質組合的圖塊總數相同的多個第一新品質組合。將初始品質組合的圖塊中的第一圖塊的品質由低品質調為高品質,並且將圖塊中除第一圖塊的圖塊與第一圖塊組合為第一新品質組合,將初始品質組合的圖塊中的第二圖塊的品質由低品質調為高品質,並且將圖塊中除第二圖塊的圖塊與第二圖塊組合為第一新品質組合,其中第一圖塊與第二圖塊為初始品質組合的圖塊中的不同圖塊。In step S1042, construct multiple new first quality combinations with the same total number of tiles as the initial quality combination. Adjust the quality of the first tile in the initial quality combination from low to high, and combine all tiles except the first tile with the first tile to form a new first quality combination. Adjust the quality of the second tile in the initial quality combination from low to high, and combine all tiles except the second tile with the second tile to form a new first quality combination. The first tile and the second tile are different tiles from the initial quality combination.

於步驟S1043中,獲取初始品質組合及第一新品質組合的使用者體驗指標,從初始品質組合及第一新品質組合中選取具有最佳化使用者體驗指標的品質組合作為更新後第一品質組合。In step S1043, user experience indicators of the initial quality combination and the first new quality combination are obtained, and the quality combination with the optimized user experience indicators is selected from the initial quality combination and the first new quality combination as the updated first quality combination.

於步驟S1044中,判斷將初始品質組合的圖塊中的任一個圖塊的品質由低品質調為高品質時是否會導致傳輸圖塊所需的頻寬超過當前用於圖塊傳輸的頻寬,或者,判斷將初始品質組合的圖塊中的任一個圖塊的品質由低品質調為高品質時是否會導致第一新品質組合的使用者體驗指標下降。In step S1044, it is determined whether changing the quality of any tile in the initial quality combination from low to high quality will cause the bandwidth required for tile transmission to exceed the current bandwidth used for tile transmission, or whether changing the quality of any tile in the initial quality combination from low to high quality will cause the user experience index of the first new quality combination to decrease.

倘若將初始品質組合的圖塊中的任一個圖塊的品質由低品質調為高品質會導致傳輸圖塊所需的頻寬超過當前用於圖塊傳輸的頻寬,或者,將初始品質組合的圖塊中的任一個圖塊的品質由低品質調為高品質會導致第一新品質組合的使用者體驗指標下降,則於步驟S1045中,將更新後第一品質組合作為最終圖塊品質組合,輸出並且儲存最終圖塊品質組合中的圖塊至終端影片暫存區。If changing the quality of any tile in the initial quality combination from low to high would cause the bandwidth required for tile transmission to exceed the current bandwidth used for tile transmission, or if changing the quality of any tile in the initial quality combination from low to high would cause the user experience index of the first new quality combination to decrease, then in step S1045, the updated first quality combination is used as the final tile quality combination, and the tiles in the final tile quality combination are output and stored in the terminal video buffer.

倘若將初始品質組合的圖塊中的任一個圖塊的品質由低品質調為高品質不會導致傳輸圖塊所需的頻寬超過當前用於圖塊傳輸的頻寬,或者,將初始品質組合的圖塊中的任一個圖塊的品質由低品質調為高品質不會導致第一新品質組合的使用者體驗指標下降,則返回至步驟S1042。If changing the quality of any tile in the initial quality combination from low to high will not cause the bandwidth required to transmit the tile to exceed the current bandwidth used for tile transmission, or if changing the quality of any tile in the initial quality combination from low to high will not cause a decrease in the user experience index of the first new quality combination, then return to step S1042.

於一實施例中,使用者體驗指標可包括平均圖塊品質、空間品質差異及時間品質差異。其中平均圖塊品質可為由圖塊重要度加權後計算之平均圖塊品質,品質指標使用峰值訊噪比(Peak signal-to-noise ratio,PSNR)表示。空間品質差異可為由圖塊重要度加權後計算之圖塊品質標準差。時間品質差異為當前平均圖塊品質與上一時段平均圖塊品質之間的差異大小。將平均圖塊品質、空間品質差異與時間品質差異三者結合後即可得到使用者體驗指標,平均圖塊品質、空間品質差異與時間品質差異必須同時進行最佳化,因三者會互相影響(例如降低空間品質差異可能導致平均圖塊品質下降)。In one embodiment, user experience metrics may include average tile quality, spatial quality variation, and temporal quality variation. Average tile quality may be the average tile quality calculated by weighting tile importance, and the quality metric is represented using peak signal-to-noise ratio (PSNR). Spatial quality variation may be the standard deviation of tile quality calculated by weighting tile importance. Temporal quality variation is the magnitude of the difference between the current average tile quality and the average tile quality of the previous time period. By combining average tile quality, spatial quality variation, and temporal quality variation, a user experience index can be obtained. Average tile quality, spatial quality variation, and temporal quality variation must be optimized simultaneously, as they can affect each other (for example, reducing spatial quality variation may lead to a decrease in average tile quality).

下文中,將搭配圖3至圖8說明本發明圖1至圖2的實施例所述之方法。本方法的各個流程可依照實施情形而隨之調整,且並不僅限於此。The method described in the embodiments of Figures 1 and 2 of this invention will be explained below with reference to Figures 3 to 8. The various processes of this method may be adjusted according to the implementation situation, and are not limited thereto.

圖3是依照本發明的一實施例的使用者轉頭之視角預測模型在某時刻所預測之頭戴式顯示器視野中心與水平線的示意圖。圖4是依照本發明的一實施例的大圓距離的示意圖。Figure 3 is a schematic diagram of the center of the head-mounted display's field of view and the horizontal line predicted by a user's head-turning angle prediction model at a certain moment according to an embodiment of the present invention. Figure 4 is a schematic diagram of the great circle distance according to an embodiment of the present invention.

請結合圖3及圖4,於一實施例中,由於越靠近頭戴式顯示器視野302的視野中心Q與水平線303之圖塊301有更高的機率被使用者注視,因此圖塊301的圖塊中心P與水平線303之間的弧度差 越小以及圖塊中心P與頭戴式顯示器視野302的視野中心Q之間的大圓距離 越小的圖塊的重要度越高,而重要度越高的圖塊以高品質呈現的機率也越高,從而提升使用者體驗。 Referring to Figures 3 and 4, in one embodiment, since the tile 301 closer to the center Q of the field of view 302 of the head-mounted display and the horizontal line 303 is more likely to be viewed by the user, the curvature difference between the tile center P of tile 301 and the horizontal line 303 is... The smaller the distance between the center P of the tile and the center Q of the field of view of the head-mounted display 302, the greater the great circle distance. Smaller tiles are more important, and more important tiles are more likely to be presented in high quality, thus improving the user experience.

如圖3所示,將全景影片30切割成寬6個、高3個的圖塊總數為18個的圖塊,每個圖塊可設置為高品質與低品質,其中高品質的圖塊可消耗的頻寬為2 Mbps且PSNR為50;低品質的圖塊可消耗的頻寬為1 Mbps且PSNR為40。於本實施例中,以一使用者轉頭之視角預測模型在某時刻所預測之頭戴式顯示器視野302的視野中心Q為星號所示位置,該頭戴式顯示器視野302覆蓋了4個圖塊。根據上述條件對圖1至圖8所述方法各步驟進行實作說明。As shown in Figure 3, the panoramic video 30 is divided into 18 tiles, each 6 pixels wide and 3 pixels high. Each tile can be set to high quality or low quality. The high-quality tile consumes 2 Mbps of bandwidth and has a PSNR of 50; the low-quality tile consumes 1 Mbps of bandwidth and has a PSNR of 40. In this embodiment, the center Q of the field of view 302 of the head-mounted display, predicted by a user's head-turning angle prediction model at a certain moment, is shown as the position indicated by the asterisk. The field of view 302 of the head-mounted display covers 4 tiles. The steps of the method described in Figures 1 to 8 are explained in practice based on the above conditions.

於步驟S102中,具體而言,依據各圖塊的圖塊中心3011與頭戴式顯示器的視野中心之間的大圓距離、各圖塊中心與水平線之間的弧度差以及重要度公式來計算各圖塊的該重要度,其中重要度公式為: …公式1 In step S102, specifically, the importance of each tile is calculated based on the great circle distance between the center 3011 of each tile and the field of view center of the head-mounted display, the difference in curvature between the center of each tile and the horizontal line, and the importance formula, wherein the importance formula is: …Formula 1

其中 為圖塊中心與水平線之間的弧度差(即圖塊中心在球面上之緯度), 為圖塊中心與頭戴式顯示器的視野中心之間的大圓距離(球體40的半徑為1),大圓距離為從球面的一點出發到達球面上的另一點所經過的最短路徑(如圖4中P到Q)的長度, 為圖塊的重要度。於本實施例中,共有18個圖塊,故 (圖塊的編號順序如表一所示圖3各圖塊之編號順序 )。 in This is the difference in radians between the center of the tile and the horizontal line (i.e., the latitude of the tile center on the sphere). Let be the great circle distance between the center of the tile and the center of the field of view of the head-mounted display (the radius of sphere 40 is 1). The great circle distance is the length of the shortest path (from P to Q in Figure 4) from one point on the sphere to another point on the sphere. This represents the importance of the tiles. In this embodiment, there are a total of 18 tiles, therefore... (The tile numbering order is shown in Table 1. Figure 3 shows the tile numbering order.) ).

表一: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 Table 1: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

於本實施例中,圖3中各圖塊之 分別如以下表二、表三所示,則可將各圖塊之 套入公式1即可計算出圖3中各圖塊的重要度 (結果如以下表四所示)。 In this embodiment, the tiles in Figure 3 are... , As shown in Tables 2 and 3 below, the tiles can be... , The importance of each tile in Figure 3 can be calculated by plugging in Formula 1. (The results are shown in Table 4 below).

表二:各圖塊之 0.73 0.73 0.73 0.73 0.73 0.73 0 0 0 0 0 0 0.73 0.73 0.73 0.73 0.73 0.73 Table 2: Each tile 0.73 0.73 0.73 0.73 0.73 0.73 0 0 0 0 0 0 0.73 0.73 0.73 0.73 0.73 0.73

表三:各圖塊之 2.517 2.407 1.652 1.036 1.115 1.793 2.660 2.487 1.477 0.482 0.655 1.664 2.106 2.027 1.349 0.624 0.735 1.489 Table 3: Each tile 2.517 2.407 1.652 1.036 1.115 1.793 2.660 2.487 1.477 0.482 0.655 1.664 2.106 2.027 1.349 0.624 0.735 1.489

表四:各圖塊的重要度 0.334 0.349 0.509 0.812 0.754 0.469 0.591 0.632 1.063 3.262 2.398 0.944 0.399 0.415 0.624 1.348 1.145 0.565 Table 4: Importance of Each Piece 0.334 0.349 0.509 0.812 0.754 0.469 0.591 0.632 1.063 3.262 2.398 0.944 0.399 0.415 0.624 1.348 1.145 0.565

其中頭戴式顯示器視野所覆蓋的4個圖塊的區域(如表四虛線框標示),虛線框左上角的圖塊因與水平線303、頭戴式顯示器視野302的視野中心Q最為接近,故計算得到之重要度(3.262)最高,以此類推,虛線框右上角的圖塊第二重要(重要度為2.398),接著依序為虛線框左下的圖塊(重要度為1.348)與虛線框右下的圖塊(重要度為1.145)。上述之各圖塊重要度,將用於步驟S104中使用者體驗指標的計算。Of the four tile areas covered by the head-mounted display's field of view (as indicated by the dashed boxes in Table 4), the tile in the upper left corner of the dashed box has the highest importance (3.262) because it is closest to the horizontal line 303 and the center Q of the head-mounted display's field of view 302. Similarly, the tile in the upper right corner of the dashed box is the second most important (importance 2.398), followed by the tile in the lower left corner (importance 1.348) and the tile in the lower right corner (importance 1.145). The importance of each tile will be used in the calculation of user experience indicators in step S104.

於步驟S103中,具體而言,依據過去頻寬記錄、頻寬變動幅度、終端影片暫存長度及公式2來獲取當前用於圖塊傳輸的頻寬,其中公式2為: …公式2 In step S103, specifically, the current bandwidth used for tile transmission is obtained based on past bandwidth records, bandwidth variation range, terminal video temporary storage length, and Formula 2, where Formula 2 is: …Formula 2

其中 為預估網路頻寬, 為終端影片暫存長度, 為終端目標影片暫存長度, 為單一圖塊影片長度, 為網路穩定度指標。 in To estimate network bandwidth, To extend the video's length on the terminal, To temporarily store the target video length on the terminal, For the length of a single tile video, It is an indicator of network stability.

依據公式3獲取網路穩定度指標 ,公式3為: …公式3 Obtain network stability indicators according to Formula 3. Formula 3 is: …Formula 3

其中 為過去 時間內的頻寬記錄之平均值, 為過去 時間內的頻寬記錄之標準差。網路穩定度指標 越高代表網路越穩定。 in For the past The average bandwidth recorded over a period of time. For the past Standard deviation of bandwidth records over a period of time. A network stability indicator. The higher the value, the more stable the network.

於本實施例中,可設置 秒、 秒、 ,則依據公式3可計算出 ,依據公式2可計算出 ,表示所有圖塊總共不得使用超過22 Mbps之頻寬。 In this embodiment, it is possible to set Second, Second, , , Then, according to Formula 3, the result can be calculated. According to Formula 2, the following can be calculated: This means that all tiles must not use a total bandwidth exceeding 22 Mbps.

於步驟S104中,具體而言,依據公式4獲取平均圖塊品質,依據公式5獲取空間品質差異。 …公式4 …公式5 In step S104, specifically, the average tile quality is obtained according to Formula 4, and the spatial quality difference is obtained according to Formula 5. …Formula 4 …Formula 5

其中 為平均圖塊品質, 為空間品質差異, 為圖塊編號 在品質編號 時的峰值訊噪比(Peak signal-to-noise ratio,PSNR), 為圖塊i的重要度,品質編號 越大表示圖塊的品質越好。於本實施例中,圖塊可設置為高品質或低品質,即 為低品質; 為高品質。 in For average tile quality, Due to differences in space quality, Number the blocks In quality number Peak signal-to-noise ratio (PSNR) at that time. Quality number to indicate the importance of tile i A larger value indicates a higher quality tile. In this embodiment, the tile can be set to high quality or low quality, i.e. ; Low quality; For high quality.

依據公式6獲取時間品質差異, …公式6 According to Formula 6, the difference in quality over time is obtained. …Formula 6

其中 為時間品質差異, 為前次之平均圖塊品質, 為平均圖塊品質。 in Due to time and quality differences, The average tile quality from the previous time. This represents the average tile quality.

依據平均圖塊品質、空間品質差異、時間品質差異及公式7獲取使用者體驗指標, …公式7 User experience indicators are obtained based on average tile quality, spatial quality variation, temporal quality variation, and Formula 7. …Formula 7

其中, 為使用者體驗指標, 分別為空間品質差異與時間品質差異在使用者體驗指標中的權重。 in, As a user experience indicator, and These represent the weights of spatial quality differences and temporal quality differences in user experience indicators.

以下結合圖5至圖8來具體說明步驟S104中第一輪至第四輪圖塊品質選擇作業。The following, in conjunction with Figures 5 to 8, will specifically explain the first to fourth rounds of tile quality selection in step S104.

圖5是依照本發明的一實施例的第一輪圖塊品質選擇作業的示意圖。Figure 5 is a schematic diagram of the first round of tile quality selection operation according to an embodiment of the present invention.

請參考圖5,於第一輪圖塊品質選擇作業中,可設置初始 ,則步驟S1041應產生18個新品質組合,但於本實施例中,簡化為被頭戴式顯示器視野302覆蓋之4個圖塊組成的初始品質組合501進行說明,而其他未覆蓋的14個圖塊皆為低品質,可消耗1Mbps頻寬,依次將初始品質組合501中的4個圖塊的品質由低品質調高為高品質,因此會分別建構或產生4個第一新品質組合502,並且此4個第一新品質組合502均與初始品質組合501中圖塊的品質有一個高品質圖塊的些微差異。對此4個第一新品質組合502及初始品質組合501計算其各圖塊之使用者體驗指標。 Please refer to Figure 5. In the first round of tile quality selection, you can set the initial settings. , If step S1041 should generate 18 new quality combinations, in this embodiment, it is simplified to an initial quality combination 501 consisting of 4 tiles covered by the head-mounted display's field of view 302, while the other 14 uncovered tiles are all of low quality and consume 1 Mbps of bandwidth. The quality of the 4 tiles in the initial quality combination 501 is sequentially increased from low to high quality, thus constructing or generating 4 first new quality combinations 502. Each of these 4 first new quality combinations 502 differs slightly from the quality of one high-quality tile in the initial quality combination 501. The user experience indicators for each tile in these 4 first new quality combinations 502 and the initial quality combination 501 are calculated.

頭戴式顯示器視野覆蓋區域(圖5中虛線框內的第一新品質組合503)之左上角的圖塊由低品質轉為高品質且虛線框內的第一新品質組合503所產生之使用者體驗指標最高,此時結合上述如表四所示之各圖塊重要度數值 及如表五所示之圖塊品質 The top-left corner of the head-mounted display's field of view (the first new quality combination 503 within the dashed box in Figure 5) changes from low quality to high quality, and the user experience index generated by the first new quality combination 503 within the dashed box is the highest. At this point, the importance values of each tile as shown in Table 4 above are considered. and the quality of the tiles as shown in Table 5 .

表五:第一輪各圖塊品質之PSNR 40 40 40 40 40 40 40 40 40 50 40 40 40 40 40 40 40 40 Table 5: PSNR of each tile quality in the first round 40 40 40 40 40 40 40 40 40 50 40 40 40 40 40 40 40 40

虛線框內的第一新品質組合503之平均圖塊品質可由公式4獲取 ;空間品質差異可由公式5算出 ;時間品質差異可由公式6算出 ;使用者體驗指標可由公式7算出 The average tile quality of the first new quality combination 503 within the dashed box can be obtained using Formula 4. The difference in spatial quality can be calculated using formula 5. The time-quality difference can be calculated using formula 6. User experience metrics can be calculated using Formula 7. .

由於虛線框內的第一新品質組合503包含1個高品質且使用頻寬2Mbps的圖塊及17個低品質且使用頻寬1Mbps的圖塊(如表六所示),故虛線框內的第一新品質組合503共使用頻寬為2*1+1*17=19Mbps,由於19Mbps仍小於步驟S103中算出可用於圖塊傳輸之頻寬大小( ),換言之,第一輪圖塊品質選擇作業中,將初始品質組合的任一個圖塊的品質調為高品質時不會導致傳輸圖塊所需的頻寬超過當前用於圖塊傳輸的頻寬,故第一輪圖塊品質選擇作業所選出的更新後的第一新品質組合504可作為第二輪圖塊品質選擇作業的初始品質組合601。 Since the first new quality combination 503 within the dashed frame contains one high-quality tile using 2Mbps bandwidth and 17 low-quality tiles using 1Mbps bandwidth (as shown in Table 6), the total bandwidth used by the first new quality combination 503 within the dashed frame is 2*1 + 1*17 = 19Mbps. Since 19Mbps is still less than the bandwidth available for tile transmission calculated in step S103 (…),… In other words, in the first round of tile quality selection, setting the quality of any tile in the initial quality combination to high quality will not cause the bandwidth required for tile transmission to exceed the current bandwidth used for tile transmission. Therefore, the updated first new quality combination 504 selected in the first round of tile quality selection can be used as the initial quality combination 601 for the second round of tile quality selection.

表六:第一輪各圖塊使用之頻寬(單位: Mbps) 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1 Table 6: Bandwidth used by each tile in the first round (unit: Mbps) 1 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1 1 1

圖6是依照本發明的一實施例的第二輪圖塊品質選擇作業的示意圖。Figure 6 is a schematic diagram of the second round of tile quality selection operation according to an embodiment of the present invention.

請參考圖6,於第二輪圖塊品質選擇作業中,依次將初始品質組合601中的低品質的3個圖塊調高為高品質,因此會分別建構或產生3個第一新品質組合602,並且此3個第一新品質組合602均與初始品質組合601中圖塊的品質有一個高品質圖塊的些微差異。Please refer to Figure 6. In the second round of tile quality selection, the three low-quality tiles in the initial quality combination 601 are upgraded to high quality in turn. As a result, three first new quality combinations 602 will be constructed or generated respectively. Each of these three first new quality combinations 602 has a slight difference in quality from the tiles in the initial quality combination 601, which is one high-quality tile.

頭戴式顯示器視野覆蓋區域(圖6中虛線框內的第一新品質組合603)之右上角的圖塊由低品質轉為高品質且虛線框內的第一新品質組合603所產生之使用者體驗指標最高,此時結合上述如表四所示之各圖塊重要度數值 及如表七所示之圖塊品質 The top right corner of the head-mounted display's field of view (the first new quality combination 603 within the dashed box in Figure 6) changes from low quality to high quality, and the user experience index generated by the first new quality combination 603 within the dashed box is the highest. At this point, the importance values of each tile as shown in Table 4 above are considered. and the quality of the tiles as shown in Table 7 .

表七:第二輪各圖塊品質之PSNR 40 40 40 40 40 40 40 40 40 50 50 40 40 40 40 40 40 40 Table 7: PSNR of each tile quality in the second round 40 40 40 40 40 40 40 40 40 50 50 40 40 40 40 40 40 40

虛線框內的第一新品質組合603之平均圖塊品質可由公式4獲取 ;空間品質差異可由公式5算出 ;時間品質差異可由公式6算出 ;使用者體驗指標可由公式7算出 The average tile quality of the first new quality combination 603 within the dashed box can be obtained using Formula 4. The difference in spatial quality can be calculated using formula 5. The time-quality difference can be calculated using formula 6. User experience metrics can be calculated using Formula 7. .

由於虛線框內的第一新品質組合603包含2個高品質且使用頻寬2Mbps的圖塊及16個低品質且使用頻寬1Mbps的圖塊(如表八所示),故虛線框內的第一新品質組合603共使用頻寬為2*2+1*16=20Mbps。由於仍小於步驟S103中算出可用於圖塊傳輸之頻寬大小( ),換言之,第二輪圖塊品質選擇作業中,將初始品質組合的任一個圖塊的品質調為高品質時不會導致傳輸圖塊所需的頻寬超過當前用於圖塊傳輸的頻寬,且使用者體驗指標 較第一輪提升( ),故第二輪圖塊品質選擇作業所選出的更新後的第一新品質組合604可作為第三輪圖塊品質選擇作業的初始品質組合701。 Since the first new quality combination 603 within the dashed frame contains 2 high-quality tiles using 2Mbps bandwidth and 16 low-quality tiles using 1Mbps bandwidth (as shown in Table 8), the total bandwidth used by the first new quality combination 603 within the dashed frame is 2*2 + 1*16 = 20Mbps. This is still smaller than the bandwidth available for tile transmission calculated in step S103. In other words, during the second round of tile quality selection, setting the quality of any tile in the initial quality combination to high quality will not cause the bandwidth required for tile transmission to exceed the current bandwidth used for tile transmission, and user experience indicators will remain unchanged. Compared to the first round of improvements ( Therefore, the updated first new quality combination 604 selected in the second round of tile quality selection can be used as the initial quality combination 701 in the third round of tile quality selection.

表八:第二輪各圖塊使用之頻寬(單位: Mbps) 1 1 1 1 1 1 1 1 1 2 2 1 1 1 1 1 1 1 Table 8: Bandwidth used by each tile in the second round (unit: Mbps) 1 1 1 1 1 1 1 1 1 2 2 1 1 1 1 1 1 1

圖7是依照本發明的一實施例的第三輪圖塊品質選擇作業的示意圖。Figure 7 is a schematic diagram of the third round of tile quality selection operation according to an embodiment of the present invention.

請參考圖7,於第三輪圖塊品質選擇作業中,依次將初始品質組合701中的低品質的2個圖塊調高為高品質,因此會分別建構或產生2個第一新品質組合702,並且此2個第一新品質組合702均與初始品質組合701中圖塊的品質有一個高品質圖塊的些微差異。Please refer to Figure 7. In the third round of tile quality selection, the two low-quality tiles in the initial quality combination 701 are upgraded to high quality in turn. As a result, two first new quality combinations 702 will be constructed or generated respectively. The quality of these two first new quality combinations 702 is slightly different from that of the high-quality tiles in the initial quality combination 701.

頭戴式顯示器視野覆蓋區域(圖7中虛線框內的第一新品質組合703)之左下角的圖塊由低品質轉為高品質且虛線框內的第一新品質組合703所產生之使用者體驗指標最高,此時結合上述如表四所示之各圖塊重要度數值 及如表九所示之圖塊品質 The lower left corner of the head-mounted display's field of view (the first new quality combination 703 within the dashed box in Figure 7) changes from low quality to high quality, and the user experience index generated by the first new quality combination 703 within the dashed box is the highest. At this point, the importance values of each tile as shown in Table 4 above are considered. and the quality of the tiles as shown in Table 9 .

表九:第三輪各圖塊品質之PSNR 40 40 40 40 40 40 40 40 40 50 50 40 40 40 40 50 40 40 Table 9: PSNR of each tile quality in the third round 40 40 40 40 40 40 40 40 40 50 50 40 40 40 40 50 40 40

虛線框內的第一新品質組合703之平均圖塊品質可由公式4獲取 ;空間品質差異可由公式5算出 ;時間品質差異可由公式6算出 ;使用者體驗指標可由公式7算出 The average tile quality of the first new quality combination 703 within the dashed box can be obtained using Formula 4. The difference in spatial quality can be calculated using formula 5. The time-quality difference can be calculated using formula 6. User experience metrics can be calculated using Formula 7. .

由於虛線框內的第一新品質組合703包含3個高品質且使用頻寬2Mbps的圖塊及15個低品質且使用頻寬1Mbps的圖塊(如表十所示),故虛線框內的第一新品質組合703共使用頻寬為2*3+1*15=21Mbps。由於仍小於步驟S103中算出可用於圖塊傳輸之頻寬大小( ),換言之,第三輪圖塊品質選擇作業中,將初始品質組合的任一個圖塊的品質調為高品質時不會導致傳輸圖塊所需的頻寬超過當前用於圖塊傳輸的頻寬,且使用者體驗指標 較第二輪提升( ),故第三輪圖塊品質選擇作業所選出的更新後的第一新品質組合704可作為第四輪圖塊品質選擇作業的初始品質組合801。 Since the first new quality combination 703 within the dashed frame contains 3 high-quality tiles using 2Mbps bandwidth and 15 low-quality tiles using 1Mbps bandwidth (as shown in Table 10), the total bandwidth used by the first new quality combination 703 within the dashed frame is 2*3 + 1*15 = 21Mbps. This is still less than the bandwidth available for tile transmission calculated in step S103. In other words, during the third round of tile quality selection, setting the quality of any tile in the initial quality combination to high quality will not cause the bandwidth required for tile transmission to exceed the current bandwidth used for tile transmission, and user experience indicators will remain unchanged. Compared to the second round of improvements ( Therefore, the updated first new quality combination 704 selected in the third round of tile quality selection can be used as the initial quality combination 801 in the fourth round of tile quality selection.

表十:第三輪各圖塊使用之頻寬(單位: Mbps) 1 1 1 1 1 1 1 1 1 2 2 1 1 1 1 2 1 1 Table 10: Bandwidth used by each tile in the third round (unit: Mbps) 1 1 1 1 1 1 1 1 1 2 2 1 1 1 1 2 1 1

圖8是依照本發明的一實施例的第四輪圖塊品質選擇作業的示意圖。Figure 8 is a schematic diagram of the fourth round of tile quality selection operation according to an embodiment of the present invention.

請參考圖8,於第四輪圖塊品質選擇作業中,將初始品質組合801中的低品質的1個圖塊調高為高品質,因此會建構或產生1個第一新品質組合802,並且此第一新品質組合802與初始品質組合801中圖塊的品質有一個高品質圖塊的些微差異。Please refer to Figure 8. In the fourth round of tile quality selection, one low-quality tile in the initial quality combination 801 is upgraded to high quality. This will construct or generate a first new quality combination 802. The quality of this first new quality combination 802 is slightly different from that of the high-quality tile in the initial quality combination 801.

頭戴式顯示器視野覆蓋區域(圖8中虛線框內的第一新品質組合803)之右下角的圖塊由低品質轉為高品質且虛線框內的第一新品質組合803所產生之使用者體驗指標最高,此時結合上述如表四所示之各圖塊重要度數值 及如表十一所示之圖塊品質 The tile in the lower right corner of the head-mounted display's field of view (the first new quality combination 803 within the dashed box in Figure 8) changes from low quality to high quality, and the user experience index generated by the first new quality combination 803 within the dashed box is the highest. At this point, the importance values of each tile as shown in Table 4 above are considered. and the quality of the tiles as shown in Table 11 .

表十一:第四輪各圖塊品質之PSNR 40 40 40 40 40 40 40 40 40 50 50 40 40 40 40 50 50 40 Table 11: PSNR of each tile quality in the fourth round 40 40 40 40 40 40 40 40 40 50 50 40 40 40 40 50 50 40

虛線框內的第一新品質組合803之平均圖塊品質可由公式4獲取 ;空間品質差異可由公式5算出 ;時間品質差異可由公式6算出 ;使用者體驗指標可由公式7算出 The average tile quality of the first new quality combination 803 within the dashed box can be obtained using Formula 4. The difference in spatial quality can be calculated using formula 5. The time-quality difference can be calculated using formula 6. User experience metrics can be calculated using Formula 7. .

由於虛線框內的第一新品質組合803包含4個高品質且使用頻寬2Mbps的圖塊及14個低品質且使用頻寬1Mbps的圖塊(如表十二所示),故虛線框內的第一新品質組合803共使用頻寬為2*4+1*14=22Mbps。雖然使用者體驗指標 較第三輪提升( ),但由於已用盡可用於圖塊傳輸之頻寬大小( ),所以不再滿足步驟S1044中的判斷條件,故停止圖塊選擇作業並將更新後第一品質組合804作為最終圖塊品質組合,輸出並且儲存更新後第一品質組合804中的圖塊至終端影片暫存區。 Since the first new quality combination 803 within the dashed box contains 4 high-quality tiles using 2Mbps bandwidth and 14 low-quality tiles using 1Mbps bandwidth (as shown in Table XII), the total bandwidth used by the first new quality combination 803 within the dashed box is 2*4 + 1*14 = 22Mbps. Although user experience indicators... Compared to the third round of improvements ( However, since the bandwidth available for tile transmission has been exhausted ( Therefore, the judgment condition in step S1044 is no longer met, so the tile selection operation is stopped and the updated first quality combination 804 is used as the final tile quality combination. The tiles in the updated first quality combination 804 are output and stored in the terminal video temporary storage area.

表十二:第四輪各圖塊使用之頻寬(單位: Mbps) 1 1 1 1 1 1 1 1 1 2 2 1 1 1 1 2 2 1 Table 12: Bandwidth used by each tile in the fourth round (unit: Mbps) 1 1 1 1 1 1 1 1 1 2 2 1 1 1 1 2 2 1

本申請實施例還提供一種電腦可讀儲存介質,該電腦可讀儲存介質中儲存有電腦程式代碼,當上述處理器執行該電腦程式代碼時,電子設備執行前述任一實施例中的方法。This application embodiment also provides a computer-readable storage medium storing computer program code, wherein when the processor executes the computer program code, the electronic device executes the method in any of the foregoing embodiments.

本領域技術人員應該可以意識到,在上述一個或多個示例中,本申請所描述的功能可以用硬體、軟體、固件或它們的任意組合來實現。當使用軟體實現時,可以將這些功能儲存在電腦可讀介質中或者作為電腦可讀介質上的一個或多個指令或代碼進行傳輸。電腦可讀介質包括電腦可讀儲存介質和通信介質,其中通信介質包括便於從一個地方向另一個地方傳送電腦程式的任何介質。儲存介質可以是通用或專用電腦能夠存取的任何可用介質。Those skilled in the art will appreciate that, in one or more of the examples above, the functions described in this application can be implemented using hardware, software, firmware, or any combination thereof. When implemented using software, these functions can be stored in a computer-readable medium or transmitted as one or more instructions or code on a computer-readable medium. Computer-readable media include computer-readable storage media and communication media, wherein communication media include any medium that facilitates the transmission of computer programs from one location to another. Storage media can be any available medium accessible to general-purpose or special-purpose computers.

基於上述,本發明提供一種基於圖塊全景影片串流之圖塊選擇方法及電腦可讀儲存介質,可依據預估的當前可用於圖塊傳輸的頻寬及頭戴式顯示器視野的視角預測結果,選擇最佳使用者體驗指標的圖塊品質組合,供終端設備預先下載至終端影片暫存區內,後續當使用者切換到該視角時,終端設備可直接存取終端影片暫存區內的緩衝內容進行播放,減少MTP延遲,確保不會因使用者轉頭之視角預測錯誤而有黑屏畫面,保證了使用者觀看全景影片的品質。Based on the above, this invention provides a method for selecting tiles based on tile-based panoramic video streaming and a computer-readable storage medium. It can select the optimal combination of tile quality indicators for user experience based on the estimated bandwidth currently available for tile transmission and the predicted view angle of the head-mounted display. This combination is pre-downloaded to the terminal device's video buffer. Subsequently, when the user switches to that view angle, the terminal device can directly access and play the buffered content in the terminal video buffer, reducing MTP latency and ensuring that there are no black screens due to incorrect prediction of the user's head-turning view angle, thus guaranteeing the quality of the panoramic video viewing experience.

雖然本發明已以實施例揭露如上,然其並非用以限定本發明,任何所屬技術領域中具有通常知識者,在不脫離本發明的精神和範圍內,當可作些許的更動與潤飾,故本發明的保護範圍當視後附的申請專利範圍所界定者為準。Although the present invention has been disclosed above by way of embodiments, it is not intended to limit the present invention. Anyone with ordinary skill in the art may make some modifications and refinements without departing from the spirit and scope of the present invention. Therefore, the scope of protection of the present invention shall be determined by the appended patent application.

30:全景影片 301:圖塊 302:頭戴式顯示器視野 303:水平線 3011:圖塊中心 40:球體 501、601、701、801:初始品質組合 502、503、602、603、702、703、802、803:第一新品質組合 504、604、704、804:更新後的第一新品質組合 S101、S102、S103、S104、S1041、S1042、S1043、S1044、S1045:步驟 30: Panoramic Video 301: Tile 302: Head-Mounted Display Field of View 303: Horizon Line 3011: Tile Center 40: Sphere 501, 601, 701, 801: Initial Quality Combination 502, 503, 602, 603, 702, 703, 802, 803: First New Quality Combination 504, 604, 704, 804: Updated First New Quality Combination S101, S102, S103, S104, S1041, S1042, S1043, S1044, S1045: Steps

圖1是依照本發明的一實施例的一種基於圖塊全景影片串流之圖塊選擇方法的流程圖。 圖2是依據圖塊品質選擇演算法來獲取最終圖塊品質組合的流程圖。 圖3是依照本發明的一實施例的使用者轉頭之視角預測模型在某時刻所預測之頭戴式顯示器視野中心與水平線的示意圖。 圖4是依照本發明的一實施例的大圓距離的示意圖。 圖5是依照本發明的一實施例的第一輪圖塊品質選擇作業的示意圖。 圖6是依照本發明的一實施例的第二輪圖塊品質選擇作業的示意圖。 圖7是依照本發明的一實施例的第三輪圖塊品質選擇作業的示意圖。 圖8是依照本發明的一實施例的第四輪圖塊品質選擇作業的示意圖。 Figure 1 is a flowchart of a tile selection method based on a tile panoramic video stream according to an embodiment of the present invention. Figure 2 is a flowchart of obtaining the final tile quality combination according to a tile quality selection algorithm. Figure 3 is a schematic diagram of the center of the head-mounted display's field of view and the horizontal line predicted by a user's head-turning view angle prediction model at a certain moment according to an embodiment of the present invention. Figure 4 is a schematic diagram of the great circle distance according to an embodiment of the present invention. Figure 5 is a schematic diagram of the first round of tile quality selection operation according to an embodiment of the present invention. Figure 6 is a schematic diagram of the second round of tile quality selection operation according to an embodiment of the present invention. Figure 7 is a schematic diagram of the third round of tile quality selection operation according to an embodiment of the present invention. Figure 8 is a schematic diagram of the fourth round of tile quality selection according to an embodiment of the present invention.

S101、S102、S103、S104:步驟 S101, S102, S103, S104: Steps

Claims (6)

一種基於圖塊全景影片串流之圖塊選擇方法,包括: 將全景影片切割成多個圖塊,且各該圖塊編碼成複數品質; 獲取並儲存各該圖塊的重要度; 依據過去頻寬記錄、頻寬變動幅度及終端影片暫存長度來預估當前用於該些圖塊傳輸的頻寬;以及 依據圖塊品質選擇演算法來獲取最終圖塊品質組合,輸出並且儲存該最終圖塊品質組合中的該些圖塊至終端影片暫存區。 A method for tile selection based on tile-based panoramic video streaming includes: segmenting the panoramic video into multiple tiles, and encoding each tile into multiple qualities; acquiring and storing the importance of each tile; estimating the bandwidth currently used for transmitting the tiles based on past bandwidth records, bandwidth variation, and terminal video storage length; and obtaining a final tile quality combination according to a tile quality selection algorithm, outputting and storing the tiles in the final tile quality combination in the terminal video storage area. 如請求項1所述的圖塊選擇方法,其中該獲取並儲存各該圖塊的該重要度的步驟中包括: 依據各該圖塊的圖塊中心與頭戴式顯示器的視野中心之間的大圓距離以及各該圖塊中心與水平線之間的弧度差來獲取各該圖塊的該重要度。 The tile selection method as described in claim 1, wherein the step of acquiring and storing the importance of each tile includes: acquiring the importance of each tile based on the great circle distance between the center of each tile and the center of the field of view of the head-mounted display, and the difference in curvature between the center of each tile and the horizontal line. 如請求項1所述的圖塊選擇方法,其中該依據該過去頻寬記錄、該頻寬變動幅度及該終端影片暫存長度來預估當前用於圖塊傳輸的該頻寬的步驟中包括: 依據過去時間間隔內的頻寬記錄之平均值,該過去時間間隔內的頻寬記錄之標準差來獲取網路穩定度指標, 依據該過去頻寬記錄、該頻寬變動幅度、該終端影片暫存長度、終端目標影片暫存長度、單一圖塊影片長度及該網路穩定度指標來獲取當前用於圖塊傳輸的該頻寬。 The tile selection method as described in claim 1, wherein the step of estimating the current bandwidth used for tile transmission based on the past bandwidth records, the bandwidth variation range, and the terminal video storage length includes: Obtaining a network stability index based on the average value of bandwidth records within past time intervals and the standard deviation of bandwidth records within the past time intervals; Obtaining the current bandwidth used for tile transmission based on the past bandwidth records, the bandwidth variation range, the terminal video storage length, the terminal target video storage length, the length of a single tile video, and the network stability index. 如請求項2所述的圖塊選擇方法,其中該依據該圖塊品質選擇演算法來獲取該最終圖塊品質組合,輸出並且儲存該最終圖塊品質組合中的該些圖塊至該終端影片暫存區的步驟中包括: 選取該些圖塊中的多個低品質圖塊形成初始品質組合; 建構與該初始品質組合的圖塊總數相同的多個第一新品質組合,將該初始品質組合的該些圖塊中的第一圖塊的品質由低品質調為高品質,並且將該些圖塊中除該第一圖塊的圖塊與該第一圖塊組合為該第一新品質組合,將該初始品質組合的該些圖塊中的第二圖塊的品質由該低品質調為該高品質,並且將該些圖塊中除該第二圖塊的圖塊與該第二圖塊組合為該第一新品質組合,其中該第一圖塊與該第二圖塊為該初始品質組合的該些圖塊中的不同圖塊; 獲取該初始品質組合及該些第一新品質組合的使用者體驗指標,從該初始品質組合及該些第一新品質組合中選取具有最佳化使用者體驗指標的品質組合作為更新後第一品質組合;以及 倘若將該初始品質組合的該些圖塊中的任一個圖塊的品質調為高品質會導致傳輸該些圖塊所需的頻寬超過當前用於圖塊傳輸的該頻寬,或者,將該初始品質組合的該些圖塊中的任一個圖塊的品質由該低品質調為該高品質會導致該些第一新品質組合的該使用者體驗指標下降,則將該更新後第一品質組合作為該最終圖塊品質組合,輸出並且儲存該最終圖塊品質組合中的該些圖塊至該終端影片暫存區。 The tile selection method as described in claim 2, wherein the step of obtaining the final tile quality combination according to the tile quality selection algorithm, outputting and storing the tiles in the final tile quality combination to the terminal video temporary storage area includes: Selecting multiple low-quality tiles from the selected tiles to form an initial quality combination; Construct multiple new first quality combinations with the same total number of tiles as the initial quality combination. Adjust the quality of the first tile in the initial quality combination from low to high quality, and combine the tiles from these new combinations (excluding the first tile) with the first tile to form the new first quality combination. Adjust the quality of the second tile in the initial quality combination from low to high quality, and combine the tiles from these new combinations (excluding the second tile) with the second tile to form the new first quality combination. The first tile and the second tile are different tiles from the initial quality combination. Obtain user experience metrics for the initial quality mix and the first new quality mixes; select the quality mix with optimized user experience metrics from the initial quality mix and the first new quality mixes to form the updated first quality mix; and If upgrading the quality of any tile in the initial quality combination to high quality would cause the bandwidth required to transmit those tiles to exceed the current bandwidth used for tile transmission, or if upgrading the quality of any tile in the initial quality combination from low quality to high quality would cause a decrease in the user experience index of those first new quality combinations, then the updated first quality combination is designated as the final tile quality combination, and the tiles in the final tile quality combination are output and stored in the terminal video storage area. 如請求項4所述的圖塊選擇方法,其中該獲取該初始品質組合及該些第一新品質組合的該使用者體驗指標,從該初始品質組合及該些第一新品質組合中選取具有該最佳化使用者體驗指標的品質組合作為該更新後第一品質組合的步驟中,包括: 依據 編號 在品質編號 時的峰值訊噪比(Peak signal-to-noise ratio,PSNR)及該圖塊的該重要度獲取平均圖塊品質, 依據 編號 在該品質編號 時的該PSNR、該圖塊的該重要度及該平均圖塊品質獲取空間品質差異, 依據前次之平均圖塊品質及該平均圖塊品質獲取時間品質差異, 依據該平均圖塊品質、該空間品質差異及該時間品質差異獲取該使用者體驗指標。 The tile selection method as described in claim 4, wherein the step of obtaining the user experience index of the initial quality combination and the first new quality combinations, and selecting the quality combination with the optimized user experience index from the initial quality combination and the first new quality combinations as the updated first quality combination, includes: Based on Number In quality number The average tile quality is obtained by using the peak signal-to-noise ratio (PSNR) and the importance of the tile at that time. Number In this quality number The spatial quality difference is obtained based on the PSNR, the importance of the tile, and the average tile quality. The temporal quality difference is obtained based on the previous average tile quality and the average tile quality. The user experience index is obtained based on the average tile quality, the spatial quality difference, and the temporal quality difference. 一種電腦可讀儲存介質,其中,該電腦可讀儲存介質用於儲存電腦程式,該電腦程式被執行時,如請求項1-5任一項所述的方法被執行。A computer-readable storage medium, wherein the computer-readable storage medium is used to store a computer program, which, when executed, performs the method described in any one of claims 1-5.
TW113109308A 2024-03-13 2024-03-13 A tile selection method for tile-based 360-degree video streaming and computer-readable storage media TW202536802A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW113109308A TW202536802A (en) 2024-03-13 2024-03-13 A tile selection method for tile-based 360-degree video streaming and computer-readable storage media

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW113109308A TW202536802A (en) 2024-03-13 2024-03-13 A tile selection method for tile-based 360-degree video streaming and computer-readable storage media

Publications (1)

Publication Number Publication Date
TW202536802A true TW202536802A (en) 2025-09-16

Family

ID=97831406

Family Applications (1)

Application Number Title Priority Date Filing Date
TW113109308A TW202536802A (en) 2024-03-13 2024-03-13 A tile selection method for tile-based 360-degree video streaming and computer-readable storage media

Country Status (1)

Country Link
TW (1) TW202536802A (en)

Similar Documents

Publication Publication Date Title
US11694316B2 (en) Method and apparatus for determining experience quality of VR multimedia
JP4996603B2 (en) Video game system using pre-encoded macroblocks
JP2022141586A (en) Cloud Gaming GPU with Integrated NIC and Shared Frame Buffer Access for Low Latency
TWI786572B (en) Immersive media providing method and acquiring method, device, equipment and storage medium
US11120615B2 (en) Dynamic rendering of low frequency objects in a virtual reality system
CN104639951A (en) Video bitstream frame extraction process and device
US20250023936A1 (en) Method and apparatus for processing media stream, computer device, and storage medium
CN110858388B (en) Method and device for enhancing video quality
US10002644B1 (en) Restructuring video streams to support random access playback
WO2023071469A1 (en) Video processing method, electronic device and storage medium
CN113286146B (en) Media data processing method, device, equipment and storage medium
TW202536802A (en) A tile selection method for tile-based 360-degree video streaming and computer-readable storage media
US11134236B2 (en) Image processing device and system
CN115942049B (en) VR video-oriented visual angle switching method, device, equipment and medium
CN117579843B (en) Video coding processing method and electronic equipment
CN115706828A (en) Data processing method and device, equipment and storage medium
WO2021193361A1 (en) Image data transfer device, image display system, and image transfer method
WO2021199128A1 (en) Image data transfer device, image generation method and computer program
CN116170609B (en) Video transcoding method, device, live broadcast server, terminal device and storage medium
JP2015519824A (en) A mechanism that facilitates cost-effective and low-latency video stream coding
JP7349863B2 (en) Video processing method, video processing device, and program
HK40076011B (en) Immersive media providing method, acquisition method, device, equipment and storage medium
WO2025161977A1 (en) Rendering method and apparatus, and device-cloud collaborative system
CN119545109A (en) Video playback method, device and equipment
WO2025112925A1 (en) Screen mirroring method, electronic device, storage medium, and program product