[go: up one dir, main page]

TWI880378B - Voice conference system and voice conference method - Google Patents

Voice conference system and voice conference method Download PDF

Info

Publication number
TWI880378B
TWI880378B TW112137880A TW112137880A TWI880378B TW I880378 B TWI880378 B TW I880378B TW 112137880 A TW112137880 A TW 112137880A TW 112137880 A TW112137880 A TW 112137880A TW I880378 B TWI880378 B TW I880378B
Authority
TW
Taiwan
Prior art keywords
invitee
voice
initiator
encryption
key
Prior art date
Application number
TW112137880A
Other languages
Chinese (zh)
Other versions
TW202516492A (en
Inventor
涂雅晴
朱瑞琪
張峻嘉
周自強
邱仲毅
Original Assignee
中華電信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中華電信股份有限公司 filed Critical 中華電信股份有限公司
Priority to TW112137880A priority Critical patent/TWI880378B/en
Application granted granted Critical
Publication of TWI880378B publication Critical patent/TWI880378B/en
Publication of TW202516492A publication Critical patent/TW202516492A/en

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)

Abstract

A voice conference system and a voice conference method are provided. The method includes: a first-invitee-device uses a first-invitee-voice, a first-voice-coding-method associated with a first-network, an encryption-and-decryption-algorithm, and a first-key to obtain a first-encoded-and-encrypted-invitee-voice; an initiator-device uses the first-encoded-and-encrypted-invitee-voice, the encryption-and-decryption-algorithm, the first-key, and the first-voice-coding-method to obtain the first-invitee-voice; the initiator-device performs mixing on an initiator-voice to obtain a mixing-result; the initiator-device uses the mixing-result, the first-voice-coding-method, the encryption-and-decryption-algorithm, and the first-key to obtain an encoded-and-encrypted-mixed-result; and the first-invitee-device uses the encoded-and-encrypted-mixed-result, the encryption-and-decryption-algorithm and the first-key to obtain an encoded-mixed-result.

Description

語音會議系統及方法Voice conferencing system and method

本發明是有關於一種語音會議系統及方法。The present invention relates to a voice conferencing system and method.

目前VoIP的語音會議大多是由終端裝置對語音進行加密,接著由伺服器依序進行解密、混音以及加密。然後,伺服器才傳送至終端裝置,以由終端裝置解密並播放。此種作法容易產生資安問題。另一方面,若需在語音會議中加入新成員,則所有終端裝置都需要重新進行金鑰協商流程,因此較耗時且複雜度高。Currently, most VoIP voice conferences are encrypted by the terminal device, and then decrypted, mixed and encrypted by the server in sequence. The server then transmits the voice to the terminal device for decryption and playback. This approach is prone to security issues. On the other hand, if a new member needs to be added to the voice conference, all terminal devices need to re-negotiate the key process, which is time-consuming and complicated.

本發明的語音會議系統包括發起者裝置、伺服器以及多個被邀請者裝置。發起者裝置對應於發起者語音。伺服器通訊連接至發起者裝置。多個被邀請者裝置包括第一被邀請者裝置,且第一被邀請者裝置經由第一網路通訊連接至伺服器,且第一被邀請者裝置對應於第一被邀請者語音,其中當發起者裝置以及第一被邀請者裝置完成語音會議註冊流程以及第一金鑰協商流程時,第一被邀請者裝置利用第一被邀請者語音、關聯於第一網路的第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後被邀請者語音;發起者裝置利用第一編碼與加密後被邀請者語音、加解密演算法、第一金鑰以及第一語音編碼方式獲得第一被邀請者語音;發起者裝置對發起者語音執行混音以獲得混音結果;發起者裝置利用混音結果、第一語音編碼方式、加解密演算法以及第一金鑰獲得編碼與加密後混音結果;第一被邀請者裝置利用編碼與加密後混音結果、加解密演算法以及第一金鑰獲得編碼後混音結果。The voice conference system of the present invention includes an initiator device, a server, and a plurality of invitee devices. The initiator device corresponds to the initiator voice. The server is communicatively connected to the initiator device. The plurality of invitee devices include a first invitee device, and the first invitee device is communicatively connected to the server via a first network, and the first invitee device corresponds to the first invitee voice, wherein when the initiator device and the first invitee device complete the voice conference registration process and the first key negotiation process, the first invitee device uses the first invitee voice, the first voice encoding method associated with the first network, the encryption and decryption algorithm, and the first key to obtain the first coded and encrypted invitee voice The initiator device obtains the first invitee voice by using the first coded and encrypted invitee voice, the encryption and decryption algorithm, the first key and the first voice encoding method; the initiator device performs mixing on the initiator voice to obtain a mixing result; the initiator device obtains the coded and encrypted mixed result by using the mixing result, the first voice encoding method, the encryption and decryption algorithm and the first key; the first invitee device obtains the coded mixed result by using the coded and encrypted mixed result, the encryption and decryption algorithm and the first key.

本發明的語音會議方法,適於由包括發起者裝置、伺服器以及多個被邀請者裝置的語音會議系統執行,其中發起者裝置對應於發起者語音,其中多個被邀請者裝置包括第一被邀請者裝置,且第一被邀請者裝置經由第一網路通訊連接至伺服器,且第一被邀請者裝置對應於第一被邀請者語音,其中所述方法包括以下步驟:當發起者裝置以及第一被邀請者裝置完成語音會議註冊流程以及第一金鑰協商流程時,由第一被邀請者裝置利用第一被邀請者語音、關聯於第一網路的第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後被邀請者語音;由發起者裝置利用第一編碼與加密後被邀請者語音、加解密演算法、第一金鑰以及第一語音編碼方式獲得第一被邀請者語音;由發起者裝置對發起者語音執行混音以獲得混音結果;由發起者裝置利用混音結果、第一語音編碼方式、加解密演算法以及第一金鑰獲得編碼與加密後混音結果;以及由第一被邀請者裝置利用編碼與加密後混音結果、加解密演算法以及第一金鑰獲得編碼後混音結果。The voice conference method of the present invention is suitable for being executed by a voice conference system including an initiator device, a server and a plurality of invitee devices, wherein the initiator device corresponds to the initiator voice, wherein the plurality of invitee devices include a first invitee device, and the first invitee device is connected to the server via a first network communication, and the first invitee device corresponds to the first invitee voice, wherein the method comprises the following steps: when the initiator device and the first invitee device complete the voice conference registration process and the first key negotiation process, the first invitee device uses the first invitee voice, the first network associated with the first network, and the first key negotiation process to register the voice conference. The first coded and encrypted invitee voice is obtained by using a voice coding method, an encryption and decryption algorithm, and a first key; the initiator device obtains the first invitee voice using the first coded and encrypted invitee voice, an encryption and decryption algorithm, a first key, and the first voice coding method; the initiator device performs mixing on the initiator voice to obtain a mixing result; the initiator device obtains the coded and encrypted mixed result by using the mixing result, the first voice coding method, an encryption and decryption algorithm, and a first key; and the first invitee device obtains the coded mixed result by using the coded and encrypted mixed result, an encryption and decryption algorithm, and the first key.

圖1是根據本發明的一實施例繪示的語音會議系統的示意圖。本發明的語音會議系統可包括發起者裝置、伺服器以及多個被邀請者裝置。伺服器可通訊連接至發起者裝置。所述多個被邀請者裝置的每一者可通訊連接至伺服器。詳細而言,所述多個被邀請者裝置可包括第一被邀請者裝置,且第一被邀請者裝置可經由第一網路通訊連接至伺服器。進一步而言,所述多個被邀請者裝置可包括第二被邀請者裝置,且第二被邀請者裝置可經由第二網路通訊連接至伺服器。需說明的是,圖1所示的被邀請者裝置的數量僅為示意,本發明不對此限制。FIG1 is a schematic diagram of a voice conferencing system according to an embodiment of the present invention. The voice conferencing system of the present invention may include an initiator device, a server, and multiple invitee devices. The server may be communicatively connected to the initiator device. Each of the multiple invitee devices may be communicatively connected to the server. In detail, the multiple invitee devices may include a first invitee device, and the first invitee device may be communicatively connected to the server via a first network. Further, the multiple invitee devices may include a second invitee device, and the second invitee device may be communicatively connected to the server via a second network. It should be noted that the number of invitee devices shown in FIG1 is for illustration only, and the present invention is not limited thereto.

發起者裝置可包括發起者註冊單元111、發起者錄音單元112、發起者收送單元113、發起者解密單元114、發起者加密單元115以及混音單元116。伺服器可包括伺服器端註冊單元210以及轉送單元220。所述多個被邀請者裝置的每一者可包括被邀請者註冊單元121、被邀請者錄音單元122、被邀請者收送單元123、被邀請者加密單元124以及被邀請者解密單元125。The initiator device may include an initiator registration unit 111, an initiator recording unit 112, an initiator transceiver unit 113, an initiator decryption unit 114, an initiator encryption unit 115, and a mixing unit 116. The server may include a server-side registration unit 210 and a transfer unit 220. Each of the plurality of invitee devices may include an invitee registration unit 121, an invitee recording unit 122, an invitee transceiver unit 123, an invitee encryption unit 124, and an invitee decryption unit 125.

發起者裝置、伺服器以及所述多個被邀請者裝置的每一者可包括通訊晶片(未繪示於圖中)、儲存裝置(未繪示於圖中)及處理模組(未繪示於圖中)。Each of the initiator device, the server, and the plurality of invitee devices may include a communication chip (not shown in the figure), a storage device (not shown in the figure), and a processing module (not shown in the figure).

圖2 是根據本發明的一實施例繪示的語音會議註冊流程及金鑰協商流程的流程圖。請同時參照圖1及圖2。為了便於說明,圖2僅繪示發起者裝置、伺服器以及第一被邀請者裝置執行語音會議註冊流程及金鑰協商流程的流程。在本實施例中,(第一被邀請者裝置的)被邀請者註冊單元121、發起者註冊單元111以及伺服器端註冊單元210可執行語音會議註冊流程。FIG. 2 is a flow chart of a voice conference registration process and a key negotiation process according to an embodiment of the present invention. Please refer to FIG. 1 and FIG. 2 at the same time. For the sake of convenience, FIG. 2 only shows the process of the initiator device, the server, and the first invitee device executing the voice conference registration process and the key negotiation process. In this embodiment, the invitee registration unit 121 (of the first invitee device), the initiator registration unit 111, and the server-side registration unit 210 can execute the voice conference registration process.

詳細而言,在步驟S201中,發起者註冊單元111可傳送註冊資訊至伺服器端註冊單元210。Specifically, in step S201, the initiator registration unit 111 may transmit registration information to the server registration unit 210.

在步驟S202中,發起者註冊單元111可從伺服器端註冊單元210接收對應於上述註冊資訊的註冊成功通知。In step S202, the initiator registration unit 111 may receive a registration success notification corresponding to the above registration information from the server-side registration unit 210.

在步驟S203中,發起者註冊單元111可經由伺服器端註冊單元210傳送推播通知至(第一被邀請者裝置的)被邀請者註冊單元121。接著,被邀請者註冊單元121可振鈴以通知第一被邀請者。In step S203, the initiator register 111 may send a push notification to the invitee register 121 (of the first invitee's device) via the server register 210. Then, the invitee register 121 may ring to notify the first invitee.

在步驟S204中,(第一被邀請者裝置的)被邀請者註冊單元121可傳送註冊資訊至伺服器端註冊單元210。In step S204 , the invitee registration unit 121 (of the first invitee device) may transmit the registration information to the server registration unit 210 .

在步驟S205中,(第一被邀請者裝置的)被邀請者註冊單元121可從伺服器端註冊單元210接收對應於(步驟S204中)註冊資訊的註冊成功通知。In step S205, the invitee registration unit 121 (of the first invitee device) may receive a registration success notification corresponding to the registration information (in step S204) from the server-side registration unit 210.

在步驟S206中,(第一被邀請者裝置的)被邀請者註冊單元121可經由伺服器端註冊單元210從發起者註冊單元111接收Invite訊息。接著,(第一被邀請者裝置的)被邀請者註冊單元121可並進行接通語音會議功能的工作。In step S206, the invitee registration unit 121 (of the first invitee device) may receive the Invite message from the initiator registration unit 111 via the server registration unit 210. Then, the invitee registration unit 121 (of the first invitee device) may perform the work of connecting the voice conference function.

在執行完步驟S201~步驟S206的語音會議註冊流程之後,發起者裝置、伺服器以及第一被邀請者裝置將繼續執行金鑰協商流程。詳細而言,在步驟S207中,發起者加密單元115可經由轉送單元220傳送金鑰交換請求至(第一被邀請者裝置的)被邀請者加密單元124。After executing the voice conference registration process of step S201 to step S206, the initiator device, the server and the first invitee device will continue to execute the key negotiation process. Specifically, in step S207, the initiator encryption unit 115 can transmit the key exchange request to the invitee encryption unit 124 (of the first invitee device) via the transfer unit 220.

在步驟S208中,(第一被邀請者裝置的)被邀請者加密單元124、發起者加密單元115以及轉送單元220可執行第一金鑰協商流程。接著,當第一被邀請者裝置的被邀請者加密單元124、發起者加密單元115以及轉送單元220完成第一金鑰協商流程時,第一被邀請者裝置的被邀請者加密單元124可獲得第一金鑰,且發起者加密單元115可獲得第一金鑰。In step S208, the invitee encryption unit 124 (of the first invitee device), the initiator encryption unit 115, and the transfer unit 220 may perform a first key negotiation process. Then, when the invitee encryption unit 124, the initiator encryption unit 115, and the transfer unit 220 of the first invitee device complete the first key negotiation process, the invitee encryption unit 124 of the first invitee device may obtain the first key, and the initiator encryption unit 115 may obtain the first key.

在執行完步驟S207~步驟S208的金鑰協商流程之後,在步驟S209中,發起者裝置、伺服器以及第一被邀請者裝置可開始進行語音會議。詳細而言,發起者收送單元113以及第一被邀請者裝置的被邀請者收送單元123可接收封包,並且可基於UDP/RTP傳輸協定來進行封包拆解,以開始進行加密語音會議。After executing the key negotiation process of step S207 to step S208, in step S209, the initiator device, the server and the first invitee device can start the voice conference. Specifically, the initiator transceiver unit 113 and the invitee transceiver unit 123 of the first invitee device can receive the packet and can decompose the packet based on the UDP/RTP transmission protocol to start the encrypted voice conference.

圖3是根據本發明的一實施例繪示的語音會議的流程圖。請同時參照圖1、圖2及圖3。為了便於說明,圖3僅繪示發起者裝置、伺服器以及第一被邀請者裝置執行語音會議的流程。在本實 例中,發起者裝置可對應於發起者語音。另一方面,第一被邀請者裝置可對應於第一被邀請者語音。當發起者裝置以及第一被邀請者裝置完成語音會議註冊流程以及第一金鑰協商流程時,第一被邀請者裝置可利用第一被邀請者語音、關聯於第一網路的第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後被邀請者語音。以下將繼續說明。FIG. 3 is a flow chart of a voice conference according to an embodiment of the present invention. Please refer to FIG. 1, FIG. 2 and FIG. 3 at the same time. For the sake of explanation, FIG. 3 only shows the process of the initiator device, the server and the first invitee device performing the voice conference. In this example, the initiator device may correspond to the initiator voice. On the other hand, the first invitee device may correspond to the first invitee voice. When the initiator device and the first invitee device complete the voice conference registration process and the first key negotiation process, the first invitee device may use the first invitee voice, the first voice encoding method associated with the first network, the encryption and decryption algorithm and the first key to obtain the first coded and encrypted invitee voice. The following will continue to explain.

在步驟S301中,發起者錄音單元112可傳送發起者語音至混音單元116。詳細而言,發起者錄音單元112可收錄來自發起者的類比訊號形式的發起者語音,並且將類比訊號形式的發起者語音轉換為數位訊號形式的發起者語音。然後,發起者錄音單元112可傳送數位訊號的發起者語音至混音單元116。另一方面,第一被邀請者裝置的被邀請者錄音單元122可收錄來自第一被邀請者的第一被邀請者語音,並且將類比訊號形式的第一被邀請者語音轉換為數位訊號形式的第一被邀請者語音。然後,第一被邀請者裝置的被邀請者錄音單元122可根據第一網路的第一網路環境來選擇出第一語音編碼方式。然後,第一被邀請者裝置的被邀請者錄音單元122可利用(關聯於第一網路的)第一語音編碼方式對第一被邀請者語音執行編碼以獲得第一編碼後被邀請者語音。在獲得第一編碼後被邀請者語音之後,第一被邀請者裝置的被邀請者錄音單元122可傳送第一編碼後被邀請者語音至第一被邀請者裝置的被邀請者加密單元124。In step S301, the originator recording unit 112 may transmit the originator's voice to the mixing unit 116. Specifically, the originator recording unit 112 may record the originator's voice in the form of an analog signal from the originator, and convert the originator's voice in the form of an analog signal into the originator's voice in the form of a digital signal. Then, the originator recording unit 112 may transmit the originator's voice in the form of a digital signal to the mixing unit 116. On the other hand, the invitee recording unit 122 of the first invitee device may record the first invitee's voice from the first invitee, and convert the first invitee's voice in the form of an analog signal into the first invitee's voice in the form of a digital signal. Then, the invitee recording unit 122 of the first invitee device may select a first voice encoding method according to the first network environment of the first network. Then, the invitee recording unit 122 of the first invitee device may encode the first invitee voice using the first voice encoding method (associated with the first network) to obtain the first encoded invitee voice. After obtaining the first encoded invitee voice, the invitee recording unit 122 of the first invitee device may transmit the first encoded invitee voice to the invitee encryption unit 124 of the first invitee device.

在步驟S302中,第一被邀請者裝置的被邀請者加密單元124可利用加解密演算法以及第一金鑰對第一編碼後被邀請者語音執行加密以獲得第一編碼與加密後被邀請者語音。然後,第一被邀請者裝置的被邀請者加密單元124可傳送第一編碼與加密後被邀請者語音至第一被邀請者裝置的被邀請者收送單元123。In step S302, the invitee encryption unit 124 of the first invitee device may encrypt the first coded invitee voice using an encryption algorithm and a first key to obtain the first coded and encrypted invitee voice. Then, the invitee encryption unit 124 of the first invitee device may transmit the first coded and encrypted invitee voice to the invitee transceiver unit 123 of the first invitee device.

在步驟S303中,第一被邀請者裝置的被邀請者收送單元123可經由轉送單元220傳送第一編碼與加密後被邀請者語音至發起者收送單元113。詳細而言,第一被邀請者裝置的被邀請者收送單元123可基於UDP/RTP傳輸協定傳送第一編碼與加密後被邀請者語音至轉送單元220。接著,轉送單元220可基於UDP/RTP傳輸協定傳送第一編碼與加密後被邀請者語音至發起者收送單元113。In step S303, the invitee transceiver unit 123 of the first invitee device may transmit the first coded and encrypted invitee voice to the initiator transceiver unit 113 via the forwarding unit 220. Specifically, the invitee transceiver unit 123 of the first invitee device may transmit the first coded and encrypted invitee voice to the forwarding unit 220 based on the UDP/RTP transmission protocol. Then, the forwarding unit 220 may transmit the first coded and encrypted invitee voice to the initiator transceiver unit 113 based on the UDP/RTP transmission protocol.

在步驟S304中,發起者收送單元113可獲得第一編碼與加密後被邀請者語音。詳細而言,發起者收送單元113可基於UDP/RTP傳輸協定來進行封包拆解以獲得第一編碼與加密後被邀請者語音。然後,發起者收送單元113可傳送第一編碼與加密後被邀請者語音至發起者解密單元114。In step S304, the initiator transceiver unit 113 may obtain the first coded and encrypted invitee voice. Specifically, the initiator transceiver unit 113 may perform packet decomposition based on the UDP/RTP transmission protocol to obtain the first coded and encrypted invitee voice. Then, the initiator transceiver unit 113 may transmit the first coded and encrypted invitee voice to the initiator decryption unit 114.

在本實施例中,發起者裝置可利用第一編碼與加密後被邀請者語音、加解密演算法、第一金鑰以及(前述步驟S301中第一被邀請者裝置的被邀請者錄音單元122所使用的)第一語音編碼方式獲得第一被邀請者語音。然後,發起者裝置可對發起者語音執行混音以獲得混音結果。然後,發起者裝置可利用混音結果、第一語音編碼方式、加解密演算法以及第一金鑰獲得編碼與加密後混音結果。以下將繼續說明。In this embodiment, the initiator device can obtain the first invitee voice using the first coded and encrypted invitee voice, the encryption and decryption algorithm, the first key, and the first voice encoding method (used by the invitee recording unit 122 of the first invitee device in the aforementioned step S301). Then, the initiator device can perform mixing on the initiator voice to obtain a mixing result. Then, the initiator device can obtain the coded and encrypted mixing result using the mixing result, the first voice encoding method, the encryption and decryption algorithm, and the first key. The following will continue to explain.

在步驟S305中,發起者解密單元114可利用加解密演算法以及第一金鑰對第一編碼與加密後被邀請者語音執行解密以獲得第一編碼後被邀請者語音。然後,發起者解密單元114可傳送第一編碼後被邀請者語音至混音單元116。In step S305, the initiator decryption unit 114 may use the encryption and decryption algorithm and the first key to decrypt the first coded and encrypted invitee voice to obtain the first coded invitee voice. Then, the initiator decryption unit 114 may transmit the first coded invitee voice to the mixing unit 116.

在步驟S306中,混音單元116可利用第一語音編碼方式對第一編碼後被邀請者語音執行解碼以獲得第一被邀請者語音。接著,發起者裝置(的混音單元116)可對發起者語音執行混音以獲得混音結果。接著,混音單元116可利用(前述步驟S301中第一被邀請者裝置的被邀請者錄音單元122所使用的)第一語音編碼方式對混音結果執行編碼以獲得編碼後混音結果。接著,混音單元116可傳送編碼後混音結果至發起者加密單元115。In step S306, the mixing unit 116 may decode the first coded invitee voice using the first voice coding method to obtain the first invitee voice. Then, the initiator device (the mixing unit 116) may mix the initiator voice to obtain a mixing result. Then, the mixing unit 116 may encode the mixing result using the first voice coding method (used by the invitee recording unit 122 of the first invitee device in the aforementioned step S301) to obtain an encoded mixing result. Then, the mixing unit 116 may transmit the encoded mixing result to the initiator encryption unit 115.

在步驟S307中,發起者加密單元115可利用加解密演算法以及第一金鑰對編碼後混音結果執行加密以獲得編碼與加密後混音結果。接著,發起者加密單元115可傳送編碼與加密後混音結果至發起者收送單元113。In step S307, the initiator encryption unit 115 may encrypt the coded mixing result using the encryption and decryption algorithm and the first key to obtain the coded and encrypted mixing result. Then, the initiator encryption unit 115 may transmit the coded and encrypted mixing result to the initiator transceiver unit 113.

在步驟S308中,發起者收送單元113可經由轉送單元220傳送編碼與加密後混音結果至第一被邀請者裝置的被邀請者收送單元123。詳細而言,發起者收送單元113可基於UDP/RTP傳輸協定傳送編碼與加密後混音結果至轉送單元220。然後,轉送單元220可基於UDP/RTP傳輸協定傳送編碼與加密後混音結果至第一被邀請者裝置的被邀請者收送單元123。In step S308, the initiator transceiver unit 113 may transmit the encoded and encrypted mixing result to the invitee transceiver unit 123 of the first invitee device via the transfer unit 220. Specifically, the initiator transceiver unit 113 may transmit the encoded and encrypted mixing result to the transfer unit 220 based on the UDP/RTP transmission protocol. Then, the transfer unit 220 may transmit the encoded and encrypted mixing result to the invitee transceiver unit 123 of the first invitee device based on the UDP/RTP transmission protocol.

在本實施例中,第一被邀請者裝置可利用編碼與加密後混音結果、加解密演算法以及第一金鑰獲得編碼後混音結果。詳細而言,在步驟S309中,第一被邀請者裝置的被邀請者收送單元123可基於UDP/RTP傳輸協定來進行封包拆解以獲得編碼與加密後混音結果。接著,第一被邀請者裝置的被邀請者收送單元123可傳送編碼與加密後混音結果至第一被邀請者裝置的被邀請者解密單元125。接著,第一被邀請者裝置的被邀請者解密單元125可利用編碼與加密後混音結果、加解密演算法以及第一金鑰獲得編碼後混音結果。然後,第一被邀請者裝置的被邀請者解密單元125可播放編碼後混音結果。In this embodiment, the first invitee device may obtain the encoded mixed audio result using the encoded and encrypted mixed audio result, the encryption and decryption algorithm, and the first key. In detail, in step S309, the invitee transceiver unit 123 of the first invitee device may perform packet decomposition based on the UDP/RTP transmission protocol to obtain the encoded and encrypted mixed audio result. Then, the invitee transceiver unit 123 of the first invitee device may transmit the encoded and encrypted mixed audio result to the invitee decryption unit 125 of the first invitee device. Then, the invitee decryption unit 125 of the first invitee device may obtain the encoded mixed audio result using the encoded and encrypted mixed audio result, the encryption and decryption algorithm, and the first key. Then, the invitee decryption unit 125 of the first invitee device may play the encoded mixed result.

在一實施例中,所述加解密演算法為ECDH/DH演算法,然而本發明不限於此。In one embodiment, the encryption and decryption algorithm is the ECDH/DH algorithm, but the present invention is not limited thereto.

在此需說明的是,如圖1所示,由於所述多個被邀請者裝置包括第一被邀請者裝置以及第二被邀請者裝置,因此發起者裝置、伺服器以及第一被邀請者裝置可利用圖2的方式來執行第一語音會議註冊流程及第一金鑰協商流程。另一方面,發起者裝置、伺服器以及第二被邀請者裝置也可以利用相似於圖2的方式來執行第二語音會議註冊流程及第二金鑰協商流程。更進一步來說,發起者裝置、伺服器、第一被邀請者裝置以及第二被邀請者裝置可利用相似於圖3的方式來執行語音會議。換言之,語音會議可包括發起者、第一被邀請者以及第二被邀請者。It should be noted that, as shown in FIG. 1 , since the multiple invitee devices include a first invitee device and a second invitee device, the initiator device, the server, and the first invitee device can use the method of FIG. 2 to perform the first voice conference registration process and the first key negotiation process. On the other hand, the initiator device, the server, and the second invitee device can also use a method similar to FIG. 2 to perform the second voice conference registration process and the second key negotiation process. Furthermore, the initiator device, the server, the first invitee device, and the second invitee device can use a method similar to FIG. 3 to perform a voice conference. In other words, the voice conference may include the initiator, the first invitee, and the second invitee.

具體而言,當第一被邀請者裝置的被邀請者加密單元124、發起者加密單元115以及轉送單元220完成(如圖2步驟S207~步驟S208的)第一金鑰協商流程時,第一被邀請者裝置的被邀請者加密單元124可獲得第一金鑰,且發起者加密單元115可獲得第一金鑰。另一方面,當第二被邀請者裝置的被邀請者加密單元124、發起者加密單元115以及轉送單元220完成(如圖2步驟S207~步驟S208的)第二金鑰協商流程時,第二被邀請者裝置的被邀請者加密單元124可獲得第二金鑰,且發起者加密單元115可獲得第二金鑰。Specifically, when the invitee encryption unit 124 of the first invitee device, the initiator encryption unit 115 and the transfer unit 220 complete the first key negotiation process (such as step S207 to step S208 in Figure 2), the invitee encryption unit 124 of the first invitee device can obtain the first key, and the initiator encryption unit 115 can obtain the first key. On the other hand, when the invitee encryption unit 124 of the second invitee device, the initiator encryption unit 115 and the transfer unit 220 complete the second key negotiation process (such as step S207 to step S208 in Figure 2), the invitee encryption unit 124 of the second invitee device can obtain the second key, and the initiator encryption unit 115 can obtain the second key.

進一步而言,第二被邀請者裝置可對應於第二被邀請者語音。所述混音結果可包括第一混音結果以及第二混音結果,其中所述編碼與加密後混音結果可包括第一編碼與加密後混音結果以及第二編碼與加密後混音結果,其中所述編碼後混音結果可包括第一編碼後混音結果以及第二編碼後混音結果。更進一步而言,相似於上述步驟S301,第二被邀請者裝置的被邀請者錄音單元122可根據第二網路的第二網路環境來選擇出第二語音編碼方式。以下將繼續說明。Furthermore, the second invitee device may correspond to the second invitee voice. The mixing result may include a first mixing result and a second mixing result, wherein the encoded and encrypted mixing result may include a first encoded and encrypted mixing result and a second encoded and encrypted mixing result, wherein the encoded mixing result may include a first encoded mixing result and a second encoded mixing result. Further, similar to the above step S301, the invitee recording unit 122 of the second invitee device may select a second voice encoding method according to the second network environment of the second network. The following will be described.

相似於上述步驟S301以及步驟S302,當發起者裝置以及第二被邀請者裝置完成語音會議註冊流程時,第二被邀請者裝置可利用第二被邀請者語音、關聯於第二網路的第二語音編碼方式、加解密演算法以及第二金鑰獲得第二編碼與加密後被邀請者語音。Similar to the above steps S301 and S302, when the initiator device and the second invitee device complete the voice conference registration process, the second invitee device can use the second invitee voice, the second voice encoding method associated with the second network, the encryption and decryption algorithm and the second key to obtain the second coded and encrypted invitee voice.

相似於上述步驟S305以及步驟S306,發起者裝置可利用第二編碼與加密後被邀請者語音、加解密演算法、第二金鑰以及第二語音編碼方式獲得第二被邀請者語音。Similar to the above-mentioned step S305 and step S306, the initiator device can obtain the second invitee's voice by using the second coded and encrypted invitee's voice, the encryption and decryption algorithm, the second key and the second voice encoding method.

然後,發起者裝置可對發起者語音以及第一被邀請者語音執行混音以獲得第一混音結果,且發起者裝置可對發起者語音以及第二被邀請者語音執行混音以獲得第二混音結果。然後,發起者裝置可利用第二混音結果、第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後混音結果,且發起者裝置可利用第一混音結果、第二語音編碼方式、加解密演算法以及第二金鑰獲得第二編碼與加密後混音結果。然後,第一被邀請者裝置可利用第一編碼與加密後混音結果、加解密演算法以及第一金鑰獲得第一編碼後混音結果,且第二被邀請者裝置可利用第二編碼與加密後混音結果、加解密演算法以及第二金鑰獲得第二編碼後混音結果。Then, the initiator device may perform mixing on the initiator's voice and the first invitee's voice to obtain a first mixing result, and the initiator device may perform mixing on the initiator's voice and the second invitee's voice to obtain a second mixing result. Then, the initiator device may use the second mixing result, the first voice coding method, the encryption algorithm, and the first key to obtain a first coded and encrypted mixing result, and the initiator device may use the first mixing result, the second voice coding method, the encryption algorithm, and the second key to obtain a second coded and encrypted mixing result. Then, the first invitee device may obtain a first encoded mixed result using the first encoded and encrypted mixed result, the encryption and decryption algorithm, and the first key, and the second invitee device may obtain a second encoded mixed result using the second encoded and encrypted mixed result, the encryption and decryption algorithm, and the second key.

圖4是根據本發明的一實施例繪示的金鑰協商流程與端對端加密的示意圖。請同時參照圖1、圖2、圖3及圖4。在本實施例中,所述多個被邀請者裝置包括第一被邀請者裝置、第二被邀請者裝置以及第三被邀請者裝置。在本實施例的金鑰協商流程中,發起者加密單元115與各被邀請者裝置的被邀請者加密單元124透過網際網路傳送協商資訊而非傳送金鑰本身。進一步而言,只有參與特定金鑰協商流程的發起者裝置與特定被邀請者裝置可以計算出金鑰。FIG. 4 is a schematic diagram of a key negotiation process and end-to-end encryption according to an embodiment of the present invention. Please refer to FIG. 1, FIG. 2, FIG. 3 and FIG. 4 simultaneously. In this embodiment, the plurality of invitee devices include a first invitee device, a second invitee device and a third invitee device. In the key negotiation process of this embodiment, the initiator encryption unit 115 and the invitee encryption unit 124 of each invitee device transmit negotiation information via the Internet instead of transmitting the key itself. Furthermore, only the initiator device and the specific invitee device participating in a specific key negotiation process can calculate the key.

更進一步而言,發起者裝置與各被邀請者裝置所計算出的金鑰為各自獨立。如圖4所示,發起者裝置與第一被邀請者裝置計算出的金鑰為第一金鑰,且發起者裝置與第二被邀請者裝置計算出的金鑰為第二金鑰,且發起者裝置與第三被邀請者裝置計算出的金鑰為第三金鑰。若需在加密語音會議進行中加入第四被邀請者裝置,則僅有第四被邀請者裝置需與發起者裝置進行金鑰協商流程以計算出第四金鑰。換言之,第一被邀請者裝置、第二被邀請者裝置以及第三被邀請者裝置不需重新進行金鑰協商流程。Furthermore, the keys calculated by the initiator device and each invitee device are independent of each other. As shown in FIG. 4 , the key calculated by the initiator device and the first invitee device is the first key, the key calculated by the initiator device and the second invitee device is the second key, and the key calculated by the initiator device and the third invitee device is the third key. If a fourth invitee device is to be added during the encrypted voice conference, only the fourth invitee device needs to perform a key negotiation process with the initiator device to calculate the fourth key. In other words, the first invitee device, the second invitee device, and the third invitee device do not need to perform a key negotiation process again.

圖5是根據本發明的一實施例繪示的終端混音的示意圖。請同時參照圖1、圖2、圖3、圖4及圖5。在本實施例中,所述多個被邀請者裝置包括第一被邀請者裝置、第二被邀請者裝置以及第三被邀請者裝置。發起者裝置可對從各被邀請者裝置接收的各編碼與加密後被邀請者語音進行解密、解碼以及混音。詳細而言,如圖5所示,在發起者裝置分別接收第一編碼與加密後被邀請者語音、第二編碼與加密後被邀請者語音以及第三編碼與加密後被邀請者語音之後,發起者裝置可對第一編碼與加密後被邀請者語音、第二編碼與加密後被邀請者語音以及第三編碼與加密後被邀請者語音分別進行解密及解碼。然後,發起者裝置可執行混音。第一被邀請者裝置將會收到發起者語音、第二被邀請者語音以及第三被邀請者語音的混音結果。再者,第二被邀請者裝置將會收到發起者語音、第一被邀請者語音以及第三被邀請者語音的混音結果。另外,第三被邀請者裝置將會收到發起者語音、第一被邀請者語音以及第二被邀請者語音的混音結果。FIG. 5 is a schematic diagram of terminal mixing according to an embodiment of the present invention. Please refer to FIG. 1, FIG. 2, FIG. 3, FIG. 4 and FIG. 5 at the same time. In this embodiment, the plurality of invitee devices include a first invitee device, a second invitee device and a third invitee device. The initiator device can decrypt, decode and mix each coded and encrypted invitee voice received from each invitee device. Specifically, as shown in FIG. 5, after the initiator device receives the first coded and encrypted invitee voice, the second coded and encrypted invitee voice and the third coded and encrypted invitee voice respectively, the initiator device can decrypt and decode the first coded and encrypted invitee voice, the second coded and encrypted invitee voice and the third coded and encrypted invitee voice respectively. Then, the initiator device can perform mixing. The first invitee device will receive the mixing result of the initiator's voice, the second invitee's voice, and the third invitee's voice. Furthermore, the second invitee device will receive the mixing result of the initiator's voice, the first invitee's voice, and the third invitee's voice. In addition, the third invitee device will receive the mixing result of the initiator's voice, the first invitee's voice, and the second invitee's voice.

若需在加密語音會議進行中加入第四被邀請者裝置,則發起者裝置只需把從第四被邀請者裝置接收的第四編碼與加密後被邀請者語音進行解密以及解碼以獲得第四被邀請者語音,再將第四被邀請者語音與(解密及解碼後的)第一被邀請者語音、第二被邀請者語音以及第三被邀請者語音分別混音,然後再分別送至各被邀請者裝置即可,並不影響已進行之加密語音會議。If a fourth invitee device needs to be added during an encrypted voice conference, the initiator device only needs to decrypt and decode the fourth coded and encrypted invitee voice received from the fourth invitee device to obtain the fourth invitee voice, and then mix the fourth invitee voice with the (decrypted and decoded) first invitee voice, second invitee voice and third invitee voice respectively, and then send them to each invitee device respectively, without affecting the encrypted voice conference that has been carried out.

圖6是根據本發明的一實施例繪示的不同網路環境下的不同語音編碼方式的示意圖。請同時參照圖1、圖2、圖3、圖4、圖5及圖6。若在第一網路(Wi-Fi網路環境)的第一被邀請者裝置選擇G.711格式做為第一語音編碼方式,則發起者裝置也會以G.711格式對混音結果進行編碼以回傳給第一被邀請者裝置。再者,若在第二網路(行動網路環境)下的第二被邀請者裝置選擇Opus格式做為第二語音編碼方式,則發起者裝置也會以Opus格式對混音結果進行編碼以回傳給第二被邀請者裝置。另外,若在第三網路(衛星網路環境)下的第三被邀請者裝置選擇Lyra格式做為第三語音編碼方式,則發起者裝置也會以Lyra格式對混音結果進行編碼以回傳給第三被邀請者裝置。FIG6 is a schematic diagram of different voice coding methods in different network environments according to an embodiment of the present invention. Please refer to FIG1, FIG2, FIG3, FIG4, FIG5 and FIG6 at the same time. If the first invitee device in the first network (Wi-Fi network environment) selects the G.711 format as the first voice coding method, the initiator device will also encode the mixing result in the G.711 format and return it to the first invitee device. Furthermore, if the second invitee device in the second network (mobile network environment) selects the Opus format as the second voice coding method, the initiator device will also encode the mixing result in the Opus format and return it to the second invitee device. In addition, if the third invitee device in the third network (satellite network environment) selects the Lyra format as the third voice encoding method, the initiator device will also encode the mixing result in the Lyra format and return it to the third invitee device.

圖7是根據本發明的一實施例繪示的跨國加密語音會議情境的示意圖。請同時參照圖1、圖2、圖3、圖4、圖5、圖6以及圖7。在具有行動網路或Wi-Fi網路的環境下,使用發起者裝置(位置為台灣總公司)的發起者可經由發起者註冊單元111傳送註冊資訊至伺服器的伺服器端註冊單元210。使用第一被邀請者裝置(位置為美國分公司)的第一被邀請者可經由第一被邀請者裝置的被邀請者註冊單元121傳送註冊資訊至伺服器的伺服器端註冊單元210。再者,使用第二被邀請者裝置(位置為中國分公司)的第二被邀請者可經由第二被邀請者裝置的被邀請者註冊單元121傳送註冊資訊至伺服器的伺服器端註冊單元210。在完成語音會議註冊流程以及各金鑰協商流程之後,發起者裝置、第一被邀請者裝置以及第二被邀請者裝置可進行跨國的三方加密語音會議。FIG7 is a schematic diagram of a cross-border encrypted voice conference scenario according to an embodiment of the present invention. Please refer to FIG1, FIG2, FIG3, FIG4, FIG5, FIG6 and FIG7 simultaneously. In an environment with a mobile network or a Wi-Fi network, the initiator using the initiator device (located at the Taiwan head office) can transmit registration information to the server-side registration unit 210 of the server via the initiator registration unit 111. The first invitee using the first invitee device (located at the US branch) can transmit registration information to the server-side registration unit 210 of the server via the invitee registration unit 121 of the first invitee device. Furthermore, the second invitee using the second invitee device (located at the Chinese branch) can transmit registration information to the server-side registration unit 210 of the server via the invitee registration unit 121 of the second invitee device. After completing the voice conference registration process and the key negotiation process, the initiator device, the first invitee device and the second invitee device can conduct a cross-border three-party encrypted voice conference.

發起者、第一被邀請者以及第二被邀請者可分別對著發起者裝置、第一被邀請者裝置以及第二被邀請者裝置開始說話。對話內容可經由發起者裝置的發起者錄音單元112、第一被邀請者裝置的被邀請者錄音單元122以及第二被邀請者裝置的被邀請者錄音單元122分別收錄,其中第一被邀請者裝置收錄的第一被邀請者語音以及第二被邀請者裝置收錄的第二被邀請者語音將分別依其選擇的語音編碼方式編碼後加密,並以密文的形式透過伺服器轉送至發起者裝置。發起者裝置會藉由發起者解密單元114進行解密,並且送至混音單元116以分別針對第一被邀請者裝置以及第二被邀請者裝置進行混音,然後採用原本使用之語音編碼方式進行編碼。最後,在發起者加密單元115加密後,分別以密文形式將混音結果傳送至第一被邀請者裝置的被邀請者解密單元125以及第二被邀請者裝置的被邀請者解密單元125,以分別解密及播放。The initiator, the first invitee and the second invitee can start talking to the initiator device, the first invitee device and the second invitee device respectively. The conversation content can be recorded by the initiator recording unit 112 of the initiator device, the invitee recording unit 122 of the first invitee device and the invitee recording unit 122 of the second invitee device respectively, wherein the first invitee's voice recorded by the first invitee device and the second invitee's voice recorded by the second invitee device will be encoded and encrypted according to the voice encoding method selected by them, and transferred to the initiator device through the server in the form of ciphertext. The initiator device will decrypt the audio through the initiator decryption unit 114 and send it to the mixing unit 116 for mixing with the first invitee device and the second invitee device respectively, and then encode it using the original voice encoding method. Finally, after encryption by the initiator encryption unit 115, the mixing result is sent to the invitee decryption unit 125 of the first invitee device and the invitee decryption unit 125 of the second invitee device in the form of ciphertext for decryption and playback respectively.

若在加密語音會議進行中有第三被邀請者裝置欲臨時加入,待第三被邀請者裝置註冊成功後,僅需與發起者裝置進行金鑰協商流程取得金鑰。換言之,已在會議中之第一被邀請者裝置以及第二被邀請者裝置不需重新取得金鑰即可持續進行加密語音會議。If a third invitee device wants to temporarily join the encrypted voice conference, after the third invitee device is successfully registered, it only needs to go through the key negotiation process with the initiator device to obtain the key. In other words, the first invitee device and the second invitee device already in the conference do not need to obtain the key again to continue the encrypted voice conference.

圖8是根據本發明的一實施例繪示的衛星網路環境下的語音會議情境的示意圖。本實施例與圖7的差異在於,由於第一被邀請者裝置的位置國家B處於戰爭期間可能無法使用行動網路或Wi-Fi網路,第一被邀請者可利用衛星電話分享衛星網路給第一被邀請者裝置。基此,第一被邀請者裝置可與具有行動網路或Wi-Fi網路之發起者裝置及第二被邀請者裝置進行加密語音會議。FIG8 is a schematic diagram of a voice conference scenario in a satellite network environment according to an embodiment of the present invention. The difference between this embodiment and FIG7 is that, since the location of the first invitee device in country B is in a war and may not be able to use a mobile network or Wi-Fi network, the first invitee can use a satellite phone to share the satellite network with the first invitee device. Based on this, the first invitee device can conduct an encrypted voice conference with the initiator device and the second invitee device that have a mobile network or Wi-Fi network.

圖9是根據本發明的一實施例繪示的語音會議方法的流程圖,其中所述語音會議系統方法可由圖1所示的語音會議系統實施。在步驟S910中,當發起者裝置以及第一被邀請者裝置完成語音會議註冊流程以及第一金鑰協商流程時,由第一被邀請者裝置利用第一被邀請者語音、關聯於第一網路的第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後被邀請者語音。在步驟S920中,由發起者裝置利用第一編碼與加密後被邀請者語音、加解密演算法、第一金鑰以及第一語音編碼方式獲得第一被邀請者語音。在步驟S930中,由發起者裝置對發起者語音執行混音以獲得混音結果。在步驟S940中,由發起者裝置利用混音結果、第一語音編碼方式、加解密演算法以及第一金鑰獲得編碼與加密後混音結果。在步驟S950中,由第一被邀請者裝置利用編碼與加密後混音結果、加解密演算法以及第一金鑰獲得編碼後混音結果。所述方法已於前述實施例說明,於此不再贅述。FIG9 is a flow chart of a voice conference method according to an embodiment of the present invention, wherein the voice conference system method can be implemented by the voice conference system shown in FIG1. In step S910, when the initiator device and the first invitee device complete the voice conference registration process and the first key negotiation process, the first invitee device uses the first invitee voice, the first voice encoding method associated with the first network, the encryption and decryption algorithm, and the first key to obtain the first coded and encrypted invitee voice. In step S920, the initiator device uses the first coded and encrypted invitee voice, the encryption and decryption algorithm, the first key, and the first voice encoding method to obtain the first invitee voice. In step S930, the initiator device performs mixing on the initiator voice to obtain a mixing result. In step S940, the initiator device uses the mixing result, the first voice encoding method, the encryption and decryption algorithm, and the first key to obtain a coded and encrypted mixing result. In step S950, the first invitee device uses the coded and encrypted mixing result, the encryption and decryption algorithm, and the first key to obtain a coded mixing result. The method has been described in the above-mentioned embodiment and will not be repeated here.

綜上所述,本發明的語音會議系統及方法可利用端對端加密以及終端混音來執行語音會議。當語音會議中加入新成員時,僅新成員與發起者裝置之間需進行金鑰協商流程。再者,由於伺服器不需進行解密、混音以及加密,因此可提高語音資訊的安全性。除此之外,被邀請者裝置可依據網路環境來選擇適合的語音編碼方式,更提高了語音會議的彈性。In summary, the voice conferencing system and method of the present invention can use end-to-end encryption and terminal mixing to perform voice conferencing. When a new member is added to the voice conference, only the new member and the initiator device need to perform a key negotiation process. Furthermore, since the server does not need to perform decryption, mixing and encryption, the security of voice information can be improved. In addition, the invitee device can choose a suitable voice encoding method according to the network environment, which further improves the flexibility of the voice conference.

111:發起者註冊單元 112:發起者錄音單元 113:發起者收送單元 114:發起者解密單元 115:發起者加密單元 116:混音單元 121:被邀請者註冊單元 122:被邀請者錄音單元 123:被邀請者收送單元 124:被邀請者加密單元 125:被邀請者解密單元 210:伺服器端註冊單元 220:轉送單元 S201~S209、S301~S309、S910~S950:步驟 111: Originator registration unit 112: Originator recording unit 113: Originator receiving and sending unit 114: Originator decryption unit 115: Originator encryption unit 116: Mixing unit 121: Invitee registration unit 122: Invitee recording unit 123: Invitee receiving and sending unit 124: Invitee encryption unit 125: Invitee decryption unit 210: Server registration unit 220: Transfer unit S201~S209, S301~S309, S910~S950: Steps

圖1是根據本發明的一實施例繪示的語音會議系統的示意圖。 圖2是根據本發明的一實施例繪示的語音會議註冊流程及金鑰協商流程的流程圖。 圖3是根據本發明的一實施例繪示的語音會議的流程圖。 圖4是根據本發明的一實施例繪示的金鑰協商流程與端對端加密的示意圖。 圖5是根據本發明的一實施例繪示的終端混音的示意圖。 圖6是根據本發明的一實施例繪示的不同網路環境下的不同語音編碼方式的示意圖。 圖7是根據本發明的一實施例繪示的跨國加密語音會議情境的示意圖。 圖8是根據本發明的一實施例繪示的衛星網路環境下的語音會議情境的示意圖。 圖9是根據本發明的一實施例繪示的語音會議方法的流程圖。 FIG. 1 is a schematic diagram of a voice conference system according to an embodiment of the present invention. FIG. 2 is a flow chart of a voice conference registration process and a key negotiation process according to an embodiment of the present invention. FIG. 3 is a flow chart of a voice conference according to an embodiment of the present invention. FIG. 4 is a schematic diagram of a key negotiation process and end-to-end encryption according to an embodiment of the present invention. FIG. 5 is a schematic diagram of terminal mixing according to an embodiment of the present invention. FIG. 6 is a schematic diagram of different voice encoding methods in different network environments according to an embodiment of the present invention. FIG. 7 is a schematic diagram of a cross-border encrypted voice conference scenario according to an embodiment of the present invention. FIG8 is a schematic diagram of a voice conferencing scenario in a satellite network environment according to an embodiment of the present invention. FIG9 is a flow chart of a voice conferencing method according to an embodiment of the present invention.

S910~S950:步驟 S910~S950: Steps

Claims (9)

一種語音會議系統,包括:發起者裝置,對應於發起者語音;伺服器,通訊連接至所述發起者裝置;以及多個被邀請者裝置,其中所述多個被邀請者裝置包括第一被邀請者裝置,且所述第一被邀請者裝置經由第一網路通訊連接至所述伺服器,且所述第一被邀請者裝置對應於第一被邀請者語音,其中當所述發起者裝置以及所述第一被邀請者裝置完成語音會議註冊流程以及第一金鑰協商流程時,所述第一被邀請者裝置利用所述第一被邀請者語音、關聯於所述第一網路的第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後被邀請者語音;所述發起者裝置利用所述第一編碼與加密後被邀請者語音、所述加解密演算法、所述第一金鑰以及所述第一語音編碼方式獲得所述第一被邀請者語音;所述發起者裝置對所述發起者語音執行混音以獲得混音結果;所述發起者裝置利用所述混音結果、所述第一語音編碼方式、所述加解密演算法以及所述第一金鑰獲得編碼與加密後混音結果;所述第一被邀請者裝置利用所述編碼與加密後混音結果、 所述加解密演算法以及所述第一金鑰獲得編碼後混音結果,其中所述第一被邀請者裝置根據所述第一網路的第一網路環境來選擇出所述第一語音編碼方式。 A voice conference system includes: an initiator device corresponding to the initiator voice; a server communicatively connected to the initiator device; and a plurality of invitee devices, wherein the plurality of invitee devices include a first invitee device, and the first invitee device is communicatively connected to the server via a first network, and the first invitee device corresponds to the first invitee voice, wherein when the initiator device and the first invitee device complete the voice conference registration process and the first key negotiation process, the first invitee device uses the first invitee voice, the first voice encoding method associated with the first network, the encryption and decryption algorithm, and the first key to obtain the first coded and encrypted invitee voice ; The initiator device obtains the first invitee voice using the first coded and encrypted invitee voice, the encryption and decryption algorithm, the first key, and the first voice encoding method; the initiator device performs mixing on the initiator voice to obtain a mixing result; the initiator device obtains the coded and encrypted mixing result using the mixing result, the first voice encoding method, the encryption and decryption algorithm, and the first key; the first invitee device obtains the coded mixing result using the coded and encrypted mixing result, the encryption and decryption algorithm, and the first key, wherein the first invitee device selects the first voice encoding method according to the first network environment of the first network. 如請求項1所述的語音會議系統,其中所述多個被邀請者裝置更包括第二被邀請者裝置,且所述第二被邀請者裝置經由第二網路通訊連接至所述伺服器,且所述第二被邀請者裝置對應於第二被邀請者語音,其中所述混音結果包括第一混音結果以及第二混音結果,其中所述編碼與加密後混音結果包括第一編碼與加密後混音結果以及第二編碼與加密後混音結果,其中所述編碼後混音結果包括第一編碼後混音結果以及第二編碼後混音結果,其中當所述發起者裝置以及所述第二被邀請者裝置完成所述語音會議註冊流程時,所述第二被邀請者裝置利用所述第二被邀請者語音、關聯於所述第二網路的第二語音編碼方式、所述加解密演算法以及第二金鑰獲得第二編碼與加密後被邀請者語音;所述發起者裝置利用所述第二編碼與加密後被邀請者語音、所述加解密演算法、所述第二金鑰以及所述第二語音編碼方式獲得所述第二被邀請者語音;所述發起者裝置對所述發起者語音以及所述第一被邀請者語音執行混音以獲得所述第一混音結果,且所述發起者裝置對所述發起者語音以及所述第二被邀請者語音執行混音以獲得所述第二混音結果; 所述發起者裝置利用所述第二混音結果、所述第一語音編碼方式、所述加解密演算法以及所述第一金鑰獲得所述第一編碼與加密後混音結果,且所述發起者裝置利用所述第一混音結果、所述第二語音編碼方式、所述加解密演算法以及所述第二金鑰獲得所述第二編碼與加密後混音結果;所述第一被邀請者裝置利用所述第一編碼與加密後混音結果、所述加解密演算法以及所述第一金鑰獲得所述第一編碼後混音結果,且所述第二被邀請者裝置利用所述第二編碼與加密後混音結果、所述加解密演算法以及所述第二金鑰獲得所述第二編碼後混音結果。 A voice conference system as described in claim 1, wherein the plurality of invitee devices further include a second invitee device, and the second invitee device is connected to the server via a second network communication, and the second invitee device corresponds to a second invitee voice, wherein the mixing result includes a first mixing result and a second mixing result, wherein the encoded and encrypted mixing result includes a first encoded and encrypted mixing result and a second encoded and encrypted mixing result, wherein the encoded mixing result includes a first The initiator device and the second invitee device complete the voice conference registration process, wherein the second invitee device uses the second invitee voice, the second voice coding method associated with the second network, the encryption algorithm and the second key to obtain the second coded and encrypted invitee voice; the initiator device uses the second coded and encrypted invitee voice, the encryption algorithm, the second key and the second key to obtain the second coded and encrypted invitee voice. The initiator device mixes the initiator voice and the first invitee voice to obtain the first mixing result, and the initiator device mixes the initiator voice and the second invitee voice to obtain the second mixing result; the initiator device uses the second mixing result, the first voice encoding method, the encryption and decryption algorithm and the first key to obtain the first encoding and encrypted mixing result, and ... The initiator device uses the first mixing result, the second voice coding method, the encryption algorithm and the second key to obtain the second encoding and encrypted mixing result; the first invitee device uses the first encoding and encrypted mixing result, the encryption algorithm and the first key to obtain the first encoding and mixed result, and the second invitee device uses the second encoding and encrypted mixing result, the encryption algorithm and the second key to obtain the second encoding and mixed result. 如請求項2所述的語音會議系統,其中所述多個被邀請者裝置的每一者包括被邀請者錄音單元,其中所述第二被邀請者裝置的所述被邀請者錄音單元根據所述第二網路的第二網路環境來選擇出所述第二語音編碼方式。 A voice conference system as described in claim 2, wherein each of the plurality of invitee devices includes an invitee recording unit, wherein the invitee recording unit of the second invitee device selects the second voice encoding method according to the second network environment of the second network. 如請求項2所述的語音會議系統,其中所述多個被邀請者裝置的每一者包括被邀請者加密單元,且所述發起者裝置包括發起者加密單元,且所述伺服器包括轉送單元,其中當所述第一被邀請者裝置的所述被邀請者加密單元、所述發起者加密單元以及所述轉送單元完成所述第一金鑰協商流程時,所述第一被邀請者裝置的所述被邀請者加密單元獲得所述第一金鑰,且所述發起者加密單元獲得所述第一金鑰;當所述第二被邀請者裝置的所述被邀請者加密單元、所述發 起者加密單元以及所述轉送單元完成第二金鑰協商流程時,所述第二被邀請者裝置的所述被邀請者加密單元獲得所述第二金鑰,且所述發起者加密單元獲得所述第二金鑰。 A voice conference system as described in claim 2, wherein each of the plurality of invitee devices includes an invitee encryption unit, and the initiator device includes an initiator encryption unit, and the server includes a transfer unit, wherein when the invitee encryption unit of the first invitee device, the initiator encryption unit, and the transfer unit complete the first key negotiation process, the invitee encryption unit of the first invitee device obtains the first key, and the initiator encryption unit obtains the first key; when the invitee encryption unit of the second invitee device, the initiator encryption unit, and the transfer unit complete the second key negotiation process, the invitee encryption unit of the second invitee device obtains the second key, and the initiator encryption unit obtains the second key. 如請求項1所述的語音會議系統,其中所述多個被邀請者裝置的每一者包括被邀請者錄音單元、被邀請者加密單元以及被邀請者解密單元,其中所述第一被邀請者裝置的所述被邀請者錄音單元利用所述第一語音編碼方式對所述第一被邀請者語音執行編碼以獲得第一編碼後被邀請者語音;所述第一被邀請者裝置的所述被邀請者加密單元利用所述加解密演算法以及所述第一金鑰對所述第一編碼後被邀請者語音執行加密以獲得所述第一編碼與加密後被邀請者語音;所述第一被邀請者裝置的所述被邀請者解密單元利用所述編碼與加密後混音結果、所述加解密演算法以及所述第一金鑰獲得所述編碼後混音結果。 A voice conference system as described in claim 1, wherein each of the plurality of invitee devices comprises an invitee recording unit, an invitee encryption unit and an invitee decryption unit, wherein the invitee recording unit of the first invitee device uses the first voice encoding method to encode the first invitee voice to obtain the first encoded invitee voice; the invitee encryption unit of the first invitee device uses the encryption and decryption algorithm and the first key to encrypt the first encoded invitee voice to obtain the first encoded and encrypted invitee voice; the invitee decryption unit of the first invitee device uses the encoding and encryption mixed sound result, the encryption and decryption algorithm and the first key to obtain the encoded mixed sound result. 如請求項1所述的語音會議系統,其中所述發起者裝置包括發起者解密單元、混音單元以及發起者加密單元,其中所述發起者解密單元利用所述加解密演算法以及所述第一金鑰對所述第一編碼與加密後被邀請者語音執行解密以獲得第一編碼後被邀請者語音;所述混音單元利用所述第一語音編碼方式對所述第一編碼後被邀請者語音執行解碼以獲得所述第一被邀請者語音; 所述混音單元利用所述第一語音編碼方式對所述混音結果執行編碼以獲得所述編碼後混音結果;所述發起者加密單元利用所述加解密演算法以及所述第一金鑰對所述編碼後混音結果執行加密以獲得所述編碼與加密後混音結果。 The voice conference system as described in claim 1, wherein the initiator device includes an initiator decryption unit, a mixing unit and an initiator encryption unit, wherein the initiator decryption unit uses the encryption and decryption algorithm and the first key to decrypt the first coded and encrypted invitee voice to obtain the first coded invitee voice; the mixing unit uses the first voice encoding method to decode the first coded invitee voice to obtain the first invitee voice; the mixing unit uses the first voice encoding method to encode the mixing result to obtain the encoded mixing result; the initiator encryption unit uses the encryption and decryption algorithm and the first key to encrypt the coded mixing result to obtain the coded and encrypted mixing result. 如請求項1所述的語音會議系統,其中所述加解密演算法為ECDH/DH演算法。 A voice conferencing system as described in claim 1, wherein the encryption and decryption algorithm is an ECDH/DH algorithm. 如請求項1所述的語音會議系統,其中所述多個被邀請者裝置的每一者包括被邀請者註冊單元,且所述發起者裝置包括發起者註冊單元,且所述伺服器包括伺服器端註冊單元,其中所述被邀請者註冊單元、所述發起者註冊單元以及所述伺服器端註冊單元執行所述語音會議註冊流程。 A voice conference system as described in claim 1, wherein each of the multiple invitee devices includes an invitee registration unit, and the initiator device includes an initiator registration unit, and the server includes a server-side registration unit, wherein the invitee registration unit, the initiator registration unit, and the server-side registration unit execute the voice conference registration process. 一種語音會議方法,適於由包括發起者裝置、伺服器以及多個被邀請者裝置的語音會議系統執行,其中所述發起者裝置對應於發起者語音,其中所述多個被邀請者裝置包括第一被邀請者裝置,且所述第一被邀請者裝置經由第一網路通訊連接至所述伺服器,且所述第一被邀請者裝置對應於第一被邀請者語音,其中所述方法包括以下步驟:當所述發起者裝置以及所述第一被邀請者裝置完成語音會議註冊流程以及第一金鑰協商流程時,由所述第一被邀請者裝置利用所述第一被邀請者語音、關聯於所述第一網路的第一語音編碼方式、加解密演算法以及第一金鑰獲得第一編碼與加密後被邀請 者語音;由所述發起者裝置利用所述第一編碼與加密後被邀請者語音、所述加解密演算法、所述第一金鑰以及所述第一語音編碼方式獲得所述第一被邀請者語音;由所述發起者裝置對所述發起者語音執行混音以獲得混音結果;由所述發起者裝置利用所述混音結果、所述第一語音編碼方式、所述加解密演算法以及所述第一金鑰獲得編碼與加密後混音結果;以及由所述第一被邀請者裝置利用所述編碼與加密後混音結果、所述加解密演算法以及所述第一金鑰獲得編碼後混音結果,其中所述方法更包括以下步驟:由所述第一被邀請者裝置根據所述第一網路的第一網路環境來選擇出所述第一語音編碼方式。 A voice conference method is suitable for being executed by a voice conference system including an initiator device, a server and a plurality of invitee devices, wherein the initiator device corresponds to the initiator voice, wherein the plurality of invitee devices include a first invitee device, and the first invitee device is connected to the server via a first network communication, and the first invitee device corresponds to the first invitee voice, wherein the method includes the following steps: when the initiator device and the first invitee device complete the voice conference registration process and the first key negotiation process, the first invitee device uses the first invitee voice, the first voice encoding method associated with the first network, the encryption and decryption algorithm and the first key to obtain the first coded and encrypted invitee voice ; the initiator device uses the first coded and encrypted invitee voice, the encryption and decryption algorithm, the first key and the first voice encoding method to obtain the first invitee voice; the initiator device performs mixing on the initiator voice to obtain a mixing result; the initiator device uses the mixing result, the first voice encoding method, the encryption and decryption algorithm and the first key to obtain the coded and encrypted mixing result; and the first invitee device uses the coded and encrypted mixing result, the encryption and decryption algorithm and the first key to obtain the coded mixing result, wherein the method further includes the following steps: the first invitee device selects the first voice encoding method according to the first network environment of the first network.
TW112137880A 2023-10-03 2023-10-03 Voice conference system and voice conference method TWI880378B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW112137880A TWI880378B (en) 2023-10-03 2023-10-03 Voice conference system and voice conference method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW112137880A TWI880378B (en) 2023-10-03 2023-10-03 Voice conference system and voice conference method

Publications (2)

Publication Number Publication Date
TWI880378B true TWI880378B (en) 2025-04-11
TW202516492A TW202516492A (en) 2025-04-16

Family

ID=96141663

Family Applications (1)

Application Number Title Priority Date Filing Date
TW112137880A TWI880378B (en) 2023-10-03 2023-10-03 Voice conference system and voice conference method

Country Status (1)

Country Link
TW (1) TWI880378B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI889612B (en) * 2024-12-06 2025-07-01 中華電信股份有限公司 System and method for dynamically switching mixing terminal and computer program product thereof

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201116012A (en) * 2009-09-29 2011-05-01 O2Micro Inc Integrated firewall / VPN system and integrated circuit thereof
US20130246529A1 (en) * 1996-03-26 2013-09-19 Joseph Salesky Presenting information in a conference
CN107667511A (en) * 2015-05-29 2018-02-06 耐瑞唯信有限公司 For carrying out the method and system of safe VOIP Multi-Way Callings
CN110727638A (en) * 2012-11-12 2020-01-24 伊诺卡姆公司 Data system and data method in electronic system and machine readable medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130246529A1 (en) * 1996-03-26 2013-09-19 Joseph Salesky Presenting information in a conference
TW201116012A (en) * 2009-09-29 2011-05-01 O2Micro Inc Integrated firewall / VPN system and integrated circuit thereof
CN110727638A (en) * 2012-11-12 2020-01-24 伊诺卡姆公司 Data system and data method in electronic system and machine readable medium
CN107667511A (en) * 2015-05-29 2018-02-06 耐瑞唯信有限公司 For carrying out the method and system of safe VOIP Multi-Way Callings

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI889612B (en) * 2024-12-06 2025-07-01 中華電信股份有限公司 System and method for dynamically switching mixing terminal and computer program product thereof

Also Published As

Publication number Publication date
TW202516492A (en) 2025-04-16

Similar Documents

Publication Publication Date Title
JP4401849B2 (en) System and method for establishing a secondary channel
JP6800169B2 (en) Systems and methods for making secure VOIP multi-party calls
US7991419B2 (en) Press-talk server, transcoder, and communication system
US7127487B1 (en) System and method for sidebar functionality in a regular conference system
US9325671B2 (en) System and method for merging encryption data using circular encryption key switching
CN103475793A (en) Making calls using an additional terminal
US8121057B1 (en) Wide area voice environment multi-channel communications system and method
TWI880378B (en) Voice conference system and voice conference method
EP2822239B1 (en) Electronic conference system, bandwidth management method and storage medium in which bandwidth management program is stored
CN104980406B (en) Call recording method, recording server, user class interchanger and recording system
US8243903B1 (en) Method and apparatus for implementing a secure side conversation on a telephone conference call
US20060215822A1 (en) Exchange system and communication recording method
US9628450B2 (en) System and method for merging encryption data without sharing a private key
CN102594794B (en) Access method and device of media encryption conference
CN105187678A (en) Bridging method for telephone conference room, and VoIP server
US9461974B2 (en) System and method to merge encrypted signals in distributed communication system
US9369273B2 (en) System and method for mixing VoIP streaming data for encrypted processing
KR100986113B1 (en) How to share media codec between terminals for conference
JP5549017B2 (en) Gateway device and data transmission method
TWI889612B (en) System and method for dynamically switching mixing terminal and computer program product thereof
JP2020053836A (en) Relay device
JP2007013254A (en) Speech recording method and system in ip telephon call
JP6729901B1 (en) Telephone conference system, telephone terminal, and program
CN108696512A (en) Across encrypted bit stream machinery of consultation, device and the conference facility of agreement
JP4778838B2 (en) Mobile station