WO2017181508A1 - Multimedia meeting control method and server - Google Patents
Multimedia meeting control method and server Download PDFInfo
- Publication number
- WO2017181508A1 WO2017181508A1 PCT/CN2016/085049 CN2016085049W WO2017181508A1 WO 2017181508 A1 WO2017181508 A1 WO 2017181508A1 CN 2016085049 W CN2016085049 W CN 2016085049W WO 2017181508 A1 WO2017181508 A1 WO 2017181508A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- server
- control terminal
- conference control
- speaking
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
- H04N7/181—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a plurality of remote sources
Definitions
- the present invention relates to the field of multimedia conferences, and in particular, to a multimedia conference control method and a server.
- Multimedia conference rooms are rapidly adopted for their functional diversity (such as on-site conferences, academic reports, training and teaching). popular.
- the multimedia conference system refers to the integration of sound, light, electrical equipment and software that are interrelated with the conference.
- the multimedia conference room whether it is for reporting, summarizing, reporting, introducing products, etc., the use of computer interactive operation of pictures, texts, sounds, shadows, paintings, fully mobilized the participants' sensory perception, greatly improving the effectiveness of the meeting.
- Multimedia is increasingly showing its advantages in the office field.
- the cameras of the venue are mostly fixed, and it is impossible to track the video of the speaker, which greatly reduces the user experience.
- the camera cannot track the problem of shooting the speaker video, and the problem in this aspect needs to be solved by the inventor.
- the main object of the present invention is to solve the problem that the camera cannot track the video of the speaker in the multimedia conference system.
- the present invention provides a multimedia conference control method, where the multimedia conference control method includes the following steps:
- the server determines, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
- the server adjusts a camera shooting speaker video according to the determined orientation information
- the server sends the speaker video to a display screen for display.
- the server before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
- the server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- the server receives a speaking instruction sent by the conference control terminal.
- the method before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
- the server saves the received agent list and the orientation information corresponding to each agent.
- the method further includes:
- the server receives video data of each of the sub-sites through a network connection
- the server performs jigsaw processing on the video data of each of the sub-sites to obtain a puzzle video
- the server sends the puzzle video to a display for display.
- the server before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
- the server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- the server receives a speaking instruction sent by the conference control terminal.
- the method before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
- the server saves the received agent list and the orientation information corresponding to each agent.
- the step of the server receiving the video data of each of the sub-sites through the network connection includes:
- the server detects the network bandwidth of the network connection in real time when receiving the video data of the conference site through the network connection;
- the server determines a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
- the server switches to the determined video bit rate and video resolution to continue receiving video data.
- the server before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
- the server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- the server receives a speaking instruction sent by the conference control terminal.
- the method before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
- the server saves the received agent list and the orientation information corresponding to each agent.
- the present invention further provides a multimedia conference server, where the multimedia conference server includes:
- the receiving module is configured to: when receiving the speaking instruction sent by the conference control terminal, determine, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
- control module configured to adjust a camera shooting speaker video according to the determined orientation information
- a sending module configured to send the speaker video to a display screen for display.
- the multimedia conference server further includes a display module
- the display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
- the multimedia conference server further includes a storage module
- the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
- the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the multimedia conference server further includes a multimedia module
- the receiving module is further configured to receive video data of each of the sub-sites through a network connection
- the multimedia module is configured to perform jigsaw processing on video data of each of the sub-sites to obtain a puzzle video
- the sending module is further configured to send the puzzle video to a display screen for display.
- the multimedia conference server further includes a display module
- the display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
- the multimedia conference server further includes a storage module
- the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
- the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the receiving module includes a detecting unit, a determining unit, and a switching unit;
- the detecting unit is configured to detect a network bandwidth of the network connection in real time when receiving video data of a sub-site through a network connection;
- the determining unit is configured to determine a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
- the switching unit is configured to switch to the determined video bit rate and video resolution to continue receiving video data.
- the multimedia conference server further includes a display module
- the display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
- the multimedia conference server further includes a storage module
- the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
- the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the invention receives the speaking instruction sent by the user based on the conference control terminal by the server, and controls the camera to aim at the corresponding orientation according to the speaking instruction, so as to realize the automatic positioning of the camera in the multimedia conference system, the speaker video It is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal.
- the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
- FIG. 1 is a hardware architecture diagram of a multimedia conference system implementing various embodiments of the present invention
- FIG. 2 is a schematic flowchart of a first embodiment of a multimedia conference control method according to the present invention
- FIG. 3 is a schematic flowchart of a second embodiment of a multimedia conference control method according to the present invention.
- FIG. 4 is a schematic flowchart diagram of a third embodiment of a multimedia conference control method according to the present invention.
- FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a multimedia conference control method according to the present invention.
- FIG. 6 is a schematic diagram of an effect of an embodiment of an agent list displayed by a conference control terminal according to the present invention.
- FIG. 7 is a schematic diagram of functional modules of a first embodiment of a multimedia conference server according to the present invention.
- FIG. 8 is a schematic diagram of functional modules of a second embodiment of a multimedia conference server according to the present invention.
- FIG. 9 is a schematic diagram of functional modules of a third embodiment of a multimedia conference server according to the present invention.
- FIG. 10 is a schematic diagram of functional modules of a fourth embodiment of a multimedia conference server according to the present invention.
- FIG. 1 is a hardware architecture diagram of a multimedia conference system implementing various embodiments of the present invention.
- the multimedia conference system may include a server 100, a conference control terminal 200, and external devices such as a camera 301, a microphone 302, a display screen 303, an audio 304, and the like.
- the conference control terminal 200 is configured to generate a corresponding instruction according to a command input by the host user and send it to the server 100 to control various operations of the conference service.
- the conference control terminal 200 can be a terminal of a mobile phone, a smart phone, a notebook computer, a PAD (tablet computer), a desktop computer, or the like.
- the camera 301 and the microphone 302 are used to collect audio and video data.
- the display screen 303 and the audio 304 device are configured to output audio and video processed by the multimedia device 102.
- the server 100 may include a multimedia device 102, a softswitch device 103, a resource access device 104, a controller 101, and the like.
- FIG. 1 illustrates a server 100 having various devices, but it should be understood that implementation is not required. All the devices shown. More or fewer devices can be implemented instead.
- the control signaling between the devices in the server 100 can be implemented through the SIP protocol, and the multimedia data passes the RTP protocol (Real-time). Transport Protocol, real-time transport protocol) bearer transmission.
- the softswitch device 103 is configured to control the registration of the various resources (such as camera resources, display resources, microphone resources, etc.) of the terminal 200 and the conference room, call routing, and the like.
- the controller 101 is used for control and management of conference services.
- the multimedia device 102 is used for processing audio and video, such as audio mixing, video puzzles, and the like.
- the resource access device 104 is configured to access a display 303, a camera 301, a microphone 302, an audio 304, and the like in the conference room.
- the present invention provides a multimedia conference control method.
- FIG. 2 is a schematic flowchart diagram of a first embodiment of a multimedia conference control method according to the present invention.
- the multimedia conference control method includes:
- Step S10 When receiving the speaking instruction sent by the conference control terminal, the server determines, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
- the host user may trigger a speaking instruction for instructing the corresponding speaker to speak by the conference control terminal, and the conference control terminal sends the speaking instruction to the server, and when the server receives the speaking instruction, according to the The speaking instruction determines the orientation information corresponding to the corresponding speaker and the speaker to control the camera to align the corresponding orientation to perform the shooting of the speaker video.
- the conference control terminal may add the agent information corresponding to the speaker as the speaker information to the speaking instruction, and when receiving the speaking instruction, the server determines the corresponding speaker information according to the speaking instruction. And querying the orientation information corresponding to the speaker locally saved by the server according to the speaker information, so as to adjust the camera to shoot the video according to the orientation information.
- the server can communicate with the conference control terminal through a SIP protocol.
- the speaking instruction may be transmitted between the server and the conference control terminal in a format of an INFO message.
- Step S20 the server adjusts a camera shooting speaker video according to the determined orientation information
- the server adjusts, according to the determined orientation information, that the corresponding camera is aligned with the speaker to perform the shooting of the video of the speaker.
- the orientation information may include a preset shooting angle for the server to adjust a corresponding camera angle according to the shooting angle to align the speaker.
- the camera may be a single one or a plurality of cameras. When there are a plurality of cameras for capturing the video of the speaker seat, the orientation information of each camera is respectively set corresponding to the same agent, and the server is configured according to each camera. The corresponding orientation information controls the angle adjustment of each camera.
- the server may further determine the corresponding speaker information when receiving the speaking instruction of the conference control terminal, and control to open the microphone device corresponding to the speaker to collect the speaker audio data, and collect the speaker's data. After the audio data is transmitted, it is mixed by the media server in the server and sent to the audio device for output.
- Step S30 the server sends the video of the speaker to a display screen for display.
- the server can send the video of the speaker shot by the camera to the display screen for display by using the RTP protocol.
- the conference control terminal may further add, to the speaking instruction, a control command for displaying a video of the speaker selected by the host user, and the server determines, according to the speaking instruction, whether to send the corresponding video of the speaker. Displaying to the display screen, if yes, the server sends the video of the speaker to the display screen for display; if not, deleting the video of the speaker.
- the server can be connected to the display through a VGA/HDMI/DVI/SDI interface.
- the server receives the speaking instruction sent by the user based on the conference control terminal, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic alignment of the camera in the multimedia conference system.
- the video is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal.
- the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
- FIG. 3 is a schematic flowchart diagram of a second embodiment of a multimedia conference control method according to the present invention. Based on the first embodiment of the foregoing multimedia conference control method, after the step S30, the method further includes:
- Step S40 the server receives video data of each of the sub-sites through a network connection
- the server can receive video data of each sub-site through the RTP protocol.
- the server may connect to a remote conference site server or a SIP conference terminal of the conference site through the network to receive video data of each of the conference sites.
- Step S50 the server performs the puzzle processing on the video data of each of the sub-sites to obtain a puzzle video
- the server can implement the puzzle processing of the video data of each of the sub-sites through the multimedia device in the server to obtain a puzzle video containing the video of each of the sub-sites.
- the server can perform puzzle processing in various ways, for example, 1+1 (1 main conference video + 1 sub-site video), 4 sub-screen, 6-screen, 1+4 (1 main conference video + 4) Video of the sub-site), 1+5 (1 main venue video + 5 sub-site videos), 9-screen and so on.
- Step S60 the server sends the puzzle video to a display screen for display.
- the server sends the puzzle video to a display for display.
- the display screen accessed by the resource access device of the server may be a single display screen or multiple display screens. For example, when multiple screens are accessed, the One display is used to display the puzzle video of all the venues, the second display is used to display the speaker video, and the third display is used to display documents such as PPT.
- the video data of each sub-site is received by the server, and the jigsaw video is displayed according to the video data, and the video of each sub-site is displayed, which improves the conference effect and improves the user experience.
- FIG. 4 is a schematic flowchart diagram of a third embodiment of a multimedia conference control method according to the present invention. Based on the second embodiment of the foregoing multimedia conference control method, the step S40 includes:
- Step S41 the server detects the network bandwidth of the network connection in real time when receiving the video data of the conference site through the network connection;
- Step S42 the server determines a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes.
- step S43 the server switches to the determined video bit rate and video resolution to continue receiving video data.
- the server detects the network bandwidth of the network connection in real time during the process of receiving the video data of the sub-site through the network connection; when detecting the change of the network bandwidth, the server determines the video code corresponding to the changed network bandwidth. Rate and video resolution; the server switches to the determined video bit rate and video resolution to continue receiving video data. For example, the server receives the video data of the conference site according to the code rate of 2000 kbps, detects that the network bandwidth changes, and the changed network bandwidth conforms to the 800 kbps code rate, and the server switches to the 800 kbps code rate to continue receiving the current location. Video data.
- the resolution and the code rate of the video are adjusted according to the network bandwidth, which avoids problems such as video jamming and flower screen caused by network deterioration during the conference, and can automatically adjust the video resolution and code rate to adapt to the network bandwidth when the network is deteriorated. , to achieve the best video effects under the current network bandwidth conditions, improve the user experience.
- FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a multimedia conference control method according to the present invention. Based on the first embodiment of the foregoing multimedia conference control method, before the step S10, the method further includes:
- step S11 the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- Step S12 The server receives the speaking instruction sent by the conference control terminal.
- the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction, and the server receives the statement sent by the conference control terminal to The speaking instruction performs a corresponding operation.
- the agent list may be stored in the server, and when the server receives a display instruction triggered by the conference terminal, the server sends the agent list to the conference terminal for display.
- the conference control terminal may trigger a corresponding speaking instruction to indicate that the participant in the agent speaks when detecting the click operation of the host user based on the agent list.
- FIG. 6 is a schematic diagram of an effect of an embodiment of a seat list displayed by a conference control terminal according to the present invention.
- the server may further receive, according to the setting instruction sent by the conference control terminal, the agent list input by the user based on the conference terminal and the orientation information corresponding to each agent; the server The received seat list and the orientation information corresponding to each seat are saved.
- the conference control terminal triggers the corresponding speaking instruction, and receives the speaking instruction sent by the conference control terminal through the server, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic camera pair in the multimedia conference system.
- the prospective spokesperson and the spokesperson video are automatically displayed on the display screen, so that the podium user can trigger the speaking command to indicate who is speaking through the conference control terminal, and the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect. , improved user experience.
- the execution bodies of the multimedia conference control methods of the foregoing first to fourth embodiments may each be a multimedia conference system or a server disposed in the multimedia conference system. Further, the multimedia conference control method may be implemented by a client control program installed in the multimedia conference system or the multimedia conference server.
- the invention further provides a multimedia conference server.
- FIG. 7 is a schematic diagram of functional modules of a first embodiment of a multimedia conference server according to the present invention.
- the multimedia conference server includes: a receiving module 10, a control module 20, and a sending module 30.
- the receiving module 10 is configured to, according to the speaking instruction, determine the orientation information corresponding to the speaking seat and the speaking seat according to the speaking instruction when receiving the speaking instruction sent by the conference controlling terminal;
- the host user may trigger a speaking instruction for instructing the corresponding speaker to speak by the conference control terminal, and the conference control terminal sends the speaking instruction to the server, and when the server receives the speaking instruction, according to the The speaking instruction determines the orientation information corresponding to the corresponding speaker and the speaker to control the camera to align the corresponding orientation to perform the shooting of the speaker video.
- the conference control terminal may add the agent information corresponding to the speaker as the speaker information to the speaking instruction, and when receiving the speaking instruction, the server determines the corresponding speaker information according to the speaking instruction. And querying the orientation information corresponding to the speaker locally saved by the server according to the speaker information, so as to adjust the camera to shoot the video according to the orientation information.
- the server can communicate with the conference control terminal through a SIP protocol.
- the speaking instruction may be transmitted between the server and the conference control terminal in a format of an INFO message.
- the control module 20 is configured to adjust a camera shooting speaker video according to the determined orientation information
- the server adjusts, according to the determined orientation information, that the corresponding camera is aligned with the speaker to perform the shooting of the video of the speaker.
- the orientation information may include a preset shooting angle for the server to adjust a corresponding camera angle according to the shooting angle to align the speaker.
- the camera may be a single one or a plurality of cameras. When there are a plurality of cameras for capturing the video of the speaker seat, the orientation information of each camera is respectively set corresponding to the same agent, and the server is configured according to each camera. The corresponding orientation information controls the angle adjustment of each camera.
- the server may further determine the corresponding speaker information when receiving the speaking instruction of the conference control terminal, and control to open the microphone device corresponding to the speaker to collect the speaker audio data, and collect the speaker's data. After the audio data is transmitted, it is mixed by the media server in the server and sent to the audio device for output.
- the sending module 30 is configured to send the video of the speaker to a display screen for display.
- the server can send the video of the speaker shot by the camera to the display screen for display by using the RTP protocol.
- the conference control terminal may further add, to the speaking instruction, a control command for displaying a video of the speaker selected by the host user, and the server determines, according to the speaking instruction, whether to send the corresponding video of the speaker. Displaying to the display screen, if yes, the server sends the video of the speaker to the display screen for display; if not, deleting the video of the speaker.
- the server can be connected to the display through a VGA/HDMI/DVI/SDI interface.
- the server receives the speaking instruction sent by the user based on the conference control terminal, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic alignment of the camera in the multimedia conference system.
- the video is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal.
- the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
- FIG 8 is a schematic diagram of functional modules of a second embodiment of the apparatus of the present invention.
- the multimedia conference server further includes a multimedia module 40.
- the receiving module 10 is further configured to receive video data of each of the sub-sites through a network connection;
- the server can receive video data of each sub-site through the RTP protocol.
- the server may connect to a remote conference site server or a SIP conference terminal of the conference site through the network to receive video data of each of the conference sites.
- the multimedia module 40 is configured to perform puzzle processing on video data of each of the sub-sites to obtain a puzzle video
- the server can implement the puzzle processing of the video data of each of the sub-sites through the multimedia device in the server to obtain a puzzle video containing the video of each of the sub-sites.
- the server can perform puzzle processing in various ways, for example, 1+1 (1 main conference video + 1 sub-site video), 4 sub-screen, 6-screen, 1+4 (1 main conference video + 4) Video of the sub-site), 1+5 (1 main venue video + 5 sub-site videos), 9-screen and so on.
- the sending module 30 is further configured to send the puzzle video to a display screen for display.
- the server sends the puzzle video to a display for display.
- the display screen accessed by the resource access device of the server may be a single display screen or multiple display screens. For example, when multiple screens are accessed, the One display is used to display the puzzle video of all the venues, the second display is used to display the speaker video, and the third display is used to display documents such as PPT.
- the video data of each sub-site is received by the server, and the jigsaw video is displayed according to the video data, and the video of each sub-site is displayed, which improves the conference effect and improves the user experience.
- FIG. 9 is a schematic diagram of functional modules of a third embodiment of the apparatus of the present invention.
- the receiving module 10 includes a detecting unit 11, a determining unit 12, and a switching unit 13 based on the second embodiment of the multimedia conference server.
- the detecting unit 11 is configured to detect a network bandwidth of the network connection in real time when receiving video data of a sub-site through a network connection;
- the determining unit 12 is configured to determine a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
- the switching unit 13 is configured to switch to the determined video bit rate and video resolution to continue receiving video data.
- the server detects the network bandwidth of the network connection in real time during the process of receiving the video data of the sub-site through the network connection; when detecting the change of the network bandwidth, the server determines the video code corresponding to the changed network bandwidth. Rate and video resolution; the server switches to the determined video bit rate and video resolution to continue receiving video data. For example, the server receives the video data of the conference site according to the code rate of 2000 kbps, detects that the network bandwidth changes, and the changed network bandwidth conforms to the 800 kbps code rate, and the server switches to the 800 kbps code rate to continue receiving the current location. Video data.
- the resolution and the code rate of the video are adjusted according to the network bandwidth, which avoids problems such as video jamming and flower screen caused by network deterioration during the conference, and can automatically adjust the video resolution and code rate to adapt to the network bandwidth when the network is deteriorated. , to achieve the best video effects under the current network bandwidth conditions, improve the user experience.
- FIG. 10 is a schematic diagram of functional modules of a fourth embodiment of the apparatus of the present invention.
- the multimedia conference server further includes a display module 50, based on the first embodiment of the multimedia conference server;
- the display module 50 is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module 10 is further configured to receive a speaking instruction sent by the conference control terminal.
- the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction, and the server receives the statement sent by the conference control terminal to The speaking instruction performs a corresponding operation.
- the agent list may be stored in the server, and when the server receives a display instruction triggered by the conference terminal, the server sends the agent list to the conference terminal for display.
- the conference control terminal may trigger a corresponding speaking instruction to indicate that the participant in the agent speaks when detecting the click operation of the host user based on the agent list.
- FIG. 6 is a schematic diagram of an effect of an embodiment of a seat list displayed by a conference control terminal according to the present invention.
- the multimedia conference server further includes a storage module, and the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive a seat list and each agent input by the user based on the conference control terminal Corresponding orientation information; the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the conference control terminal triggers the corresponding speaking instruction, and receives the speaking instruction sent by the conference control terminal through the server, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic camera pair in the multimedia conference system.
- the prospective spokesperson and the spokesperson video are automatically displayed on the display screen, so that the podium user can trigger the speaking command to indicate who is speaking through the conference control terminal, and the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect. , improved user experience.
- the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better.
- Implementation Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
- the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
技术领域Technical field
本发明涉及多媒体会议领域,尤其涉及一种多媒体会议控制方法及服务器。The present invention relates to the field of multimedia conferences, and in particular, to a multimedia conference control method and a server.
背景技术Background technique
随着多媒体技术的普及和发展,使得视频会议、远程教学等可视化信息技术在会议室领域得到广泛应用,多媒体会议室以其功能的多样性(如现场会议、学术报告、培训教学等)得到迅速普及。多媒体会议系统是泛指与会议相互关联的声、光、电设备及软件的集成。在多媒体会议室里不管是作报告、总结、汇报、介绍产品等等,用电脑互动操作的图、文、声、影、画展示,充分调动了与会者的感官知觉,大大提高了会议效果。多媒体在办公领域中,也越来越体现出它的优势。但是,在现有的多媒体会议系统中,会场的摄像头多是固定的,无法跟踪拍摄发言人视频,极大的降低了用户体验,With the popularization and development of multimedia technology, visual information technology such as video conferencing and distance learning has been widely used in the conference room field. Multimedia conference rooms are rapidly adopted for their functional diversity (such as on-site conferences, academic reports, training and teaching). popular. The multimedia conference system refers to the integration of sound, light, electrical equipment and software that are interrelated with the conference. In the multimedia conference room, whether it is for reporting, summarizing, reporting, introducing products, etc., the use of computer interactive operation of pictures, texts, sounds, shadows, paintings, fully mobilized the participants' sensory perception, greatly improving the effectiveness of the meeting. Multimedia is increasingly showing its advantages in the office field. However, in the existing multimedia conference system, the cameras of the venue are mostly fixed, and it is impossible to track the video of the speaker, which greatly reduces the user experience.
因此,在多媒体会议系统中摄像头无法跟踪拍摄发言人视频的问题,此方面的问题亟待发明人解决。Therefore, in the multimedia conference system, the camera cannot track the problem of shooting the speaker video, and the problem in this aspect needs to be solved by the inventor.
上述内容仅用于辅助理解本发明的技术方案,并不代表承认上述内容是现有技术。The above content is only used to assist in understanding the technical solutions of the present invention, and does not constitute an admission that the above is prior art.
发明内容Summary of the invention
本发明的主要目的在于解决在多媒体会议系统中,摄像头无法跟踪拍摄发言人视频的问题。The main object of the present invention is to solve the problem that the camera cannot track the video of the speaker in the multimedia conference system.
为实现上述目的,本发明提供一种多媒体会议控制方法,所述多媒体会议控制方法包括以下步骤:To achieve the above objective, the present invention provides a multimedia conference control method, where the multimedia conference control method includes the following steps:
服务器在接收到会控终端发送的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息;When receiving the speaking instruction sent by the conference control terminal, the server determines, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
所述服务器根据所确定的方位信息调整摄像头拍摄发言席视频;The server adjusts a camera shooting speaker video according to the determined orientation information;
所述服务器将所述发言席视频发送至显示屏进行显示。The server sends the speaker video to a display screen for display.
可选的,所述服务器在接收到会控终端的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息的步骤之前,还包括:Optionally, before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
所述服务器接收所述会控终端发送的发言指令。The server receives a speaking instruction sent by the conference control terminal.
可选的,所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令的步骤之前,还包括:Optionally, before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
所述服务器在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;Receiving, by the server, a setting instruction sent by the conference control terminal, receiving, by the user, the agent list input by the conference control terminal and the orientation information corresponding to each agent;
所述服务器保存所接收到的坐席列表及各个坐席对应的方位信息。The server saves the received agent list and the orientation information corresponding to each agent.
可选的,所述服务器将所述发言席视频发送至显示屏进行显示的步骤之后,还包括:Optionally, after the step of the server sending the video to the display screen for display, the method further includes:
所述服务器通过网络连接接收各个分会场的视频数据;The server receives video data of each of the sub-sites through a network connection;
所述服务器将各个分会场的视频数据进行拼图处理,得到拼图视频;The server performs jigsaw processing on the video data of each of the sub-sites to obtain a puzzle video;
所述服务器将所述拼图视频发送至显示屏进行显示。The server sends the puzzle video to a display for display.
可选的,所述服务器在接收到会控终端的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息的步骤之前,还包括:Optionally, before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
所述服务器接收所述会控终端发送的发言指令。The server receives a speaking instruction sent by the conference control terminal.
可选的,所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令的步骤之前,还包括:Optionally, before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
所述服务器在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;Receiving, by the server, a setting instruction sent by the conference control terminal, receiving, by the user, the agent list input by the conference control terminal and the orientation information corresponding to each agent;
所述服务器保存所接收到的坐席列表及各个坐席对应的方位信息。The server saves the received agent list and the orientation information corresponding to each agent.
可选的,所述服务器通过网络连接接收各个分会场的视频数据的步骤包括:Optionally, the step of the server receiving the video data of each of the sub-sites through the network connection includes:
所述服务器在通过网络连接接收分会场的视频数据时,实时检测所述网络连接的网络带宽;The server detects the network bandwidth of the network connection in real time when receiving the video data of the conference site through the network connection;
所述服务器在检测到所述网络带宽发生变化时,确定变化后的网络带宽对应的视频码率及视频分辨率;The server determines a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
所述服务器切换至所确定的视频码率及视频分辨率继续接收视频数据。The server switches to the determined video bit rate and video resolution to continue receiving video data.
可选的,所述服务器在接收到会控终端的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息的步骤之前,还包括:Optionally, before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
所述服务器接收所述会控终端发送的发言指令。The server receives a speaking instruction sent by the conference control terminal.
可选的,所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令的步骤之前,还包括:Optionally, before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
所述服务器在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;Receiving, by the server, a setting instruction sent by the conference control terminal, receiving, by the user, the agent list input by the conference control terminal and the orientation information corresponding to each agent;
所述服务器保存所接收到的坐席列表及各个坐席对应的方位信息。The server saves the received agent list and the orientation information corresponding to each agent.
此外,为实现上述目的,本发明还提供一种多媒体会议服务器,所述多媒体会议服务器包括:In addition, to achieve the above object, the present invention further provides a multimedia conference server, where the multimedia conference server includes:
接收模块,用于在接收到会控终端发送的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息;The receiving module is configured to: when receiving the speaking instruction sent by the conference control terminal, determine, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
控制模块,用于根据所确定的方位信息调整摄像头拍摄发言席视频;a control module, configured to adjust a camera shooting speaker video according to the determined orientation information;
发送模块,用于将所述发言席视频发送至显示屏进行显示。a sending module, configured to send the speaker video to a display screen for display.
可选的,所述多媒体会议服务器还包括显示模块;Optionally, the multimedia conference server further includes a display module;
所述显示模块,用于通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
所述接收模块,还用于接收所述会控终端发送的发言指令。The receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
可选的,所述多媒体会议服务器还包括存储模块;Optionally, the multimedia conference server further includes a storage module;
所述接收模块,还用于在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;The receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
所述存储模块,用于保存所接收到的坐席列表及各个坐席对应的方位信息。The storage module is configured to save the received agent list and the orientation information corresponding to each agent.
可选的,所述多媒体会议服务器还包括多媒体模块;Optionally, the multimedia conference server further includes a multimedia module;
所述接收模块,还用于通过网络连接接收各个分会场的视频数据;The receiving module is further configured to receive video data of each of the sub-sites through a network connection;
所述多媒体模块,用于将各个分会场的视频数据进行拼图处理,得到拼图视频;The multimedia module is configured to perform jigsaw processing on video data of each of the sub-sites to obtain a puzzle video;
所述发送模块,还用于将所述拼图视频发送至显示屏进行显示。The sending module is further configured to send the puzzle video to a display screen for display.
可选的,所述多媒体会议服务器还包括显示模块;Optionally, the multimedia conference server further includes a display module;
所述显示模块,用于通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
所述接收模块,还用于接收所述会控终端发送的发言指令。The receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
可选的,所述多媒体会议服务器还包括存储模块;Optionally, the multimedia conference server further includes a storage module;
所述接收模块,还用于在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;The receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
所述存储模块,用于保存所接收到的坐席列表及各个坐席对应的方位信息。The storage module is configured to save the received agent list and the orientation information corresponding to each agent.
可选的,所述接收模块包括检测单元、确定单元和切换单元;Optionally, the receiving module includes a detecting unit, a determining unit, and a switching unit;
所述检测单元,用于在通过网络连接接收分会场的视频数据时,实时检测所述网络连接的网络带宽;The detecting unit is configured to detect a network bandwidth of the network connection in real time when receiving video data of a sub-site through a network connection;
所述确定单元,用于在检测到所述网络带宽发生变化时,确定变化后的网络带宽对应的视频码率及视频分辨率;The determining unit is configured to determine a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
所述切换单元,用于切换至所确定的视频码率及视频分辨率继续接收视频数据。The switching unit is configured to switch to the determined video bit rate and video resolution to continue receiving video data.
可选的,所述多媒体会议服务器还包括显示模块;Optionally, the multimedia conference server further includes a display module;
所述显示模块,用于通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
所述接收模块,还用于接收所述会控终端发送的发言指令。The receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
可选的,所述多媒体会议服务器还包括存储模块;Optionally, the multimedia conference server further includes a storage module;
所述接收模块,还用于在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;The receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
所述存储模块,用于保存所接收到的坐席列表及各个坐席对应的方位信息。The storage module is configured to save the received agent list and the orientation information corresponding to each agent.
本发明通过服务器接收用户基于会控终端发送的发言指令,并根据该发言指令控制摄像头对准对应的方位进行发言人视频的拍摄,实现了多媒体会议系统中摄像头自动对准发言人,发言人视频自动显示到显示屏,使得主席台用户能够通过会控终端触发发言指令指示谁发言,对应的发言人视频就显示在会场的显示屏上,极大的提高了会议效果,提升了用户体验。The invention receives the speaking instruction sent by the user based on the conference control terminal by the server, and controls the camera to aim at the corresponding orientation according to the speaking instruction, so as to realize the automatic positioning of the camera in the multimedia conference system, the speaker video It is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal. The corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
附图说明DRAWINGS
图1为实现本发明各个实施例的多媒体会议系统的硬件架构图;1 is a hardware architecture diagram of a multimedia conference system implementing various embodiments of the present invention;
图2为本发明多媒体会议控制方法的第一实施例的流程示意图;2 is a schematic flowchart of a first embodiment of a multimedia conference control method according to the present invention;
图3为本发明多媒体会议控制方法的第二实施例的流程示意图;3 is a schematic flowchart of a second embodiment of a multimedia conference control method according to the present invention;
图4为本发明多媒体会议控制方法的第三实施例的流程示意图;4 is a schematic flowchart diagram of a third embodiment of a multimedia conference control method according to the present invention;
图5为本发明多媒体会议控制方法的第四实施例的流程示意图;FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a multimedia conference control method according to the present invention; FIG.
图6为本发明中通过会控终端显示的坐席列表的一实施例的效果示意图;6 is a schematic diagram of an effect of an embodiment of an agent list displayed by a conference control terminal according to the present invention;
图7为本发明多媒体会议服务器的第一实施例的功能模块示意图;7 is a schematic diagram of functional modules of a first embodiment of a multimedia conference server according to the present invention;
图8为本发明多媒体会议服务器的第二实施例的功能模块示意图;8 is a schematic diagram of functional modules of a second embodiment of a multimedia conference server according to the present invention;
图9为本发明多媒体会议服务器的第三实施例的功能模块示意图;9 is a schematic diagram of functional modules of a third embodiment of a multimedia conference server according to the present invention;
图10为本发明多媒体会议服务器的第四实施例的功能模块示意图。FIG. 10 is a schematic diagram of functional modules of a fourth embodiment of a multimedia conference server according to the present invention.
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The implementation, functional features, and advantages of the present invention will be further described in conjunction with the embodiments.
具体实施方式detailed description
应当理解,此处所描述的具体实施例仅仅用以解释本发明,并不用于限定本发明。It is understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
现在将参考附图描述实现本发明各个实施例的多媒体会议系统。图1为实现本发明各个实施例的多媒体会议系统的硬件架构图。多媒体会议系统可以包括服务器100、会控终端200以及诸如摄像头301、麦克风302、显示屏303、音响304等等的外部设备。A multimedia conference system implementing various embodiments of the present invention will now be described with reference to the accompanying drawings. FIG. 1 is a hardware architecture diagram of a multimedia conference system implementing various embodiments of the present invention. The multimedia conference system may include a server 100, a conference control terminal 200, and external devices such as a camera 301, a microphone 302, a display screen 303, an audio 304, and the like.
所述会控终端200用于根据主持人用户所输入的命令生成对应的指令并发送至服务器100,以控制会议业务的各种操作。所述会控终端200可以为移动电话、智能电话、笔记本电脑、PAD(平板电脑)、台式计算机等等的终端。The conference control terminal 200 is configured to generate a corresponding instruction according to a command input by the host user and send it to the server 100 to control various operations of the conference service. The conference control terminal 200 can be a terminal of a mobile phone, a smart phone, a notebook computer, a PAD (tablet computer), a desktop computer, or the like.
所述摄像头301、麦克风302用于采集音视频数据。所述显示屏303及所述音响304设备用于输出多媒体设备102处理后的音视频。The camera 301 and the microphone 302 are used to collect audio and video data. The display screen 303 and the audio 304 device are configured to output audio and video processed by the multimedia device 102.
所述服务器100可以包括多媒体设备102、软交换设备103、资源接入设备104和控制器101等等,图1示出了具有各种设备的服务器100,但是应理解的是,并不要求实施所有示出的设备。可以替代地实施更多或更少的设备。所述服务器100内部的各个设备之间的控制信令可以通过SIP协议实现,多媒体数据通过RTP协议(Real-time Transport Protocol,实时传输协议)承载传输。所述软交换设备103用于会控终端200及会议室各种资源(如摄像头资源、显示屏资源、麦克风资源等)的注册、呼叫路由等。所述控制器101用于会议业务的控制与管理。所述多媒体设备102用于音视频的处理,例如:音频的混音、视频的拼图等。所述资源接入设备104用于接入会议室内的显示屏303、摄像头301、麦克风302、音响304等设备。The server 100 may include a multimedia device 102, a softswitch device 103, a resource access device 104, a controller 101, and the like. FIG. 1 illustrates a server 100 having various devices, but it should be understood that implementation is not required. All the devices shown. More or fewer devices can be implemented instead. The control signaling between the devices in the server 100 can be implemented through the SIP protocol, and the multimedia data passes the RTP protocol (Real-time). Transport Protocol, real-time transport protocol) bearer transmission. The softswitch device 103 is configured to control the registration of the various resources (such as camera resources, display resources, microphone resources, etc.) of the terminal 200 and the conference room, call routing, and the like. The controller 101 is used for control and management of conference services. The multimedia device 102 is used for processing audio and video, such as audio mixing, video puzzles, and the like. The resource access device 104 is configured to access a display 303, a camera 301, a microphone 302, an audio 304, and the like in the conference room.
基于上述多媒体会议系统的硬件架构,本发明提供一种多媒体会议控制方法。Based on the hardware architecture of the multimedia conference system described above, the present invention provides a multimedia conference control method.
参照图2,图2为本发明多媒体会议控制方法的第一实施例的流程示意图。Referring to FIG. 2, FIG. 2 is a schematic flowchart diagram of a first embodiment of a multimedia conference control method according to the present invention.
在本实施例中,所述多媒体会议控制方法包括:In this embodiment, the multimedia conference control method includes:
步骤S10,服务器在接收到会控终端发送的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息;Step S10: When receiving the speaking instruction sent by the conference control terminal, the server determines, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
可以由主持人用户通过会控终端触发用于指示对应的发言人进行发言的发言指令,所述会控终端将所述发言指令发送至服务器,所述服务器在接收到所述发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息,以控制摄像头对准对应方位进行发言人视频的拍摄。The host user may trigger a speaking instruction for instructing the corresponding speaker to speak by the conference control terminal, and the conference control terminal sends the speaking instruction to the server, and when the server receives the speaking instruction, according to the The speaking instruction determines the orientation information corresponding to the corresponding speaker and the speaker to control the camera to align the corresponding orientation to perform the shooting of the speaker video.
所述会控终端可以将发言人所对应的坐席信息作为发言席信息添加至所述发言指令中,所述服务器在接收到所述发言指令时,根据所述发言指令确定对应的发言席信息,并根据所述发言席信息查询所述服务器本地保存的所述发言席对应的方位信息,以根据所述方位信息调整摄像头拍摄发言席视频。The conference control terminal may add the agent information corresponding to the speaker as the speaker information to the speaking instruction, and when receiving the speaking instruction, the server determines the corresponding speaker information according to the speaking instruction. And querying the orientation information corresponding to the speaker locally saved by the server according to the speaker information, so as to adjust the camera to shoot the video according to the orientation information.
所述服务器可以通过SIP协议与所述会控终端之间进行通信。所述发言指令可以以INFO消息的格式在所述服务器与所述会控终端之间进行传输。The server can communicate with the conference control terminal through a SIP protocol. The speaking instruction may be transmitted between the server and the conference control terminal in a format of an INFO message.
步骤S20,所述服务器根据所确定的方位信息调整摄像头拍摄发言席视频;Step S20, the server adjusts a camera shooting speaker video according to the determined orientation information;
所述服务器根据所确定的方位信息调整对应的摄像头对准所述发言席进行发言席视频的拍摄。所述方位信息可以包括预设的拍摄角度,以供服务器根据所述拍摄角度调整对应的摄像头角度以对准所述发言席。进一步的,所述摄像头可以为单独的一个或者也可以是多个,当用于拍摄发言席视频的摄像头为多个时,对应于同一坐席分别设置各个摄像头的方位信息,所述服务器根据各个摄像头对应的方位信息控制各个摄像头的角度调整。The server adjusts, according to the determined orientation information, that the corresponding camera is aligned with the speaker to perform the shooting of the video of the speaker. The orientation information may include a preset shooting angle for the server to adjust a corresponding camera angle according to the shooting angle to align the speaker. Further, the camera may be a single one or a plurality of cameras. When there are a plurality of cameras for capturing the video of the speaker seat, the orientation information of each camera is respectively set corresponding to the same agent, and the server is configured according to each camera. The corresponding orientation information controls the angle adjustment of each camera.
进一步的,所述服务器还可以在接收到会控终端的发言指令时,确定对应的发言席信息,并控制打开所述发言席对应的麦克风设备以采集发言人音频数据,在采集到发言人的音频数据后,通过服务器内的媒体服务器进行混音处理后发送至音响设备输出。Further, the server may further determine the corresponding speaker information when receiving the speaking instruction of the conference control terminal, and control to open the microphone device corresponding to the speaker to collect the speaker audio data, and collect the speaker's data. After the audio data is transmitted, it is mixed by the media server in the server and sent to the audio device for output.
步骤S30,所述服务器将所述发言席视频发送至显示屏进行显示。Step S30, the server sends the video of the speaker to a display screen for display.
所述服务器可以通过RTP协议将摄像头拍摄的发言席视频发送至显示屏进行显示。进一步的,所述会控终端还可以将主持人用户所选择的是否显示发言席视频的控制命令添加至所述发言指令中,所述服务器根据所述发言指令判断是否将对应的发言席视频发送至显示屏进行显示,若是,则所述服务器将所述发言席视频发送至显示屏进行显示;若否,则删除所述发言席视频。The server can send the video of the speaker shot by the camera to the display screen for display by using the RTP protocol. Further, the conference control terminal may further add, to the speaking instruction, a control command for displaying a video of the speaker selected by the host user, and the server determines, according to the speaking instruction, whether to send the corresponding video of the speaker. Displaying to the display screen, if yes, the server sends the video of the speaker to the display screen for display; if not, deleting the video of the speaker.
所述服务器可以通过VGA/HDMI/DVI/SDI接口与所述显示屏进行连接。The server can be connected to the display through a VGA/HDMI/DVI/SDI interface.
本实施例通过服务器接收用户基于会控终端发送的发言指令,并根据该发言指令控制摄像头对准对应的方位进行发言人视频的拍摄,实现了多媒体会议系统中摄像头自动对准发言人,发言人视频自动显示到显示屏,使得主席台用户能够通过会控终端触发发言指令指示谁发言,对应的发言人视频就显示在会场的显示屏上,极大的提高了会议效果,提升了用户体验。In this embodiment, the server receives the speaking instruction sent by the user based on the conference control terminal, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic alignment of the camera in the multimedia conference system. The video is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal. The corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
参照图3,图3为本发明多媒体会议控制方法的第二实施例的流程示意图。基于上述多媒体会议控制方法的第一实施例,所述步骤S30之后,还包括:Referring to FIG. 3, FIG. 3 is a schematic flowchart diagram of a second embodiment of a multimedia conference control method according to the present invention. Based on the first embodiment of the foregoing multimedia conference control method, after the step S30, the method further includes:
步骤S40,所述服务器通过网络连接接收各个分会场的视频数据;Step S40, the server receives video data of each of the sub-sites through a network connection;
所述服务器可以通过RTP协议接收各个分会场的视频数据。所述服务器可以通过网络连接远程的分会场服务器或者分会场的SIP会议终端,以接收各个分会场的视频数据。The server can receive video data of each sub-site through the RTP protocol. The server may connect to a remote conference site server or a SIP conference terminal of the conference site through the network to receive video data of each of the conference sites.
步骤S50,所述服务器将各个分会场的视频数据进行拼图处理,得到拼图视频;Step S50, the server performs the puzzle processing on the video data of each of the sub-sites to obtain a puzzle video;
所述服务器可以通过所述服务器内的多媒体设备实现对各个分会场的视频数据的拼图处理,以得到含有各个分会场视频的拼图视频。所述服务器可以按照各种方式进行拼图处理,例如:1+1(1个主会场视频+1个分会场视频),4分屏,6分屏,1+4(1个主会场视频+4个分会场视频),1+5(1个主会场视频+5个分会场视频),9分屏等等。The server can implement the puzzle processing of the video data of each of the sub-sites through the multimedia device in the server to obtain a puzzle video containing the video of each of the sub-sites. The server can perform puzzle processing in various ways, for example, 1+1 (1 main conference video + 1 sub-site video), 4 sub-screen, 6-screen, 1+4 (1 main conference video + 4) Video of the sub-site), 1+5 (1 main venue video + 5 sub-site videos), 9-screen and so on.
步骤S60,所述服务器将所述拼图视频发送至显示屏进行显示。Step S60, the server sends the puzzle video to a display screen for display.
所述服务器将所述拼图视频发送至显示屏进行显示。进一步的,通过所述服务器的资源接入设备所接入的显示屏可以是一个单独的显示屏或者也可以是多个显示屏,例如:当接入的显示屏为多个时,可以将第一显示屏用于显示所有会场的拼图视频,将第二显示屏用于显示发言人视频,将第三显示屏用于显示PPT等文档。The server sends the puzzle video to a display for display. Further, the display screen accessed by the resource access device of the server may be a single display screen or multiple display screens. For example, when multiple screens are accessed, the One display is used to display the puzzle video of all the venues, the second display is used to display the speaker video, and the third display is used to display documents such as PPT.
本实施例通过服务器接收各个分会场的视频数据,并根据所述视频数据进行拼图处理得到拼图视频进行显示,实现了各个分会场视频的显示,提高了会议效果,提升了用户体验。In this embodiment, the video data of each sub-site is received by the server, and the jigsaw video is displayed according to the video data, and the video of each sub-site is displayed, which improves the conference effect and improves the user experience.
参照图4,图4为本发明多媒体会议控制方法的第三实施例的流程示意图。基于上述多媒体会议控制方法的第二实施例,所述步骤S40包括:Referring to FIG. 4, FIG. 4 is a schematic flowchart diagram of a third embodiment of a multimedia conference control method according to the present invention. Based on the second embodiment of the foregoing multimedia conference control method, the step S40 includes:
步骤S41,所述服务器在通过网络连接接收分会场的视频数据时,实时检测所述网络连接的网络带宽;Step S41, the server detects the network bandwidth of the network connection in real time when receiving the video data of the conference site through the network connection;
步骤S42,所述服务器在检测到所述网络带宽发生变化时,确定变化后的网络带宽对应的视频码率及视频分辨率;Step S42, the server determines a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes.
步骤S43,所述服务器切换至所确定的视频码率及视频分辨率继续接收视频数据。In step S43, the server switches to the determined video bit rate and video resolution to continue receiving video data.
所述服务器在通过网络连接接收分会场的视频数据过程中,实时检测所述网络连接的网络带宽;所述服务器在检测到所述网络带宽发生变化时,确定变化后的网络带宽对应的视频码率及视频分辨率;所述服务器切换至所确定的视频码率及视频分辨率继续接收视频数据。例如:所述服务器按照2000kbps码率接收分会场的视频数据,检测测到网络带宽发生变化,变化后的网络带宽符合800kbps码率,则所述服务器切换至800kbps码率从当前位置继续接收所述视频数据。The server detects the network bandwidth of the network connection in real time during the process of receiving the video data of the sub-site through the network connection; when detecting the change of the network bandwidth, the server determines the video code corresponding to the changed network bandwidth. Rate and video resolution; the server switches to the determined video bit rate and video resolution to continue receiving video data. For example, the server receives the video data of the conference site according to the code rate of 2000 kbps, detects that the network bandwidth changes, and the changed network bandwidth conforms to the 800 kbps code rate, and the server switches to the 800 kbps code rate to continue receiving the current location. Video data.
本实施例根据网络带宽调整视频的分辨率及码率,避免了开会过程中由于网络恶化造成视频卡顿、花屏等问题,在网络恶化时,能够自动调整视频分辨率及码率以适应网络带宽,实现了在当前网络带宽条件下达到最好的视频效果,提高了用户体验。In this embodiment, the resolution and the code rate of the video are adjusted according to the network bandwidth, which avoids problems such as video jamming and flower screen caused by network deterioration during the conference, and can automatically adjust the video resolution and code rate to adapt to the network bandwidth when the network is deteriorated. , to achieve the best video effects under the current network bandwidth conditions, improve the user experience.
参照图5,图5为本发明多媒体会议控制方法的第四实施例的流程示意图。基于上述多媒体会议控制方法的第一实施例,所述步骤S10之前,还包括:Referring to FIG. 5, FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a multimedia conference control method according to the present invention. Based on the first embodiment of the foregoing multimedia conference control method, before the step S10, the method further includes:
步骤S11,所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;In step S11, the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
步骤S12,所述服务器接收所述会控终端发送的发言指令。Step S12: The server receives the speaking instruction sent by the conference control terminal.
所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令,所述服务器接收所述会控终端发送的发言至,以根据所述发言指令进行对应的操作。The server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction, and the server receives the statement sent by the conference control terminal to The speaking instruction performs a corresponding operation.
所述坐席列表可以保存在所述服务器内,在所述服务器接收到用户基于会控终端触发的显示指令时,将所述坐席列表发送至会控终端进行显示。所述会控终端可以在侦测到主持人用户基于所述坐席列表的点击操作时,触发对应的发言指令以指示处于该坐席的与会人员进行发言。具体的,参照图6,图6为本发明中通过会控终端所显示的坐席列表的一实施例的效果示意图。The agent list may be stored in the server, and when the server receives a display instruction triggered by the conference terminal, the server sends the agent list to the conference terminal for display. The conference control terminal may trigger a corresponding speaking instruction to indicate that the participant in the agent speaks when detecting the click operation of the host user based on the agent list. Specifically, referring to FIG. 6, FIG. 6 is a schematic diagram of an effect of an embodiment of a seat list displayed by a conference control terminal according to the present invention.
进一步的,在步骤S11之前,所述服务器还可以在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;所述服务器保存所接收到的坐席列表及各个坐席对应的方位信息。Further, before the step S11, the server may further receive, according to the setting instruction sent by the conference control terminal, the agent list input by the user based on the conference terminal and the orientation information corresponding to each agent; the server The received seat list and the orientation information corresponding to each seat are saved.
本实施例会控终端触发对应的发言指令,并通过服务器接收会控终端发送的发言指令,根据该发言指令控制摄像头对准对应的方位进行发言人视频的拍摄,实现了多媒体会议系统中摄像头自动对准发言人,发言人视频自动显示到显示屏,使得主席台用户能够通过会控终端触发发言指令指示谁发言,对应的发言人视频就显示在会场的显示屏上,极大的提高了会议效果,提升了用户体验。In this embodiment, the conference control terminal triggers the corresponding speaking instruction, and receives the speaking instruction sent by the conference control terminal through the server, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic camera pair in the multimedia conference system. The prospective spokesperson and the spokesperson video are automatically displayed on the display screen, so that the podium user can trigger the speaking command to indicate who is speaking through the conference control terminal, and the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect. , improved user experience.
上述第一至第四实施例的多媒体会议控制方法的执行主体均可以为多媒体会议系统或设置在所述多媒体会议系统内的服务器。更进一步地,该多媒体会议控制方法可以由安装在所述多媒体会议系统或者所述多媒体会议服务器内的客户端控制程序实现。The execution bodies of the multimedia conference control methods of the foregoing first to fourth embodiments may each be a multimedia conference system or a server disposed in the multimedia conference system. Further, the multimedia conference control method may be implemented by a client control program installed in the multimedia conference system or the multimedia conference server.
本发明进一步提供一种多媒体会议服务器。The invention further provides a multimedia conference server.
参照图7,图7为本发明多媒体会议服务器的第一实施例的功能模块示意图。Referring to FIG. 7, FIG. 7 is a schematic diagram of functional modules of a first embodiment of a multimedia conference server according to the present invention.
在本实施例中,所述多媒体会议服务器包括:接收模块10、控制模块20及发送模块30。In this embodiment, the multimedia conference server includes: a receiving module 10, a control module 20, and a sending module 30.
所述接收模块10,用于在接收到会控终端发送的发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息;The receiving module 10 is configured to, according to the speaking instruction, determine the orientation information corresponding to the speaking seat and the speaking seat according to the speaking instruction when receiving the speaking instruction sent by the conference controlling terminal;
可以由主持人用户通过会控终端触发用于指示对应的发言人进行发言的发言指令,所述会控终端将所述发言指令发送至服务器,所述服务器在接收到所述发言指令时,根据所述发言指令确定对应的发言席及所述发言席对应的方位信息,以控制摄像头对准对应方位进行发言人视频的拍摄。The host user may trigger a speaking instruction for instructing the corresponding speaker to speak by the conference control terminal, and the conference control terminal sends the speaking instruction to the server, and when the server receives the speaking instruction, according to the The speaking instruction determines the orientation information corresponding to the corresponding speaker and the speaker to control the camera to align the corresponding orientation to perform the shooting of the speaker video.
所述会控终端可以将发言人所对应的坐席信息作为发言席信息添加至所述发言指令中,所述服务器在接收到所述发言指令时,根据所述发言指令确定对应的发言席信息,并根据所述发言席信息查询所述服务器本地保存的所述发言席对应的方位信息,以根据所述方位信息调整摄像头拍摄发言席视频。The conference control terminal may add the agent information corresponding to the speaker as the speaker information to the speaking instruction, and when receiving the speaking instruction, the server determines the corresponding speaker information according to the speaking instruction. And querying the orientation information corresponding to the speaker locally saved by the server according to the speaker information, so as to adjust the camera to shoot the video according to the orientation information.
所述服务器可以通过SIP协议与所述会控终端之间进行通信。所述发言指令可以以INFO消息的格式在所述服务器与所述会控终端之间进行传输。The server can communicate with the conference control terminal through a SIP protocol. The speaking instruction may be transmitted between the server and the conference control terminal in a format of an INFO message.
所述控制模块20,用于根据所确定的方位信息调整摄像头拍摄发言席视频;The control module 20 is configured to adjust a camera shooting speaker video according to the determined orientation information;
所述服务器根据所确定的方位信息调整对应的摄像头对准所述发言席进行发言席视频的拍摄。所述方位信息可以包括预设的拍摄角度,以供服务器根据所述拍摄角度调整对应的摄像头角度以对准所述发言席。进一步的,所述摄像头可以为单独的一个或者也可以是多个,当用于拍摄发言席视频的摄像头为多个时,对应于同一坐席分别设置各个摄像头的方位信息,所述服务器根据各个摄像头对应的方位信息控制各个摄像头的角度调整。The server adjusts, according to the determined orientation information, that the corresponding camera is aligned with the speaker to perform the shooting of the video of the speaker. The orientation information may include a preset shooting angle for the server to adjust a corresponding camera angle according to the shooting angle to align the speaker. Further, the camera may be a single one or a plurality of cameras. When there are a plurality of cameras for capturing the video of the speaker seat, the orientation information of each camera is respectively set corresponding to the same agent, and the server is configured according to each camera. The corresponding orientation information controls the angle adjustment of each camera.
进一步的,所述服务器还可以在接收到会控终端的发言指令时,确定对应的发言席信息,并控制打开所述发言席对应的麦克风设备以采集发言人音频数据,在采集到发言人的音频数据后,通过服务器内的媒体服务器进行混音处理后发送至音响设备输出。Further, the server may further determine the corresponding speaker information when receiving the speaking instruction of the conference control terminal, and control to open the microphone device corresponding to the speaker to collect the speaker audio data, and collect the speaker's data. After the audio data is transmitted, it is mixed by the media server in the server and sent to the audio device for output.
所述发送模块30,用于将所述发言席视频发送至显示屏进行显示。The sending module 30 is configured to send the video of the speaker to a display screen for display.
所述服务器可以通过RTP协议将摄像头拍摄的发言席视频发送至显示屏进行显示。进一步的,所述会控终端还可以将主持人用户所选择的是否显示发言席视频的控制命令添加至所述发言指令中,所述服务器根据所述发言指令判断是否将对应的发言席视频发送至显示屏进行显示,若是,则所述服务器将所述发言席视频发送至显示屏进行显示;若否,则删除所述发言席视频。The server can send the video of the speaker shot by the camera to the display screen for display by using the RTP protocol. Further, the conference control terminal may further add, to the speaking instruction, a control command for displaying a video of the speaker selected by the host user, and the server determines, according to the speaking instruction, whether to send the corresponding video of the speaker. Displaying to the display screen, if yes, the server sends the video of the speaker to the display screen for display; if not, deleting the video of the speaker.
所述服务器可以通过VGA/HDMI/DVI/SDI接口与所述显示屏进行连接。The server can be connected to the display through a VGA/HDMI/DVI/SDI interface.
本实施例通过服务器接收用户基于会控终端发送的发言指令,并根据该发言指令控制摄像头对准对应的方位进行发言人视频的拍摄,实现了多媒体会议系统中摄像头自动对准发言人,发言人视频自动显示到显示屏,使得主席台用户能够通过会控终端触发发言指令指示谁发言,对应的发言人视频就显示在会场的显示屏上,极大的提高了会议效果,提升了用户体验。In this embodiment, the server receives the speaking instruction sent by the user based on the conference control terminal, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic alignment of the camera in the multimedia conference system. The video is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal. The corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
参照图8,图8为本发明装置的第二实施例的功能模块示意图。基于上述多媒体会议服务器的第一实施例,所述多媒体会议服务器还包括多媒体模块40。Referring to Figure 8, Figure 8 is a schematic diagram of functional modules of a second embodiment of the apparatus of the present invention. Based on the first embodiment of the multimedia conference server, the multimedia conference server further includes a multimedia module 40.
所述接收模块10,还用于通过网络连接接收各个分会场的视频数据;The receiving module 10 is further configured to receive video data of each of the sub-sites through a network connection;
所述服务器可以通过RTP协议接收各个分会场的视频数据。所述服务器可以通过网络连接远程的分会场服务器或者分会场的SIP会议终端,以接收各个分会场的视频数据。The server can receive video data of each sub-site through the RTP protocol. The server may connect to a remote conference site server or a SIP conference terminal of the conference site through the network to receive video data of each of the conference sites.
所述多媒体模块40,用于将各个分会场的视频数据进行拼图处理,得到拼图视频;The multimedia module 40 is configured to perform puzzle processing on video data of each of the sub-sites to obtain a puzzle video;
所述服务器可以通过所述服务器内的多媒体设备实现对各个分会场的视频数据的拼图处理,以得到含有各个分会场视频的拼图视频。所述服务器可以按照各种方式进行拼图处理,例如:1+1(1个主会场视频+1个分会场视频),4分屏,6分屏,1+4(1个主会场视频+4个分会场视频),1+5(1个主会场视频+5个分会场视频),9分屏等等。The server can implement the puzzle processing of the video data of each of the sub-sites through the multimedia device in the server to obtain a puzzle video containing the video of each of the sub-sites. The server can perform puzzle processing in various ways, for example, 1+1 (1 main conference video + 1 sub-site video), 4 sub-screen, 6-screen, 1+4 (1 main conference video + 4) Video of the sub-site), 1+5 (1 main venue video + 5 sub-site videos), 9-screen and so on.
所述发送模块30,还用于将所述拼图视频发送至显示屏进行显示。The sending module 30 is further configured to send the puzzle video to a display screen for display.
所述服务器将所述拼图视频发送至显示屏进行显示。进一步的,通过所述服务器的资源接入设备所接入的显示屏可以是一个单独的显示屏或者也可以是多个显示屏,例如:当接入的显示屏为多个时,可以将第一显示屏用于显示所有会场的拼图视频,将第二显示屏用于显示发言人视频,将第三显示屏用于显示PPT等文档。The server sends the puzzle video to a display for display. Further, the display screen accessed by the resource access device of the server may be a single display screen or multiple display screens. For example, when multiple screens are accessed, the One display is used to display the puzzle video of all the venues, the second display is used to display the speaker video, and the third display is used to display documents such as PPT.
本实施例通过服务器接收各个分会场的视频数据,并根据所述视频数据进行拼图处理得到拼图视频进行显示,实现了各个分会场视频的显示,提高了会议效果,提升了用户体验。In this embodiment, the video data of each sub-site is received by the server, and the jigsaw video is displayed according to the video data, and the video of each sub-site is displayed, which improves the conference effect and improves the user experience.
参照图9,图9为本发明装置的第三实施例的功能模块示意图。基于上述多媒体会议服务器的第二实施例,所述接收模块10包括检测单元11、确定单元12和切换单元13;Referring to FIG. 9, FIG. 9 is a schematic diagram of functional modules of a third embodiment of the apparatus of the present invention. The receiving module 10 includes a detecting unit 11, a determining unit 12, and a switching unit 13 based on the second embodiment of the multimedia conference server.
所述检测单元11,用于在通过网络连接接收分会场的视频数据时,实时检测所述网络连接的网络带宽;The detecting unit 11 is configured to detect a network bandwidth of the network connection in real time when receiving video data of a sub-site through a network connection;
所述确定单元12,用于在检测到所述网络带宽发生变化时,确定变化后的网络带宽对应的视频码率及视频分辨率;The determining unit 12 is configured to determine a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
所述切换单元13,用于切换至所确定的视频码率及视频分辨率继续接收视频数据。The switching unit 13 is configured to switch to the determined video bit rate and video resolution to continue receiving video data.
所述服务器在通过网络连接接收分会场的视频数据过程中,实时检测所述网络连接的网络带宽;所述服务器在检测到所述网络带宽发生变化时,确定变化后的网络带宽对应的视频码率及视频分辨率;所述服务器切换至所确定的视频码率及视频分辨率继续接收视频数据。例如:所述服务器按照2000kbps码率接收分会场的视频数据,检测测到网络带宽发生变化,变化后的网络带宽符合800kbps码率,则所述服务器切换至800kbps码率从当前位置继续接收所述视频数据。The server detects the network bandwidth of the network connection in real time during the process of receiving the video data of the sub-site through the network connection; when detecting the change of the network bandwidth, the server determines the video code corresponding to the changed network bandwidth. Rate and video resolution; the server switches to the determined video bit rate and video resolution to continue receiving video data. For example, the server receives the video data of the conference site according to the code rate of 2000 kbps, detects that the network bandwidth changes, and the changed network bandwidth conforms to the 800 kbps code rate, and the server switches to the 800 kbps code rate to continue receiving the current location. Video data.
本实施例根据网络带宽调整视频的分辨率及码率,避免了开会过程中由于网络恶化造成视频卡顿、花屏等问题,在网络恶化时,能够自动调整视频分辨率及码率以适应网络带宽,实现了在当前网络带宽条件下达到最好的视频效果,提高了用户体验。In this embodiment, the resolution and the code rate of the video are adjusted according to the network bandwidth, which avoids problems such as video jamming and flower screen caused by network deterioration during the conference, and can automatically adjust the video resolution and code rate to adapt to the network bandwidth when the network is deteriorated. , to achieve the best video effects under the current network bandwidth conditions, improve the user experience.
参照图10,图10为本发明装置的第四实施例的功能模块示意图。基于上述多媒体会议服务器的第一实施例,所述多媒体会议服务器还包括显示模块50;Referring to FIG. 10, FIG. 10 is a schematic diagram of functional modules of a fourth embodiment of the apparatus of the present invention. The multimedia conference server further includes a display module 50, based on the first embodiment of the multimedia conference server;
所述显示模块50,用于通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令;The display module 50 is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
所述接收模块10,还用于接收所述会控终端发送的发言指令。The receiving module 10 is further configured to receive a speaking instruction sent by the conference control terminal.
所述服务器通过所述会控终端显示预设的坐席列表,以供用户基于所述坐席列表确定发言席并触发对应的发言指令,所述服务器接收所述会控终端发送的发言至,以根据所述发言指令进行对应的操作。The server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction, and the server receives the statement sent by the conference control terminal to The speaking instruction performs a corresponding operation.
所述坐席列表可以保存在所述服务器内,在所述服务器接收到用户基于会控终端触发的显示指令时,将所述坐席列表发送至会控终端进行显示。所述会控终端可以在侦测到主持人用户基于所述坐席列表的点击操作时,触发对应的发言指令以指示处于该坐席的与会人员进行发言。具体的,参照图6,图6为本发明中通过会控终端所显示的坐席列表的一实施例的效果示意图。The agent list may be stored in the server, and when the server receives a display instruction triggered by the conference terminal, the server sends the agent list to the conference terminal for display. The conference control terminal may trigger a corresponding speaking instruction to indicate that the participant in the agent speaks when detecting the click operation of the host user based on the agent list. Specifically, referring to FIG. 6, FIG. 6 is a schematic diagram of an effect of an embodiment of a seat list displayed by a conference control terminal according to the present invention.
进一步的,所述多媒体会议服务器还包括存储模块;所述接收模块,还用于在接收到所述会控终端发送的设置指令时,接收用户基于所述会控终端输入的坐席列表及各个坐席对应的方位信息;所述存储模块,用于保存所接收到的坐席列表及各个坐席对应的方位信息。Further, the multimedia conference server further includes a storage module, and the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive a seat list and each agent input by the user based on the conference control terminal Corresponding orientation information; the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
本实施例会控终端触发对应的发言指令,并通过服务器接收会控终端发送的发言指令,根据该发言指令控制摄像头对准对应的方位进行发言人视频的拍摄,实现了多媒体会议系统中摄像头自动对准发言人,发言人视频自动显示到显示屏,使得主席台用户能够通过会控终端触发发言指令指示谁发言,对应的发言人视频就显示在会场的显示屏上,极大的提高了会议效果,提升了用户体验。In this embodiment, the conference control terminal triggers the corresponding speaking instruction, and receives the speaking instruction sent by the conference control terminal through the server, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic camera pair in the multimedia conference system. The prospective spokesperson and the spokesperson video are automatically displayed on the display screen, so that the podium user can trigger the speaking command to indicate who is speaking through the conference control terminal, and the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect. , improved user experience.
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It is to be understood that the term "comprises", "comprising", or any other variants thereof, is intended to encompass a non-exclusive inclusion, such that a process, method, article, or device comprising a series of elements includes those elements. It also includes other elements that are not explicitly listed, or elements that are inherent to such a process, method, article, or device. An element that is defined by the phrase "comprising a ..." does not exclude the presence of additional equivalent elements in the process, method, item, or device that comprises the element.
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the embodiments of the present invention are merely for the description, and do not represent the advantages and disadvantages of the embodiments.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,空调器,或者网络设备等)执行本发明各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better. Implementation. Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk, The optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
以上仅为本发明的优选实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above are only the preferred embodiments of the present invention, and are not intended to limit the scope of the invention, and the equivalent structure or equivalent process transformations made by the description of the present invention and the drawings are directly or indirectly applied to other related technical fields. The same is included in the scope of patent protection of the present invention.
Claims (18)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610255434.5A CN105812717A (en) | 2016-04-21 | 2016-04-21 | Multimedia conference control method and server |
| CN201610255434.5 | 2016-04-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2017181508A1 true WO2017181508A1 (en) | 2017-10-26 |
Family
ID=56458395
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2016/085049 Ceased WO2017181508A1 (en) | 2016-04-21 | 2016-06-07 | Multimedia meeting control method and server |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN105812717A (en) |
| WO (1) | WO2017181508A1 (en) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112616035A (en) * | 2020-11-23 | 2021-04-06 | 深圳市捷视飞通科技股份有限公司 | Multi-picture splicing method and device, computer equipment and storage medium |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106789914B (en) * | 2016-11-24 | 2020-04-14 | 邦彦技术股份有限公司 | Multimedia conference control method and system |
| WO2018098780A1 (en) * | 2016-12-01 | 2018-06-07 | 深圳前海达闼云端智能科技有限公司 | Interactive advertisement presentation method, terminal, and smart city interaction system |
| CN109246383B (en) | 2017-07-11 | 2022-03-29 | 中兴通讯股份有限公司 | Control method of multimedia conference terminal and multimedia conference server |
| US10356362B1 (en) * | 2018-01-16 | 2019-07-16 | Google Llc | Controlling focus of audio signals on speaker during videoconference |
| US10511808B2 (en) * | 2018-04-10 | 2019-12-17 | Facebook, Inc. | Automated cinematic decisions based on descriptive models |
| CN109698928B (en) * | 2018-11-15 | 2021-04-13 | 贵阳朗玛信息技术股份有限公司 | Method and device for adjusting video stream in video conference system |
| CN111212218A (en) * | 2018-11-22 | 2020-05-29 | 阿里巴巴集团控股有限公司 | Shooting control method and device and shooting system |
| CN109547735B (en) * | 2019-01-18 | 2024-04-16 | 海南科先电子科技有限公司 | Conference integration system |
| CN111245823A (en) * | 2020-01-09 | 2020-06-05 | 福建星网智慧科技股份有限公司 | Movable wireless private network audio and video communication system based on LTE protocol |
| CN114067668B (en) * | 2020-08-04 | 2024-12-20 | 广州艾美网络科技有限公司 | Adjustable multimedia system and control method thereof |
| CN116366961A (en) * | 2021-12-24 | 2023-06-30 | 广西三诺数字科技有限公司 | Video conference method and device and computer equipment |
| CN114449205B (en) * | 2022-04-08 | 2022-07-29 | 浙江华创视讯科技有限公司 | Data processing method, terminal device, electronic device and storage medium |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20030013017A (en) * | 2001-08-06 | 2003-02-14 | 주식회사 호스트이엔아이 | Speaker recognition method in presentation system |
| CN102469295A (en) * | 2010-10-29 | 2012-05-23 | 华为终端有限公司 | Conference control method, related equipment and system |
| CN102625077A (en) * | 2011-01-27 | 2012-08-01 | 深圳市合智创盈电子有限公司 | Conference recording method, conference photographing device, client and system |
| CN103327250A (en) * | 2013-06-24 | 2013-09-25 | 深圳锐取信息技术股份有限公司 | Method for controlling camera lens based on pattern recognition |
| CN103986914A (en) * | 2014-05-27 | 2014-08-13 | 东南大学 | Adaptive code rate method based on the number of clients in wireless video surveillance system |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| NO333026B1 (en) * | 2008-09-17 | 2013-02-18 | Cisco Systems Int Sarl | Control system for a local telepresence video conferencing system and method for establishing a video conferencing call. |
| CN101742222A (en) * | 2009-12-30 | 2010-06-16 | 华为终端有限公司 | Operation method for camera position and video conference terminal |
| CN101877706B (en) * | 2010-06-24 | 2013-04-17 | 北京邮电大学 | Multi-terminal multimedia conference control system and implementation method |
| CN104144315B (en) * | 2013-05-06 | 2017-12-29 | 华为技术有限公司 | The display methods and multi-spot video conference system of a kind of multipoint videoconference |
| US20150146078A1 (en) * | 2013-11-27 | 2015-05-28 | Cisco Technology, Inc. | Shift camera focus based on speaker position |
| CN204119373U (en) * | 2014-04-02 | 2015-01-21 | 中国舰船研究设计中心 | A kind of digital session face tracking system |
| CN105163134B (en) * | 2015-08-03 | 2018-09-07 | 腾讯科技(深圳)有限公司 | Video coding parameter setting method, device and the video encoder of live video |
-
2016
- 2016-04-21 CN CN201610255434.5A patent/CN105812717A/en active Pending
- 2016-06-07 WO PCT/CN2016/085049 patent/WO2017181508A1/en not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20030013017A (en) * | 2001-08-06 | 2003-02-14 | 주식회사 호스트이엔아이 | Speaker recognition method in presentation system |
| CN102469295A (en) * | 2010-10-29 | 2012-05-23 | 华为终端有限公司 | Conference control method, related equipment and system |
| CN102625077A (en) * | 2011-01-27 | 2012-08-01 | 深圳市合智创盈电子有限公司 | Conference recording method, conference photographing device, client and system |
| CN103327250A (en) * | 2013-06-24 | 2013-09-25 | 深圳锐取信息技术股份有限公司 | Method for controlling camera lens based on pattern recognition |
| CN103986914A (en) * | 2014-05-27 | 2014-08-13 | 东南大学 | Adaptive code rate method based on the number of clients in wireless video surveillance system |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112616035A (en) * | 2020-11-23 | 2021-04-06 | 深圳市捷视飞通科技股份有限公司 | Multi-picture splicing method and device, computer equipment and storage medium |
| CN112616035B (en) * | 2020-11-23 | 2023-09-19 | 深圳市捷视飞通科技股份有限公司 | Multi-picture splicing method, device, computer equipment and storage medium |
Also Published As
| Publication number | Publication date |
|---|---|
| CN105812717A (en) | 2016-07-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2017181508A1 (en) | Multimedia meeting control method and server | |
| WO2018094791A1 (en) | Multimedia conferencing control method and system | |
| WO2019019374A1 (en) | Method, apparatus, and system for controlling household appliance with intelligent voice device | |
| WO2017107388A1 (en) | Hdmi version switching method and display device | |
| WO2017135585A2 (en) | Main speaker, sub speaker and system including the same | |
| WO2018120457A1 (en) | Data processing method, apparatus, device, and computer readable storage medium | |
| WO2017201899A1 (en) | Method and apparatus for connecting to bluetooth device | |
| WO2019114269A1 (en) | Method for resuming playing program, television device, and computer readable storage medium | |
| WO2020010671A1 (en) | Display method and device, television set, and storage medium | |
| WO2018000856A1 (en) | Method implementing sdn overlay network message forwarding, terminal, apparatus, and computer readable storage medium | |
| WO2019024336A1 (en) | Data query method and device, and computer readable storage medium | |
| WO2017096671A1 (en) | Network conferencing method and device | |
| WO2017113614A1 (en) | Method and device for intercut playing of advertisement during video playing | |
| WO2018233221A1 (en) | Multi-window sound output method, television, and computer-readable storage medium | |
| WO2019031735A1 (en) | Image processing apparatus, image processing method, and image display system | |
| WO2017063369A1 (en) | Method of establishing wireless direct connection and device utilizing same | |
| WO2017045441A1 (en) | Smart television-based audio playback method and apparatus | |
| WO2019071762A1 (en) | Floor positioning method and system, server and computer-readable storage medium | |
| WO2017181504A1 (en) | Method and television set for intelligently adjusting subtitle size | |
| WO2017185480A1 (en) | Multi-screen interaction connection method, device and system | |
| WO2017113596A1 (en) | Method and system for listen-only control, mobile terminal, and smart television | |
| WO2018205514A1 (en) | Set-top box wireless-compatibility automatic testing method, system, and readable storage medium | |
| WO2017152527A1 (en) | Method for controlling slave device application of smart television, and smart television | |
| WO2017148028A1 (en) | Remote network connection method and system based on smart television | |
| WO2017084298A1 (en) | Warning method and system for television set |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16899095 Country of ref document: EP Kind code of ref document: A1 |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 16899095 Country of ref document: EP Kind code of ref document: A1 |