WO2017181508A1 - Procédé et serveur de gestion de réunion multimédia - Google Patents
Procédé et serveur de gestion de réunion multimédia Download PDFInfo
- Publication number
- WO2017181508A1 WO2017181508A1 PCT/CN2016/085049 CN2016085049W WO2017181508A1 WO 2017181508 A1 WO2017181508 A1 WO 2017181508A1 CN 2016085049 W CN2016085049 W CN 2016085049W WO 2017181508 A1 WO2017181508 A1 WO 2017181508A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- server
- control terminal
- conference control
- speaking
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
- H04N7/181—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast for receiving images from a plurality of remote sources
Definitions
- the present invention relates to the field of multimedia conferences, and in particular, to a multimedia conference control method and a server.
- Multimedia conference rooms are rapidly adopted for their functional diversity (such as on-site conferences, academic reports, training and teaching). popular.
- the multimedia conference system refers to the integration of sound, light, electrical equipment and software that are interrelated with the conference.
- the multimedia conference room whether it is for reporting, summarizing, reporting, introducing products, etc., the use of computer interactive operation of pictures, texts, sounds, shadows, paintings, fully mobilized the participants' sensory perception, greatly improving the effectiveness of the meeting.
- Multimedia is increasingly showing its advantages in the office field.
- the cameras of the venue are mostly fixed, and it is impossible to track the video of the speaker, which greatly reduces the user experience.
- the camera cannot track the problem of shooting the speaker video, and the problem in this aspect needs to be solved by the inventor.
- the main object of the present invention is to solve the problem that the camera cannot track the video of the speaker in the multimedia conference system.
- the present invention provides a multimedia conference control method, where the multimedia conference control method includes the following steps:
- the server determines, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
- the server adjusts a camera shooting speaker video according to the determined orientation information
- the server sends the speaker video to a display screen for display.
- the server before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
- the server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- the server receives a speaking instruction sent by the conference control terminal.
- the method before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
- the server saves the received agent list and the orientation information corresponding to each agent.
- the method further includes:
- the server receives video data of each of the sub-sites through a network connection
- the server performs jigsaw processing on the video data of each of the sub-sites to obtain a puzzle video
- the server sends the puzzle video to a display for display.
- the server before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
- the server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- the server receives a speaking instruction sent by the conference control terminal.
- the method before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
- the server saves the received agent list and the orientation information corresponding to each agent.
- the step of the server receiving the video data of each of the sub-sites through the network connection includes:
- the server detects the network bandwidth of the network connection in real time when receiving the video data of the conference site through the network connection;
- the server determines a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
- the server switches to the determined video bit rate and video resolution to continue receiving video data.
- the server before the step of determining, by the server according to the speaking instruction, the step of determining the orientation information corresponding to the speaker and the speaker according to the speaking instruction, the server further includes:
- the server displays a preset agent list by using the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- the server receives a speaking instruction sent by the conference control terminal.
- the method before the step of the server displaying the preset agent list by the conference control terminal for the user to determine the speaker based on the agent list and triggering the corresponding speaking instruction, the method further includes:
- the server saves the received agent list and the orientation information corresponding to each agent.
- the present invention further provides a multimedia conference server, where the multimedia conference server includes:
- the receiving module is configured to: when receiving the speaking instruction sent by the conference control terminal, determine, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
- control module configured to adjust a camera shooting speaker video according to the determined orientation information
- a sending module configured to send the speaker video to a display screen for display.
- the multimedia conference server further includes a display module
- the display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
- the multimedia conference server further includes a storage module
- the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
- the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the multimedia conference server further includes a multimedia module
- the receiving module is further configured to receive video data of each of the sub-sites through a network connection
- the multimedia module is configured to perform jigsaw processing on video data of each of the sub-sites to obtain a puzzle video
- the sending module is further configured to send the puzzle video to a display screen for display.
- the multimedia conference server further includes a display module
- the display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
- the multimedia conference server further includes a storage module
- the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
- the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the receiving module includes a detecting unit, a determining unit, and a switching unit;
- the detecting unit is configured to detect a network bandwidth of the network connection in real time when receiving video data of a sub-site through a network connection;
- the determining unit is configured to determine a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
- the switching unit is configured to switch to the determined video bit rate and video resolution to continue receiving video data.
- the multimedia conference server further includes a display module
- the display module is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module is further configured to receive a speaking instruction sent by the conference control terminal.
- the multimedia conference server further includes a storage module
- the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive the agent list input by the user based on the conference control terminal and the orientation information corresponding to each agent;
- the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the invention receives the speaking instruction sent by the user based on the conference control terminal by the server, and controls the camera to aim at the corresponding orientation according to the speaking instruction, so as to realize the automatic positioning of the camera in the multimedia conference system, the speaker video It is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal.
- the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
- FIG. 1 is a hardware architecture diagram of a multimedia conference system implementing various embodiments of the present invention
- FIG. 2 is a schematic flowchart of a first embodiment of a multimedia conference control method according to the present invention
- FIG. 3 is a schematic flowchart of a second embodiment of a multimedia conference control method according to the present invention.
- FIG. 4 is a schematic flowchart diagram of a third embodiment of a multimedia conference control method according to the present invention.
- FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a multimedia conference control method according to the present invention.
- FIG. 6 is a schematic diagram of an effect of an embodiment of an agent list displayed by a conference control terminal according to the present invention.
- FIG. 7 is a schematic diagram of functional modules of a first embodiment of a multimedia conference server according to the present invention.
- FIG. 8 is a schematic diagram of functional modules of a second embodiment of a multimedia conference server according to the present invention.
- FIG. 9 is a schematic diagram of functional modules of a third embodiment of a multimedia conference server according to the present invention.
- FIG. 10 is a schematic diagram of functional modules of a fourth embodiment of a multimedia conference server according to the present invention.
- FIG. 1 is a hardware architecture diagram of a multimedia conference system implementing various embodiments of the present invention.
- the multimedia conference system may include a server 100, a conference control terminal 200, and external devices such as a camera 301, a microphone 302, a display screen 303, an audio 304, and the like.
- the conference control terminal 200 is configured to generate a corresponding instruction according to a command input by the host user and send it to the server 100 to control various operations of the conference service.
- the conference control terminal 200 can be a terminal of a mobile phone, a smart phone, a notebook computer, a PAD (tablet computer), a desktop computer, or the like.
- the camera 301 and the microphone 302 are used to collect audio and video data.
- the display screen 303 and the audio 304 device are configured to output audio and video processed by the multimedia device 102.
- the server 100 may include a multimedia device 102, a softswitch device 103, a resource access device 104, a controller 101, and the like.
- FIG. 1 illustrates a server 100 having various devices, but it should be understood that implementation is not required. All the devices shown. More or fewer devices can be implemented instead.
- the control signaling between the devices in the server 100 can be implemented through the SIP protocol, and the multimedia data passes the RTP protocol (Real-time). Transport Protocol, real-time transport protocol) bearer transmission.
- the softswitch device 103 is configured to control the registration of the various resources (such as camera resources, display resources, microphone resources, etc.) of the terminal 200 and the conference room, call routing, and the like.
- the controller 101 is used for control and management of conference services.
- the multimedia device 102 is used for processing audio and video, such as audio mixing, video puzzles, and the like.
- the resource access device 104 is configured to access a display 303, a camera 301, a microphone 302, an audio 304, and the like in the conference room.
- the present invention provides a multimedia conference control method.
- FIG. 2 is a schematic flowchart diagram of a first embodiment of a multimedia conference control method according to the present invention.
- the multimedia conference control method includes:
- Step S10 When receiving the speaking instruction sent by the conference control terminal, the server determines, according to the speaking instruction, the orientation information corresponding to the corresponding speaking seat and the speaking seat;
- the host user may trigger a speaking instruction for instructing the corresponding speaker to speak by the conference control terminal, and the conference control terminal sends the speaking instruction to the server, and when the server receives the speaking instruction, according to the The speaking instruction determines the orientation information corresponding to the corresponding speaker and the speaker to control the camera to align the corresponding orientation to perform the shooting of the speaker video.
- the conference control terminal may add the agent information corresponding to the speaker as the speaker information to the speaking instruction, and when receiving the speaking instruction, the server determines the corresponding speaker information according to the speaking instruction. And querying the orientation information corresponding to the speaker locally saved by the server according to the speaker information, so as to adjust the camera to shoot the video according to the orientation information.
- the server can communicate with the conference control terminal through a SIP protocol.
- the speaking instruction may be transmitted between the server and the conference control terminal in a format of an INFO message.
- Step S20 the server adjusts a camera shooting speaker video according to the determined orientation information
- the server adjusts, according to the determined orientation information, that the corresponding camera is aligned with the speaker to perform the shooting of the video of the speaker.
- the orientation information may include a preset shooting angle for the server to adjust a corresponding camera angle according to the shooting angle to align the speaker.
- the camera may be a single one or a plurality of cameras. When there are a plurality of cameras for capturing the video of the speaker seat, the orientation information of each camera is respectively set corresponding to the same agent, and the server is configured according to each camera. The corresponding orientation information controls the angle adjustment of each camera.
- the server may further determine the corresponding speaker information when receiving the speaking instruction of the conference control terminal, and control to open the microphone device corresponding to the speaker to collect the speaker audio data, and collect the speaker's data. After the audio data is transmitted, it is mixed by the media server in the server and sent to the audio device for output.
- Step S30 the server sends the video of the speaker to a display screen for display.
- the server can send the video of the speaker shot by the camera to the display screen for display by using the RTP protocol.
- the conference control terminal may further add, to the speaking instruction, a control command for displaying a video of the speaker selected by the host user, and the server determines, according to the speaking instruction, whether to send the corresponding video of the speaker. Displaying to the display screen, if yes, the server sends the video of the speaker to the display screen for display; if not, deleting the video of the speaker.
- the server can be connected to the display through a VGA/HDMI/DVI/SDI interface.
- the server receives the speaking instruction sent by the user based on the conference control terminal, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic alignment of the camera in the multimedia conference system.
- the video is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal.
- the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
- FIG. 3 is a schematic flowchart diagram of a second embodiment of a multimedia conference control method according to the present invention. Based on the first embodiment of the foregoing multimedia conference control method, after the step S30, the method further includes:
- Step S40 the server receives video data of each of the sub-sites through a network connection
- the server can receive video data of each sub-site through the RTP protocol.
- the server may connect to a remote conference site server or a SIP conference terminal of the conference site through the network to receive video data of each of the conference sites.
- Step S50 the server performs the puzzle processing on the video data of each of the sub-sites to obtain a puzzle video
- the server can implement the puzzle processing of the video data of each of the sub-sites through the multimedia device in the server to obtain a puzzle video containing the video of each of the sub-sites.
- the server can perform puzzle processing in various ways, for example, 1+1 (1 main conference video + 1 sub-site video), 4 sub-screen, 6-screen, 1+4 (1 main conference video + 4) Video of the sub-site), 1+5 (1 main venue video + 5 sub-site videos), 9-screen and so on.
- Step S60 the server sends the puzzle video to a display screen for display.
- the server sends the puzzle video to a display for display.
- the display screen accessed by the resource access device of the server may be a single display screen or multiple display screens. For example, when multiple screens are accessed, the One display is used to display the puzzle video of all the venues, the second display is used to display the speaker video, and the third display is used to display documents such as PPT.
- the video data of each sub-site is received by the server, and the jigsaw video is displayed according to the video data, and the video of each sub-site is displayed, which improves the conference effect and improves the user experience.
- FIG. 4 is a schematic flowchart diagram of a third embodiment of a multimedia conference control method according to the present invention. Based on the second embodiment of the foregoing multimedia conference control method, the step S40 includes:
- Step S41 the server detects the network bandwidth of the network connection in real time when receiving the video data of the conference site through the network connection;
- Step S42 the server determines a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes.
- step S43 the server switches to the determined video bit rate and video resolution to continue receiving video data.
- the server detects the network bandwidth of the network connection in real time during the process of receiving the video data of the sub-site through the network connection; when detecting the change of the network bandwidth, the server determines the video code corresponding to the changed network bandwidth. Rate and video resolution; the server switches to the determined video bit rate and video resolution to continue receiving video data. For example, the server receives the video data of the conference site according to the code rate of 2000 kbps, detects that the network bandwidth changes, and the changed network bandwidth conforms to the 800 kbps code rate, and the server switches to the 800 kbps code rate to continue receiving the current location. Video data.
- the resolution and the code rate of the video are adjusted according to the network bandwidth, which avoids problems such as video jamming and flower screen caused by network deterioration during the conference, and can automatically adjust the video resolution and code rate to adapt to the network bandwidth when the network is deteriorated. , to achieve the best video effects under the current network bandwidth conditions, improve the user experience.
- FIG. 5 is a schematic flowchart diagram of a fourth embodiment of a multimedia conference control method according to the present invention. Based on the first embodiment of the foregoing multimedia conference control method, before the step S10, the method further includes:
- step S11 the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction;
- Step S12 The server receives the speaking instruction sent by the conference control terminal.
- the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction, and the server receives the statement sent by the conference control terminal to The speaking instruction performs a corresponding operation.
- the agent list may be stored in the server, and when the server receives a display instruction triggered by the conference terminal, the server sends the agent list to the conference terminal for display.
- the conference control terminal may trigger a corresponding speaking instruction to indicate that the participant in the agent speaks when detecting the click operation of the host user based on the agent list.
- FIG. 6 is a schematic diagram of an effect of an embodiment of a seat list displayed by a conference control terminal according to the present invention.
- the server may further receive, according to the setting instruction sent by the conference control terminal, the agent list input by the user based on the conference terminal and the orientation information corresponding to each agent; the server The received seat list and the orientation information corresponding to each seat are saved.
- the conference control terminal triggers the corresponding speaking instruction, and receives the speaking instruction sent by the conference control terminal through the server, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic camera pair in the multimedia conference system.
- the prospective spokesperson and the spokesperson video are automatically displayed on the display screen, so that the podium user can trigger the speaking command to indicate who is speaking through the conference control terminal, and the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect. , improved user experience.
- the execution bodies of the multimedia conference control methods of the foregoing first to fourth embodiments may each be a multimedia conference system or a server disposed in the multimedia conference system. Further, the multimedia conference control method may be implemented by a client control program installed in the multimedia conference system or the multimedia conference server.
- the invention further provides a multimedia conference server.
- FIG. 7 is a schematic diagram of functional modules of a first embodiment of a multimedia conference server according to the present invention.
- the multimedia conference server includes: a receiving module 10, a control module 20, and a sending module 30.
- the receiving module 10 is configured to, according to the speaking instruction, determine the orientation information corresponding to the speaking seat and the speaking seat according to the speaking instruction when receiving the speaking instruction sent by the conference controlling terminal;
- the host user may trigger a speaking instruction for instructing the corresponding speaker to speak by the conference control terminal, and the conference control terminal sends the speaking instruction to the server, and when the server receives the speaking instruction, according to the The speaking instruction determines the orientation information corresponding to the corresponding speaker and the speaker to control the camera to align the corresponding orientation to perform the shooting of the speaker video.
- the conference control terminal may add the agent information corresponding to the speaker as the speaker information to the speaking instruction, and when receiving the speaking instruction, the server determines the corresponding speaker information according to the speaking instruction. And querying the orientation information corresponding to the speaker locally saved by the server according to the speaker information, so as to adjust the camera to shoot the video according to the orientation information.
- the server can communicate with the conference control terminal through a SIP protocol.
- the speaking instruction may be transmitted between the server and the conference control terminal in a format of an INFO message.
- the control module 20 is configured to adjust a camera shooting speaker video according to the determined orientation information
- the server adjusts, according to the determined orientation information, that the corresponding camera is aligned with the speaker to perform the shooting of the video of the speaker.
- the orientation information may include a preset shooting angle for the server to adjust a corresponding camera angle according to the shooting angle to align the speaker.
- the camera may be a single one or a plurality of cameras. When there are a plurality of cameras for capturing the video of the speaker seat, the orientation information of each camera is respectively set corresponding to the same agent, and the server is configured according to each camera. The corresponding orientation information controls the angle adjustment of each camera.
- the server may further determine the corresponding speaker information when receiving the speaking instruction of the conference control terminal, and control to open the microphone device corresponding to the speaker to collect the speaker audio data, and collect the speaker's data. After the audio data is transmitted, it is mixed by the media server in the server and sent to the audio device for output.
- the sending module 30 is configured to send the video of the speaker to a display screen for display.
- the server can send the video of the speaker shot by the camera to the display screen for display by using the RTP protocol.
- the conference control terminal may further add, to the speaking instruction, a control command for displaying a video of the speaker selected by the host user, and the server determines, according to the speaking instruction, whether to send the corresponding video of the speaker. Displaying to the display screen, if yes, the server sends the video of the speaker to the display screen for display; if not, deleting the video of the speaker.
- the server can be connected to the display through a VGA/HDMI/DVI/SDI interface.
- the server receives the speaking instruction sent by the user based on the conference control terminal, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic alignment of the camera in the multimedia conference system.
- the video is automatically displayed on the display screen, so that the chairman station can trigger the speaking command to indicate who is speaking through the conference terminal.
- the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect and improves the user experience.
- FIG 8 is a schematic diagram of functional modules of a second embodiment of the apparatus of the present invention.
- the multimedia conference server further includes a multimedia module 40.
- the receiving module 10 is further configured to receive video data of each of the sub-sites through a network connection;
- the server can receive video data of each sub-site through the RTP protocol.
- the server may connect to a remote conference site server or a SIP conference terminal of the conference site through the network to receive video data of each of the conference sites.
- the multimedia module 40 is configured to perform puzzle processing on video data of each of the sub-sites to obtain a puzzle video
- the server can implement the puzzle processing of the video data of each of the sub-sites through the multimedia device in the server to obtain a puzzle video containing the video of each of the sub-sites.
- the server can perform puzzle processing in various ways, for example, 1+1 (1 main conference video + 1 sub-site video), 4 sub-screen, 6-screen, 1+4 (1 main conference video + 4) Video of the sub-site), 1+5 (1 main venue video + 5 sub-site videos), 9-screen and so on.
- the sending module 30 is further configured to send the puzzle video to a display screen for display.
- the server sends the puzzle video to a display for display.
- the display screen accessed by the resource access device of the server may be a single display screen or multiple display screens. For example, when multiple screens are accessed, the One display is used to display the puzzle video of all the venues, the second display is used to display the speaker video, and the third display is used to display documents such as PPT.
- the video data of each sub-site is received by the server, and the jigsaw video is displayed according to the video data, and the video of each sub-site is displayed, which improves the conference effect and improves the user experience.
- FIG. 9 is a schematic diagram of functional modules of a third embodiment of the apparatus of the present invention.
- the receiving module 10 includes a detecting unit 11, a determining unit 12, and a switching unit 13 based on the second embodiment of the multimedia conference server.
- the detecting unit 11 is configured to detect a network bandwidth of the network connection in real time when receiving video data of a sub-site through a network connection;
- the determining unit 12 is configured to determine a video bit rate and a video resolution corresponding to the changed network bandwidth when detecting that the network bandwidth changes;
- the switching unit 13 is configured to switch to the determined video bit rate and video resolution to continue receiving video data.
- the server detects the network bandwidth of the network connection in real time during the process of receiving the video data of the sub-site through the network connection; when detecting the change of the network bandwidth, the server determines the video code corresponding to the changed network bandwidth. Rate and video resolution; the server switches to the determined video bit rate and video resolution to continue receiving video data. For example, the server receives the video data of the conference site according to the code rate of 2000 kbps, detects that the network bandwidth changes, and the changed network bandwidth conforms to the 800 kbps code rate, and the server switches to the 800 kbps code rate to continue receiving the current location. Video data.
- the resolution and the code rate of the video are adjusted according to the network bandwidth, which avoids problems such as video jamming and flower screen caused by network deterioration during the conference, and can automatically adjust the video resolution and code rate to adapt to the network bandwidth when the network is deteriorated. , to achieve the best video effects under the current network bandwidth conditions, improve the user experience.
- FIG. 10 is a schematic diagram of functional modules of a fourth embodiment of the apparatus of the present invention.
- the multimedia conference server further includes a display module 50, based on the first embodiment of the multimedia conference server;
- the display module 50 is configured to display a preset agent list by using the conference control terminal, so that the user determines a speaker based on the agent list and triggers a corresponding speaking instruction;
- the receiving module 10 is further configured to receive a speaking instruction sent by the conference control terminal.
- the server displays a preset agent list through the conference control terminal, so that the user determines the speaker based on the agent list and triggers a corresponding speaking instruction, and the server receives the statement sent by the conference control terminal to The speaking instruction performs a corresponding operation.
- the agent list may be stored in the server, and when the server receives a display instruction triggered by the conference terminal, the server sends the agent list to the conference terminal for display.
- the conference control terminal may trigger a corresponding speaking instruction to indicate that the participant in the agent speaks when detecting the click operation of the host user based on the agent list.
- FIG. 6 is a schematic diagram of an effect of an embodiment of a seat list displayed by a conference control terminal according to the present invention.
- the multimedia conference server further includes a storage module, and the receiving module is further configured to: when receiving the setting instruction sent by the conference control terminal, receive a seat list and each agent input by the user based on the conference control terminal Corresponding orientation information; the storage module is configured to save the received agent list and the orientation information corresponding to each agent.
- the conference control terminal triggers the corresponding speaking instruction, and receives the speaking instruction sent by the conference control terminal through the server, and controls the camera to align the corresponding orientation according to the speaking instruction to perform the shooting of the speaker video, thereby realizing the automatic camera pair in the multimedia conference system.
- the prospective spokesperson and the spokesperson video are automatically displayed on the display screen, so that the podium user can trigger the speaking command to indicate who is speaking through the conference control terminal, and the corresponding speaker video is displayed on the display screen of the conference site, which greatly improves the conference effect. , improved user experience.
- the foregoing embodiment method can be implemented by means of software plus a necessary general hardware platform, and of course, can also be through hardware, but in many cases, the former is better.
- Implementation Based on such understanding, the technical solution of the present invention, which is essential or contributes to the prior art, may be embodied in the form of a software product stored in a storage medium (such as ROM/RAM, disk,
- the optical disc includes a number of instructions for causing a terminal device (which may be a cell phone, a computer, a server, an air conditioner, or a network device, etc.) to perform the methods described in various embodiments of the present invention.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Telephonic Communication Services (AREA)
Abstract
L'invention concerne un procédé de gestion de réunion multimédia comprenant les étapes suivantes : lors de la réception d'une commande de parole transmise par un terminal de gestion de conférence, un serveur détermine, selon la commande de parole, un siège de locuteur correspondant et des informations d'emplacement correspondant au siège de locuteur ; le serveur règle, selon les informations d'emplacement déterminées, une caméra pour enregistrer une vidéo du siège de locuteur ; et le serveur transmet à un écran d'affichage la vidéo du siège de locuteur et affiche celle-ci. L'invention concerne également un serveur de conférence multimédia mettant en œuvre un alignement automatique d'une caméra avec un locuteur dans un système de conférence multimédia. Une vidéo du locuteur est automatiquement affichée sur un écran d'affichage pour permettre à un utilisateur de siège de président de déclencher, grâce au terminal de gestion de conférence, une commande de parole pour indiquer un locuteur actif. Par conséquent, une vidéo du locuteur correspondant est affichée sur l'écran d'affichage dans une salle de réunion, ce qui améliore significativement une fonction de réunion et intensifie l'expérience de l'utilisateur.
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201610255434.5A CN105812717A (zh) | 2016-04-21 | 2016-04-21 | 多媒体会议控制方法及服务器 |
| CN201610255434.5 | 2016-04-21 |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2017181508A1 true WO2017181508A1 (fr) | 2017-10-26 |
Family
ID=56458395
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2016/085049 Ceased WO2017181508A1 (fr) | 2016-04-21 | 2016-06-07 | Procédé et serveur de gestion de réunion multimédia |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN105812717A (fr) |
| WO (1) | WO2017181508A1 (fr) |
Cited By (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112616035A (zh) * | 2020-11-23 | 2021-04-06 | 深圳市捷视飞通科技股份有限公司 | 多画面拼接方法、装置、计算机设备和存储介质 |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN106789914B (zh) * | 2016-11-24 | 2020-04-14 | 邦彦技术股份有限公司 | 一种多媒体会议控制方法和系统 |
| WO2018098780A1 (fr) * | 2016-12-01 | 2018-06-07 | 深圳前海达闼云端智能科技有限公司 | Procédé de présentation de publicité interactive, terminal et système d'interaction de ville intelligente |
| CN109246383B (zh) | 2017-07-11 | 2022-03-29 | 中兴通讯股份有限公司 | 一种多媒体会议终端的控制方法及多媒体会议服务器 |
| US10356362B1 (en) * | 2018-01-16 | 2019-07-16 | Google Llc | Controlling focus of audio signals on speaker during videoconference |
| US10511808B2 (en) * | 2018-04-10 | 2019-12-17 | Facebook, Inc. | Automated cinematic decisions based on descriptive models |
| CN109698928B (zh) * | 2018-11-15 | 2021-04-13 | 贵阳朗玛信息技术股份有限公司 | 一种调节视频会议系统中视频流的方法及装置 |
| CN111212218A (zh) * | 2018-11-22 | 2020-05-29 | 阿里巴巴集团控股有限公司 | 拍摄控制方法、设备及拍摄系统 |
| CN109547735B (zh) * | 2019-01-18 | 2024-04-16 | 海南科先电子科技有限公司 | 一种会议集成系统 |
| CN111245823A (zh) * | 2020-01-09 | 2020-06-05 | 福建星网智慧科技股份有限公司 | 一种基于lte协议可移动的无线专网音视频通信系统 |
| CN114067668B (zh) * | 2020-08-04 | 2024-12-20 | 广州艾美网络科技有限公司 | 可调多媒体系统及其控制方法 |
| CN116366961A (zh) * | 2021-12-24 | 2023-06-30 | 广西三诺数字科技有限公司 | 视频会议方法、装置及计算机设备 |
| CN114449205B (zh) * | 2022-04-08 | 2022-07-29 | 浙江华创视讯科技有限公司 | 数据处理方法、终端设备、电子设备及存储介质 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20030013017A (ko) * | 2001-08-06 | 2003-02-14 | 주식회사 호스트이엔아이 | 프리젠테이션 시스템에서의 화자 인식 방법 |
| CN102469295A (zh) * | 2010-10-29 | 2012-05-23 | 华为终端有限公司 | 会议控制方法及相关设备和系统 |
| CN102625077A (zh) * | 2011-01-27 | 2012-08-01 | 深圳市合智创盈电子有限公司 | 一种会议记录方法、会议摄像装置、客户机及系统 |
| CN103327250A (zh) * | 2013-06-24 | 2013-09-25 | 深圳锐取信息技术股份有限公司 | 基于模式识别镜头控制方法 |
| CN103986914A (zh) * | 2014-05-27 | 2014-08-13 | 东南大学 | 无线视频监控系统中基于客户端数量的码率自适应方法 |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| NO333026B1 (no) * | 2008-09-17 | 2013-02-18 | Cisco Systems Int Sarl | Styringssystem for et lokalt telepresencevideokonferansesystem og fremgangsmate for a etablere en videokonferansesamtale. |
| CN101742222A (zh) * | 2009-12-30 | 2010-06-16 | 华为终端有限公司 | 摄像头位置的操作方法及视频会议终端 |
| CN101877706B (zh) * | 2010-06-24 | 2013-04-17 | 北京邮电大学 | 多终端的多媒体会议控制系统及实现方法 |
| CN104144315B (zh) * | 2013-05-06 | 2017-12-29 | 华为技术有限公司 | 一种多点视频会议的显示方法及多点视频会议系统 |
| US20150146078A1 (en) * | 2013-11-27 | 2015-05-28 | Cisco Technology, Inc. | Shift camera focus based on speaker position |
| CN204119373U (zh) * | 2014-04-02 | 2015-01-21 | 中国舰船研究设计中心 | 一种数字会议人脸跟踪系统 |
| CN105163134B (zh) * | 2015-08-03 | 2018-09-07 | 腾讯科技(深圳)有限公司 | 直播视频的视频编码参数设置方法、装置及视频编码设备 |
-
2016
- 2016-04-21 CN CN201610255434.5A patent/CN105812717A/zh active Pending
- 2016-06-07 WO PCT/CN2016/085049 patent/WO2017181508A1/fr not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20030013017A (ko) * | 2001-08-06 | 2003-02-14 | 주식회사 호스트이엔아이 | 프리젠테이션 시스템에서의 화자 인식 방법 |
| CN102469295A (zh) * | 2010-10-29 | 2012-05-23 | 华为终端有限公司 | 会议控制方法及相关设备和系统 |
| CN102625077A (zh) * | 2011-01-27 | 2012-08-01 | 深圳市合智创盈电子有限公司 | 一种会议记录方法、会议摄像装置、客户机及系统 |
| CN103327250A (zh) * | 2013-06-24 | 2013-09-25 | 深圳锐取信息技术股份有限公司 | 基于模式识别镜头控制方法 |
| CN103986914A (zh) * | 2014-05-27 | 2014-08-13 | 东南大学 | 无线视频监控系统中基于客户端数量的码率自适应方法 |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112616035A (zh) * | 2020-11-23 | 2021-04-06 | 深圳市捷视飞通科技股份有限公司 | 多画面拼接方法、装置、计算机设备和存储介质 |
| CN112616035B (zh) * | 2020-11-23 | 2023-09-19 | 深圳市捷视飞通科技股份有限公司 | 多画面拼接方法、装置、计算机设备和存储介质 |
Also Published As
| Publication number | Publication date |
|---|---|
| CN105812717A (zh) | 2016-07-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| WO2017181508A1 (fr) | Procédé et serveur de gestion de réunion multimédia | |
| WO2018094791A1 (fr) | Procédé et système de commande de conférence multimédia | |
| WO2019019374A1 (fr) | Procédé, appareil et système permettant de commander un appareil électroménager à l'aide d'un dispositif vocal intelligent | |
| WO2017107388A1 (fr) | Procédé de commutation de version hdmi et dispositif d'affichage | |
| WO2017135585A2 (fr) | Haut-parleur principal, haut-parleur secondaire et système comprenant ceux-ci | |
| WO2018120457A1 (fr) | Procédé de traitement de données, appareil, dispositif et support de stockage lisible par ordinateur | |
| WO2017201899A1 (fr) | Procédé et appareil de connexion à un dispositif bluetooth | |
| WO2019114269A1 (fr) | Procédé de reprise de la visualisation d'un programme, téléviseur et support d'informations lisible par ordinateur | |
| WO2020010671A1 (fr) | Procédé et dispositif d'affichage, poste de télévision et support d'informations | |
| WO2018000856A1 (fr) | Procédé de mise en œuvre de transfert de message de réseau de recouvrement sdn, terminal, appareil et support d'informations lisible par ordinateur | |
| WO2019024336A1 (fr) | Procédé et dispositif d'interrogation de données, et support de stockage lisible par ordinateur | |
| WO2017096671A1 (fr) | Procédé et dispositif de conférence en réseau | |
| WO2017113614A1 (fr) | Procédé et dispositif de lecture insérée de publicité au cours d'une lecture vidéo | |
| WO2018233221A1 (fr) | Procédé de sortie sonore multi-fenêtre, télévision et support de stockage lisible par ordinateur | |
| WO2019031735A1 (fr) | Appareil de traitement d'image, procédé de traitement d'image et système d'affichage d'image | |
| WO2017063369A1 (fr) | Procédé d'établissement d'une connexion directe sans fil et dispositif utilisant ce dernier | |
| WO2017045441A1 (fr) | Procédé et appareil de lecture audio utilisant une télévision intelligente | |
| WO2019071762A1 (fr) | Procédé et système de positionnement au sol, serveur et support d'enregistrement lisible par ordinateur | |
| WO2017181504A1 (fr) | Procédé et téléviseur pour le réglage intelligent de la taille de sous-titres | |
| WO2017185480A1 (fr) | Procédé, dispositif et système de connexion d'interaction multi-écran | |
| WO2017113596A1 (fr) | Procédé et système de commande par écoute seulement, terminal mobile, et télévision intelligente | |
| WO2018205514A1 (fr) | Procédé de test automatique de compatibilité sans fil de boîtier décodeur, système et support de stockage lisible | |
| WO2017152527A1 (fr) | Procédé de commande d'application de dispositif esclave de télévision intelligente, et télévision intelligente | |
| WO2017148028A1 (fr) | Procédé et système de connexion de réseau à distance basés sur un téléviseur intelligent | |
| WO2017084298A1 (fr) | Procédé et système d'avertissement pour un téléviseur |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 16899095 Country of ref document: EP Kind code of ref document: A1 |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 16899095 Country of ref document: EP Kind code of ref document: A1 |