WO2019174622A1 - Voice control device and voice control method - Google Patents
Voice control device and voice control method Download PDFInfo
- Publication number
- WO2019174622A1 WO2019174622A1 PCT/CN2019/078193 CN2019078193W WO2019174622A1 WO 2019174622 A1 WO2019174622 A1 WO 2019174622A1 CN 2019078193 W CN2019078193 W CN 2019078193W WO 2019174622 A1 WO2019174622 A1 WO 2019174622A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- volume
- value
- changed
- volume value
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G1/00—Details of arrangements for controlling amplification
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03G—CONTROL OF AMPLIFICATION
- H03G3/00—Gain control in amplifiers or frequency changers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
Definitions
- the present embodiment relates to a voice control device and a voice control method.
- the operation using the remote control method or the like of the power-on/off, volume adjustment, and the like of the device can be remotely operated in a hands-free manner, and thus the convenience is greatly improved for the user.
- Patent Document 1 Japanese Laid-Open Patent Publication No. 2017-175397
- An object of the present invention is to provide a voice control device and a voice control method capable of suppressing a change in volume to an unintended size while reducing volume by remote operation.
- the voice control device of the present invention includes: an instruction server that analyzes an instruction of volume control input from the outside, and outputs an operation signal to a voice control device that is a controlled object; and a volume control unit that determines to change the volume value based on the operation signal, And comparing the current volume value of the voice control device with the changed volume value, where the changed volume value is greater than the set maximum volume value, or the difference between the changed volume value and the current volume value is greater than In the case of the set maximum volume change amount, the volume control unit resets the changed volume value.
- Fig. 1 is a schematic view showing an example of a configuration of a sound control system using a sound control device according to the present invention.
- FIG. 2 is a block diagram showing a broadcast receiving apparatus which is an example of a voice control device according to the present invention.
- Fig. 3 is a flow chart showing an example of a method of setting the maximum value of the volume and the maximum value of the volume change range.
- FIG. 4 is a flow chart for explaining an example of a voice control method according to the present invention.
- Fig. 5 is a flow chart showing an example of a method for determining whether or not the volume can be changed.
- Fig. 6 is a flow chart showing an example of a method of resetting the volume after the change.
- Fig. 7 is a flow chart showing another example of the voice control method according to the present invention.
- Fig. 1 is a schematic view showing an example of a configuration of a sound control system using a sound control device according to the present invention.
- the voice control device 100 has a voice control device 1 and an instruction server 1a. Further, the speaker 200 with a microphone is connected to the command server 1a via the network line 400 and the voice recognition server 300.
- the voice control device 1 can adjust the volume of the built-in audio device according to a control instruction from the outside.
- a broadcast receiving device will be described as an example of the voice control device 1.
- the broadcast receiving device 1 as a voice control device has a control unit 10.
- the control unit 10 is configured by a processor using a CPU or the like, and can control each unit in accordance with a program stored in the ROM 10a using the RAM 10b, or can realize some or all of the functions by using an electronic circuit of hardware.
- a nonvolatile memory 10c is also provided in the control unit 10.
- the preset volume maximum value Vh and the volume change width maximum value Vcl are stored in the nonvolatile memory 10c. Further, various kinds of information extracted from a broadcast signal to be described later are stored in the nonvolatile memory 10c.
- FIG. 2 is a block diagram showing a configuration of a broadcast receiving apparatus as an example of a volume control device according to an embodiment.
- the antenna 21 receives the height BS broadcast and the high-bandwidth CS broadcast (hereinafter referred to as high-bandwidth broadcast) and supplies the high-bandwidth broadcast signal to the input terminal 2a of the broadcast receiving apparatus 1.
- the height wideband broadcast signal input to the input terminal 2a is supplied to the tuner section Tu.
- the tuner section Tu has a plurality of tuners Tu1, Tu2, ... (hereinafter collectively referred to as tuner Tun).
- the tuner Tun is controlled by the control unit 10, and selects a broadcast signal of a desired channel from the respective high-bandwidth broadcast signals input, and outputs the selected signal to the signal processing unit 3.
- the signal processing unit 3 demodulates the input broadcast signal.
- each broadcast signal is input via an antenna
- the transmission path of the broadcast signal is not particularly limited, and may be an IP broadcast received via a network line without an antenna, via a cable television broadcast network. Received broadcast.
- a TLV (Type Length Value) stream can be obtained by demodulation by the signal processing unit 3.
- the signal processing unit 3 separates the IP packets included in the TLV stream, extracts video data, audio data, subtitle data, character superimposition (text), application data, and ECM (Entitlement Control Message) included in the IP packet. : Authorization Control Message), EMM (Entitlement Management Message) various messages.
- the signal processing unit 3 extracts a transport stream (a transport stream multiplexed according to the MPEG-2 Systems standard) by a demodulation process with respect to a current broadcast signal such as a terrestrial digital broadcast signal.
- the signal processing unit 3 extracts video data and audio data from the transport stream, and extracts various messages including ECM and EMM.
- a CAS module 15 is provided in the broadcast receiving apparatus 1.
- the CAS module 15 is used to define the control of the reception mode.
- the CAS module 15 may be mounted on a chip embedded in the broadcast receiving device 1 and may be configured by a substrate in which a CAS module is embedded, and the substrate may be inserted into a narrow not shown in the broadcast receiving device 1 .
- a slot is implemented to limit the reception function.
- the CAS module 15 also has a function of a limited reception function of a high-bandwidth broadcast and a limited reception function of a current broadcast, or a limited reception function of only a high-bandwidth broadcast, and the limited reception function of the current broadcast is adopted as an IC. The case where the CAS card of the card 23 is implemented.
- the signal processing unit 3 performs predetermined signal processing on the extracted video and audio of each broadcast method, and obtains video data and audio data.
- the signal processing unit 3 supplies video data and audio data to the graphics processing unit 4 or the audio processing unit 7, and supplies other data to the control unit 10.
- the OSD (On-screen display) signal generation unit 5 generates image information (OSD signal) for overprint display based on the information supplied from the control unit 10 and supplies it to the graphics processing unit 4.
- the graphics processing unit 4 manages the digitized video data from the signal processing unit 3 and the GUI (Graphical User Interface)-based window drawing from the OSD signal generating unit 5, and superimposes the data based on the image data-based image.
- the broadcast, the image of the GUI, and the like are sent to the image processing unit 6.
- various data for acquiring a program, electronic program guide (EPG) information, program attribute information, subtitle information, and the like are supplied to the graphics processing unit 4, and the graphics processing unit 4 can perform the above-described The image generation processing of the EPG, the subtitle, and the like is displayed by the input information.
- EPG electronic program guide
- the video processing unit 6 is controlled by the control unit 10 to convert the digital video signal from the graphics processing unit 4 into a format (number of pixels, frame rate, scanning method) that can be displayed on the video display unit 8, or to arbitrarily adjust the display color. It is output to the image display unit 8. Thereby, the video can be displayed on the display screen of the video display unit 8.
- the sound processing unit 7 converts the digitized sound data from the signal processing unit 3 into an analog sound signal that can be played by the speaker 9, and then outputs it to the speaker 9 to play the sound. It should be noted that when the broadcast receiving device is a set top box or the like, it is not necessary to include the video display unit 8 and the speaker 9.
- a card interface (I/F) 13, a communication I/F 16, a USB I/F 17, and an HDMI I/F 18 are provided in the broadcast receiving apparatus 1.
- the card I/F 13 enables data transmission and reception between the IC card 23 attached to the card holder 14 and the control unit 10.
- a CAS card hereinafter referred to as a B-CAS card
- the card I/F 13 transmits and receives data related to descrambling of the current broadcast.
- the communication I/F 16 is an interface corresponding to a predetermined communication path such as a telephone line or a LAN.
- the terminal 2b is connected to the command server 1a so that data can be transmitted and received via the command server 1a between the external device and the control unit 10.
- An operation signal such as a volume change can be input from the command server 1a via the communication I/F 16, and the communication I/F 16 can output the received operation signal to the control unit 10.
- the control unit 10 as the volume control unit controls the respective portions of the broadcast receiving device 1 based on the operation signal input from the command server 1a via the communication I/F 16. For example, when the operation of increasing the volume is performed, the control unit 10 generates an appropriate volume change instruction signal and outputs it to the sound processing unit 7 after verifying the validity of the input volume change operation. The sound processing unit 7 performs volume change processing based on the volume change instruction signal.
- the HDMI I/F 18 is an interface corresponding to the HDMI (registered trademark) standard, and can be connected to an external device (not shown) via the terminal 2d, and can receive an HDMI (registered trademark) signal from the external device and output it to the control unit 10.
- an HDMI (registered trademark) signal from the control unit 10 is output to the external device via the terminal 2d.
- the USBI/F 17 is an interface corresponding to the USB standard, and is connected to a USB-compatible device such as the HDD 24 via the terminal 2c, so that data can be transmitted and received between the USB-compatible device and the control unit 10.
- the control unit 10 can supply image data to the HDD 24 via the USB I/F 17 and the terminal 2c for recording.
- control unit 10 can also transmit a signal of the broadcast program. Recorded in each of these recording sections.
- control unit 10 is configured such that the descrambled program data (video and audio data) acquired by the signal processing unit 3 can be supplied to the USB I/F 17 via the USB I/F 17 when the user specifies the recording of the broadcast program. HDD24, making it record. It is to be noted that the control unit 10 is configured to be able to perform recording of a broadcast program even in a standby state (function standby state).
- the broadcast receiving device 1 is provided with an operation unit 11 and a receiving unit 12.
- the operation unit 11 is constituted by a switch, a key, a button, and the like (not shown), and outputs an operation signal based on a user operation to the control unit 10.
- the receiving unit 12 receives an operation signal based on a user operation from the remote controller 22, and outputs the received operation signal to the control unit 10.
- the control unit 10 controls each part of the broadcast receiving device 1 based on an operation signal from the operation unit 11 and the receiving unit 12. For example, when an operation of designating a channel number is designated by the up/down key, the one-touch dial key, or the like by the remote controller 22, each tuner Tun performs a channel selection operation based on an operation signal from the receiving unit 12.
- a power switch (not shown) for turning on/off the power of the broadcast receiving device 1 is disposed in the operation unit 11 and the remote controller 22.
- the broadcast receiving device 1 is provided with a power supply circuit (not shown) for supplying power to each unit, and the control unit 10 is configured to control the power supply state to each part of the broadcast receiving device 1 by operating a switch of the power supply. , thus set to standby.
- the standby state at least the graphics processing unit 4, the OSD signal generating unit 5, the video processing unit 6, the audio processing unit 7, the video display unit 8, and the speaker 9 of FIG. 2 are stopped, even if the control unit 10 is When the action is taken, the user cannot watch the broadcast program.
- the standby state means an output stop state of video and audio data.
- the control unit 10 can realize various functions as described below in the standby state.
- the control unit 10 has a power supply that supplies power to a circuit necessary for receiving the download content information based on the notification information in the standby state, downloads the downloaded content at the same time, and cuts off the power supply when the download ends. control function.
- the control unit 10 has a power supply control function that allows the download content to be downloaded and the power supply to be turned off when the download is completed, even when the power switch is turned off to the standby state.
- the control unit 10 can also operate in the same manner as the above-described energization control process for the EMM acquisition.
- the control unit 10 can acquire EPG information using the channel search in the standby state.
- the control unit 10 can receive the TMCC in the standby state, and monitors the emergency notification descriptor (MH) of the MPT in the reception channel even when the start control bit of the TMCC is 1, so that the emergency alarm broadcast (EWS) can be received and processed. .
- MH emergency notification descriptor
- the control unit 10 is capable of executing various reservation operations (program reservations, etc.) of the recipient.
- the command server 1a receives an operation signal input from the outside via the network line 400 and input to the broadcast receiving apparatus 1.
- the instruction content, parameters, and the like of the received operation signal are interpreted, converted into a format that can be processed by the broadcast receiving apparatus 1, and output.
- the operation signal output from the command server 1a is input to the control unit 10 of the broadcast receiving device 1 via the terminal 2b and the communication I/F 16.
- the speaker 200 with a microphone is a speaker having an auxiliary function of a network connection function and a sound operation. After the voice command issued by the user is input to the speaker 200 with the microphone, the sound command signal is output to the voice recognition server 300.
- the microphone-equipped speaker 200 can output not only the voice command from the user to the voice recognition server 300 but also the voice input from the voice recognition server 300.
- the voice recognition server 300 moves toward the command server 1a via the network line 400.
- the operation instruction content is output to the broadcast receiving apparatus 1 as an operation instruction signal.
- Fig. 3 is a flow chart showing an example of a method of setting the maximum value of the volume and the maximum value of the volume change width.
- the user turns on the power of the broadcast receiving device 1, and displays a setting menu screen for setting the maximum value of the volume and the maximum value of the volume change width on the display screen of the video display unit 8 using the remote controller 22 or the like (S1).
- the volume maximum value Vh should be selected from a value within a range from 0 to the maximum volume that the broadcast receiving apparatus 1 can output. For example, when the volume can be set to 0 to 100, the volume maximum value Vh is selected (or input) to a value of 100 or less. Further, the volume change width maximum value Vcl is also selected from a value within a range from 0 to the maximum volume that the broadcast receiving apparatus 1 can output.
- the default volume maximum value Vh and the volume change width maximum value Vcl are previously recorded in the nonvolatile memory 10c when the product is shipped, if the user does not perform the volume maximum value and the volume change amount is the largest.
- the default values can be used for the value settings.
- the maximum volume that can be set by the broadcast receiving device 1 is set to the volume maximum value Vh, and the maximum value can be set within the range in which the volume can be set.
- FIG. 4 is a flowchart illustrating an example of a voice control method according to the embodiment.
- the user inputs the content to the speaker 200 with the microphone in order to increase the volume of the broadcast receiving apparatus 1 (S11). For example, the user issues a voice command such as "increasing the volume of the television" to the speaker 200 with the microphone.
- the speaker 200 with a microphone outputs the input voice command as sound data to the voice recognition server 300 (S12).
- the voice recognition server 300 analyzes the input voice data and analyzes the user's intention (S13). Specifically, the operation content is discriminated by the analysis of the sound data, and the device as the operation target is judged. For example, when the sound data of "increasing the volume of the television" is input, it is determined that the operation content is the volume increase, and it is determined that the device to be operated is the broadcast receiving apparatus 1.
- the voice recognition server 300 outputs the analysis result to the command server 1a via the network line 400 (S14).
- the output destination of the analysis result is selected based on the operation target device determined in S13.
- the instruction server 1a to which the operation signal is input is selected to output the analysis result.
- the instruction server 1a recognizes the content of the operation, the parameters, and the operation target device based on the analysis result input from the voice recognition server 300.
- the operation target device is the broadcast receiving device 1
- the operation content is the volume increase
- an operation signal in a form that can be processed in the broadcast receiving device 1 as the volume control device is generated and output (S15).
- the operation signal output from the command server 1a is input to the control unit 10 of the broadcast receiving device 1 via the terminal 2b and the communication I/F 16.
- the control unit 10 checks the validity when the volume is controlled based on the operation signal input from the command server 1a (S16). Specifically, when the content is an instruction to increase the volume, it is determined whether or not the changed volume Vo exceeds the set volume maximum value Vh. Further, it is also determined whether or not the volume change amount Vc from the current volume Vp exceeds the maximum value Vcl of the set change width.
- S16 the specific inspection procedure in S16 will be described using FIG.
- FIG. 5 is a flowchart for explaining an example of a method of determining whether or not the volume can be changed.
- the current volume value Vp stored in the nonvolatile memory 10c is acquired (S21).
- the operation signal output from the command server 1a to the broadcast receiving apparatus 1 also includes the change in S15.
- the volume value Vo When the changed volume value Vo is input (S22, YES), the changed volume value Vo and the set volume maximum value Vh are compared (S23).
- the changed volume value Vo is not included in the operation signal output from the command server 1a to the broadcast receiving apparatus 1 (No in S22)
- the volume change amount Vc is also included in the operation signal output from the command server 1a to the broadcast receiving apparatus 1 in S15.
- the volume change amount Vc is input (Yes in S24), the current volume value Vp is added to the volume change amount Vc, and the changed volume value Vo is calculated (S26).
- the volume change amount Vc is not input (No in S24)
- the current volume value Vp is added to the preset volume change setting amount Vcd, and the changed volume value Vo is calculated (S25).
- the volume change setting amount Vcd is a value (predetermined value) used for the volume control when only the instruction to change the volume is input and the change condition is not input.
- the volume maximum value Vh and the volume change width maximum value Vcl are stored in advance in the nonvolatile memory 10c.
- the volume change range is compared with the set volume change width maximum value Vcl (S27). It should be noted that the magnitude of the volume change is calculated by subtracting the current volume value Vp from the changed volume value Vo.
- the volume change amount is larger than the set volume change width maximum value Vcl (S27, YES)
- the series of inspection steps is ended.
- the control unit 10 When the result of the check is OK (S17, YES), the control unit 10 outputs a control signal to the sound processing unit 7 in accordance with an instruction input from the command server 1a to change the volume of the speaker 9 to the change. After the volume value Vo. The sound processing unit 7 changes the volume in accordance with the control signal (S18), and ends the series of processes of the voice control.
- control unit 10 resets the volume value Vo after the change (S19). The specific procedure of resetting the changed volume value Vo will be described using FIG.
- FIG. 6 is a flowchart for explaining an example of a method of resetting the volume after the change.
- the value obtained by subtracting the current volume value Vp from the changed volume value Vo set at the time point that is, the volume change range is compared with the volume change width maximum value Vcl (S31).
- the volume change range is equal to or less than the volume change width maximum value Vcl (No in S31)
- the changed volume value Vo is compared with the set volume maximum value Vh (S33).
- the volume change range is larger than the volume change width maximum value Vcl (S31, YES)
- the volume obtained by adding the current volume value Vp and the volume change width maximum value Vcl is reset to the changed value.
- the volume value is Vo (S32).
- the changed volume value Vo is compared with the set volume maximum value Vh (S33).
- the control unit 10 outputs a control signal to the sound processing unit 7 to change the volume of the speaker 9 to the changed volume value Vo.
- the sound processing unit 7 changes the volume in accordance with the control signal (S19). Then, the user is notified of the content indicating that the content is restricted to be executed (S20), and the series of processing of the voice control is ended.
- the notification of the content indicating that the content is restricted to be executed is output from the broadcast receiving apparatus 1 via the instruction server 1a, the network line 400, and the voice recognition server 300 from the speaker 200 with the microphone.
- the manner of notification is not limited to sound.
- the volume maximum value Vh and the volume change range maximum value Vcl are set in advance in the audio device.
- the volume is changed in accordance with the input operation instruction content.
- the volume is changed within a predetermined range based on the volume maximum value Vh and the volume change width maximum value Vcl.
- the above-mentioned content is a determination as to whether or not the content of the operation instruction by the control unit 10 of the broadcast receiving device 1 as the audio device is appropriate, and when the content of the operation instruction is determined to be inappropriate, the operation may be performed by an instruction.
- the server 1a is coming.
- an operation signal is input to the command server 1a, and the operation target device is the broadcast receiving device 1.
- the command server 1a When the operation content is recognized as the volume increase, the command server 1a outputs the volume maximum value Vh and the volume change range to the broadcast receiving device 1. The maximum value Vcl and an indication of the current volume value Vp.
- the control unit 10 of the broadcast receiving device 1 When receiving the instruction, the control unit 10 of the broadcast receiving device 1 outputs the values stored in the nonvolatile memory 10c to the command server 1a.
- the instruction server 1a judges whether or not the operation instruction content is appropriate using these input values. If the determination is not appropriate, the volume value Vo after the change is reset, and an operation signal is input to the broadcast receiving device 1.
- the above two servers are provided with a voice recognition server having a function of analyzing a voice command input from a speaker with a microphone, analyzing a text data to analyze a user's intention, and an instruction server for giving an operation instruction to the audio device.
- the device transmits the operation content based on the voice instruction from the user.
- these functions may be performed by one server, or may be configured such that three or more servers share the functions.
- command server 1a may be configured inside the volume control device 1.
- FIG. 7 is a flowchart illustrating another example of the voice control method according to the embodiment.
- the broadcast receiving apparatus 1 When the user inputs a voice command to increase the volume of the broadcast receiving device 1 to the speaker 200 with the microphone, the user proceeds to the validity of the check operation instruction content (a series of steps from S11 to S17 in FIG. 4), and the determination check is performed. If the result is inappropriate (S17, NO), the broadcast receiving apparatus 1 notifies the user that the result of the check is "inappropriate” and asks whether or not the volume can be changed (S41). As a method of notification, in S20 of FIG. 4, similarly to the method of notifying that the change is controlled, the command server 1a, the network line 400, and the voice recognition server 300 are notified by sound from the speaker 200 with a microphone.
- the user inputs the operation instruction again by voice according to the inquiry of whether or not the volume can be changed.
- the voice command input from the user is again subjected to predetermined analysis by the voice recognition server 300, and is input as an operation signal to the command server 1a via the network line 400.
- the command server 1a converts the operation signal into a predetermined format and inputs it to the broadcast receiving apparatus 1.
- the control unit 10 outputs a control signal to the sound processing unit 7 in accordance with an instruction input from the command server 1a to change the volume of the speaker 9 in accordance with an operation instruction input from the user.
- the sound processing unit 7 changes the volume in accordance with the control signal (S42), and ends the series of processes of the voice control.
- volume maximum value Vh volume change width maximum value Vcl
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Circuit For Audible Band Transducer (AREA)
- Control Of Amplification And Gain Control (AREA)
Abstract
Description
本实施方式涉及声控装置以及声控方法。The present embodiment relates to a voice control device and a voice control method.
近年来,在将各式各样的设备连接到网络而推进IoT(Internet of Things:物联网)化的过程中,各个设备联合动作的情形增加。尤其是,在具有搭载AI功能的无线通信连接功能与声音操作的辅助功能的扬声器(下面表述为AI扬声器)已得到实用化之后,使用AI扬声器对经由网络连接的设备进行操作的情况不断增加。In recent years, in the process of connecting various devices to the network and promoting the Internet of Things (IoT), the situation in which each device operates in tandem has increased. In particular, after a speaker having an AI function-equipped wireless communication connection function and an auxiliary function for sound operation (hereinafter referred to as an AI speaker) has been put into practical use, the use of an AI speaker to operate a device connected via a network has been increasing.
例如,当使用AI扬声器时,设备的电源的接通/断开、音量调整等使用遥控方法等进行的操作能够以免提的方式远程操作,因此对于用户来说,极大地提高了便利性。For example, when the AI speaker is used, the operation using the remote control method or the like of the power-on/off, volume adjustment, and the like of the device can be remotely operated in a hands-free manner, and thus the convenience is greatly improved for the user.
但是,另一方面,由于在从AI扬声器等其它设备进行远程控制的情况下,用户实际上大多情况下不需要看着被控对象的设备进行操作,因此用户有时会进行误操作(输入错误的指示)。另外,由于用户对AI扬声器的误操作、指示传递路径中途的故障等,有可能会进行非用户本意的动作。However, on the other hand, in the case of remote control from other devices such as AI speakers, the user does not actually need to look at the device of the controlled object for operation, and thus the user sometimes performs an erroneous operation (incorrect input) Instructions). In addition, due to the user's erroneous operation of the AI speaker, a failure in the middle of the instruction transmission path, etc., it is possible to perform an operation that is not intended by the user.
例如,在想要通过AI扬声器等其它设备的声音识别功能控制内置于广播接收装置等中的音频设备(扬声器)的音量的情况下,会存在因声音的误识别等而导致音量变得非常大的问题。For example, when it is desired to control the volume of an audio device (speaker) built in a broadcast receiving device or the like by a voice recognition function of another device such as an AI speaker, there is a case where the volume becomes extremely large due to erroneous recognition of the sound or the like. The problem.
现有技术文献Prior art literature
专利文献Patent literature
专利文献1:日本特开2017-175397号公报Patent Document 1: Japanese Laid-Open Patent Publication No. 2017-175397
发明内容Summary of the invention
本发明的目的在于,提供一种在通过远程操作进行音量调整时能够抑制音量变更为非意愿的大小而减少对用户造成的刺激的声控装置以及声控方法。An object of the present invention is to provide a voice control device and a voice control method capable of suppressing a change in volume to an unintended size while reducing volume by remote operation.
本发明的声控装置具有:指令服务器,对从外部输入的音量控制的指令进行解析,并向作为被控对象的声控设备输出操作信号;以及音量控制部,基于所述操作信号确定变更音量值,并将所述声控设备的当前音量值与所述变更音量值进行比较,在所述变更音量值大于已设定的最大音量值、或者所述变更音量值与所述当前音量值的差分大于已设定的最大音量变更量的情况下,所述音量控制部对所述变更音量值进行重新设定。The voice control device of the present invention includes: an instruction server that analyzes an instruction of volume control input from the outside, and outputs an operation signal to a voice control device that is a controlled object; and a volume control unit that determines to change the volume value based on the operation signal, And comparing the current volume value of the voice control device with the changed volume value, where the changed volume value is greater than the set maximum volume value, or the difference between the changed volume value and the current volume value is greater than In the case of the set maximum volume change amount, the volume control unit resets the changed volume value.
图1是表示采用本发明涉及的声控装置的声控系统的一例结构概略图。Fig. 1 is a schematic view showing an example of a configuration of a sound control system using a sound control device according to the present invention.
图2是表示本发明涉及的声控设备的一例的广播接收装置的结构图。FIG. 2 is a block diagram showing a broadcast receiving apparatus which is an example of a voice control device according to the present invention.
图3是说明音量最大值以及音量变更幅度最大值的一例设定方法的流程图。Fig. 3 is a flow chart showing an example of a method of setting the maximum value of the volume and the maximum value of the volume change range.
图4是说明本发明所涉及的一例声控方法的流程图。4 is a flow chart for explaining an example of a voice control method according to the present invention.
图5是说明判断能否变更音量的一例方法流程图。Fig. 5 is a flow chart showing an example of a method for determining whether or not the volume can be changed.
图6是说明对变更后的音量进行重新设定的一例方法流程图。Fig. 6 is a flow chart showing an example of a method of resetting the volume after the change.
图7是说明本发明涉及的声控方法的其它一例流程图。Fig. 7 is a flow chart showing another example of the voice control method according to the present invention.
以下,参照附图对实施方式进行说明。Hereinafter, embodiments will be described with reference to the drawings.
图1是表示采用本发明所涉及的声控装置的声控系统的一例结构的概略图。声控装置100具有声控设备1与指令服务器1a。另外,带麦克风的扬声器200经由网络线路400、声音识别服务器300与指令服务器1a连接。Fig. 1 is a schematic view showing an example of a configuration of a sound control system using a sound control device according to the present invention. The
声控设备1能根据来自外部的控制指示调整内置的音频设备的音量。以下,作为声控设备1的一例,对广播接收装置进行说明。作为声控设备的广播接收装置1具有控制部10。控制部10由采用了CPU等的处理器构成,既可以按照存储于ROM10a的程序使用RAM10b进行动作来控制各部,也可以利用硬件的电子电路实现一部分或者全部的功能。另外,在控制部10中也设有非易失性存储器10c。在非易失性存储器10c中存储有预先设定的音量最大值Vh与音量变更幅度最大值Vcl。另外,在非易失性存储器10c中存储有从后述的广播信号中提取的各种信息。图2是表示作为实施方式涉及的音量控制设备的一例的广播接收装置的结构框图。The
天线21接收高度BS广播以及高度宽带CS广播(下面将它们称作高度宽带广播)并向广播接收装置1的输入端子2a供给高度宽带广播信号。输入到输入端子2a的高度宽带广播信号被供给至调谐部Tu。The
调谐部Tu具有多个调谐器Tu1、Tu2、……(下面将它们总称为调谐器Tun)。调谐器Tun由控制部10控制,从分别输入的高度宽带广播信号 中选择希望的频道的广播信号,并将所选择的信号向信号处理部3输出。信号处理部3对输入的广播信号进行解调。The tuner section Tu has a plurality of tuners Tu1, Tu2, ... (hereinafter collectively referred to as tuner Tun). The tuner Tun is controlled by the
在图2的例子中仅示出一根与高度宽带广播对应的天线21,但也可以构成为设置与地面数字广播、BS广播、CS广播对应的天线,将来自各天线的信号供给至调谐部Tu。在该情况下,信号处理部3与各广播的调制方式对应地进行解调。由此,在信号处理部3中获得各广播信号的流。需要注意的是,在图2中示出各广播信号经由天线输入的例子,但广播信号的传输路径没有特别限定,也可以是不经由天线而经由网络线路接收的IP广播、经由有线电视广播网接收的广播。In the example of FIG. 2, only one
例如,对于高度宽带广播信号,通过信号处理部3的解调,可得到TLV(Type Length Value:类型长度值)流。信号处理部3分离包含在TLV流中的IP包,提取包含在IP包中的影像数据、声音数据、字幕数据、字符叠加(文字ス一パ一)、应用程序数据以及包含ECM(Entitlement Control Message:授权控制消息)、EMM(Entitlement Management Message:授权管理消息)的各种消息。For example, for the highly wideband broadcast signal, a TLV (Type Length Value) stream can be obtained by demodulation by the
另外,信号处理部3对于地面数字广播信号等现行广播信号,通过解调处理提取传送流(根据MPEG-2Systems标准而被多重化的传送流)。信号处理部3从传送流中提取影像数据以及声音数据,并提取包含ECM、EMM的各种消息。Further, the
在广播接收装置1中设有CAS模块15。CAS模块15用于限定接收方式的控制。CAS模块15有时也会构成为搭载在内置于广播接收装置1的芯片上,另外,也考虑由嵌入有CAS模块的基板构成,通过将该基板插入设于广播接收装置1的未图示的狭槽而实现限定接收功能的情况。另外,CAS模块15也存在具有高度宽带广播的限定接收功能与现行广播的 限定接收功能这两功能的情况,或者仅具备高度宽带广播的限定接收功能,而现行广播的限定接收功能则通过作为IC卡23的CAS卡来实现的情况。A
信号处理部3在对各广播方式的被提取的影像以及声音进行了解码处理之后进行规定的信号处理,从而获得影像数据以及声音数据。信号处理部3分别向图形处理部4或者声音处理部7提供影像数据以及声音数据,并向控制部10提供其它的数据。The
OSD(On-screen display)信号生成部5基于从控制部10提供的信息,生成用于叠印显示的图像信息(OSD信号)并提供给图形处理部4。图形处理部4对来自信号处理部3的数字化的影像数据与来自OSD信号生成部5的基于GUI(Graphical User Interface:图形用户界面)的窗口描绘进行管理,在基于影像数据的图像上叠加基于数据广播、GUI的图像等,并将叠加后的图像送至影像处理部6。The OSD (On-screen display)
例如,除了影像数据之外,还向图形处理部4提供用于获取节目的各种数据、电子节目指南(EPG)信息、节目属性信息、字幕信息等,图形处理部4能够进行为了能够基于这些输入的信息而显示EPG、字幕等的图像生成处理。For example, in addition to the video data, various data for acquiring a program, electronic program guide (EPG) information, program attribute information, subtitle information, and the like are supplied to the
影像处理部6由控制部10控制,将来自图形处理部4的数字影像信号转换为能够在影像显示部8显示的格式(像素数、帧频、扫描方式)、或任意地调整显示色,并向影像显示部8输出。由此,能够在影像显示部8的显示画面上显示影像。The
另外,声音处理部7在将来自信号处理部3的数字化的声音数据转换为能通过扬声器9播放的模拟声音信号之后,向扬声器9输出而播放声音。 需要注意的是,在广播接收装置为机顶盒等的情况下,不需要具备影像显示部8、扬声器9。Further, the
在广播接收装置1中设有卡接口(I/F)13、通信I/F 16、USBI/F 17以及HDMII/F 18。卡I/F 13使安装于卡保持架14的IC卡23与控制部10之间能够进行数据的收发。例如,在采用与现行数字广播对应的CAS卡(下面称作B-CAS卡)作为IC卡23的情况下,卡I/F 13收发与现行广播的解扰相关的数据。A card interface (I/F) 13, a communication I/
通信I/F 16是与电话线路、LAN等规定的通信路径对应的接口。经由端子2b与指令服务器1a连接,使得外部设备与控制部10之间能够经由指令服务器1a进行数据的收发。能够经由通信I/F 16从指令服务器1a输入音量变更等的操作信号,通信I/F 16能够将接收到的操作信号向控制部10输出。The communication I/
作为音量控制部的控制部10基于经由通信I/F 16由指令服务器1a输入的操作信号,控制广播接收装置1的各个部分。例如,在进行了增大音量的操作的情况下,在控制部10中,在验证了输入的音量变更操作的妥当性之后,生成适当的音量变更指示信号并向声音处理部7输出。声音处理部7根据音量变更指示信号进行音量变更处理。The
另外,HDMI I/F 18是与HDMI(注册商标)标准对应的接口,能够经由端子2d与未图示的外部设备连接而接收来自外部设备的HDMI(注册商标)信号并向控制部10输出,或者经由端子2d向外部设备输出来自控制部10的HDMI(注册商标)信号。In addition, the HDMI I/
USBI/F 17是与USB标准对应的接口,经由端子2c与HDD24等USB对应设备连接,使得USB对应设备与控制部10之间能够进行数据的收发。 例如,在HDD24连接到了端子2c的情况下,控制部10能够经由USBI/F 17以及端子2c向HDD24提供影像数据,使其记录。The USBI/
需要注意的是,在通过内置于广播接收装置1的未图示的HDD、光盘驱动、SD(注册商标)卡读写器等构成记录部的情况下,控制部10也能够将广播节目的信号记录于这些各记录部。In the case where the recording unit is configured by an HDD, an optical disk drive, an SD (registered trademark) card reader/writer or the like (not shown) built in the
例如,控制部10的构成应使得在用户指定了广播节目的录像的情况下,能够将由信号处理部3获取到的解扰后的节目数据(影像以及声音数据)经由USB I/F 17提供给HDD24,使其进行记录。需要注意的是,控制部10构成为在待机状态(功能待机状态)下也能够执行广播节目的记录。For example, the
在广播接收装置1中设有操作部11以及接收部12。操作部11由未图示的开关、键、按钮等构成,向控制部10输出基于用户操作的操作信号。另外,接收部12接收从遥控器22发出的基于用户操作的操作信号,并将接收到的操作信号向控制部10输出。控制部10基于来自操作部11以及接收部12的操作信号,控制广播接收装置1的各个部分。例如,在借助遥控器22进行了通过上移/下移键、单触选台键等指定接收频道号的操作的情况下,各调谐器Tun根据来自接收部12的操作信号进行选台动作。The
在操作部11以及遥控器22中配设有用于操作广播接收装置1的电源的接通/断开的未图示的电源开关。在广播接收装置1中设有用于向各部供给电源的未图示的电源电路,控制部10的构成为:通过操作电源的开关,控制部10能够控制向广播接收装置1各部分的电源供给状态,从而设置为待机状态。A power switch (not shown) for turning on/off the power of the
需要注意的是,在待机状态下,至少图2的图形处理部4、OSD信号生成部5、影像处理部6、声音处理部7、影像显示部8以及扬声器9停止动作,即便控制部10正在进行动作,用户也无法收看广播电视节目。而 且,在将本实施方式应用于接收高度宽带广播而输出影像以及声音的解码器的情况下,待机状态意指影像以及声音数据的输出停止状态。It should be noted that, in the standby state, at least the
控制部10在待机状态下,能够如下所述地实现各种功能。例如,控制部10具有即便在待机状态下也会基于通知信息利用自身的计时器向下载内容信息的接收所需的电路供给电源、同时对下载内容进行下载并在下载结束时切断其电源的电源控制功能。另外,控制部10具有即便在进行电源开关的断开操作而转移至待机状态的情况下也保持必要的电路的通电状态而对下载内容进行下载并在下载结束时切断电源的电源控制功能。The
控制部10为了EMM获取,也能够与上述的通电控制过程同样地进行动作。The
控制部10在待机状态下能够使用频道搜索获取EPG信息。The
控制部10在待机状态下能够接收TMCC,在TMCC的起动控制位为1的情况下也监视处于接收频道的MPT的紧急通知描述符(MH),从而能够对紧急报警广播(EWS)进行接收处理。The
控制部10能够执行接收者的各种预约操作(节目预约等)。The
指令服务器1a接收经由网络线路400从外部输入的、向广播接收装置1输入的操作信号。对接收到的操作信号的指示内容、参数等进行解释,在将其转换为能够在广播接收装置1处理的格式并输出。从指令服务器1a输出的操作信号经由端子2b、通信I/F 16输入广播接收装置1的控制部10。The command server 1a receives an operation signal input from the outside via the
带麦克风的扬声器200是具有网络连接功能与声音操作的辅助功能的扬声器。当用户发出的声音指令输入到所述带麦克风的扬声器200之后,将该声音指令信号向声音识别服务器300输出。所述带麦克风的扬声器 200不仅能够将来自用户的声音指令输出给声音识别服务器300,而且还能够对从声音识别服务器300输入的数据进行声音输出。The
声音识别服务器300对从带麦克风的扬声器200输入的声音指令进行语言解析,识别声音指令想要进行的操作内容与操作对象。例如,在用户向带麦克风的扬声器200输入声音指令“提高电视的音量”时,声音识别服务器300通过语言解析识别出操作内容为“提高音量”、操作对象为“电视(=广播接收装置1)”。然后,将操作内容作为操作指示信号对操作对象的装置或者控制操作对象的装置的设备输出。The
例如,在图1所示的情况下,在从带麦克风的扬声器200输入的对广播接收装置1进行的操作指示作为声音指令的情况下,声音识别服务器300经由网络线路400朝着指令服务器1a而将该操作指示内容作为操作指示信号向广播接收装置1输出。For example, in the case shown in FIG. 1, in the case where an operation instruction to the
接下来,对本实施方式中的声控方法进行说明。作为用于接收来自外部的远程操作的预先作业,首先,在广播接收装置1中设定音量最大值Vh、音量变更幅度最大值Vcl。对于该设定作业,使用图3进行说明。图3是一例说明音量最大值以及音量变更幅度最大值设定方法的流程图。Next, the voice control method in the present embodiment will be described. As a pre-work for receiving a remote operation from the outside, first, the
首先,用户将广播接收装置1的电源接通,使用遥控器22等使影像显示部8的显示画面上显示用于设定音量最大值以及音量变更幅度最大值的设定菜单画面(S1)。First, the user turns on the power of the
接下来,使用遥控器22等输入音量最大值Vh以及音量变更幅度最大值Vcl的值(S2)。音量最大值Vh应从0到该广播接收装置1能够输出的最大音量为止的范围内的值中进行选择。例如,在能够设定至音量0~100的情况下,音量最大值Vh选择(或者输入)100以下的值。另外,音量 变更幅度最大值Vcl也从0到该广播接收装置1能够输出的最大音量为止的范围内的值中进行选择。Next, the value of the volume maximum value Vh and the volume change width maximum value Vcl (S2) is input using the
最后,通过按下在画面上显示的保存按钮等,从而将在S2中设定的值写入非易失性存储器10c(S3)。Finally, the value set in S2 is written in the
需要注意的是,在产品出厂时等预先将默认的音量最大值Vh以及音量变更幅度最大值Vcl记录于非易失性存储器10c中的情况下,若用户没有进行音量最大值以及音量变更幅度最大值的设定,则能够使用这些默认的值。It is to be noted that, in the case where the default volume maximum value Vh and the volume change width maximum value Vcl are previously recorded in the
另外,在未设定默认的值、且用户也未进行设定的情况下,将广播接收装置1能够设定的最大音量设为音量最大值Vh,在能够设定音量的范围内,将最大音量减去最小音量得到的值设为音量变更幅度最大值Vcl。例如,在能够设定的音量范围为0~100的情况下,设音量最大值Vh=100、音量变更幅度最大值Vcl=100(=100-0)。Further, when the default value is not set and the user does not perform the setting, the maximum volume that can be set by the
接下来,对本实施方式中的声控方法进行说明。如图4所示,对根据用户的声音指令,通过远程操作提高广播接收装置1的音量的方法进行说明。图4是说明本实施方式涉及的一例声控方法的流程图。Next, the voice control method in the present embodiment will be described. As shown in FIG. 4, a method of increasing the volume of the
首先,用户将内容为提高广播接收装置1的音量的指示以声音形式输入带麦克风的扬声器200(S11)。例如用户向带麦克风的扬声器200发出“提高电视的音量”等声音指令。带麦克风的扬声器200将输入的声音指令作为声音数据向声音识别服务器300输出(S12)。First, the user inputs the content to the
声音识别服务器300对输入的声音数据进行解析,分析用户的意图(S13)。具体来说,通过声音数据的解析,辨别操作内容,并且判断作为操作对象的设备。例如,在输入了“提高电视的音量”的声音数据的情况 下,辨别操作内容是提高音量,并判定作为操作对象的设备是广播接收装置1。The
接下来,声音识别服务器300将分析结果经由网络线路400向指令服务器1a输出(S14)。根据在S13中判定的操作对象设备,选择分析结果的输出目的地。在操作对象为广播接收装置1的情况下,选择向其输入操作信号的指令服务器1a输出分析结果。Next, the
指令服务器1a基于从声音识别服务器300输入的分析结果,识别操作的内容、参数、操作对象设备。当其识别出操作对象设备为广播接收装置1、操作内容为提高音量时,生成并输出可在作为音量控制设备的广播接收装置1中处理的形式的操作信号(S15)。从指令服务器1a输出的操作信号经由端子2b、通信I/F 16输入广播接收装置1的控制部10。The instruction server 1a recognizes the content of the operation, the parameters, and the operation target device based on the analysis result input from the
在广播接收装置1中,于控制部10中检查基于从指令服务器1a输入的操作信号控制了音量时的妥当性(S16)。具体来说,在输入了内容为提高音量的指示的情况下,判断变更后的音量Vo是否超过所设定的音量最大值Vh。另外,也判断从当前的音量Vp起的音量变更量Vc是否超过所设定的变更幅度的最大值Vcl。使用图5对S16中的具体的检查步骤进行说明。In the
图5是对判断能否变更音量的方法的一例进行说明的流程图。首先,获取保存于非易失性存储器10c中的当前的音量值Vp(S21)。接下来,判断是否从指令服务器1a输入了变更后的音量值Vo(S22)。在图4的S11中用户指定具体的音量并向带麦克风的扬声器200输入了内容为提高音量的指示的情况下,在S15中从指令服务器1a输出到广播接收装置1的操作信号中也包含变更后的音量值Vo。在输入了变更后的音量值Vo的情况下(S22,是),比较变更后的音量值Vo与所设定的音量最大值Vh(S23)。FIG. 5 is a flowchart for explaining an example of a method of determining whether or not the volume can be changed. First, the current volume value Vp stored in the
另一方面,在从指令服务器1a输出到广播接收装置1的操作信号中未包含变更后的音量值Vo的情况下(S22,否),判断是否从指令服务器1a输入了音量变更量Vc(S24)。在图4的S11中用户指定具体的音量变更量并向带麦克风的扬声器200输入了内容为提高音量的指示的情况、例如输入了“将电视的音量提高10”等声音指示的情况下,在S15中从指令服务器1a输出至广播接收装置1的操作信号中也包含音量变更量Vc。On the other hand, when the changed volume value Vo is not included in the operation signal output from the command server 1a to the broadcast receiving apparatus 1 (No in S22), it is determined whether or not the volume change amount Vc is input from the command server 1a (S24). ). In S11 of FIG. 4, when the user specifies a specific volume change amount and inputs an instruction to increase the volume to the microphone-equipped
在输入了音量变更量Vc的情况下(S24,是),将当前的音量值Vp与音量变更量Vc相加,计算变更后的音量值Vo(S26)。另一方面,在未输入音量变更量Vc的情况下(S24,否),将当前的音量值Vp与预先设定的音量变更设定量Vcd相加,计算变更后的音量值Vo(S25)。需要注意的是,音量变更设定量Vcd是在仅输入变更音量的指示而未输入变更条件的情况下用于音量控制的值(规定值)。与音量最大值Vh、音量变更幅度最大值Vcl同样地预先存储于非易失性存储器10c中。When the volume change amount Vc is input (Yes in S24), the current volume value Vp is added to the volume change amount Vc, and the changed volume value Vo is calculated (S26). On the other hand, when the volume change amount Vc is not input (No in S24), the current volume value Vp is added to the preset volume change setting amount Vcd, and the changed volume value Vo is calculated (S25). . It is to be noted that the volume change setting amount Vcd is a value (predetermined value) used for the volume control when only the instruction to change the volume is input and the change condition is not input. The volume maximum value Vh and the volume change width maximum value Vcl are stored in advance in the
若在S25或者S26中算出变更后的音量值Vo,则接着比较变更后的音量值Vo与所设定的音量最大值Vh(S23)。在变更后的音量值Vo大于音量最大值Vh的情况下(S23,是),判定检查结果为不能改变(S29),并结束一系列的检查步骤。When the changed volume value Vo is calculated in S25 or S26, the changed volume value Vo and the set volume maximum value Vh are compared (S23). When the changed volume value Vo is larger than the volume maximum value Vh (S23, YES), it is determined that the inspection result is unchangeable (S29), and the series of inspection steps is ended.
另一方面,在变更后的音量值Vo为音量最大值Vh以下的情况下(S23,否),比较音量的变更幅度与所设定的音量变更幅度最大值Vcl(S27)。需要注意的是,音量的变更幅度是通过从变更后的音量值Vo中减去当前的音量值Vp来计算的。在音量的变更幅度大于所设定的音量变更幅度最大值Vcl的情况下(S27,是),判定检查结果为不能改变(S29),并结束一系列的检查步骤。On the other hand, when the changed volume value Vo is equal to or less than the volume maximum value Vh (No in S23), the volume change range is compared with the set volume change width maximum value Vcl (S27). It should be noted that the magnitude of the volume change is calculated by subtracting the current volume value Vp from the changed volume value Vo. When the volume change amount is larger than the set volume change width maximum value Vcl (S27, YES), it is determined that the check result is unchangeable (S29), and the series of inspection steps is ended.
另一方面,在音量的变更幅度为所设定的音量变更幅度最大值Vcl以下的情况下(S27,否),判定检查结果为OK(S28),并结束一系列的检查步骤。On the other hand, when the volume change range is equal to or smaller than the set volume change width maximum value Vcl (No in S27), it is determined that the check result is OK (S28), and the series of inspection steps is ended.
返回图4的步骤,在检查结果为OK的情况下(S17,是),控制部10按照从指令服务器1a输入的指示,向声音处理部7输出控制信号,以将扬声器9的音量变更为变更后的音量值Vo。声音处理部7按照控制信号变更音量(S18),结束声控的一系列的处理。When the result of the check is OK (S17, YES), the
另一方面,在检查结果为不能改变的情况下(S17,否),控制部10重新设定变更后的音量值Vo(S19)。使用图6对重新设定变更后的音量值Vo的具体步骤进行说明。On the other hand, if the result of the check is not changeable (No in S17), the
图6是对重新设定变更后的音量的一例方法进行说明的流程图。首先,将从在该时间点已设定的变更后的音量值Vo中减去当前的音量值Vp而得的值、即音量的变更幅度与音量变更幅度最大值Vcl进行比较(S31)。在音量的变更幅度为音量变更幅度最大值Vcl以下的情况下(S31,否),将变更后的音量值Vo与所设定的音量最大值Vh进行比较(S33)。FIG. 6 is a flowchart for explaining an example of a method of resetting the volume after the change. First, the value obtained by subtracting the current volume value Vp from the changed volume value Vo set at the time point, that is, the volume change range is compared with the volume change width maximum value Vcl (S31). When the volume change range is equal to or less than the volume change width maximum value Vcl (No in S31), the changed volume value Vo is compared with the set volume maximum value Vh (S33).
另一方面,在音量的变更幅度大于音量变更幅度最大值Vcl的情况下(S31,是),将当前的音量值Vp与音量变更幅度最大值Vcl相加而得的音量重新设定为变更后的音量值Vo(S32)。然后,将变更后的音量值Vo与所设定的音量最大值Vh进行比较(S33)。On the other hand, when the volume change range is larger than the volume change width maximum value Vcl (S31, YES), the volume obtained by adding the current volume value Vp and the volume change width maximum value Vcl is reset to the changed value. The volume value is Vo (S32). Then, the changed volume value Vo is compared with the set volume maximum value Vh (S33).
在变更后的音量值Vo为所设定的音量最大值Vh以下的情况下(S33,否),确定变更后的音量值Vo,并结束一系列的音量重设定步骤。当变更后的音量值Vo大于所设定的音量最大值Vh的情况下(S33,是),将音量最大值Vh重新设定为变更后的音量值Vo(S34),并结束一系列的音量重设定步骤。When the changed volume value Vo is equal to or smaller than the set volume maximum value Vh (No in S33), the changed volume value Vo is determined, and a series of volume resetting steps are ended. When the changed volume value Vo is larger than the set volume maximum value Vh (S33, YES), the volume maximum value Vh is reset to the changed volume value Vo (S34), and the series of volume ends. Reset the steps.
返回图4的步骤,在重新设定了变更后的音量值Vo之后,控制部10向声音处理部7输出控制信号,以便将扬声器9的音量变更为变更后的音量值Vo。声音处理部7按照控制信号来变更音量(S19)。然后,向用户通知操作指示内容受限地被执行的内容(S20),并结束声控的一系列的处理。Returning to the step of FIG. 4, after the changed volume value Vo is reset, the
操作指示内容受限地被执行的内容的通知是从广播接收装置1经由指令服务器1a、网络线路400、声音识别服务器300从带麦克风的扬声器200输出。The notification of the content indicating that the content is restricted to be executed is output from the
例如,在广播接收装置1的当前的音量为19、音量最大值Vh为40时,在用户向带麦克风的扬声器200输入“将电视的音量提高30”的情况下,根据音量最大值Vh的限制条件,变更后的音量并非是用户希望的49而是40。这样,在没有将音量变更为用户意图的状态的情况下,经由带麦克风的扬声器200以声音向用户进行通知。需要注意的是,通知的方式不限于声音,例如,既可以在广播接收装置1等的显示画面上作为消息进行显示,也可以采用其它方式对用户进行通知。另外,依赖于声音的通知不仅可以通过被输入操作内容的设备(=带麦克风的扬声器200)来进行,例如也可以通过广播接收装置1的扬声器9来进行。For example, when the current volume of the
这样,根据本实施方式,在通过远程控制从带麦克风的扬声器等其它设备对音响设备操作指示提高音量的内容的情况下,在音频设备中预先设定音量最大值Vh以及音量变更幅度最大值Vcl,通过将被音量输入的操作指示内容与这些设定值进行比对,由此判断操作指示内容是否妥当。在判定为“妥当”的情况下,按照所输入的操作指示内容变更音量。另一方面,在判定为“不妥当”的情况下,基于音量最大值Vh以及音量变更幅度最大值Vcl,在规定的范围内变更音量。这样就能够防止从音频装置中突然输出大的音量,能够减少对用户的刺激。As described above, according to the present embodiment, in the case where the content of the audio device is instructed to increase the volume by remote control from another device such as a speaker with a microphone, the volume maximum value Vh and the volume change range maximum value Vcl are set in advance in the audio device. By comparing the contents of the operation instruction input by the volume with these set values, it is judged whether or not the operation instruction content is appropriate. When it is judged as "appropriate", the volume is changed in accordance with the input operation instruction content. On the other hand, when it is judged as "inappropriate", the volume is changed within a predetermined range based on the volume maximum value Vh and the volume change width maximum value Vcl. This makes it possible to prevent a sudden output of a large volume from the audio device, and it is possible to reduce irritation to the user.
此外,上述内容是由作为音响设备的广播接收装置1的控制部10进行的操作指示内容是否妥当的判断以及在判定操作指示内容不妥当的情况下进行代替操作指示,但这些操作也可以由指令服务器1a来进行。In addition, the above-mentioned content is a determination as to whether or not the content of the operation instruction by the
例如,向指令服务器1a输入操作信号,操作对象设备为广播接收装置1,当识别出操作内容为提高音量时,从指令服务器1a对广播接收装置1输出使其发送音量最大值Vh、音量变更幅度最大值Vcl以及当前的音量值Vp的指示。广播接收装置1的控制部10在接收到该指示时向指令服务器1a输出保存于非易失性存储器10c中的这些值。指令服务器1a使用输入的这些值,判断操作指示内容是否妥当。在判定不妥当的情况下,重新设定变更后的音量值Vo,并向广播接收装置1输入操作信号。For example, an operation signal is input to the command server 1a, and the operation target device is the
另外,上面使用具有对从带麦克风的扬声器输入的声音指令进行解析,解析文本数据来分析用户的意图的功能的声音识别服务器、以及向音频装置进行操作指示的指令服务器这两台服务器,向音频装置传递基于来自用户的声音指示的操作内容,但也可以由1台服务器来进行这些功能,或者可以构成为使3台以上的多台服务器分担进行这些功能。Further, the above two servers are provided with a voice recognition server having a function of analyzing a voice command input from a speaker with a microphone, analyzing a text data to analyze a user's intention, and an instruction server for giving an operation instruction to the audio device. The device transmits the operation content based on the voice instruction from the user. However, these functions may be performed by one server, or may be configured such that three or more servers share the functions.
另外,也可以将指令服务器1a构成于音量控制设备1的内部。Alternatively, the command server 1a may be configured inside the
进一步地,上面在判定操作指示内容为“不妥当”的情况下,在声控设备中重新设定为妥当的操作指示内容而进行了音量变更,但也可以向用户询问能否操作。图7是说明本实施方式所涉及的声控方法的其它一例的流程图。Further, in the case where the determination operation instruction content is "inappropriate", the volume change is performed by resetting the sound operation device to the appropriate operation instruction content, but the user may be inquired as to whether or not the operation is possible. FIG. 7 is a flowchart illustrating another example of the voice control method according to the embodiment.
用户向带麦克风的扬声器200输入了内容为提高广播接收装置1的音量的声音指令后,实施至检查操作指示内容的妥当性(从图4的S11至S17的一系列的步骤),在判定检查结果为不妥当的情况下(S17,否),广播接收装置1向用户通知检查结果为“不妥当”,询问能否变更音量 (S41)。作为通知的方法,希望在图4的S20中,与通知变更受到控制这一内容的方法同样地经由指令服务器1a、网络线路400、声音识别服务器300从带麦克风的扬声器200中以声音进行通知。When the user inputs a voice command to increase the volume of the
用户根据能否变更音量的询问,再次以声音输入操作指示。从用户输入的声音指令再次通过声音识别服务器300进行规定的分析,并作为操作信号经由网络线路400输入指令服务器1a。指令服务器1a将操作信号转换为规定的格式而输入广播接收装置1。控制部10按照从指令服务器1a输入的指示,向声音处理部7输出控制信号,以按照从用户输入的操作指示变更扬声器9的音量。声音处理部7按照控制信号变更音量(S42),并结束声控的一系列的处理。The user inputs the operation instruction again by voice according to the inquiry of whether or not the volume can be changed. The voice command input from the user is again subjected to predetermined analysis by the
这样,在判定最开始的操作指示内容不妥当的情况下,向用户再次进行询问,从而在用户想要超过设定值(音量最大值Vh、音量变更幅度最大值Vcl)变更音量的情况下也能够应对,能够更灵活地进行音量调整操作。When it is determined that the content of the initial operation instruction is not appropriate, the user is again inquired, and when the user wants to change the volume by exceeding the set value (volume maximum value Vh, volume change width maximum value Vcl) It can cope with and can perform volume adjustment operations more flexibly.
本发明不限于上述的实施例,可在不偏离发明的宗旨的范围内进行各种变更、应用是不言而喻的。The present invention is not limited to the above-described embodiments, and various changes and applications can be made without departing from the spirit and scope of the invention.
Claims (10)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201980018951.2A CN112243525A (en) | 2018-03-15 | 2019-03-14 | Voice control device and voice control method |
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2018-048338 | 2018-03-15 | ||
| JP2018048338A JP6947356B2 (en) | 2018-03-15 | 2018-03-15 | Acoustic control device and acoustic control method |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2019174622A1 true WO2019174622A1 (en) | 2019-09-19 |
Family
ID=67908527
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2019/078193 Ceased WO2019174622A1 (en) | 2018-03-15 | 2019-03-14 | Voice control device and voice control method |
Country Status (3)
| Country | Link |
|---|---|
| JP (1) | JP6947356B2 (en) |
| CN (1) | CN112243525A (en) |
| WO (1) | WO2019174622A1 (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2022029906A1 (en) * | 2020-08-05 | 2022-02-10 | 三菱電機株式会社 | Control device, control system, and control method |
| JP7766498B2 (en) * | 2022-01-18 | 2025-11-10 | 日本放送協会 | Volume control device, device, control system, and program |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060018492A1 (en) * | 2004-07-23 | 2006-01-26 | Inventec Corporation | Sound control system and method |
| CN102436821A (en) * | 2011-12-02 | 2012-05-02 | 海能达通信股份有限公司 | Method and equipment for self-adaptive adjustment of sound effect |
| CN106375594A (en) * | 2016-10-25 | 2017-02-01 | 乐视控股(北京)有限公司 | Method and device for adjusting equipment, and electronic equipment |
| CN107484000A (en) * | 2017-09-29 | 2017-12-15 | 北京奇艺世纪科技有限公司 | A kind of volume adjusting method of terminal, device and voice remote controller |
| CN107657954A (en) * | 2017-10-27 | 2018-02-02 | 成都常明信息技术有限公司 | A kind of Intelligent volume speech robot people |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP4282614B2 (en) * | 2005-01-27 | 2009-06-24 | 株式会社東芝 | Amplifying device, communication terminal device and volume control method |
| JP2008061062A (en) * | 2006-09-01 | 2008-03-13 | Sharp Corp | Video / audio output device |
| JP2008141721A (en) * | 2006-11-06 | 2008-06-19 | Matsushita Electric Ind Co Ltd | Broadcast receiving terminal |
| JP5061774B2 (en) * | 2007-08-02 | 2012-10-31 | ソニー株式会社 | Video signal generator |
| JP5434372B2 (en) * | 2009-08-26 | 2014-03-05 | ヤマハ株式会社 | Volume control device |
| CN103137126A (en) * | 2011-11-30 | 2013-06-05 | 北京德信互动网络技术有限公司 | Intelligent electronic device based on voice control and voice control method |
| CN103209370A (en) * | 2012-01-16 | 2013-07-17 | 联想(北京)有限公司 | Electronic equipment and method for adjusting file sound parameters output by sound playing device |
| CN103731617A (en) * | 2013-12-03 | 2014-04-16 | 乐视致新电子科技(天津)有限公司 | Volume adjustment method and device of smart television |
-
2018
- 2018-03-15 JP JP2018048338A patent/JP6947356B2/en active Active
-
2019
- 2019-03-14 CN CN201980018951.2A patent/CN112243525A/en active Pending
- 2019-03-14 WO PCT/CN2019/078193 patent/WO2019174622A1/en not_active Ceased
Patent Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20060018492A1 (en) * | 2004-07-23 | 2006-01-26 | Inventec Corporation | Sound control system and method |
| CN102436821A (en) * | 2011-12-02 | 2012-05-02 | 海能达通信股份有限公司 | Method and equipment for self-adaptive adjustment of sound effect |
| CN106375594A (en) * | 2016-10-25 | 2017-02-01 | 乐视控股(北京)有限公司 | Method and device for adjusting equipment, and electronic equipment |
| CN107484000A (en) * | 2017-09-29 | 2017-12-15 | 北京奇艺世纪科技有限公司 | A kind of volume adjusting method of terminal, device and voice remote controller |
| CN107657954A (en) * | 2017-10-27 | 2018-02-02 | 成都常明信息技术有限公司 | A kind of Intelligent volume speech robot people |
Also Published As
| Publication number | Publication date |
|---|---|
| CN112243525A (en) | 2021-01-19 |
| JP2019161546A (en) | 2019-09-19 |
| JP6947356B2 (en) | 2021-10-13 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN102630383B (en) | Display device, control method for said display device | |
| US7975285B2 (en) | Broadcast receiver and output control method thereof | |
| US12348825B2 (en) | Apparatus, systems and methods for pre-tuning a second tuner in anticipation of a channel surfing activity | |
| KR100651894B1 (en) | Imaging Device and Control Method | |
| JP5362834B2 (en) | Display device, program, and computer-readable storage medium storing program | |
| US20080089534A1 (en) | Video playing apparatus and method of controlling volume in video playing apparatus | |
| US20110181789A1 (en) | Volume adjustment device and volume adjustment method | |
| JP2009077192A (en) | Receiving device and image output control method of receiving device | |
| WO2019174622A1 (en) | Voice control device and voice control method | |
| JP2010081639A (en) | Method and apparatus for controlling video signal processing apparatus | |
| US20020154246A1 (en) | Method and apparatus for control of auxiliary video information display | |
| KR20060035079A (en) | Method and apparatus for preprocessing service information in digital cable broadcasting | |
| US20150024732A1 (en) | Electronic device and method for controlling the same | |
| KR20100072681A (en) | Apparatus and method for image displaying in image display device | |
| JP5166570B2 (en) | Electronic device and video processing method | |
| JP2002344840A (en) | Broadcast receiver provided with broadcast language display function | |
| JP6923177B2 (en) | Broadcast receiving device and broadcasting receiving method | |
| JP2019161547A (en) | Electronic apparatus and power supply state setting method | |
| KR100731357B1 (en) | Image quality adjusting method and image processing device performing the same | |
| KR20220163644A (en) | Signal processing device and method thereof | |
| KR20010076448A (en) | The audio signal controlling method of digital television | |
| CN1988618A (en) | Broadcast receiving apparatus capable of manual fine tuning and method thereof | |
| JP2011130365A (en) | Video display device, video display method, television receiver, program, and recording medium | |
| JP2019186833A (en) | Broadcast receiver and broadcast reception method | |
| KR20080074628A (en) | EPP information display device and method of digital TV |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 19767903 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| NENP | Non-entry into the national phase |
Ref country code: JP |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 19767903 Country of ref document: EP Kind code of ref document: A1 |