WO2015096504A1 - Voice command correcting method, correcting system and electronic device - Google Patents
Voice command correcting method, correcting system and electronic device Download PDFInfo
- Publication number
- WO2015096504A1 WO2015096504A1 PCT/CN2014/085567 CN2014085567W WO2015096504A1 WO 2015096504 A1 WO2015096504 A1 WO 2015096504A1 CN 2014085567 W CN2014085567 W CN 2014085567W WO 2015096504 A1 WO2015096504 A1 WO 2015096504A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- voice
- voice command
- information
- command
- corrected
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
Definitions
- the present invention relates to the field of electronic technologies, and in particular, to a method for correcting voice commands, a correction system, and an electronic device. Background technique
- a human-computer interaction method in which voice command operation is very common is also a very natural way of human-computer interaction.
- user terminal devices such as smart phones, tablets, iPads, etc.
- voice command operations on user terminal devices has become a trend.
- the electronic device When receiving the input voice information, the electronic device extracts the voice feature information according to the input voice information, and then searches for a voice command corresponding to the extracted voice feature information in the voice feature database, and executes the voice command to complete the voice. Command operation.
- the voice feature library is the key data that distinguishes different users and distinguishes different commands, it is the personalized data of the user's "exclusive", and is also affected by many factors such as hardware conditions and usage environment of the user terminal equipment. Therefore, after the user terminal device is replaced or the voice feature library is upgraded, the voice feature information of some or all voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that when the voice command is operated, there will be The case of identifying inaccurate or incapable of recognizing the input voice information.
- the present invention provides a voice command correction method, a correction system, and an electronic device, which solves the voice feature information of some or all voice commands of the voice feature library after the user terminal device is replaced or the voice feature library is upgraded in the prior art.
- the corresponding voice feature information of the device set may be deviated, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- a first aspect of the embodiments of the present invention provides a method for correcting a voice command, where the method includes: obtaining a voice command that needs to be corrected in a voice feature database; prompting a user to input voice information corresponding to the voice command; The voice information input by the user; obtaining the voice feature information according to the collected voice information; updating the voice feature information of the voice command to be corrected according to the obtained voice feature information, to correct the voice command.
- the obtaining the voice command that needs to be corrected in the voice feature database includes:
- the corrected voice command is a voice command that needs to be corrected each time the voice feature library is received.
- the obtaining a voice command that needs to be corrected in the voice feature library includes:
- the updated voice command and its voice feature information are received, and the updated voice command is determined to be a voice command that needs to be corrected.
- the updating the voice command to be corrected according to the obtained voice feature information is specifically: replacing the obtained voice feature information with the voice feature information of the voice command that needs to be corrected.
- the fourth aspect is In an implementation manner, the prompting user inputs voice information corresponding to the voice command, specifically:
- a second aspect of the embodiments of the present invention provides a method for correcting a voice command, where the method includes: obtaining a voice command set that needs to be corrected in a voice feature database, where the voice command set includes M voice commands, and the M is greater than An integer equal to 2;
- N N of the pre-corrected and corrected voice feature information according to the corrected N first voice commands a deviation value, determining a manner of correcting the second voice command, correcting the second voice command according to the determined correction mode, where the second voice command is the (N+1)th to the Mth Any of the voice commands.
- the determining, according to the N deviation values of the pre-corrected and corrected speech feature information, the N first voice commands that have been corrected are determined to be the second voice command.
- the way to correct including:
- the change rule is searched according to the N deviation values
- determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting user input
- the voice information corresponding to the second voice command is used for the voice information
- the voice information input by the user obtains the voice feature information according to the collected voice information, and updates the voice feature information of the second voice command according to the obtained voice feature information to correct the second voice command.
- the N, before and after the correction, based on the N first voice commands that have been corrected Determining the manner of correcting the second voice command further includes: if the N deviation values are less than the first threshold, discarding the correction of the second voice command.
- the N of the pre-corrected and corrected voice feature information according to the N first voice commands that have been corrected And determining, by the deviation value, the manner of correcting the second voice command, further comprising: if the N deviation values are greater than the second threshold, re-determining the voice command set that needs to be corrected in the voice feature database .
- a third aspect of the embodiments of the present invention provides a voice command correction system, where the system includes: a determining unit, configured to obtain a voice command in a voice feature library that needs to be corrected; and a prompting unit, configured to prompt a user to input the voice a voice information corresponding to the command; a set unit for collecting voice information input by the user; an obtaining unit, configured to obtain voice feature information according to the collected voice information; and a correcting unit, configured to use the obtained voice feature information
- the voice feature information of the voice command to be corrected is updated to correct the voice command.
- the determining unit is specifically configured to: receive the voice feature database; according to a specific voice command in the voice feature library preset by the system, or according to the voice The frequency of use of the voice command in the feature library, or the voice command selected in the voice feature library according to the input information of the user determines a voice command to be corrected, and the specific voice command is a voice that needs to be corrected after receiving the voice feature library each time. command.
- the determining unit is specifically configured to:
- the correcting unit is specifically configured to: replace the obtained voice feature information with the The voice feature information of the voice command that needs to be corrected.
- the prompting unit is specifically configured to: according to the voice
- the voice feature information of the command generates voice information and plays, or plays a machine sound corresponding to the voice command; or displays text or image information corresponding to the voice command.
- a fourth aspect of the embodiments of the present invention provides a voice command correction system, where the system includes: an obtaining unit, configured to obtain a voice command set that needs to be corrected in a voice feature database, where the voice command set includes M voice commands.
- the M is an integer greater than or equal to 2;
- a first correcting unit configured to perform, for each of the first to Nth voice commands in the voice command set: prompting the user to input voice information corresponding to the first voice command, and collecting the user input And the voice information is obtained according to the collected voice information, and the sound feature information of the first voice command in the voice feature database is updated according to the obtained voice feature information to correct the first voice command, N is an integer smaller than the M is greater than 0, and the first voice command is any one of the first to Nth voice commands;
- a second correcting unit configured to perform, for each of the N+1th to Mth voice commands in the voice command set: before and after correction according to the N first voice commands that have been corrected And determining, by the N deviation values of the following voice feature information, a manner of correcting the second voice command, and correcting the second voice command according to the determined correction mode, where the second voice command is the N+1 to any one of the M voice commands.
- the second calibration unit is specifically configured to:
- the change rule is searched according to the N deviation values
- determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining the second voice Command execution: prompting a user to input voice information corresponding to the second voice command, collecting voice information input by the user, obtaining voice feature information according to the collected voice information, and updating the voice information according to the obtained voice feature information Two voice command information of the voice command to correct the second voice command.
- the second correcting unit is further configured to: if the N deviation values are smaller than the first threshold, The correction of the second voice command is then abandoned.
- the determining unit is further configured to: if the N deviation values are greater than the second threshold, Re-determining the set of voice commands in the speech feature library that need to be corrected.
- a fifth aspect of the embodiments of the present invention provides an electronic device, where the electronic device includes: a memory, configured to store a voice feature database, the voice feature library includes a plurality of voice commands, and a processor, configured to obtain a voice feature library.
- a voice command that needs to be corrected obtains voice feature information according to the collected voice information, and updates voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command;
- the processor when the electronic device receives the voice feature database, the processor is specifically configured to: according to a specific voice command in the voice feature library preset by the system Or determining a voice command to be corrected according to a frequency of use of the voice command in the voice feature library, or a voice command selected in the voice feature library according to input information of the user, where the specific voice command is a voice feature received each time. Voice commands that need to be corrected after the library.
- the processor when the electronic device receives the second voice feature database, the processor is further configured to: receive the updated voice command and the voice feature thereof Information, and determining the updated voice command as a voice command that needs to be corrected.
- the processor is specifically configured to: obtain the obtained voice feature information The voice feature information of the voice command to be corrected is replaced.
- the output device is specifically a voice player or a display screen; the voice player is configured to generate voice information according to the voice feature information of the voice command and play, or play the machine sound corresponding to the voice command; the display screen is used to display the text corresponding to the voice command or Image information.
- a sixth aspect of the embodiments of the present invention provides an electronic device, where the electronic device includes: a memory, configured to store a voice feature database, the voice feature library includes a plurality of voice commands, and a processor, configured to obtain the voice feature
- the set of voice commands to be corrected in the library is executed for each of the first to Nth voice commands in the voice command set: the voice feature information is obtained according to the collected voice information, and is updated according to the obtained voice feature information.
- the first voice command is any one of the first to Nth voice commands
- the second voice command is any one of the N+1th to Mth voice commands;
- an output device configured to prompt the user to input voice information corresponding to the first voice command
- input means configured to collect voice information input by the user.
- the processor is specifically configured to:
- the change rule is searched according to the N deviation values
- determining to update the first voice feature information of the second voice command according to the change rule to correct the second voice command determining to perform the second voice command: prompting
- the user inputs voice information corresponding to the second voice command, collects voice information input by the user, obtains voice feature information according to the collected voice information, and obtains voice feature information according to the obtained voice information.
- the obtained voice feature information updates the voice feature information of the second voice command to correct the second voice command.
- the processor is further configured to: if the N deviation values are smaller than the first threshold, The correction of the second voice command is then abandoned.
- the processor is further configured to: if the N deviation values are greater than the second threshold, Then, the voice command set that needs to be corrected in the voice feature library is re-determined.
- the method for correcting the voice command, the correction system, and the electronic device by prompting the user to input voice information corresponding to the voice command, receiving voice information input by the user according to the prompt, acquiring voice feature information according to the voice information, and using the voice feature information Updating the original voice feature information of the voice command to be corrected; or, when the number of voice commands to be corrected is M, the first N voice commands of the M voice commands to be corrected use the input voice
- the voice feature information of the information replaces the corresponding voice feature information to correct the first N voice commands, and the back (MN) voice commands pass the original voice feature information of the corrected first N voice commands and the input voice feature information.
- the deviation value is selected to be corrected, so that the latter (MN) voice commands are corrected, so that the language feature information of the voice command can be adapted to the user terminal device after replacement or the voice feature library is upgraded, and the prior art is solved.
- the voice feature information of some or all voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- FIG. 1 is a flowchart of a method for correcting a voice command according to a first preferred embodiment of the present invention
- FIG. 2 is a flowchart of a method for correcting a voice command according to a second preferred embodiment of the present application
- 4 is a structural block diagram of a system for correcting a voice command according to a fourth preferred embodiment of the present invention
- FIG. 5 is a schematic structural diagram of an electronic device according to a fifth preferred embodiment of the present application;
- FIG. 6 is a schematic structural diagram of an electronic device according to a sixth preferred embodiment of the present application. detailed description
- the embodiment of the present application solves the voice of some or all voice commands of the voice feature library after the user terminal device is replaced or the voice feature database is upgraded in the prior art by providing a voice command correction method, a correction system, and an electronic device. There is a deviation between the feature information and the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- Executing part or all of the voice commands in the voice feature library prompting the user to input voice information corresponding to the voice command; collecting voice information input by the user; and obtaining voice feature information according to the collected voice information; And updating the voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command.
- the voice command is corrected, so that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature database is upgraded, and the user equipment device is replaced or voiced in the prior art.
- the voice feature information of some or all of the voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that there is a technique of identifying the inaccurate or failing to recognize the input voice information during the voice command operation. problem.
- the method for correcting the voice command includes the following steps:
- Step 110 Obtain a voice command in the voice feature library that needs to be corrected.
- the voice feature library generally includes a plurality of voice commands, voice feature information corresponding to the plurality of voice commands, and further, may further include text information corresponding to the plurality of commands.
- the voice command in the voice feature library is usually a control instruction name, and when the feature value of the voice information collected by the electronic device through the audio input device matches the corresponding feature information of a voice command in the voice feature library, The electronic device performs a corresponding control operation according to the name of the control instruction corresponding to the voice command.
- the voice feature information that is, the voice feature value, is a feature value extracted from the voice information generated when the specific voice input voice command is recognized, and the voice feature values in the voice information generated when different user voices input the same voice command may be different.
- Step 120 Prompt the user to input voice information corresponding to the voice command.
- the first one playing the voice information corresponding to the voice command through an audio output device (such as an audio, a headphone, etc.) of the electronic device, such as voice feature information according to the voice command.
- an audio output device such as an audio, a headphone, etc.
- Generating voice information and playing, or directly playing the machine sound corresponding to the voice command for example, when the voice command corresponds to opening the camera control command, generating voice information according to the voice feature information or according to The machine sound corresponding to the voice command plays the voice message with "turn on the camera”.
- the second type displaying text or image information corresponding to the voice command, for example, when the voice command corresponds to opening a camera control instruction, displaying text information with the words "open camera” through a display screen or the like.
- Step 130 Collect voice information input by the user.
- the voice when the user repeats according to the prompt can be collected by an audio input device (such as a microphone or the like) of the electronic device.
- an audio input device such as a microphone or the like
- the voice input has a voice message with "turn on the camera”.
- the voice message with "turn on the camera” is collected by an input device such as a microphone.
- Step 140 Acquire voice feature information according to the collected voice information, that is, extract voice feature values of the voice information according to the collected voice information.
- the speech feature information can be obtained by extracting the Mel Frequency Cepstrum Coefficient (MFCC) in the collected speech information:
- MFCC Mel Frequency Cepstrum Coefficient
- N is the number of points of the Fourier transform of the collected speech information
- M is the number of filters of the filter bank used later
- S(m) is the logarithmic energy output by each filter bank.
- Different systems may use different order MFCC coefficients for speech feature extraction and recognition.
- the MFCC coefficient is a feature information parameter that is widely used at present. Extracting the Meir cepstral coefficient in the voice information of the collected set is the way to obtain the speech feature information of the present application, and can also be decomposed by empirical mode decomposition.
- the (EMD) technique or the like extracts speech feature information, and does not limit the use of MFCC coefficients.
- Step 150 Update voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command.
- the original voice feature information is directly replaced with the obtained voice feature information, and the original voice feature information is not retained, that is, the obtained voice feature information is replaced by the need.
- the voice feature information of the corrected voice command secondly, adjusting the parameter value of the voice feature information of the voice command to be corrected according to the deviation value of the obtained voice feature information and the voice feature information of the voice command to be corrected,
- the voice feature information of the voice command is made applicable.
- a voice command to be corrected is obtained in the voice feature library, and the voice command is assumed to correspond to a control instruction of the control device to perform the camera operation;
- the eighth output has a voice message with "turn on the camera", or a prompt for text message; after the user receives the above prompt, the voice input has a voice message with "turn on the camera", and the band is collected by an input device such as a microphone.
- the voice information of "turning on the camera” is extracted; the voice feature information is extracted according to the voice information collected; finally, the voice feature information is updated with the voice feature information, for example, the original voice feature information is replaced, and the correction is completed.
- the method for correcting the voice command receives the voice command that needs to be corrected, receives the voice information corresponding to the voice command input by the user, acquires the voice feature information according to the voice information, and updates the voice feature information to the corresponding original voice.
- Feature information thereby making the language of the voice command
- the feature information can be adapted to the voice feature information of some or all voice commands of the voice feature library after the user terminal device is replaced or the voice feature library is upgraded in the prior art after the user terminal device is replaced or the voice feature database is upgraded. There may be deviations from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- the voice command to be corrected may be one of the voice feature library or a plurality of commands.
- the voice command may be executed for each of the plurality of voice commands. Steps 120, 130, 140, and 150, or performing the following steps one by one for the plurality of voice commands, that is, after the first voice command is corrected, performing the second voice command correction, and then performing the third voice command correction. .. ..., until the calibration is complete.
- step 110 how to obtain the voice command in the voice feature library is required to be corrected in the following two manners:
- this method can be used after the user terminal device is replaced, that is, when a voice feature library is newly installed on a terminal device.
- the method specifically includes the following steps:
- the corrected voice command is a voice command that needs to be corrected each time the voice feature library is received.
- the method is specifically:
- the updated voice command and its voice feature information are received, and the updated voice command is determined to be a voice command that needs to be corrected.
- the upgrade voice command set may include voice feature information corresponding to the original voice command in the voice feature library, or may be a newly added voice command and corresponding voice feature information. Therefore, it is determined that the voice command to be corrected may be the original voice command, or may be a newly added voice command.
- the method for correcting the voice command is configured to prompt the user to input the voice information corresponding to the voice command, receive the voice information input by the user according to the prompt, acquire the voice feature information according to the voice information, and update the voice feature information to the voice command that needs to be corrected.
- the voice feature information is such that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the voice feature database is solved after the user terminal device is replaced or the voice feature database is upgraded in the prior art.
- the voice feature information of some or all of the voice commands may deviate from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- the present application further provides a method for correcting a voice command, as shown in FIG. 2, which is a flowchart of a method for correcting a voice command according to a second preferred embodiment of the present application.
- the method for correcting the voice command includes the following steps:
- Step 210 Obtain a voice command set that needs to be corrected in the voice feature database, where the voice command set includes M voice commands, and the M is an integer greater than or equal to 2.
- the voice feature library generally includes a plurality of voice commands, voice feature information corresponding to the plurality of voice commands, and further may further include text information corresponding to the plurality of commands.
- the voice command in the voice feature library is usually a control instruction name, and when the feature value of the voice information collected by the electronic device through the audio input device matches the corresponding feature information of a voice command in the voice feature library, The electronic device performs a corresponding control operation according to the name of the control instruction corresponding to the voice command.
- the voice feature information that is, the voice feature value, is a feature value extracted from the voice information generated when the specific voice input voice command is recognized, and the voice feature values in the voice information generated when different user voices input the same voice command may be different.
- Step 220 Perform, for each voice command in the first to Nth voice commands in the voice command set, prompting the user to input voice information corresponding to the first voice command, and collecting voice information input by the user, Acquiring the voice feature information according to the collected voice information, and updating the voice feature information of the first voice command in the voice feature database according to the obtained voice feature information to correct the first voice command, where the N is smaller than An integer whose M is greater than 0, the first voice command It is any one of the first to Nth voice commands.
- the voice feature information can be obtained by extracting the Mel Frequency Cepstrum Coefficient (MFCC) in the collected voice information:
- MFCC Mel Frequency Cepstrum Coefficient
- N is the number of points of the Fourier transform of the collected speech information
- M is the number of filters of the filter bank used later
- S(m) is the logarithmic energy output by each filter bank.
- Different systems may use different order MFCC coefficients for speech feature extraction and recognition.
- the MFCC coefficient is a feature information parameter that is widely used at present. Extracting the Meir cepstral coefficient in the voice information of the collected set is the way to obtain the speech feature information of the present application, and can also be decomposed by empirical mode decomposition.
- the (EMD) technique or the like extracts speech feature information, and does not limit the use of MFCC coefficients.
- Step 230 Perform, for each of the N+1th to Mth voice commands in the voice command set: before and after the correction of the N first voice commands that have been corrected And determining, by the N deviation values of the information, a manner of correcting the second voice command, and correcting the second voice command according to the determined correction manner, where the second voice command is the (N+1)th Go to any of the M voice commands.
- Each of the N deviation values may obtain the voice feature information by using the MFCC coefficient, and compare with the voice feature information of the corresponding first voice command to obtain a deviation value, where the deviation value may be the obtained voice feature information and
- the ratio of the voice feature information of the corresponding first voice command may also be the difference between the obtained voice feature information and the voice feature information of the corresponding first voice command.
- the method for correcting the voice command is to correct the N by updating the corresponding voice feature information by using the voice feature information of the input voice information from the first to the Nth voice commands of the M voice commands that need to be corrected. a voice command, and correcting the N+1th to Mth voice commands based on the N deviation values of the pre-corrected and corrected voice feature information according to the corrected N first voice commands, thereby completing the M
- the correction of the voice command enables the language feature information of the voice command in the voice feature library to be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and solves the problem that the user terminal device is replaced or the voice feature in the prior art.
- the voice feature information of some or all voice commands of the voice signature database may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized. .
- this method can be used after the user terminal device is replaced, that is, when a voice feature library is newly installed on a terminal device.
- the method specifically includes the following steps:
- the corrected voice command is a voice command that needs to be corrected each time the voice feature library is received.
- the method includes the following steps:
- the upgrade voice command set may include voice feature information corresponding to the original voice command in the voice feature library, or may be a newly added voice command and corresponding voice feature information. Therefore, the changed voice command may be the original voice command or a newly added voice command.
- the following method is used to determine a manner in which each of the N+1th to Mth voice commands in the voice command set is corrected:
- the change rule is searched according to the N deviation values
- determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting user input And the voice information corresponding to the second voice command, the voice information input by the user, the voice feature information obtained according to the collected voice information, and the voice feature of the second voice command according to the obtained voice feature information Information to correct the second voice command;
- the voice command set of the voice feature database that needs to be corrected for the voice feature information is re-determined.
- the following method is used to determine the manner in which the first voice feature information of each of the (M-N) voice commands in the voice command set is corrected.
- the first threshold is 5%
- the second threshold is 20%
- 4 is said to be equal to 3.
- the first deviation value is 5.9%
- the second deviation value is 6%
- the third deviation value is 6%. Since the difference between the three deviation values is only 0.1, three There is a variation law of the deviation value, that is, the difference between the first voice feature information and the required voice feature information is 6%, and the system can issue a prompt, such as displaying, by the display unit, "whether the (MN) voice needs to be adjusted according to the deviation value of 6%.
- the first voice feature information of each voice command in the command when the user selects "yes", Then, the first voice feature information of each of the (MN) voice commands is corrected according to a deviation value of 6%.
- the third deviation value is 40%, because two of the three deviation values are greater than 20%, so Determining, in the voice feature library, a set of voice commands that need to be corrected for the first voice feature information, such as all voice commands that are not corrected by the voice feature library are listed as voice commands that need to correct the first voice feature information.
- the above-mentioned voice command correction method 200 corrects the first N voices by replacing the first and last N voice commands of the M voice commands that need to be corrected with the voice feature information of the input voice information by replacing the corresponding voice feature information.
- the following (MN) voice commands select a correction mode by using the original voice feature information of the corrected N voice commands and the deviation value of the input voice feature information, thereby correcting the latter (MN) voice commands. Therefore, the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the part or all of the voice feature database after the user terminal device is replaced or the voice feature library is upgraded in the prior art is solved.
- the voice feature information of the voice command may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- the present application further provides a voice command correction system 300, as shown in FIG. 3, which is a structural block diagram of a voice command correction system 300 according to a third preferred embodiment of the present application.
- the correction system 300 includes: a determining unit 310, a prompting unit 320, a collecting unit 330, an obtaining unit 340, and a correcting unit 350.
- a determining unit 310 configured to obtain a voice command in the voice feature library that needs to be corrected
- the prompting unit 320 is configured to prompt the user to input voice information corresponding to the voice command
- the collecting unit 330 is configured to collect voice information input by the user
- the obtaining unit 340 is configured to obtain the voice feature information according to the collected voice information
- the correcting unit 350 is configured to update the voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command.
- the determining unit 310 is specifically configured to: receive the voice feature library; according to a specific voice command in the voice feature library preset by the system, or according to a voice command in the voice feature library
- the voice command to be corrected is determined using a frequency, or a voice command selected in the voice feature library according to the user's input information, and the specific voice command is a voice command that needs to be corrected each time the voice feature library is received.
- the determining unit 310 is specifically configured to: receive an updated voice command and voice feature information, and determine the updated voice command as a voice command that needs to be corrected.
- the correcting unit 350 is specifically configured to: replace the obtained voice feature information with the voice feature information of the voice command that needs to be corrected.
- the prompting unit 320 is specifically configured to: generate voice information according to the voice feature information of the voice command and play, or play a machine sound corresponding to the voice command; or display a text corresponding to the voice command Or image information.
- the above-mentioned voice command correction system 300 prompts the user to input the voice information corresponding to the first voice command through the prompting unit 320, and collects the voice information input by the user according to the prompt by the collecting unit 330, and the obtaining unit 340 according to the voice information. Acquiring the voice feature information, and the voice feature information is updated by the correcting unit 350 to the voice feature information of the voice command to be corrected, so that the language feature information of the voice command can be adapted after the user terminal device is replaced or the voice feature library is upgraded.
- the voice feature information of some or all of the voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, thereby operating in the voice command.
- FIG. 4 is a structural block diagram of a voice command correction system 400 according to a fourth preferred embodiment of the present application.
- Place The correction system 400 includes an obtaining unit 410, a first correcting unit 420, and a second correcting unit 430.
- the obtaining unit 410 is configured to obtain a voice command set that needs to be corrected in the voice feature database, where the voice command set includes M voice commands, where the M is an integer greater than or equal to 2;
- the first correcting unit 420 is configured to perform, for each voice command, the first to the Nth voice commands in the voice command set: prompting the user to input voice information corresponding to the first voice command, and collecting the user Inputting voice information, obtaining voice feature information according to the collected voice information, updating sound feature information of the first voice command in the voice feature database according to the obtained voice feature information, to correct the first voice command, N is an integer smaller than the M is greater than 0, and the first voice command is any one of the first to Nth voice commands.
- a second correcting unit 430 configured to perform, for each of the N+1th to Mth voice commands in the voice command set: according to the corrected N first voice commands before and after the correction And determining, by the N deviation values of the corrected voice feature information, a manner of correcting the second voice command, and correcting the second voice command according to the determined correction mode, where the second voice command is Any one of the N+1th to Mth voice commands.
- the second correcting unit 430 is specifically configured to:
- the change rule is searched according to the N deviation values
- determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting user input And the voice information corresponding to the second voice command, the voice information input by the user, the voice feature information obtained according to the collected voice information, and the voice feature of the second voice command according to the obtained voice feature information Information to correct the second voice command.
- the second correcting unit 430 is further configured to: if the N deviation values are less than the first threshold, discard the correction of the second voice command.
- the second correcting unit 430 is further configured to: if the N deviation values are greater than the second threshold, re-determine the voice command set in the voice feature database that needs to be corrected.
- the above-mentioned voice command correction system 400 updates the voice feature information of the input voice information to the corresponding voice feature by using the first to Nth voice commands of the M voice commands to be corrected by the first correction unit 420.
- the information is used to correct the N voice commands, and the second correction unit 430 determines the correction mode of the (N)th to Mth voice commands according to the N deviation values, and corrects the (MN) voice commands.
- the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the part or all of the voice feature database after the user terminal device is replaced or the voice feature library is upgraded in the prior art is solved.
- the voice feature information of the voice command may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
- the present application further provides an electronic device 500.
- FIG. 5 it is a schematic structural diagram of an electronic device 500 according to a fifth preferred embodiment of the present application.
- the electronic device 500 includes a memory 510, a processor 520, an output device 530, and an input device 540.
- the memory 510 is configured to store a voice feature library, where the voice feature library includes a plurality of voice commands.
- the memory 510 is a memory device in the terminal system for storing programs and data.
- the memory 510 can also be an external memory such as a magnetic medium or an optical disk, or an internal memory.
- the processor 520 is configured to obtain a voice command that needs to be corrected in the voice feature database, obtain voice feature information according to the collected voice information, and update voice feature information of the voice command that needs to be corrected according to the voice feature information, to Correct the voice command.
- the output device 530 is configured to prompt the user to input voice information corresponding to the voice command.
- the output device 530 expresses data in the form of numbers, characters, images, sounds, etc., such as a display, a voice output system such as a speaker, and the like.
- the input device 540 is configured to collect voice information input by the user.
- the input device 540 can be a device such as a microphone or a microphone.
- the processor 520 when the electronic device 500 receives the voice feature database, the processor 520 is specifically configured to: according to a specific voice command in the voice feature library preset by the system, or according to The frequency of use of the voice command in the voice feature library, or according to the user's The voice command selected by the input information in the voice feature library determines a voice command to be corrected, and the specific voice command is a voice command that needs to be corrected each time the voice feature library is received.
- the processor 520 is further configured to: receive an updated voice command and voice feature information thereof, and determine the updated voice command as a voice command that needs to be corrected.
- the processor 520 is specifically configured to: replace the obtained voice feature information with the voice feature information of the voice command that needs to be corrected.
- the output device 530 is specifically a voice player or a display screen; the voice player is configured to generate voice information according to the voice feature information of the voice command, and play, or play the machine corresponding to the voice command.
- the display screen is used to display text or image information corresponding to the voice command.
- the electronic device 500 is configured to prompt the user to input voice information corresponding to the voice command through the output device 530, and receive voice information input by the user according to the prompt by the input device 540, and acquire the voice feature information according to the voice information by the processor 520.
- the voice feature information is updated to the voice feature information that needs to be corrected, so that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature database is upgraded, and the prior art is replaced after the user terminal device is replaced.
- the voice signature database is upgraded, the voice feature information of some or all of the voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that the voice information may be inaccurately recognized or the input voice information may not be recognized during the voice command operation.
- FIG. 6 is a schematic structural diagram of an electronic device 600 according to a sixth preferred embodiment of the present application.
- the electronic device 600 includes: a memory 610, a processor 620, an output device 630, and an input device 640.
- the memory 610 is configured to store a voice feature library, where the voice feature library includes a plurality of voice commands.
- the memory 610 is a memory device in the terminal system for storing programs and data.
- the memory 610 can also be an external memory such as a magnetic medium or an optical disk, or an internal memory.
- the processor 620 is configured to obtain a voice command set that needs to be corrected in the voice feature database, and execute each voice command in the first to Nth voice commands in the voice command set: according to the collected words Acquiring information to obtain voice feature information, updating tone feature information of the first voice command in the voice feature library according to the obtained voice feature information to correct the first voice command; and N+ in the voice command set Performing each of the 1 to M voice commands: determining the second voice based on the N deviation values of the corrected and corrected voice feature information of the N first voice commands that have been corrected The command is used to perform the correction, and the second voice command is corrected according to the determined correction mode, where M is an integer greater than or equal to 2, and the N is an integer smaller than the M greater than 0, the first voice The command is any one of the first to Nth voice commands, and the second voice command is any one of the N+1th to Mth voice commands;
- the output device 630 is configured to prompt the user to input voice information corresponding to the first voice command.
- the output device 630 expresses data in the form of numbers, characters, images, sounds, etc., such as a display, a voice output system such as a speaker, and the like.
- the input device 640 is configured to collect voice information input by the user.
- the input device 640 can be a device such as a microphone or a microphone.
- the processor 620 is specifically configured to: if the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, Finding a change rule according to the N deviation values;
- determining to update the first voice feature information of the second voice command according to the change rule to correct the second voice command If the change rule is found, determining to update the first voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting
- the user inputs voice information corresponding to the second voice command, collects voice information input by the user, obtains voice feature information according to the collected voice information, and updates the second voice command according to the obtained voice feature information. Voice feature information to correct the second voice command.
- the processor 620 is further configured to: if the N deviation values are less than the first threshold, discard the correction of the second voice command.
- the processor 620 is further configured to: if the N deviation values are greater than the second threshold, re-determine the voice command set in the voice feature database that needs to be corrected.
- the electronic device 600 is configured to prompt the user to input voice information corresponding to the first voice command through the output device 630, and collect voice information input by the user according to the prompt by the input device 640, and extract the voice in the voice information by using the processor 620.
- Feature information updating the extracted voice feature information to the voice feature information of the first voice command, thereby completing correction of the first to Nth voice commands of the M voice commands, and then passing the processor 620 And selecting, according to the N first deviation commands that have been corrected according to the N deviation values of the pre-corrected and corrected speech feature information, a manner of correcting the (MN) voice commands in the M voice commands, Correcting the (MN) voice commands according to the correction manner, so that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the prior art in the user terminal is solved.
- the voice feature information and settings of some or all voice commands of the voice signature database are set. There is a deviation in the corresponding voice feature information of the backup set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- User Interface Of Digital Computer (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
一种语音命令的校正方法、 校正系统和电子设备 Method for correcting voice command, correction system and electronic device
本申请要求在 2013 年 12 月 23 日提交中国专利局、 申请号为 201310719786.8、 发明名称为 "一种语音命令的校正方法、 校正系统和电子设 备" 的中国专利申请的优先权, 其全部内容通过引用结合在本申请中。 This application claims priority to Chinese Patent Application No. 201310719786.8, entitled "A Voice Command Correction Method, Calibration System, and Electronic Equipment", filed on December 23, 2013, the entire contents of which are hereby incorporated by reference. The citations are incorporated herein by reference.
技术领域 Technical field
本发明涉及电子技术领域, 尤其涉及一种语音命令的校正方法、 校正系 统和电子设备。 背景技术 The present invention relates to the field of electronic technologies, and in particular, to a method for correcting voice commands, a correction system, and an electronic device. Background technique
语音命令操作方式非常普遍的一种人机交互方式, 也是一种非常自然的 人人机交互方式。 随着近年来用户终端设备(如智能手机、 平板电脑、 iPad 等) 的处理能力越来越强大, 在用户终端设备上应用语音命令操作成为一种 趋势。 A human-computer interaction method in which voice command operation is very common is also a very natural way of human-computer interaction. With the increasing processing power of user terminal devices (such as smart phones, tablets, iPads, etc.) in recent years, the application of voice command operations on user terminal devices has become a trend.
电子设备在接收到输入的语音信息时, 根据输入的语音信息提取语音特 征信息, 然后在语音特征库中查找与所述提取的语音特征信息对应的语音命 令, 执行所述语音命令, 从而完成语音命令操作。 When receiving the input voice information, the electronic device extracts the voice feature information according to the input voice information, and then searches for a voice command corresponding to the extracted voice feature information in the voice feature database, and executes the voice command to complete the voice. Command operation.
因为, 语音特征库是区分不同用户、 区分不同命令的关键数据, 是用户 "专属" 的个性化数据, 同时也受到用户终端设备的硬件条件、 使用环境等 多个因素的影响。 因此, 在用户终端设备更换后或者语音特征库升级后, 语 音特征库的部分或者全部语音命令的语音特征信息与设备釆集的对应语音特 征信息会存在偏差, 从而在语音命令操作时, 会存在识别不准确或者不能识 别输入的语音信息的情况。 发明内容 Because the voice feature library is the key data that distinguishes different users and distinguishes different commands, it is the personalized data of the user's "exclusive", and is also affected by many factors such as hardware conditions and usage environment of the user terminal equipment. Therefore, after the user terminal device is replaced or the voice feature library is upgraded, the voice feature information of some or all voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that when the voice command is operated, there will be The case of identifying inaccurate or incapable of recognizing the input voice information. Summary of the invention
本申请提供一种语音命令的校正方法、 校正系统和电子设备, 解决现有 了技术中在用户终端设备更换后或者语音特征库升级后, 语音特征库的部分 或者全部语音命令的语音特征信息与设备釆集的对应语音特征信息会存在偏 差, 从而在语音命令操作时存在识别不准确或者不能识别输入的语音信息的 技术问题。 The present invention provides a voice command correction method, a correction system, and an electronic device, which solves the voice feature information of some or all voice commands of the voice feature library after the user terminal device is replaced or the voice feature library is upgraded in the prior art. The corresponding voice feature information of the device set may be deviated, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
本发明实施例第一方面提供了一种语音命令的校正方法, 所述方法包括: 获得语音特征库中需要校正的语音命令; 提示用户输入与所述语音命令对应 的语音信息; 釆集所述用户输入的语音信息; 根据釆集到的语音信息获得语 音特征信息; 根据获得的语音特征信息更新所述需要校正的语音命令的语音 特征信息, 以校正所述语音命令。 A first aspect of the embodiments of the present invention provides a method for correcting a voice command, where the method includes: obtaining a voice command that needs to be corrected in a voice feature database; prompting a user to input voice information corresponding to the voice command; The voice information input by the user; obtaining the voice feature information according to the collected voice information; updating the voice feature information of the voice command to be corrected according to the obtained voice feature information, to correct the voice command.
在第一方面第一种可能的实现方式中, 所述获得语音特征库中需要校正 的语音命令, 具体包括: In a first possible implementation manner of the first aspect, the obtaining the voice command that needs to be corrected in the voice feature database includes:
接收所述语音特征库; Receiving the voice feature library;
根据系统预先设定的所述语音特征库中的特定语音命令, 或者根据所述 语音特征库中语音命令的使用频率, 或者按照用户的输入信息在所述语音特 征库中选择的语音命令确定需要校正的语音命令, 所述特定语音命令为每次 接收语音特征库后需要校正的语音命令。 Determining a need according to a specific voice command in the voice feature library preset by the system, or according to a frequency of use of the voice command in the voice feature library, or a voice command selected in the voice feature library according to input information of the user The corrected voice command is a voice command that needs to be corrected each time the voice feature library is received.
结合第一方面, 在第一方面第二种可能的实现方式中, 所述获得语音特 征库中需要校正的语音命令, 具体包括: With reference to the first aspect, in a second possible implementation manner of the first aspect, the obtaining a voice command that needs to be corrected in the voice feature library includes:
接收更新的语音命令及其语音特征信息, 将所述更新的语音命令确定为 需要校正的语音命令。 The updated voice command and its voice feature information are received, and the updated voice command is determined to be a voice command that needs to be corrected.
结合第一方面、 第一方面的第一或第二种可能的实现方式, 在第一方面 第三种可能的实现方式中, 所述根据获得的语音特征信息更新所述需要校正 的语音命令的语音特征信息, 具体为: 将获得的语音特征信息替换所述需要 校正的语音命令的语音特征信息。 With reference to the first aspect, the first or the second possible implementation manner of the first aspect, in a third possible implementation manner of the first aspect, the updating the voice command to be corrected according to the obtained voice feature information The voice feature information is specifically: replacing the obtained voice feature information with the voice feature information of the voice command that needs to be corrected.
结合第一方面第一到第三种任一可能的实现方式, 在第一方面第四种可 能的实现方式中, 所述提示用户输入与所述语音命令对应的语音信息, 具体 为: In combination with any of the first to third possible implementations of the first aspect, the fourth aspect is In an implementation manner, the prompting user inputs voice information corresponding to the voice command, specifically:
根据所述语音命令的语音特征信息生成语音信息并播放; 或者 Generating voice information according to voice feature information of the voice command and playing; or
播放所述语音命令对应的机器音; 或者 Playing the machine sound corresponding to the voice command; or
显示所述语音命令对应的文字或图像信息。 Display text or image information corresponding to the voice command.
本发明实施例第二方面提供了一种语音命令的校正方法, 所述方法包括: 获得语音特征库中需要校正的语音命令集合, 所述语音命令集合包括 M 个语音命令, 所述 M为大于等于 2的整数; A second aspect of the embodiments of the present invention provides a method for correcting a voice command, where the method includes: obtaining a voice command set that needs to be corrected in a voice feature database, where the voice command set includes M voice commands, and the M is greater than An integer equal to 2;
对语音命令集合中的第一个到第 N个语音命令中每个语音命令执行: 提 示用户输入与所述第一语音命令对应的语音信息、 釆集所述用户输入的语音 信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的语音特征信息 更新更新所述语音特征库中所述第一语音命令的语音特征信息以校正所述第 一语音命令, 所述 N为小于所述 M大于 0的整数, 所述第一语音命令为所述 第一个到第 N个语音命令中的任意一个语音命令; Performing on each of the first to Nth voice commands in the voice command set: prompting the user to input voice information corresponding to the first voice command, collecting voice information input by the user, according to the set Obtaining voice feature information, updating and updating voice feature information of the first voice command in the voice feature database according to the obtained voice feature information to correct the first voice command, where N is smaller than the M An integer greater than 0, the first voice command being any one of the first to Nth voice commands;
对所述语音命令集合中的第 N+1个到第 M个语音命令中的每个语音命令 执行: 根据已校正完成的 N个第一语音命令在校正前和校正后的语音特征信 息的 N个偏差值, 确定对所述第二语音命令进行校正的方式, 根据确定出的 校正方式对所述第二语音命令进行校正, 所述第二语音命令为所述第 N+1个 到第 M个语音命令中的任意一个语音命令。 Performing on each of the N+1th to Mth voice commands in the voice command set: N of the pre-corrected and corrected voice feature information according to the corrected N first voice commands a deviation value, determining a manner of correcting the second voice command, correcting the second voice command according to the determined correction mode, where the second voice command is the (N+1)th to the Mth Any of the voice commands.
在第二方面第一种可能的实现方式中, 所述根据已校正完成的 N个第一 语音命令在校正前和校正后的语音特征信息的 N个偏差值, 确定对所述第二 语音命令进行校正的方式, 包括: In a first possible implementation manner of the second aspect, the determining, according to the N deviation values of the pre-corrected and corrected speech feature information, the N first voice commands that have been corrected are determined to be the second voice command. The way to correct, including:
若所述 N个偏差值在第一阔值与第二阔值之间, 所述第一阔值小于所述 第二阔值, 则根据所述 N个偏差值查找变化规律; If the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, the change rule is searched according to the N deviation values;
若查找到变化规律, 则确定根据所述变化规律对所述第二语音命令的语 音特征信息进行更新以校正所述第二语音命令; 否则, 确定对所述第二语音 命令执行: 提示用户输入与所述第二语音命令对应的语音信息、 釆集所述用 户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的 语音特征信息更新所述第二语音命令的语音特征信息, 以校正所述第二语音 命令。 If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting user input The voice information corresponding to the second voice command is used for the voice information The voice information input by the user obtains the voice feature information according to the collected voice information, and updates the voice feature information of the second voice command according to the obtained voice feature information to correct the second voice command.
结合第二方面第一种可能实现的方式, 在第二方面第二种可能的实现方 式中, 所述根据已校正完成的 N个第一语音命令在校正前和校正后的语音特 征信息的 N个偏差值, 确定对所述第二语音命令进行校正的方式, 还包括: 若所述 N个偏差值小于所述第一阔值, 则放弃对所述第二语音命令的校正。 With reference to the first possible implementation manner of the second aspect, in a second possible implementation manner of the second aspect, the N, before and after the correction, based on the N first voice commands that have been corrected Determining the manner of correcting the second voice command further includes: if the N deviation values are less than the first threshold, discarding the correction of the second voice command.
结合第二方面第一种可能实现的方式, 在第二方面第三种可能实现的方 式中, 所述根据已校正完成的 N个第一语音命令在校正前和校正后的语音特 征信息的 N个偏差值, 确定对所述第二语音命令进行校正的方式, 还包括: 若所述 N个偏差值大于所述第二阔值, 则重新确定所述语音特征库中需要校 正的语音命令集合。 With reference to the first possible implementation manner of the second aspect, in a third possible implementation manner of the second aspect, the N of the pre-corrected and corrected voice feature information according to the N first voice commands that have been corrected And determining, by the deviation value, the manner of correcting the second voice command, further comprising: if the N deviation values are greater than the second threshold, re-determining the voice command set that needs to be corrected in the voice feature database .
本发明实施例第三方面提供了一种语音命令的校正系统, 所述系统包括: 确定单元, 用于获得语音特征库中需要校正的语音命令; 提示单元, 用于提 示用户输入与所述语音命令对应的语音信息; 釆集单元, 用于釆集所述用户 输入的语音信息; 获得单元, 用于根据釆集到的语音信息获得语音特征信息; 校正单元, 用于根据获得的语音特征信息更新所述需要校正的语音命令的语 音特征信息, 以校正所述语音命令。 A third aspect of the embodiments of the present invention provides a voice command correction system, where the system includes: a determining unit, configured to obtain a voice command in a voice feature library that needs to be corrected; and a prompting unit, configured to prompt a user to input the voice a voice information corresponding to the command; a set unit for collecting voice information input by the user; an obtaining unit, configured to obtain voice feature information according to the collected voice information; and a correcting unit, configured to use the obtained voice feature information The voice feature information of the voice command to be corrected is updated to correct the voice command.
在第三方面第一种可能的实现方式中, 所述确定单元具体用于: 接收所 述语音特征库; 根据系统预先设定的所述语音特征库中的特定语音命令, 或 者根据所述语音特征库中语音命令的使用频率, 或者按照用户的输入信息在 所述语音特征库中选择的语音命令确定需要校正的语音命令, 所述特定语音 命令为每次接收语音特征库后需要校正的语音命令。 In a first possible implementation manner of the third aspect, the determining unit is specifically configured to: receive the voice feature database; according to a specific voice command in the voice feature library preset by the system, or according to the voice The frequency of use of the voice command in the feature library, or the voice command selected in the voice feature library according to the input information of the user determines a voice command to be corrected, and the specific voice command is a voice that needs to be corrected after receiving the voice feature library each time. command.
结合第三方面, 在第三方面第二种可能的实现方式中, 所述确定单元具 体用于: With reference to the third aspect, in a second possible implementation manner of the third aspect, the determining unit is specifically configured to:
接收更新的语音命令及其语音特征信息, 将所述更新的语音命令确定为 需要校正的语音命令。 结合第三方面、 第三方面的第一种或第二种可能实现的方式, 在第三方 面第三种可能实现的方式中, 所述校正单元具体用于: 将获得的语音特征信 息替换所述需要校正的语音命令的语音特征信息。 Receiving the updated voice command and its voice feature information, and determining the updated voice command as a voice command that needs to be corrected. With reference to the third aspect, the first or the second possible implementation manner of the third aspect, in the third possible implementation manner of the third aspect, the correcting unit is specifically configured to: replace the obtained voice feature information with the The voice feature information of the voice command that needs to be corrected.
结合第三方面、 第三方面的第一种、 第二种或第三种可能实现的方式, 在第三方面第四种可能实现的方式中, 所述提示单元具体用于: 根据所述语 音命令的语音特征信息生成语音信息并播放, 或者播放所述语音命令对应的 机器音; 或者显示所述语音命令对应的文字或图像信息。 With reference to the third aspect, the first, the second or the third possible implementation manner of the third aspect, in a fourth possible implementation manner of the third aspect, the prompting unit is specifically configured to: according to the voice The voice feature information of the command generates voice information and plays, or plays a machine sound corresponding to the voice command; or displays text or image information corresponding to the voice command.
本发明实施例第四方面提供了一种语音命令的校正系统, 所述系统包括: 获得单元, 用于获得语音特征库中需要校正的语音命令集合, 所述语音 命令集合包括 M个语音命令, 所述 M为大于等于 2的整数; A fourth aspect of the embodiments of the present invention provides a voice command correction system, where the system includes: an obtaining unit, configured to obtain a voice command set that needs to be corrected in a voice feature database, where the voice command set includes M voice commands. The M is an integer greater than or equal to 2;
第一校正单元, 用于对语音命令集合中的第一个到第 N个语音命令中每 个语音命令执行: 提示用户输入与所述第一语音命令对应的语音信息、 釆集 所述用户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据 获得的语音特征信息更新所述语音特征库中所述第一语音命令的音特征信息 以校正所述第一语音命令, 所述 N为小于所述 M大于 0的整数, 所述第一语 音命令为所述第一个到第 N个语音命令中的任意一个语音命令; a first correcting unit, configured to perform, for each of the first to Nth voice commands in the voice command set: prompting the user to input voice information corresponding to the first voice command, and collecting the user input And the voice information is obtained according to the collected voice information, and the sound feature information of the first voice command in the voice feature database is updated according to the obtained voice feature information to correct the first voice command, N is an integer smaller than the M is greater than 0, and the first voice command is any one of the first to Nth voice commands;
第二校正单元,用于对所述语音命令集合中的第 N+1个到第 M个语音命 令中的每个语音命令执行: 根据已校正完成的 N个第一语音命令在校正前和 校正后的语音特征信息的 N个偏差值, 确定对所述第二语音命令进行校正的 方式, 根据确定出的校正方式对所述第二语音命令进行校正, 所述第二语音 命令为所述第 N+1个到第 M个语音命令中的任意一个语音命令。 a second correcting unit, configured to perform, for each of the N+1th to Mth voice commands in the voice command set: before and after correction according to the N first voice commands that have been corrected And determining, by the N deviation values of the following voice feature information, a manner of correcting the second voice command, and correcting the second voice command according to the determined correction mode, where the second voice command is the N+1 to any one of the M voice commands.
结合第四方面, 在第四方面第一种可能的实现方式中, 所述第二校正单 元具体用于: With reference to the fourth aspect, in a first possible implementation manner of the fourth aspect, the second calibration unit is specifically configured to:
若所述 N个偏差值在第一阔值与第二阔值之间, 所述第一阔值小于所述 第二阔值, 则根据所述 N个偏差值查找变化规律; If the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, the change rule is searched according to the N deviation values;
若查找到变化规律, 则确定根据所述变化规律对所述第二语音命令的语 音特征信息进行更新以校正所述第二语音命令; 否则, 确定对所述第二语音 命令执行: 提示用户输入与所述第二语音命令对应的语音信息、 釆集所述用 户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的 语音特征信息更新所述第二语音命令的语音特征信息, 以校正所述第二语音 命令。 If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining the second voice Command execution: prompting a user to input voice information corresponding to the second voice command, collecting voice information input by the user, obtaining voice feature information according to the collected voice information, and updating the voice information according to the obtained voice feature information Two voice command information of the voice command to correct the second voice command.
结合第四方面的第一种可能的实现方式, 在第四方面第二种可能的实现 方式中, 述第二校正单元还用于: 若所述 N个偏差值小于所述第一阔值, 则 放弃对所述第二语音命令的校正。 With reference to the first possible implementation manner of the fourth aspect, in a second possible implementation manner of the fourth aspect, the second correcting unit is further configured to: if the N deviation values are smaller than the first threshold, The correction of the second voice command is then abandoned.
结合第四方面的第一种可能的实现方式, 在第四方面第三种可能的实现 方式中, 所述确定单元还用于: 若所述 N个偏差值大于所述第二阔值, 则重 新确定所述语音特征库中需要校正的语音命令集合。 With the first possible implementation of the fourth aspect, in a third possible implementation manner of the fourth aspect, the determining unit is further configured to: if the N deviation values are greater than the second threshold, Re-determining the set of voice commands in the speech feature library that need to be corrected.
本发明实施例第五方面提供了一种电子设备, 所述电子设备包括: 存储 器, 用于存储语音特征库, 所述语音特征库包括多个语音命令; 处理器, 用 于获得语音特征库中需要校正的语音命令, 根据釆集到的语音信息获得语音 特征信息, 并根据所述获得的语音特征信息更新所述需要校正的语音命令的 语音特征信息, 以校正所述语音命令; 输出装置, 用于提示用户输入与所述 语音命令对应的语音信息; 输入装置, 用于釆集所述用户输入的语音信息。 A fifth aspect of the embodiments of the present invention provides an electronic device, where the electronic device includes: a memory, configured to store a voice feature database, the voice feature library includes a plurality of voice commands, and a processor, configured to obtain a voice feature library. a voice command that needs to be corrected, obtains voice feature information according to the collected voice information, and updates voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command; And a method for prompting a user to input voice information corresponding to the voice command; and inputting means, configured to collect voice information input by the user.
在第五方面第一种可能的实现方式中, 所述电子设备接收到所述语音特 征库时, 所述处理器具体用于: 根据系统预先设定的所述语音特征库中的特 定语音命令, 或者根据所述语音特征库中语音命令的使用频率, 或者按照用 户的输入信息在所述语音特征库中选择的语音命令确定需要校正的语音命 令, 所述特定语音命令为每次接收语音特征库后需要校正的语音命令。 In a first possible implementation manner of the fifth aspect, when the electronic device receives the voice feature database, the processor is specifically configured to: according to a specific voice command in the voice feature library preset by the system Or determining a voice command to be corrected according to a frequency of use of the voice command in the voice feature library, or a voice command selected in the voice feature library according to input information of the user, where the specific voice command is a voice feature received each time. Voice commands that need to be corrected after the library.
结合第五方面, 在第五方面的第二种可能的实现方式中, 所述电子设备 在接收到第二语音特征库时, 所述处理器还用于: 接收更新的语音命令及其 语音特征信息, 并将所述更新的语音命令确定为需要校正的语音命令。 With reference to the fifth aspect, in a second possible implementation manner of the fifth aspect, when the electronic device receives the second voice feature database, the processor is further configured to: receive the updated voice command and the voice feature thereof Information, and determining the updated voice command as a voice command that needs to be corrected.
结合第五方面、 第五方面的第一种或第二种可能的实现方式中, 在第五 方面的第三种可能的实现方式中, 所述处理器具体用于: 将获得的语音特征 信息替换所述需要校正的语音命令的语音特征信息。 结合第五方面、 第五方面的第一种、 第二种或第三种可能的实现方式中, 在第五方面的第四种可能的实现方式中, 所述输出装置具体为语音播放器或 者显示屏; 所述语音播放器用于根据所述语音命令的语音特征信息生成语音 信息并播放, 或播放所述语音命令对应的机器音; 所述显示屏用于显示所述 语音命令对应的文字或图像信息。 With reference to the fifth aspect, the first or the second possible implementation manner of the fifth aspect, in a third possible implementation manner of the fifth aspect, the processor is specifically configured to: obtain the obtained voice feature information The voice feature information of the voice command to be corrected is replaced. With reference to the fifth aspect, the first, the second or the third possible implementation manner of the fifth aspect, in a fourth possible implementation manner of the fifth aspect, the output device is specifically a voice player or a display screen; the voice player is configured to generate voice information according to the voice feature information of the voice command and play, or play the machine sound corresponding to the voice command; the display screen is used to display the text corresponding to the voice command or Image information.
本发明实施例第六方面提供了一种电子设备, 所述电子设备包括: 存储器, 用于存储语音特征库, 所述语音特征库包括多个语音命令; 处理器, 用于获得所述语音特征库中需要校正的语音命令集合, 对语音 命令集合中的第一个到第 N个语音命令中每个语音命令执行: 根据釆集到的 语音信息获得语音特征信息、 根据获得的语音特征信息更新所述语音特征库 中所述第一语音命令的语音特征信息以校正所述第一语音命令; 对所述语音 命令集合中的第 N+1个到第 M个语音命令中的每个语音命令执行:根据已校 正完成的 N个第一语音命令在校正前和校正后的语音特征信息的 N个偏差 值, 确定对所述第二语音命令进行校正的方式, 根据确定出的校正方式对所 述第二语音命令进行校正, 所述 M为大于等于 2的整数, 所述 N为小于所述 M大于 0的整数, 所述第一语音命令为所述第一个到第 N个语音命令中的任 意一个语音命令,所述第二语音命令为所述第 N+1个到第 M个语音命令中的 任意一个语音命令; A sixth aspect of the embodiments of the present invention provides an electronic device, where the electronic device includes: a memory, configured to store a voice feature database, the voice feature library includes a plurality of voice commands, and a processor, configured to obtain the voice feature The set of voice commands to be corrected in the library is executed for each of the first to Nth voice commands in the voice command set: the voice feature information is obtained according to the collected voice information, and is updated according to the obtained voice feature information. The voice feature information of the first voice command in the voice feature library to correct the first voice command; each voice command in the N+1th to Mth voice commands in the voice command set Executing: determining, according to the N deviation values of the pre-corrected and corrected speech feature information, the N first deviations of the corrected N first voice commands, and correcting the second voice command according to the determined correction mode Corrected by the second voice command, where M is an integer greater than or equal to 2, and the N is an integer smaller than the M greater than 0, The first voice command is any one of the first to Nth voice commands, and the second voice command is any one of the N+1th to Mth voice commands;
输出装置, 用于提示用户输入与所述第一语音命令对应的语音信息; 输入装置, 用于釆集所述用户输入的语音信息。 And an output device, configured to prompt the user to input voice information corresponding to the first voice command; and input means, configured to collect voice information input by the user.
在第六方面第一种可能的实现方式中, 所述处理器具体用于: In a first possible implementation manner of the sixth aspect, the processor is specifically configured to:
若所述 N个偏差值在第一阔值与第二阔值之间, 所述第一阔值小于所述 第二阔值, 则根据所述 N个偏差值查找变化规律; If the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, the change rule is searched according to the N deviation values;
若查找到变化规律, 则确定根据所述变化规律对所述第二语音命令的第 一语音特征信息进行更新以校正所述第二语音命令; 否则, 确定对所述第二 语音命令执行: 提示用户输入与所述第二语音命令对应的语音信息、 釆集所 述用户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获 得的语音特征信息更新所述第二语音命令的语音特征信息, 以校正所述第二 语音命令。 If the change rule is found, determining to update the first voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting The user inputs voice information corresponding to the second voice command, collects voice information input by the user, obtains voice feature information according to the collected voice information, and obtains voice feature information according to the obtained voice information. The obtained voice feature information updates the voice feature information of the second voice command to correct the second voice command.
结合第六方面第一种可能的实现方式中, 在第六方面第二种可能的实现 方式中, 所述处理器具体还用于: 若所述 N个偏差值小于所述第一阔值, 则 放弃对所述第二语音命令的校正。 With reference to the first possible implementation manner of the sixth aspect, in a second possible implementation manner of the sixth aspect, the processor is further configured to: if the N deviation values are smaller than the first threshold, The correction of the second voice command is then abandoned.
结合第六方面第一种可能的实现方式中, 在第六方面第三种可能的实现 方式中, 所述处理器具体还用于: 若所述 N个偏差值大于所述第二阔值, 则 重新确定所述语音特征库中需要校正的语音命令集合。 With reference to the first possible implementation manner of the sixth aspect, in a third possible implementation manner of the sixth aspect, the processor is further configured to: if the N deviation values are greater than the second threshold, Then, the voice command set that needs to be corrected in the voice feature library is re-determined.
本申请有益效果如下: The beneficial effects of the application are as follows:
上述语音命令的校正方法、 校正系统和电子设备, 通过提示用户输入与 所述语音命令对应的语音信息, 接收用户根据提示输入的语音信息, 根据该 语音信息获取语音特征信息, 将该语音特征信息更新所述需要校正的语音命 令的原语音特征信息; 或者, 在所述需要校正的语音命令为 M个时, 所述 M 个需要校正的语音命令的中前 N个语音命令釆用输入的语音信息的语音特征 信息替换对应的语音特征信息以校正所述前 N个语音命令, 所述后 (M-N ) 个语音命令通过已校正的前 N个语音命令的原语音特征信息和输入的语音特 征信息的偏差值选择校正方式, 从而对所述后 (M-N )个语音命令进行校正, 从而使得语音命令的语言特征信息能够适应于用户终端设备更换后或者语音 特征库升级后, 解决了现有技术中在用户终端设备更换后或者语音特征库升 级后, 语音特征库的部分或者全部语音命令的语音特征信息与设备釆集的对 应语音特征信息会存在偏差, 从而在语音命令操作时存在识别不准确或者不 能识别输入的语音信息的技术问题。 附图说明 The method for correcting the voice command, the correction system, and the electronic device, by prompting the user to input voice information corresponding to the voice command, receiving voice information input by the user according to the prompt, acquiring voice feature information according to the voice information, and using the voice feature information Updating the original voice feature information of the voice command to be corrected; or, when the number of voice commands to be corrected is M, the first N voice commands of the M voice commands to be corrected use the input voice The voice feature information of the information replaces the corresponding voice feature information to correct the first N voice commands, and the back (MN) voice commands pass the original voice feature information of the corrected first N voice commands and the input voice feature information. The deviation value is selected to be corrected, so that the latter (MN) voice commands are corrected, so that the language feature information of the voice command can be adapted to the user terminal device after replacement or the voice feature library is upgraded, and the prior art is solved. After the user terminal device is replaced or the voice signature database is upgraded After that, the voice feature information of some or all voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized. DRAWINGS
图 1为本申请第一较佳实施方式语音命令的校正方法的方法流程图; 图 2为本申请第二较佳实施方式语音命令的校正方法的方法流程图; 图 3为本申请第三较佳实施方式语音命令的校正系统的结构框图; 图 4为本申请第四较佳实施方式语音命令的校正系统的结构框图; 图 5为本申请第五较佳实施方式电子设备的结构示意图; 1 is a flowchart of a method for correcting a voice command according to a first preferred embodiment of the present invention; FIG. 2 is a flowchart of a method for correcting a voice command according to a second preferred embodiment of the present application; A structural block diagram of a correction system for a voice command of a preferred embodiment; 4 is a structural block diagram of a system for correcting a voice command according to a fourth preferred embodiment of the present invention; FIG. 5 is a schematic structural diagram of an electronic device according to a fifth preferred embodiment of the present application;
图 6为本申请第六较佳实施方式电子设备的结构示意图。 具体实施方式 6 is a schematic structural diagram of an electronic device according to a sixth preferred embodiment of the present application. detailed description
本申请实施例通过提供一种语音命令的校正方法、 校正系统和电子设备 , 解决了现有技术中在用户终端设备更换后或者语音特征库升级后, 语音特征 库的部分或者全部语音命令的语音特征信息与设备釆集的对应语音特征信息 会存在偏差, 从而在语音命令操作时存在识别不准确或者不能识别输入的语 音信息的技术问题。 The embodiment of the present application solves the voice of some or all voice commands of the voice feature library after the user terminal device is replaced or the voice feature database is upgraded in the prior art by providing a voice command correction method, a correction system, and an electronic device. There is a deviation between the feature information and the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
本申请实施例中的技术方案为解决上述问题, 总体思路如下: The technical solution in the embodiment of the present application is to solve the above problem, and the general idea is as follows:
对所述语音特征库中的部分或者全部语音命令执行: 提示用户输入与所 述语音命令对应的语音信息; 釆集所述用户输入的语音信息; 根据釆集到的 语音信息获得语音特征信息; 根据获得的语音特征信息更新所述需要校正的 语音命令的语音特征信息, 以校正所述语音命令。 Executing part or all of the voice commands in the voice feature library: prompting the user to input voice information corresponding to the voice command; collecting voice information input by the user; and obtaining voice feature information according to the collected voice information; And updating the voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command.
通过上述技术方案, 从而对所述语音命令进行校正, 使得语音命令的语 言特征信息能够适应于用户终端设备更换后或者语音特征库升级后, 解决了 现有技术中在用户终端设备更换后或者语音特征库升级后, 语音特征库的部 分或者全部语音命令的语音特征信息与设备釆集的对应语音特征信息会存在 偏差, 从而在语音命令操作时存在识别不准确或者不能识别输入的语音信息 的技术问题。 Through the foregoing technical solution, the voice command is corrected, so that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature database is upgraded, and the user equipment device is replaced or voiced in the prior art. After the feature library is upgraded, the voice feature information of some or all of the voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that there is a technique of identifying the inaccurate or failing to recognize the input voice information during the voice command operation. problem.
为了更好的理解上述技术方案, 下面将结合说明书附图以及具体的实施 方式对上述技术方案进行详细的说明。 In order to better understand the above technical solutions, the above technical solutions will be described in detail below in conjunction with the drawings and specific embodiments.
实施例一 Embodiment 1
如图 1 所示, 为本申请第一较佳实施方式语音命令的校正方法的方法流 程图。 所述语音命令的校正方法包括以下步骤: As shown in FIG. 1, a flow chart of a method for correcting a voice command according to a first preferred embodiment of the present application is shown. The method for correcting the voice command includes the following steps:
步骤 110, 获得语音特征库中需要校正的语音命令。 具体地, 语音特征库通常包括多个语音命令、 与所述多个语音命令对应 的语音特征信息, 进一步的还可包括与所述多个命令对应的文字信息。 语音 特征库中的语音命令通常为控制指令名称, 当电子设备通过音频输入装置釆 集到的语音信息的特征值与语音特征库中的某个语音命令的对应的特征信息 匹配时, 则所述电子设备会根据所述语音命令对应的控制指令名称执行相应 的控制操作。 语音特征信息, 即语音特征值, 是根据识别特定语音输入语音 命令时产生的语音信息中提取的特征值, 不同的使用者语音输入同一语音命 令时产生的语音信息中的语音特征值可能不同。 Step 110: Obtain a voice command in the voice feature library that needs to be corrected. Specifically, the voice feature library generally includes a plurality of voice commands, voice feature information corresponding to the plurality of voice commands, and further, may further include text information corresponding to the plurality of commands. The voice command in the voice feature library is usually a control instruction name, and when the feature value of the voice information collected by the electronic device through the audio input device matches the corresponding feature information of a voice command in the voice feature library, The electronic device performs a corresponding control operation according to the name of the control instruction corresponding to the voice command. The voice feature information, that is, the voice feature value, is a feature value extracted from the voice information generated when the specific voice input voice command is recognized, and the voice feature values in the voice information generated when different user voices input the same voice command may be different.
步骤 120, 提示用户输入与所述语音命令对应的语音信息。 Step 120: Prompt the user to input voice information corresponding to the voice command.
具体地, 提示的方式可以有多种, 第一种: 通过电子设备的音频输出装 置 (如音响、 耳机等)播放与所述语音命令对应的语音信息, 如根据所述语 音命令的语音特征信息生成语音信息并播放, 或, 直接播放所述语音命令对 应的机器音, 举例说明, 在所述语音命令对应于打开相机控制指令时, 则通 过根据语音特征信息生成语音信息播放或者根据与所述语音命令对应的机器 音播放带有 "打开相机" 的语音信息。 第二种: 显示所述语音命令对应的文 字或图像信息, 如, 在所述语音命令对应于打开相机控制指令时, 则通过显 示屏等显示带有 "打开相机" 字样的文字信息。 Specifically, there may be multiple ways of prompting, the first one: playing the voice information corresponding to the voice command through an audio output device (such as an audio, a headphone, etc.) of the electronic device, such as voice feature information according to the voice command. Generating voice information and playing, or directly playing the machine sound corresponding to the voice command, for example, when the voice command corresponds to opening the camera control command, generating voice information according to the voice feature information or according to The machine sound corresponding to the voice command plays the voice message with "turn on the camera". The second type: displaying text or image information corresponding to the voice command, for example, when the voice command corresponds to opening a camera control instruction, displaying text information with the words "open camera" through a display screen or the like.
步骤 130, 釆集所述用户输入的语音信息。 Step 130: Collect voice information input by the user.
具体的, 可通过电子设备的音频输入装置 (如话筒等)釆集用户根据所 述提示进行复述时的语音。 Specifically, the voice when the user repeats according to the prompt can be collected by an audio input device (such as a microphone or the like) of the electronic device.
以下继续以所述语音命令对应与打开相机控制指令进行说明。 在使用者 收到上述提示后, 语音输入带有 "打开相机" 的语音信息。 通过输入设备如 麦克风等釆集该带有 "打开相机" 的语音信息。 The following description continues with the voice command corresponding and the camera control command being turned on. After the user receives the above prompt, the voice input has a voice message with "turn on the camera". The voice message with "turn on the camera" is collected by an input device such as a microphone.
步骤 140, 根据釆集到的语音信息获得语音特征信息, 即, 根据釆集到的 语音信息, 提取该语音信息的语音特征值。 Step 140: Acquire voice feature information according to the collected voice information, that is, extract voice feature values of the voice information according to the collected voice information.
语音特征信息可以通过提取釆集的语音信息中的美尔倒谱系数 (MFCC, Mel Frequency Cepstrum Coefficient)获得: C(n) = ^(m) cos ~ 10≤n < M The speech feature information can be obtained by extracting the Mel Frequency Cepstrum Coefficient (MFCC) in the collected speech information: C(n) = ^(m) cos ~ 10≤n < M
M ) M )
其中, N为对釆集到的语音信息进行傅立叶变换的点数, M为其后使用 的滤波器组的滤波器个数, S(m)为每个滤波器组输出的对数能量。 不同的系 统可能釆用不同阶数的 MFCC系数进行语音特征的提取和识别。 Where N is the number of points of the Fourier transform of the collected speech information, M is the number of filters of the filter bank used later, and S(m) is the logarithmic energy output by each filter bank. Different systems may use different order MFCC coefficients for speech feature extraction and recognition.
MFCC 系数是目前应用广泛的特征信息参数, 提取釆集的语音信息中的 美尔倒谱系数提取语音特征信息是本申请的一种获取语音特征信息的方式, 还可以通过釆用经验模态分解 (EMD)技术等提取语音特征信息,并不限定使用 MFCC系数。 The MFCC coefficient is a feature information parameter that is widely used at present. Extracting the Meir cepstral coefficient in the voice information of the collected set is the way to obtain the speech feature information of the present application, and can also be decomposed by empirical mode decomposition. The (EMD) technique or the like extracts speech feature information, and does not limit the use of MFCC coefficients.
步骤 150,根据获得的语音特征信息更新所述需要校正的语音命令的语音 特征信息, 以校正所述语音命令。 Step 150: Update voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command.
更新的方式至少有以下两种: 第一, 直接将所述原语音特征信息替换为 获得的语音特征信息, 不再保留所述原语音特征信息, 即, 将获得的语音特 征信息替换所述需要校正的语音命令的语音特征信息; 第二, 根据获得的语 音特征信息与所述需要校正的语音命令的语音特征信息的偏差值, 调整所述 需要校正的语音命令的语音特征信息的参数值, 使得所述语音命令的语音特 征信息能够适用。 There are at least the following two ways to update: First, the original voice feature information is directly replaced with the obtained voice feature information, and the original voice feature information is not retained, that is, the obtained voice feature information is replaced by the need. The voice feature information of the corrected voice command; secondly, adjusting the parameter value of the voice feature information of the voice command to be corrected according to the deviation value of the obtained voice feature information and the voice feature information of the voice command to be corrected, The voice feature information of the voice command is made applicable.
以下通过举例说明该语音命令的校正方法的具体实施过程, 首先在所述 语音特征库中获得一个需要校正的语音命令, 假设该语音命令对应于控制设 备执行打开相机操作的控制指令; 然后通过喇八等输出带有 "打开相机" 的 语音信息, 或者文字信息的提示; 在使用者收到上述提示后, 语音输入带有 "打开相机" 的语音信息, 通过输入设备如麦克风等釆集该带有 "打开相机" 的语音信息; 根据釆集到的语音信息提取语音特征信息; 最后, 釆用该语音 特征信息更新原语音特征信息, 如替换原语音特征信息, 至此, 校正完成。 The following is a specific implementation process of the method for correcting the voice command. First, a voice command to be corrected is obtained in the voice feature library, and the voice command is assumed to correspond to a control instruction of the control device to perform the camera operation; The eighth output has a voice message with "turn on the camera", or a prompt for text message; after the user receives the above prompt, the voice input has a voice message with "turn on the camera", and the band is collected by an input device such as a microphone. The voice information of "turning on the camera" is extracted; the voice feature information is extracted according to the voice information collected; finally, the voice feature information is updated with the voice feature information, for example, the original voice feature information is replaced, and the correction is completed.
上述语音命令的校正方法, 通过提示所述需要校正的语音命令, 接收使 用者输入的对应所述语音命令的语音信息, 根据该语音信息获取语音特征信 息, 将该语音特征信息更新对应的原语音特征信息, 从而使得语音命令的语 言特征信息能够适应于用户终端设备更换后或者语音特征库升级后, 解决了 现有技术中在用户终端设备更换后或者语音特征库升级后, 语音特征库的部 分或者全部语音命令的语音特征信息与设备釆集的对应语音特征信息会存在 偏差, 从而在语音命令操作时存在识别不准确或者不能识别输入的语音信息 的技术问题。 The method for correcting the voice command receives the voice command that needs to be corrected, receives the voice information corresponding to the voice command input by the user, acquires the voice feature information according to the voice information, and updates the voice feature information to the corresponding original voice. Feature information, thereby making the language of the voice command The feature information can be adapted to the voice feature information of some or all voice commands of the voice feature library after the user terminal device is replaced or the voice feature library is upgraded in the prior art after the user terminal device is replaced or the voice feature database is upgraded. There may be deviations from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
需要校正的语音命令可以为所述语音特征库中的一个命令, 也可以为多 个命令, 在所述语音命令为多个时, 可对所述多个语音命令中的每个语音命 令执行以下步骤 120、 130、 140和 150, 或者, 对所述多个语音命令逐一执行 以下步骤, 即第一个语音命令校正后, 进行第二个语音命令校正, 再进行第 三个语音命令校正, ... ..., 直到校正完成。 The voice command to be corrected may be one of the voice feature library or a plurality of commands. When the voice command is multiple, the following voice commands may be executed for each of the plurality of voice commands. Steps 120, 130, 140, and 150, or performing the following steps one by one for the plurality of voice commands, that is, after the first voice command is corrected, performing the second voice command correction, and then performing the third voice command correction. .. ..., until the calibration is complete.
具体地, 对于步骤 110 中, 如何获取所述获得所述语音特征库中需要校 正语音命令, 有以下两种方式: Specifically, in step 110, how to obtain the voice command in the voice feature library is required to be corrected in the following two manners:
第一种, 在所述用户终端设备更换后, 也就是说在一个终端设备上新安 装一个语音特征库时, 可釆用该方式。 该方式具体包括以下步骤: First, this method can be used after the user terminal device is replaced, that is, when a voice feature library is newly installed on a terminal device. The method specifically includes the following steps:
接收所述语音特征库; Receiving the voice feature library;
根据系统预先设定的所述语音特征库中的特定语音命令, 或者根据所述 语音特征库中语音命令的使用频率, 或者按照用户的输入信息在所述语音特 征库中选择的语音命令确定需要校正的语音命令, 所述特定语音命令为每次 接收语音特征库后需要校正的语音命令。 Determining a need according to a specific voice command in the voice feature library preset by the system, or according to a frequency of use of the voice command in the voice feature library, or a voice command selected in the voice feature library according to input information of the user The corrected voice command is a voice command that needs to be corrected each time the voice feature library is received.
第二种, 在对所述语音特征库进行数据升级时, 可以釆用该方式, 该方 式具体为: Secondly, when the data is upgraded to the voice feature database, the method can be used. The method is specifically:
接收更新的语音命令及其语音特征信息, 将所述更新的语音命令确定为 需要校正的语音命令。 The updated voice command and its voice feature information are received, and the updated voice command is determined to be a voice command that needs to be corrected.
升级语音命令集合可以包括对所述语音特征库内原语音命令对应的语音 特征信息等, 也可以为新增加的语音命令及其对应的语音特征信息等。 因此, 确定需要校正的语音命令可能是原有的语音命令, 也可能是新增加的语音命 上述语音命令的校正方法, 通过提示用户输入与所述语音命令对应的语 音信息, 接收用户根据提示输入的语音信息, 根据该语音信息获取语音特征 信息, 将该语音特征信息更新需要校正的语音命令的语音特征信息, 从而使 得语音命令的语言特征信息能够适应于用户终端设备更换后或者语音特征库 升级后, 解决了现有技术中在用户终端设备更换后或者语音特征库升级后, 语音特征库的部分或者全部语音命令的语音特征信息与设备釆集的对应语音 特征信息会存在偏差, 从而在语音命令操作时存在识别不准确或者不能识别 输入的语音信息的技术问题。 The upgrade voice command set may include voice feature information corresponding to the original voice command in the voice feature library, or may be a newly added voice command and corresponding voice feature information. Therefore, it is determined that the voice command to be corrected may be the original voice command, or may be a newly added voice command. The method for correcting the voice command is configured to prompt the user to input the voice information corresponding to the voice command, receive the voice information input by the user according to the prompt, acquire the voice feature information according to the voice information, and update the voice feature information to the voice command that needs to be corrected. The voice feature information is such that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the voice feature database is solved after the user terminal device is replaced or the voice feature database is upgraded in the prior art. The voice feature information of some or all of the voice commands may deviate from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
实施例二 Embodiment 2
基于同样的发明构思, 本申请还提供了一种语音命令的校正方法, 如图 2 所示, 为本申请第二较佳实施方式语音命令的校正方法的方法流程图。 所述 语音命令的校正方法包括以下步骤: Based on the same inventive concept, the present application further provides a method for correcting a voice command, as shown in FIG. 2, which is a flowchart of a method for correcting a voice command according to a second preferred embodiment of the present application. The method for correcting the voice command includes the following steps:
步骤 210, 获得所述语音特征库中需要校正的语音命令集合, 所述语音命 令集合包括 M个语音命令, 所述 M为大于等于 2的整数。 Step 210: Obtain a voice command set that needs to be corrected in the voice feature database, where the voice command set includes M voice commands, and the M is an integer greater than or equal to 2.
具体地, 语音特征库通常包括多个语音命令、 与所述多个语音命令对应 的语音特征信息, 进一步的还可包括与所述多个命令对应的文字信息。 语音 特征库中的语音命令通常为控制指令名称, 当电子设备通过音频输入装置釆 集到的语音信息的特征值与语音特征库中的某个语音命令的对应的特征信息 匹配时, 则所述电子设备会根据所述语音命令对应的控制指令名称执行相应 的控制操作。 语音特征信息, 即语音特征值, 是根据识别特定语音输入语音 命令时产生的语音信息中提取的特征值, 不同的使用者语音输入同一语音命 令时产生的语音信息中的语音特征值可能不同。 Specifically, the voice feature library generally includes a plurality of voice commands, voice feature information corresponding to the plurality of voice commands, and further may further include text information corresponding to the plurality of commands. The voice command in the voice feature library is usually a control instruction name, and when the feature value of the voice information collected by the electronic device through the audio input device matches the corresponding feature information of a voice command in the voice feature library, The electronic device performs a corresponding control operation according to the name of the control instruction corresponding to the voice command. The voice feature information, that is, the voice feature value, is a feature value extracted from the voice information generated when the specific voice input voice command is recognized, and the voice feature values in the voice information generated when different user voices input the same voice command may be different.
步骤 220, 对语音命令集合中的第一个到第 N个语音命令中每个语音命 令执行: 提示用户输入与所述第一语音命令对应的语音信息、 釆集所述用户 输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的语 音特征信息更新所述语音特征库中所述第一语音命令的语音特征信息以校正 所述第一语音命令, 所述 N为小于所述 M大于 0的整数, 所述第一语音命令 为所述第一个到第 N个语音命令中的任意一个语音命令。 Step 220: Perform, for each voice command in the first to Nth voice commands in the voice command set, prompting the user to input voice information corresponding to the first voice command, and collecting voice information input by the user, Acquiring the voice feature information according to the collected voice information, and updating the voice feature information of the first voice command in the voice feature database according to the obtained voice feature information to correct the first voice command, where the N is smaller than An integer whose M is greater than 0, the first voice command It is any one of the first to Nth voice commands.
对第一个到第 N个语音命令中每个语音命令执行: 提示用户输入与所述 第一语音命令对应的语音信息、 釆集所述用户输入的语音信息、 根据釆集到 的语音信息获得语音特征信息、 根据获得的语音特征信息更新所述语音特征 库中所述第一语音命令的语音特征信息以校正所述第一语音命令, 可以通过 以下方式实现:对所述第一个到第 N个语音命令中的每个语音命令同时执行, 或者, 对所述第一个到第 N个语音命令逐一执行, 即先对第一个语音命令中 的进行校正后, 再对第二个语音命令进行校正, ... ..., 直到对第 N个语音命 令校正完成为止。 Performing for each of the first to Nth voice commands: prompting the user to input voice information corresponding to the first voice command, collecting voice information input by the user, and obtaining voice information according to the voice information collected by the user And the voice feature information is updated, and the voice feature information of the first voice command in the voice feature database is updated according to the obtained voice feature information to correct the first voice command, which can be implemented by: performing the first to the first Each of the N voice commands is simultaneously executed, or the first to the Nth voice commands are performed one by one, that is, the first voice command is corrected first, and then the second voice is corrected. The command is corrected, ... until the correction of the Nth voice command is completed.
另外, 语音特征信息可以通过提取釆集的语音信息中的美尔倒谱系数 (MFCC , Mel Frequency Cepstrum Coefficient)获得: In addition, the voice feature information can be obtained by extracting the Mel Frequency Cepstrum Coefficient (MFCC) in the collected voice information:
C(n) = ^(m) cos ~ 10≤n < M C(n) = ^(m) cos ~ 10≤n < M
M ) M )
其中, N为对釆集到的语音信息进行傅立叶变换的点数, M为其后使用 的滤波器组的滤波器个数, S(m)为每个滤波器组输出的对数能量。 不同的系 统可能釆用不同阶数的 MFCC系数进行语音特征的提取和识别。 Where N is the number of points of the Fourier transform of the collected speech information, M is the number of filters of the filter bank used later, and S(m) is the logarithmic energy output by each filter bank. Different systems may use different order MFCC coefficients for speech feature extraction and recognition.
MFCC 系数是目前应用广泛的特征信息参数, 提取釆集的语音信息中的 美尔倒谱系数提取语音特征信息是本申请的一种获取语音特征信息的方式, 还可以通过釆用经验模态分解 (EMD)技术等提取语音特征信息,并不限定使用 MFCC系数。 The MFCC coefficient is a feature information parameter that is widely used at present. Extracting the Meir cepstral coefficient in the voice information of the collected set is the way to obtain the speech feature information of the present application, and can also be decomposed by empirical mode decomposition. The (EMD) technique or the like extracts speech feature information, and does not limit the use of MFCC coefficients.
步骤 230, 对所述语音命令集合中的第 N+1个到第 M个语音命令中的每 个语音命令执行: 根据已校正完成的 N个第一语音命令在校正前和校正后的 语音特征信息的 N个偏差值, 确定对所述第二语音命令进行校正的方式, 根 据确定出的校正方式对所述第二语音命令进行校正, 所述第二语音命令为所 述第 N+1个到第 M个语音命令中的任意一个语音命令。 Step 230: Perform, for each of the N+1th to Mth voice commands in the voice command set: before and after the correction of the N first voice commands that have been corrected And determining, by the N deviation values of the information, a manner of correcting the second voice command, and correcting the second voice command according to the determined correction manner, where the second voice command is the (N+1)th Go to any of the M voice commands.
也就是说, 在所述 N个偏差值满足不同条件时, 对所述(M-N )个语音 命令中的每个语音命令校正的方式不同, 具体参见后续描述。 N个偏差值中的每个偏差值可以通过上述 MFCC系数获得语音特征信息, 与对应的第一语音命令的语音特征信息进行比较, 获取偏差值, 其中, 偏差 值可以是获得的语音特征信息与对应的第一语音命令的语音特征信息的比 值, 也可以是获得的语音特征信息与对应的第一语音命令的语音特征信息的 差值。 That is to say, when the N deviation values satisfy different conditions, the manner of correcting each of the (MN) voice commands is different, and refer to the subsequent description. Each of the N deviation values may obtain the voice feature information by using the MFCC coefficient, and compare with the voice feature information of the corresponding first voice command to obtain a deviation value, where the deviation value may be the obtained voice feature information and The ratio of the voice feature information of the corresponding first voice command may also be the difference between the obtained voice feature information and the voice feature information of the corresponding first voice command.
上述语音命令的校正方法, 通过将所述 M个需要校正的语音命令的中第 一个到第 N个语音命令釆用输入的语音信息的语音特征信息更新对应的语音 特征信息以校正所述 N个语音命令, 再根据已校正完成的 N个第一语音命令 在校正前和校正后的语音特征信息的 N个偏差值校正第 N+1个到第 M个语音 命令, 从而完成对所述 M个语音命令的校正, 使得所述语音特征库内的语音 命令的语言特征信息能够适应于用户终端设备更换后或者语音特征库升级 后, 解决了现有技术中在用户终端设备更换后或者语音特征库升级后, 语音 特征库的部分或者全部语音命令的语音特征信息与设备釆集的对应语音特征 信息会存在偏差, 从而在语音命令操作时存在识别不准确或者不能识别输入 的语音信息的技术问题。 The method for correcting the voice command is to correct the N by updating the corresponding voice feature information by using the voice feature information of the input voice information from the first to the Nth voice commands of the M voice commands that need to be corrected. a voice command, and correcting the N+1th to Mth voice commands based on the N deviation values of the pre-corrected and corrected voice feature information according to the corrected N first voice commands, thereby completing the M The correction of the voice command enables the language feature information of the voice command in the voice feature library to be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and solves the problem that the user terminal device is replaced or the voice feature in the prior art. After the library is upgraded, the voice feature information of some or all voice commands of the voice signature database may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized. .
具体地, 对于如何获取所述获得所述语音特征库中需要校正第一语音特 征信息的语音命令, 有以下两种方式: Specifically, there are two ways for obtaining the voice command for obtaining the voice feature database that needs to correct the first voice feature information:
第一种, 在所述用户终端设备更换后, 也就是说在一个终端设备上新安 装一个语音特征库时, 可釆用该方式。 该方式具体包括以下步骤: First, this method can be used after the user terminal device is replaced, that is, when a voice feature library is newly installed on a terminal device. The method specifically includes the following steps:
接收所述语音特征库; Receiving the voice feature library;
根据系统预先设定的所述语音特征库中的特定语音命令, 或者根据所述 语音特征库中语音命令的使用频率, 或者按照用户的输入信息在所述语音特 征库中选择的语音命令确定需要校正的语音命令, 所述特定语音命令为每次 接收语音特征库后需要校正的语音命令。 Determining a need according to a specific voice command in the voice feature library preset by the system, or according to a frequency of use of the voice command in the voice feature library, or a voice command selected in the voice feature library according to input information of the user The corrected voice command is a voice command that needs to be corrected each time the voice feature library is received.
第二种, 在对所述语音特征库进行数据升级时, 可以釆用该方式, 该方 式具体包括以下步骤: Secondly, when the data is upgraded, the method may be used. The method includes the following steps:
接收更新的语音命令及其语音特征信息, 将所述更新的语音命令确定为 需要校正的语音命令。 Receiving an updated voice command and its voice feature information, determining the updated voice command as Voice commands that need to be corrected.
升级语音命令集合可以包括对所述语音特征库内原语音命令对应的语音 特征信息等, 也可以为新增加的语音命令及其对应的语音特征信息等。 因此, 发生变化的语音命令可能是原有的语音命令, 也可能是新增加的语音命令。 The upgrade voice command set may include voice feature information corresponding to the original voice command in the voice feature library, or may be a newly added voice command and corresponding voice feature information. Therefore, the changed voice command may be the original voice command or a newly added voice command.
具体地, 在所述 N个偏差值不同时, 釆用以下方法确定所述语音命令集 合中的第 N+1个到第 M个语音命令中的每个语音命令进行校正的方式: Specifically, when the N deviation values are different, the following method is used to determine a manner in which each of the N+1th to Mth voice commands in the voice command set is corrected:
若所述 N个偏差值在第一阔值与第二阔值之间, 所述第一阔值小于所述 第二阔值, 则根据所述 N个偏差值查找变化规律; If the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, the change rule is searched according to the N deviation values;
若查找到变化规律, 则确定根据所述变化规律对所述第二语音命令的语 音特征信息进行更新以校正所述第二语音命令; 否则, 确定对所述第二语音 命令执行: 提示用户输入与所述第二语音命令对应的语音信息、 釆集所述用 户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的 语音特征信息更新所述第二语音命令的语音特征信息, 以校正所述第二语音 命令; If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting user input And the voice information corresponding to the second voice command, the voice information input by the user, the voice feature information obtained according to the collected voice information, and the voice feature of the second voice command according to the obtained voice feature information Information to correct the second voice command;
若所述 N个偏差值小于所述第一阔值, 则放弃对所述第二语音命令的校 正 If the N deviation values are less than the first threshold, discarding the correction of the second voice command
若所述 N个偏差值大于所述第二阔值, 则重新确定所述语音特征库中需 要校正语音特征信息的语音命令集合。 And if the N deviation values are greater than the second threshold, the voice command set of the voice feature database that needs to be corrected for the voice feature information is re-determined.
以下通过举例说明, 在所述 N个偏差值不同时, 釆用以下方法确定所述 语音命令集合中的 (M-N )个语音命令中的每个语音命令的第一语音特征信 息进行校正的方式。 4叚设所述第一阔值为 5%, 第二阔值为 20%, 4叚设所述 N 等于 3。 Hereinafter, by way of example, when the N deviation values are different, the following method is used to determine the manner in which the first voice feature information of each of the (M-N) voice commands in the voice command set is corrected. 4, the first threshold is 5%, the second threshold is 20%, and 4 is said to be equal to 3.
在第一种情况时, 第一偏差值为 5.9%, 第二偏差值为 6%, 第三偏差值 为 6%, 因三个偏差值之间的差值范围仅为 0.1 , 则认为三个偏差值存在变化 规律, 即第一语音特征信息与需要的语音特征信息之间相差 6%, 系统可以发 出提示, 如通过显示单元显示 "是否需要按照偏差值 6%调整所述(M-N )个 语音命令中的每个语音命令的第一语音特征信息", 当使用者选择 "是" 时, 则对所述(M-N ) 个语音命令中的每个语音命令的第一语音特征信息按照偏 差值为 6%进行校正。 In the first case, the first deviation value is 5.9%, the second deviation value is 6%, and the third deviation value is 6%. Since the difference between the three deviation values is only 0.1, three There is a variation law of the deviation value, that is, the difference between the first voice feature information and the required voice feature information is 6%, and the system can issue a prompt, such as displaying, by the display unit, "whether the (MN) voice needs to be adjusted according to the deviation value of 6%. The first voice feature information of each voice command in the command", when the user selects "yes", Then, the first voice feature information of each of the (MN) voice commands is corrected according to a deviation value of 6%.
在第二种情况时, 若第一偏差值为 1%, 第二偏差值为 2%, 第三偏差值 为 4%, 因三个偏差值均小于 5%, 因此, 放弃对所述(M-N )个语音命令中 的每个语音命令的校正。 In the second case, if the first deviation value is 1%, the second deviation value is 2%, and the third deviation value is 4%, since the three deviation values are all less than 5%, therefore, the pair is discarded (MN Correction of each voice command in a voice command.
在第三种情况时, 若第一偏差值为 15%, 第二偏差值为 30%, 第三偏差 值为 40%, 因三个偏差值中两个偏差值大于 20%, 因此, 则重新确定所述语 音特征库中需要校正第一语音特征信息的语音命令集合, 如将所述语音特征 库的未校正的所有语音命令均列为需要校正第一语音特征信息的语音命令。 In the third case, if the first deviation value is 15%, the second deviation value is 30%, and the third deviation value is 40%, because two of the three deviation values are greater than 20%, so Determining, in the voice feature library, a set of voice commands that need to be corrected for the first voice feature information, such as all voice commands that are not corrected by the voice feature library are listed as voice commands that need to correct the first voice feature information.
上述语音命令的校正方法 200, 通过将所述 M个需要校正的语音命令的 中前 N个语音命令釆用输入的语音信息的语音特征信息更换对应的语音特征 信息以校正所述前 N个语音命令, 所述后 (M-N )个语音命令通过已校正的 N个语音命令的原语音特征信息和输入的语音特征信息的偏差值选择校正方 式, 从而对所述后 (M-N )个语音命令进行校正, 从而使得语音命令的语言 特征信息能够适应于用户终端设备更换后或者语音特征库升级后, 解决了现 有技术中在用户终端设备更换后或者语音特征库升级后, 语音特征库的部分 或者全部语音命令的语音特征信息与设备釆集的对应语音特征信息会存在偏 差, 从而在语音命令操作时存在识别不准确或者不能识别输入的语音信息的 技术问题。 The above-mentioned voice command correction method 200 corrects the first N voices by replacing the first and last N voice commands of the M voice commands that need to be corrected with the voice feature information of the input voice information by replacing the corresponding voice feature information. And the following (MN) voice commands select a correction mode by using the original voice feature information of the corrected N voice commands and the deviation value of the input voice feature information, thereby correcting the latter (MN) voice commands. Therefore, the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the part or all of the voice feature database after the user terminal device is replaced or the voice feature library is upgraded in the prior art is solved. The voice feature information of the voice command may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
实施例三 Embodiment 3
基于同样的发明构思, 本申请还提供了一种语音命令的校正系统 300, 如 图 3所示, 为本申请第三较佳实施方式语音命令的校正系统 300的结构框图。 所述校正系统 300包括: 确定单元 310、 提示单元 320、 釆集单元 330、 获得 单元 340和校正单元 350。 Based on the same inventive concept, the present application further provides a voice command correction system 300, as shown in FIG. 3, which is a structural block diagram of a voice command correction system 300 according to a third preferred embodiment of the present application. The correction system 300 includes: a determining unit 310, a prompting unit 320, a collecting unit 330, an obtaining unit 340, and a correcting unit 350.
确定单元 310 , 用于获得语音特征库中需要校正的语音命令; a determining unit 310, configured to obtain a voice command in the voice feature library that needs to be corrected;
提示单元 320 , 用于提示用户输入与所述语音命令对应的语音信息; 釆集单元 330 , 用于釆集所述用户输入的语音信息; 获得单元 340 , 用于根据釆集到的语音信息获得语音特征信息; The prompting unit 320 is configured to prompt the user to input voice information corresponding to the voice command, and the collecting unit 330 is configured to collect voice information input by the user; The obtaining unit 340 is configured to obtain the voice feature information according to the collected voice information;
校正单元 350 ,用于根据获得的语音特征信息更新所述需要校正的语音命 令的语音特征信息, 以校正所述语音命令。 The correcting unit 350 is configured to update the voice feature information of the voice command to be corrected according to the obtained voice feature information to correct the voice command.
在具体实施方式中, 所述确定单元 310具体用于: 接收所述语音特征库; 根据系统预先设定的所述语音特征库中的特定语音命令, 或者根据所述语音 特征库中语音命令的使用频率, 或者按照用户的输入信息在所述语音特征库 中选择的语音命令确定需要校正的语音命令, 所述特定语音命令为每次接收 语音特征库后需要校正的语音命令。 In a specific implementation, the determining unit 310 is specifically configured to: receive the voice feature library; according to a specific voice command in the voice feature library preset by the system, or according to a voice command in the voice feature library The voice command to be corrected is determined using a frequency, or a voice command selected in the voice feature library according to the user's input information, and the specific voice command is a voice command that needs to be corrected each time the voice feature library is received.
在具体实施方式中, 所述确定单元 310具体用于: 接收更新的语音命令 及其语音特征信息, 将所述更新的语音命令确定为需要校正的语音命令。 In a specific implementation, the determining unit 310 is specifically configured to: receive an updated voice command and voice feature information, and determine the updated voice command as a voice command that needs to be corrected.
在具体实施方式中, 所述校正单元 350具体用于: 将获得的语音特征信 息替换所述需要校正的语音命令的语音特征信息。 In a specific implementation, the correcting unit 350 is specifically configured to: replace the obtained voice feature information with the voice feature information of the voice command that needs to be corrected.
在具体实施方式中, 所述提示单元 320具体用于: 根据所述语音命令的 语音特征信息生成语音信息并播放, 或者播放所述语音命令对应的机器音; 或者显示所述语音命令对应的文字或图像信息。 In a specific implementation manner, the prompting unit 320 is specifically configured to: generate voice information according to the voice feature information of the voice command and play, or play a machine sound corresponding to the voice command; or display a text corresponding to the voice command Or image information.
上述语音命令的校正系统 300 ,通过提示单元 320提示用户输入与所述第 一语音命令对应的语音信息, 通过釆集单元 330釆集用户根据提示输入的语 音信息, 通过获得单元 340根据该语音信息获取语音特征信息, 通过校正单 元 350将该语音特征信息更新所述需要校正的语音命令的语音特征信息, 从 而使得语音命令的语言特征信息能够适应于用户终端设备更换后或者语音特 征库升级后, 解决了现有技术中在用户终端设备更换后或者语音特征库升级 后, 语音特征库的部分或者全部语音命令的语音特征信息与设备釆集的对应 语音特征信息会存在偏差, 从而在语音命令操作时存在识别不准确或者不能 识别输入的语音信,包、的技术问题。 The above-mentioned voice command correction system 300 prompts the user to input the voice information corresponding to the first voice command through the prompting unit 320, and collects the voice information input by the user according to the prompt by the collecting unit 330, and the obtaining unit 340 according to the voice information. Acquiring the voice feature information, and the voice feature information is updated by the correcting unit 350 to the voice feature information of the voice command to be corrected, so that the language feature information of the voice command can be adapted after the user terminal device is replaced or the voice feature library is upgraded. After the replacement of the user terminal device or the upgrade of the voice feature library in the prior art, the voice feature information of some or all of the voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, thereby operating in the voice command. There are technical problems in the identification of inaccurate or unrecognizable voice messages, packets.
实施例四 Embodiment 4
基于同样的发明构思, 本申请还提供了一种语音命令的校正系统, 如图 4 所示, 为本申请第四较佳实施方式语音命令的校正系统 400 的结构框图。 所 述校正系统 400包括:获得单元 410、第一校正单元 420和第二校正单元 430。 获得单元 410, 用于获得语音特征库中需要校正的语音命令集合, 所述语 音命令集合包括 M个语音命令, 所述 M为大于等于 2的整数; Based on the same inventive concept, the present application further provides a voice command correction system, as shown in FIG. 4, which is a structural block diagram of a voice command correction system 400 according to a fourth preferred embodiment of the present application. Place The correction system 400 includes an obtaining unit 410, a first correcting unit 420, and a second correcting unit 430. The obtaining unit 410 is configured to obtain a voice command set that needs to be corrected in the voice feature database, where the voice command set includes M voice commands, where the M is an integer greater than or equal to 2;
第一校正单元 420, 用于对语音命令集合中的第一个到第 N个语音命令 中每个语音命令执行: 提示用户输入与所述第一语音命令对应的语音信息、 釆集所述用户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的语音特征信息更新所述语音特征库中所述第一语音命令的音特征 信息以校正所述第一语音命令, 所述 N为小于所述 M大于 0的整数, 所述第 一语音命令为所述第一个到第 N个语音命令中的任意一个语音命令。 The first correcting unit 420 is configured to perform, for each voice command, the first to the Nth voice commands in the voice command set: prompting the user to input voice information corresponding to the first voice command, and collecting the user Inputting voice information, obtaining voice feature information according to the collected voice information, updating sound feature information of the first voice command in the voice feature database according to the obtained voice feature information, to correct the first voice command, N is an integer smaller than the M is greater than 0, and the first voice command is any one of the first to Nth voice commands.
第二校正单元 430, 用于对所述语音命令集合中的第 N+1个到第 M个语 音命令中的每个语音命令执行: 根据已校正完成的 N个第一语音命令在校正 前和校正后的语音特征信息的 N个偏差值, 确定对所述第二语音命令进行校 正的方式, 根据确定出的校正方式对所述第二语音命令进行校正, 所述第二 语音命令为所述第 N+1个到第 M个语音命令中的任意一个语音命令。 a second correcting unit 430, configured to perform, for each of the N+1th to Mth voice commands in the voice command set: according to the corrected N first voice commands before and after the correction And determining, by the N deviation values of the corrected voice feature information, a manner of correcting the second voice command, and correcting the second voice command according to the determined correction mode, where the second voice command is Any one of the N+1th to Mth voice commands.
在具体实施方式中, 所述第二校正单元 430具体用于: In a specific implementation, the second correcting unit 430 is specifically configured to:
若所述 N个偏差值在第一阔值与第二阔值之间, 所述第一阔值小于所述 第二阔值, 则根据所述 N个偏差值查找变化规律; If the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, the change rule is searched according to the N deviation values;
若查找到变化规律, 则确定根据所述变化规律对所述第二语音命令的语 音特征信息进行更新以校正所述第二语音命令; 否则, 确定对所述第二语音 命令执行: 提示用户输入与所述第二语音命令对应的语音信息、 釆集所述用 户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获得的 语音特征信息更新所述第二语音命令的语音特征信息, 以校正所述第二语音 命令。 If the change rule is found, determining to update the voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting user input And the voice information corresponding to the second voice command, the voice information input by the user, the voice feature information obtained according to the collected voice information, and the voice feature of the second voice command according to the obtained voice feature information Information to correct the second voice command.
在具体实施方式中, 所述第二校正单元 430还用于: 若所述 N个偏差值 小于所述第一阔值, 则放弃对所述第二语音命令的校正。 In a specific implementation, the second correcting unit 430 is further configured to: if the N deviation values are less than the first threshold, discard the correction of the second voice command.
在具体实施方式中, 所述第二校正单元 430还用于: 若所述 N个偏差值 大于所述第二阔值, 则重新确定所述语音特征库中需要校正的语音命令集合。 上述语音命令的校正系统 400, 通过第一校正单元 420将所述 M个需要 校正的语音命令的中第一个到第 N个语音命令釆用输入的语音信息的语音特 征信息更新对应的语音特征信息以校正所述 N个语音命令, 通过第二校正单 元 430根据所述 N个偏差值确定第 N+1个到第 M个语音命令的校正方式,对 所述(M-N )个语音命令进行校正, 从而使得语音命令的语言特征信息能够 适应于用户终端设备更换后或者语音特征库升级后, 解决了现有技术中在用 户终端设备更换后或者语音特征库升级后, 语音特征库的部分或者全部语音 命令的语音特征信息与设备釆集的对应语音特征信息会存在偏差, 从而在语 音命令操作时存在识别不准确或者不能识别输入的语音信息的技术问题。 In a specific implementation, the second correcting unit 430 is further configured to: if the N deviation values are greater than the second threshold, re-determine the voice command set in the voice feature database that needs to be corrected. The above-mentioned voice command correction system 400 updates the voice feature information of the input voice information to the corresponding voice feature by using the first to Nth voice commands of the M voice commands to be corrected by the first correction unit 420. The information is used to correct the N voice commands, and the second correction unit 430 determines the correction mode of the (N)th to Mth voice commands according to the N deviation values, and corrects the (MN) voice commands. Therefore, the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the part or all of the voice feature database after the user terminal device is replaced or the voice feature library is upgraded in the prior art is solved. The voice feature information of the voice command may be deviated from the corresponding voice feature information of the device set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
实施例五 Embodiment 5
基于同样的发明构思, 本申请还提供了一种电子设备 500, 如图 5所示, 为本申请第五较佳实施方式电子设备 500 的结构示意图。 所述电子设备 500 包括: 存储器 510、 处理器 520、 输出装置 530和输入装置 540。 Based on the same inventive concept, the present application further provides an electronic device 500. As shown in FIG. 5, it is a schematic structural diagram of an electronic device 500 according to a fifth preferred embodiment of the present application. The electronic device 500 includes a memory 510, a processor 520, an output device 530, and an input device 540.
存储器 510, 用于存储语音特征库, 所述语音特征库包括多个语音命令; 存储器 510是终端系统中的记忆设备, 用来存放程序和数据。 存储器 510也 可以为外部存储器如磁性介质或光盘, 也可以为内部存储器。 The memory 510 is configured to store a voice feature library, where the voice feature library includes a plurality of voice commands. The memory 510 is a memory device in the terminal system for storing programs and data. The memory 510 can also be an external memory such as a magnetic medium or an optical disk, or an internal memory.
处理器 520, 用于获得语音特征库中需要校正的语音命令,根据釆集到的 语音信息获得语音特征信息, 并根据所述语音特征信息更新所述需要校正的 语音命令的语音特征信息, 以校正所述语音命令。 The processor 520 is configured to obtain a voice command that needs to be corrected in the voice feature database, obtain voice feature information according to the collected voice information, and update voice feature information of the voice command that needs to be corrected according to the voice feature information, to Correct the voice command.
输出装置 530 , 用于提示用户输入与所述语音命令对应的语音信息。 输出 设备 530是将数据以数字、 字符、 图像、 声音等形式表示出来, 如显示器、 语音输出系统如喇叭等。 The output device 530 is configured to prompt the user to input voice information corresponding to the voice command. The output device 530 expresses data in the form of numbers, characters, images, sounds, etc., such as a display, a voice output system such as a speaker, and the like.
输入装置 540 , 用于釆集所述用户输入的语音信息。输入装置 540可以为 麦克风、 话筒等装置。 The input device 540 is configured to collect voice information input by the user. The input device 540 can be a device such as a microphone or a microphone.
在具体实施方式中, 在所述电子设备 500在接收到所述语音特征库时, 所述处理器 520具体用于: 根据系统预先设定的所述语音特征库中的特定语 音命令, 或者根据所述语音特征库中语音命令的使用频率, 或者按照用户的 输入信息在所述语音特征库中选择的语音命令确定需要校正的语音命令, 所 述特定语音命令为每次接收语音特征库后需要校正的语音命令。 In a specific implementation manner, when the electronic device 500 receives the voice feature database, the processor 520 is specifically configured to: according to a specific voice command in the voice feature library preset by the system, or according to The frequency of use of the voice command in the voice feature library, or according to the user's The voice command selected by the input information in the voice feature library determines a voice command to be corrected, and the specific voice command is a voice command that needs to be corrected each time the voice feature library is received.
在具体实施方式中, 所述处理器 520还用于: 接收更新的语音命令及其 语音特征信息, 并将所述更新的语音命令确定为需要校正的语音命令。 In a specific implementation, the processor 520 is further configured to: receive an updated voice command and voice feature information thereof, and determine the updated voice command as a voice command that needs to be corrected.
在具体实施方式中, 所述处理器 520具体用于: 将获得的语音特征信息 替换所述需要校正的语音命令的语音特征信息。 In a specific implementation, the processor 520 is specifically configured to: replace the obtained voice feature information with the voice feature information of the voice command that needs to be corrected.
在具体实施方式中, 所述输出装置 530具体为语音播放器或者显示屏; 所述语音播放器用于根据所述语音命令的语音特征信息生成语音信息并播 放, 或播放所述语音命令对应的机器音; 所述显示屏用于显示所述语音命令 对应的文字或图像信息。 In a specific implementation manner, the output device 530 is specifically a voice player or a display screen; the voice player is configured to generate voice information according to the voice feature information of the voice command, and play, or play the machine corresponding to the voice command. The display screen is used to display text or image information corresponding to the voice command.
上述电子设备 500,通过所述输出装置 530提示用户输入与所述语音命令 对应的语音信息, 通过输入装置 540接收用户根据提示输入的语音信息, 通 过处理器 520根据该语音信息获取语音特征信息, 将该语音特征信息更新所 述需要校正的语音特征信息, 从而使得语音命令的语言特征信息能够适应于 用户终端设备更换后或者语音特征库升级后, 解决了现有技术中在用户终端 设备更换后或者语音特征库升级后, 语音特征库的部分或者全部语音命令的 语音特征信息与设备釆集的对应语音特征信息会存在偏差, 从而在语音命令 操作时存在识别不准确或者不能识别输入的语音信息的技术问题。 The electronic device 500 is configured to prompt the user to input voice information corresponding to the voice command through the output device 530, and receive voice information input by the user according to the prompt by the input device 540, and acquire the voice feature information according to the voice information by the processor 520. The voice feature information is updated to the voice feature information that needs to be corrected, so that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature database is upgraded, and the prior art is replaced after the user terminal device is replaced. After the voice signature database is upgraded, the voice feature information of some or all of the voice commands of the voice feature library may be deviated from the corresponding voice feature information of the device set, so that the voice information may be inaccurately recognized or the input voice information may not be recognized during the voice command operation. Technical problem.
实施例六 Embodiment 6
基于同样的发明构思, 本申请还提供了一种电子设备 600, 如图 6所示, 为本申请第六较佳实施方式电子设备 600 的结构示意图。 所述电子设备 600 包括: 存储器 610、 处理器 620、 输出装置 630、 输入装置 640。 Based on the same inventive concept, the present application further provides an electronic device 600. As shown in FIG. 6, FIG. 6 is a schematic structural diagram of an electronic device 600 according to a sixth preferred embodiment of the present application. The electronic device 600 includes: a memory 610, a processor 620, an output device 630, and an input device 640.
存储器 610, 用于存储语音特征库, 所述语音特征库包括多个语音命令。 存储器 610是终端系统中的记忆设备, 用来存放程序和数据。 存储器 610也 可以为外部存储器如磁性介质或光盘, 也可以为内部存储器。 The memory 610 is configured to store a voice feature library, where the voice feature library includes a plurality of voice commands. The memory 610 is a memory device in the terminal system for storing programs and data. The memory 610 can also be an external memory such as a magnetic medium or an optical disk, or an internal memory.
处理器 620, 用于获得语音特征库中需要校正的语音命令集合,对语音命 令集合中的第一个到第 N个语音命令中每个语音命令执行: 根据釆集到的语 音信息获得语音特征信息、 根据获得的语音特征信息更新所述语音特征库中 所述第一语音命令的音特征信息以校正所述第一语音命令; 对所述语音命令 集合中的第 N+1个到第 M个语音命令中的每个语音命令执行:根据已校正完 成的 N个第一语音命令在校正前和校正后的语音特征信息的 N个偏差值, 确 定对所述第二语音命令进行校正的方式, 根据确定出的校正方式对所述第二 语音命令进行校正, 所述 M为大于等于 2的整数, 所述 N为小于所述 M大 于 0的整数, 所述第一语音命令为所述第一个到第 N个语音命令中的任意一 个语音命令,所述第二语音命令为所述第 N+1个到第 M个语音命令中的任意 一个语音命令; The processor 620 is configured to obtain a voice command set that needs to be corrected in the voice feature database, and execute each voice command in the first to Nth voice commands in the voice command set: according to the collected words Acquiring information to obtain voice feature information, updating tone feature information of the first voice command in the voice feature library according to the obtained voice feature information to correct the first voice command; and N+ in the voice command set Performing each of the 1 to M voice commands: determining the second voice based on the N deviation values of the corrected and corrected voice feature information of the N first voice commands that have been corrected The command is used to perform the correction, and the second voice command is corrected according to the determined correction mode, where M is an integer greater than or equal to 2, and the N is an integer smaller than the M greater than 0, the first voice The command is any one of the first to Nth voice commands, and the second voice command is any one of the N+1th to Mth voice commands;
输出装置 630, 用于提示用户输入与所述第一语音命令对应的语音信息。 输出设备 630是将数据以数字、 字符、 图像、 声音等形式表示出来, 如显示 器、 语音输出系统如喇叭等。 The output device 630 is configured to prompt the user to input voice information corresponding to the first voice command. The output device 630 expresses data in the form of numbers, characters, images, sounds, etc., such as a display, a voice output system such as a speaker, and the like.
输入装置 640, 用于釆集所述用户输入的语音信息。输入装置 640可以为 麦克风、 话筒等装置。 The input device 640 is configured to collect voice information input by the user. The input device 640 can be a device such as a microphone or a microphone.
在具体实施方式中, 所述处理器 620具体用于: 若所述 N个偏差值在第 一阔值与第二阔值之间, 所述第一阔值小于所述第二阔值, 则根据所述 N个 偏差值查找变化规律; In a specific implementation, the processor 620 is specifically configured to: if the N deviation values are between the first threshold and the second threshold, and the first threshold is less than the second threshold, Finding a change rule according to the N deviation values;
若查找到变化规律, 则确定根据所述变化规律对所述第二语音命令的第 一语音特征信息进行更新以校正所述第二语音命令; 否则, 确定对所述第二 语音命令执行: 提示用户输入与所述第二语音命令对应的语音信息、 釆集所 述用户输入的语音信息、 根据釆集到的语音信息获得语音特征信息、 根据获 得的语音特征信息更新所述第二语音命令的语音特征信息, 以校正所述第二 语音命令。 If the change rule is found, determining to update the first voice feature information of the second voice command according to the change rule to correct the second voice command; otherwise, determining to perform the second voice command: prompting The user inputs voice information corresponding to the second voice command, collects voice information input by the user, obtains voice feature information according to the collected voice information, and updates the second voice command according to the obtained voice feature information. Voice feature information to correct the second voice command.
在具体实施方式中, 所述处理器 620具体还用于: 若所述 N个偏差值小 于所述第一阔值, 则放弃对所述第二语音命令的校正。 In a specific implementation, the processor 620 is further configured to: if the N deviation values are less than the first threshold, discard the correction of the second voice command.
在具体实施方式中, 所述处理器 620具体还用于: 若所述 N个偏差值大 于所述第二阔值, 则重新确定所述语音特征库中需要校正的语音命令集合。 上述电子设备 600,通过输出装置 630提示用户输入与所述第一语音命令 对应的语音信息, 通过输入装置 640釆集用户根据提示输入的语音信息, 通 过处理器 620提取所述语音信息中的语音特征信息, 将提取的语音特征信息 更新所述第一语音命令的语音特征信息, 从而完成所述 M个语音命令中的第 一个到第 N个语音命令的校正, 再通过所述处理器 620, 跟据所述根据已校 正完成的 N个第一语音命令在校正前和校正后的语音特征信息的 N个偏差值 选择对所述 M个语音命令中 (M-N )个语音命令的校正方式, 跟据所述校正 方式对所述(M-N )个语音命令进行校正, 从而使得语音命令的语言特征信 息能够适应于用户终端设备更换后或者语音特征库升级后, 解决了现有技术 中在用户终端设备更换后或者语音特征库升级后, 语音特征库的部分或者全 部语音命令的语音特征信息与设备釆集的对应语音特征信息会存在偏差, 从 而在语音命令操作时存在识别不准确或者不能识别输入的语音信息的技术问 题。 In a specific implementation, the processor 620 is further configured to: if the N deviation values are greater than the second threshold, re-determine the voice command set in the voice feature database that needs to be corrected. The electronic device 600 is configured to prompt the user to input voice information corresponding to the first voice command through the output device 630, and collect voice information input by the user according to the prompt by the input device 640, and extract the voice in the voice information by using the processor 620. Feature information, updating the extracted voice feature information to the voice feature information of the first voice command, thereby completing correction of the first to Nth voice commands of the M voice commands, and then passing the processor 620 And selecting, according to the N first deviation commands that have been corrected according to the N deviation values of the pre-corrected and corrected speech feature information, a manner of correcting the (MN) voice commands in the M voice commands, Correcting the (MN) voice commands according to the correction manner, so that the language feature information of the voice command can be adapted to the user terminal device after the replacement or the voice feature library is upgraded, and the prior art in the user terminal is solved. After the equipment is replaced or after the voice signature database is upgraded, the voice feature information and settings of some or all voice commands of the voice signature database are set. There is a deviation in the corresponding voice feature information of the backup set, so that there is a technical problem that the voice command operation is inaccurate or the input voice information cannot be recognized.
尽管已描述了本发明的优选实施例, 但本领域内的技术人员一旦得知了 基本创造性概念, 则可对这些实施例作出另外的变更和修改。 所以, 所附权 利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。 发明的精神和范围。 这样, 倘若本发明的这些修改和变型属于本发明权利要 求及其等同技术的范围之内, 则本发明也意图包含这些改动和变型在内。 Although the preferred embodiment of the invention has been described, it will be apparent to those skilled in the < Therefore, the appended claims are intended to be interpreted as including the preferred embodiments and the modifications and modifications The spirit and scope of the invention. Thus, it is intended that the present invention cover the modifications and variations of the inventions
Claims
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201310719786.8 | 2013-12-23 | ||
| CN201310719786.8A CN103699359B (en) | 2013-12-23 | 2013-12-23 | A kind of bearing calibration of voice command, correction system and electronic equipment |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| WO2015096504A1 true WO2015096504A1 (en) | 2015-07-02 |
Family
ID=50360900
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/CN2014/085567 Ceased WO2015096504A1 (en) | 2013-12-23 | 2014-08-29 | Voice command correcting method, correcting system and electronic device |
Country Status (2)
| Country | Link |
|---|---|
| CN (1) | CN103699359B (en) |
| WO (1) | WO2015096504A1 (en) |
Families Citing this family (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103699359B (en) * | 2013-12-23 | 2017-12-29 | 华为技术有限公司 | A kind of bearing calibration of voice command, correction system and electronic equipment |
| CN108053674A (en) * | 2018-01-16 | 2018-05-18 | 湖州华科信息咨询有限公司 | A kind of method and apparatus for being used for traffic lights fault cues and repair |
| CN118588061B (en) * | 2024-08-02 | 2024-11-05 | 深圳唯创知音电子有限公司 | Equipment control method, system, equipment and storage medium based on voice recognition |
Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6513005B1 (en) * | 1999-07-27 | 2003-01-28 | International Business Machines Corporation | Method for correcting error characters in results of speech recognition and speech recognition system using the same |
| CN101029829A (en) * | 2006-02-27 | 2007-09-05 | 环达电脑(上海)有限公司 | Method for updating navigation speech |
| US20090326938A1 (en) * | 2008-05-28 | 2009-12-31 | Nokia Corporation | Multiword text correction |
| US20120303368A1 (en) * | 2011-05-27 | 2012-11-29 | Ting Ma | Number-assistant voice input system, number-assistant voice input method for voice input system and number-assistant voice correcting method for voice input system |
| CN103207769A (en) * | 2012-01-16 | 2013-07-17 | 联想(北京)有限公司 | Method and user equipment for voice amending |
| CN103366741A (en) * | 2012-03-31 | 2013-10-23 | 盛乐信息技术(上海)有限公司 | Voice input error correction method and system |
| CN103699359A (en) * | 2013-12-23 | 2014-04-02 | 华为技术有限公司 | Correction method, correction system for voice command and electronic device |
-
2013
- 2013-12-23 CN CN201310719786.8A patent/CN103699359B/en not_active Expired - Fee Related
-
2014
- 2014-08-29 WO PCT/CN2014/085567 patent/WO2015096504A1/en not_active Ceased
Patent Citations (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6513005B1 (en) * | 1999-07-27 | 2003-01-28 | International Business Machines Corporation | Method for correcting error characters in results of speech recognition and speech recognition system using the same |
| CN101029829A (en) * | 2006-02-27 | 2007-09-05 | 环达电脑(上海)有限公司 | Method for updating navigation speech |
| US20090326938A1 (en) * | 2008-05-28 | 2009-12-31 | Nokia Corporation | Multiword text correction |
| US20120303368A1 (en) * | 2011-05-27 | 2012-11-29 | Ting Ma | Number-assistant voice input system, number-assistant voice input method for voice input system and number-assistant voice correcting method for voice input system |
| CN103207769A (en) * | 2012-01-16 | 2013-07-17 | 联想(北京)有限公司 | Method and user equipment for voice amending |
| CN103366741A (en) * | 2012-03-31 | 2013-10-23 | 盛乐信息技术(上海)有限公司 | Voice input error correction method and system |
| CN103699359A (en) * | 2013-12-23 | 2014-04-02 | 华为技术有限公司 | Correction method, correction system for voice command and electronic device |
Also Published As
| Publication number | Publication date |
|---|---|
| CN103699359B (en) | 2017-12-29 |
| CN103699359A (en) | 2014-04-02 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US10134390B2 (en) | Electronic device and voice recognition method thereof | |
| WO2017066424A1 (en) | Method and apparatus for initiating an operation using voice data | |
| CN107644638A (en) | Audio recognition method, device, terminal and computer-readable recording medium | |
| CN107146631B (en) | Music identification method, note identification model establishment method, device and electronic equipment | |
| US9196253B2 (en) | Information processing apparatus for associating speaker identification information to speech data | |
| CN113113040B (en) | Audio processing method and device, terminal and storage medium | |
| CN106796785A (en) | Sample sound for producing sound detection model is verified | |
| CN111091811B (en) | Method and device for processing voice training data and storage medium | |
| CN116229987B (en) | Campus voice recognition method, device and storage medium | |
| US20200135211A1 (en) | Information processing method, information processing device, and recording medium | |
| CN110400567A (en) | Registered voiceprint dynamic update method and computer storage medium | |
| CN113870873A (en) | Intelligent tuning method, device, medium and computing device based on tone color | |
| CN106910499A (en) | The control method and device of application program | |
| US10224029B2 (en) | Method for using voiceprint identification to operate voice recognition and electronic device thereof | |
| WO2015096504A1 (en) | Voice command correcting method, correcting system and electronic device | |
| WO2017049475A1 (en) | Information processing method and smart wristband | |
| JP2018045127A (en) | Computer program for speech recognition, speech recognition apparatus and speech recognition method | |
| US20170270923A1 (en) | Voice processing device and voice processing method | |
| CN114067807A (en) | Audio data processing method, device and electronic device | |
| CN113707183B (en) | Audio processing method and device in video | |
| EP3149727A1 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
| WO2016045468A1 (en) | Voice input control method and apparatus, and terminal | |
| WO2019041871A1 (en) | Voice object recognition method and device | |
| CN112992167A (en) | Audio signal processing method and device and electronic equipment | |
| JP2017161581A (en) | Voice recognition device, voice recognition program |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 14873752 Country of ref document: EP Kind code of ref document: A1 |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| 122 | Ep: pct application non-entry in european phase |
Ref document number: 14873752 Country of ref document: EP Kind code of ref document: A1 |