CN110059059A

CN110059059A - Batch screening technique, device, computer equipment and the storage medium of voice messaging

Info

Publication number: CN110059059A
Application number: CN201910197526.6A
Authority: CN
Inventors: 王涛
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2019-03-15
Filing date: 2019-03-15
Publication date: 2019-07-26
Anticipated expiration: 2039-03-15
Also published as: CN110059059B

Abstract

The embodiment of the invention discloses batch screening technique, device, computer equipment and the storage mediums of a kind of voice messaging, if wherein obtaining the import folders address of the file where preset training set the method includes receiving message processing directives；Preset threshold and preset first export folders address, the second export folders address are determined according to the message processing directives；The import folders address is read to obtain all voice messagings to be processed；Preset voice screening script is called to extract the characteristic information of each voice messaging to be processed respectively；All characteristic informations are successively read to judge whether it matches with preset threshold；If so, storing voice messaging to be measured corresponding to this feature information into the second export folders corresponding to the second export folders address to be used for batch signatures.The present invention can efficiently and accurately realize the unified screening to multiple voice messagings to be processed in training set, and reduce the mistake of screening process.

Description

Batch screening technique, device, computer equipment and the storage medium of voice messaging

Technical field

The present invention relates to data processing field more particularly to batch screening technique, device, the computers of a kind of voice messaging Equipment and storage medium.

Background technique

It usually requires to collect or acquire a large amount of voice messagings from various channels in speech recognition project, and utilizes these languages Message breath is trained neural network as the training sample in training set, to obtain accordingly for carrying out the language of feature The identification model of sound identification.And it is accurate in order to ensure the smooth and acquired identification model of the training process of neural network Property, it usually needs the pre-processing before being trained to acquired voice messaging, such as the screening of effective voice messaging, and it is real Now need progressive alternate that could complete the pretreatment work of a large amount of voice messaging, but the process factor of iteration processing It is big according to amount, it is very easy to operation error occur, causes the problem of voice messaging screening inaccuracy.

Summary of the invention

The embodiment of the present invention provides batch screening technique, device, computer equipment and the storage medium of a kind of voice messaging, It can efficiently and accurately realize the unified screening to multiple voice messagings to be processed in training set, and reduce the mistake of screening process Accidentally.

In a first aspect, the embodiment of the invention provides a kind of batch screening techniques of voice messaging, this method comprises:

If receiving message processing directives, the address of the file where preset training set is obtained, and the address is made For import folders address, the training set includes multiple voice messagings to be processed；

Preset threshold and preset first export folders address, the second output are determined according to the message processing directives Folder address, wherein first export folders address is the address that the first export folders is saved, and described first is defeated File includes multiple readable text files out, and second export folders address is the ground that the second export folders is saved Location；

The import folders address is read to obtain all voice messagings to be processed；

Call preset voice screening script to extract the characteristic information of each voice messaging to be processed respectively, and will be each The characteristic information of voice messaging to be processed is respectively written into different readable text files；

The characteristic information in all readable text files in first export folders is successively read to judge State whether the characteristic information in readable text file matches with preset threshold；

It is if the characteristic information in the readable text file matches with preset threshold, the readable text file institute is right The voice messaging to be measured answered is stored into second export folders for batch signatures.

Second aspect, the embodiment of the invention also provides a kind of batch screening plant of voice messaging, which includes using In the unit for executing the above method.

The third aspect, the embodiment of the invention also provides a kind of computer equipments comprising memory and processor, it is described Computer program is stored on memory, the processor realizes the above method when executing the computer program.

Fourth aspect, the embodiment of the invention also provides a kind of computer readable storage medium, the storage medium storage There is computer program, the computer program can realize the above method when being executed by a processor.

The embodiment of the invention provides a kind of batch screening technique of voice messaging, device, computer equipment and storages to be situated between Matter.Wherein, which comprises if receiving message processing directives, the address of the file where preset training set is obtained, And using the address as import folders address, the training set includes multiple voice messagings to be processed；At the information Reason, which instructs, determines preset threshold and preset first export folders address, the second export folders address；It reads described defeated Enter folder address to obtain all voice messagings to be processed；Call preset voice screening script with extract respectively each to The characteristic information of voice messaging is handled, and the characteristic information of each voice messaging to be processed is respectively written into different readable texts In file；It is described to judge to be successively read the characteristic information in all readable text files in first export folders Whether the characteristic information in readable text file matches with preset threshold；If characteristic information in the readable text file with Preset threshold matches, then stores voice messaging to be measured corresponding to the readable text file to second export folders In be used for batch signatures.The embodiment of the present invention can efficiently and accurately be realized in training set by above-mentioned batch processing The unified screening of multiple voice messagings to be processed, and the mistake of screening process is reduced, in order to accurately realize neural network Training.

Detailed description of the invention

Technical solution in order to illustrate the embodiments of the present invention more clearly, below will be to needed in embodiment description Attached drawing is briefly described, it should be apparent that, drawings in the following description are some embodiments of the invention, general for this field For logical technical staff, without creative efforts, it is also possible to obtain other drawings based on these drawings.

Fig. 1 is a kind of flow diagram of the batch screening technique of voice messaging provided in an embodiment of the present invention；

Fig. 2 is a kind of sub-process schematic diagram of the batch screening technique of voice messaging provided in an embodiment of the present invention；

Fig. 3 is a kind of sub-process schematic diagram of the batch screening technique of voice messaging provided in an embodiment of the present invention；

Fig. 4 be another embodiment of the present invention provides a kind of voice messaging batch screening technique flow diagram；

Fig. 5 is a kind of schematic block diagram of the batch screening plant of voice messaging provided in an embodiment of the present invention；

Fig. 6 is a kind of signal of the information determination unit of the batch screening plant of voice messaging provided in an embodiment of the present invention Property block diagram；

Fig. 7 is a kind of signal of the information judging unit of the batch screening plant of voice messaging provided in an embodiment of the present invention Property block diagram；

Fig. 8 be another embodiment of the present invention provides a kind of voice messaging batch screening plant schematic block diagram；

Fig. 9 is a kind of computer equipment structure composition schematic diagram provided in an embodiment of the present invention.

Specific embodiment

Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are some of the embodiments of the present invention, instead of all the embodiments.Based on this hair Embodiment in bright, every other implementation obtained by those of ordinary skill in the art without making creative efforts Example, shall fall within the protection scope of the present invention.

It should be appreciated that ought use in this specification and in the appended claims, term " includes " and "comprising" instruction Described feature, entirety, step, operation, the presence of element and/or component, but one or more of the other feature, whole is not precluded Body, step, operation, the presence or addition of element, component and/or its set.

It is also understood that mesh of the term used in this description of the invention merely for the sake of description specific embodiment And be not intended to limit the present invention.As description of the invention and it is used in the attached claims, unless on Other situations are hereafter clearly indicated, otherwise " one " of singular, "one" and "the" are intended to include plural form.

Referring to Fig. 1, Fig. 1 is a kind of exemplary flow of the batch screening technique of voice messaging provided by the embodiments of the present application Figure.The batch screening technique of the voice messaging is applied in management server.The management server management server passes through training Before collection is trained neural network, batch pretreatment is carried out to the voice messaging to be processed in the training set got, such as The voice messaging to be processed of damage, too short voice messaging to be processed are rejected from training set, it can by above-mentioned batch processing It efficiently and accurately realizes the unified screening to multiple voice messagings to be processed in training set, and reduces the mistake of screening process, In order to accurately realize the training of neural network.As shown in Figure 1, the step of this method includes step S101~S104.

Step S101 obtains the address of the file where preset training set, and will if receiving message processing directives As import folders address, the training set includes multiple voice messagings to be processed for the address.

In the present embodiment, it in order to be trained neural network to obtaining corresponding speech recognition modeling, needs pair Voice messaging to be processed in the training set got carries out the pretreatment of batch, meets wanting for trained neural network to reach It asks, improves the precision for the speech recognition modeling that training obtains.And training set can be it is pre-set, it can from each energy It enough carries out collection voice messaging in the application program of voice messaging acquisition to be stored, can also be through different recording personnel Recording is carried out to obtain voice messaging, the voice messaging being stored in training set at this time is voice messaging to be processed.When Management server receives Client-initiated message processing directives, then then obtaining the file where pre-set training set Address, and using the address as import folders address, in order to which user is accurately located import folders, i.e. input file Training set is stored in folder.

Step S102, with determining preset threshold and preset first export folders according to the message processing directives Location, the second export folders address, wherein first export folders address is the ground that the first export folders is saved Location, first export folders include multiple readable text files, and second export folders address is the second output text The address that part double-layered quilt saves.

It in the present embodiment, may include pre-set preset threshold and preset in the message processing directives One export folders address, the second export folders address, in order to which management server is after receiving message processing directives, By analyzing the message processing directives to obtain parameter needed for audio screening process, these parameters may include above-mentioned Preset threshold and preset first export folders address, the second export folders address.Meanwhile first output file The address that folder address is saved as the first export folders is corresponding first export folders, first output file Folder may include having multiple readable text files, and readable text file herein can be the empty text file of document name, It is also possible to prestore the text file of enough memory spaces, i.e., the storage that readable File can be used for carrying out data is protected It stays.Second export folders address is the address that the second export folders is saved, and is corresponding second export folders.

Wherein, audio is screened and is mainly sieved according to the characteristic information of of voice messaging to be processed itself Choosing, therefore need to preset preset threshold, whether the characteristic information that voice messaging to be processed is defined by preset threshold meets It is required that and the satisfactory voice messaging to be processed is stored to the second output text corresponding to the second export folders address In part folder.For example, preset threshold can be the relevant threshold values of audio duration with voice messaging, adopting with voice messaging can be The relevant threshold values of number of samples can also be the relevant threshold values with the zoom factor of voice messaging, be also possible to voice messaging Relevant threshold values of maximum amplitude value etc..In addition, the first export folders corresponding to the first export folders address can be used for Store intermediate file.

In one embodiment, as shown in Fig. 2, the step S102 may include step S201~S202.

Step S201 parses the message processing directives to obtain corresponding presupposed information.

Wherein, the message processing directives include the pre-set much information of user, in order to which management server exists After obtaining the message processing directives, corresponding audio screening is carried out according to pre-set much information.

Step S202 determines preset threshold and preset first export folders address, according to the presupposed information Two export folders addresses.

Wherein, in order to realize the accurate screening of voice messaging, management server can be determined according to the presupposed information exist The parameter needed in audio screening process, such as preset threshold and preset first export folders address, the second output file Press from both sides address.For example, the preset threshold can be the relevant preset duration threshold values with the audio duration of voice messaging, can also be The relevant default sampling number threshold values with the sampled point of voice messaging.

Step S103 reads the import folders address to obtain all voice messagings to be processed.

In the present embodiment, management server can read the import folders address, and according to the import folders Address determines corresponding import folders, so that all voice messagings to be processed in corresponding import folders are obtained, with Convenient for carrying out batch processing to all voice messagings to be processed.

Step S104 calls preset voice screening script to extract the feature letter of each voice messaging to be processed respectively Breath, and the characteristic information of each voice messaging to be processed is respectively written into different readable text files.

In the present embodiment, preset format conversion script refers to and pre-set can be screened to voice messaging Script, such as preset voice screening script can be SOX script, can also be other for carrying out audio screening certainly Script, program or function etc..It, can batch after management server executes the preset audio screening script of calling by Python Extract the characteristic information of each voice messaging to be processed in ground.It may include wherein voice messaging about the characteristic information of voice messaging The information such as audio duration, sampling number, zoom factor and maximum amplitude value.For the ease of being carried out to voice messaging to be processed Specific analysis, the characteristic information of each voice messaging to be processed can be stored into a corresponding readable text file into Row record, under normal circumstances, the corresponding different readable text file of different voice messagings to be processed.As optional, The readable text file can be TXT file, naturally it is also possible to be other text files convenient for read-write, such as word file.

In addition, for the ease of unified management, all readable text files can be stored in preset first output file It presss from both sides in the first export folders corresponding to address, in order to which management server is called the operation such as extraction as needed.

Step S105, the characteristic information being successively read in all readable text files in first export folders To judge whether the characteristic information in the readable text file matches with preset threshold.

In the present embodiment, management server can be successively read all machine readable texts in the first export folders herein Characteristic information in part, and acquired characteristic information is gone to match with preset threshold, so that it is determined that meeting preset threshold The voice messaging for the requirement defined.

In one embodiment, as shown in figure 3, the preset threshold includes preset duration threshold values, the characteristic information includes Audio duration, the step S105 may include step S301~S303.

Step S301, when the audio being successively read in all readable text files in first export folders It is long.

Wherein, management server can be successively from the sound read in all readable text files in the first export folders Frequency duration, each corresponding voice messaging to be processed of readable text file, therefore knowing that management server is extracted should be The audio duration of each voice messaging to be processed.

Step S302, judges whether the audio duration in the readable text file is greater than or equal to preset duration threshold values.

Wherein, when the audio duration of voice messaging is less than preset duration threshold values, the voice messaging may be indicated in training Good effect can not be played during neural network, to guarantee training result, when can remain larger than or be equal to default The voice messaging of long threshold values.So when need that batch is gone to judge that audio duration in the readable text file is greater than or equal to it is pre- If duration threshold values.The preset duration threshold values can carry out equipment according to the actual demand situation of user, in the present embodiment, not It limits.

Step S303 determines if the audio duration in the readable text file is greater than or equal to preset duration threshold values Characteristic information in the readable text file matches with preset threshold.

Wherein, when the audio duration in readable text file is greater than or equal to preset duration threshold values, then then can be determined that Characteristic information in the readable text file is matched with preset threshold, then shows that the readable text file is corresponding at this time Voice messaging to be measured be effective voice messaging.

In addition, the step S105 may include:

Step S303a can described in judgement if the audio duration in the readable text file is less than preset duration threshold values The characteristic information read in text file is not matched that with preset threshold.When the audio duration in the readable text file is less than in advance If duration threshold values when, need to screen out voice messaging to be processed corresponding to the readable text file.

As further embodiment, can also include: before the step S303

Step S304 is successively read if the audio duration in the readable text file is greater than or equal to preset duration threshold values Take the sampling number in all readable text files in first export folders.

Wherein, after the audio duration of voice messaging to be measured meets certain require, in order to further determine voice to be measured Whether information is effective information, it is also necessary to analyze from sampling number voice messaging to be measured, therefore need successively to obtain Take the sampling number in all readable text files.

Step S305 judges that the sampling number in the readable text file is greater than or equal to default sampling number.

Wherein, in order to ensure voice messaging to be measured is relatively sharp in playing process, the voice to be measured of selection is needed at this time The sampling number of information needs to be greater than or equal to default sampling number, which can carry out according to the demand of user Corresponding setting, in the present embodiment and without limitation.

Specifically, executing institute if the sampling number in the readable text file is greater than or equal to default sampling number State the step of characteristic information determined in the readable text file matches with preset threshold, i.e. execution step S303.Wherein, If being greater than or equal to default sampling number using points in the readable text file, show that the readable text file institute is right The voice messaging to be processed answered is effective voice messaging, therefore can be determined that the characteristic information in the readable text file and pre- What if threshold values matched.

In addition, executing the judgement institute if the sampling number in the readable text file is less than default sampling number The step of characteristic information and preset threshold in readable text file do not match that is stated, S303a is thened follow the steps.Wherein, work as institute When stating the audio duration in readable text file less than preset duration threshold values, then show that voice messaging to be measured at this time is not to be inconsistent Requirement is closed, needs to screen out voice messaging to be processed corresponding to the readable text file.

Step S106, if the characteristic information in the readable text file matches with preset threshold, by the machine readable text Voice messaging to be measured corresponding to this document is stored into second export folders for batch signatures.

In the present embodiment, to call the voice messaging for having carried out audio screening convenient for management server, basis is needed Second export folders address determines the position of the second export folders, and characteristic information obtained from being screened in batches and pre- If the voice messaging to be measured that threshold values matches is stored into the second export folders.

As further embodiment, the message processing directives include the 4th export folders address, and the method is also May include:

Step S107, it is if the characteristic information in the readable text file is not matched that with preset threshold, this is readable Voice messaging to be measured corresponding to text file is stored to the 4th output file corresponding to the 4th export folders address In folder.

In the present embodiment, if the characteristic information in the readable text file is not matched that with preset threshold, show Voice messaging to be processed corresponding to the readable text file be it is undesirable, can be according to described for the ease of management Four export folders addresses determine its corresponding 4th export folders, and by voice to be measured corresponding to the readable text file Information is stored into the 4th export folders.

To sum up, the present embodiment can be realized efficiently and accurately by above-mentioned batch processing to multiple wait locate in training set The unified screening of voice messaging is managed, and reduces the mistake of screening process, in order to accurately realize the training of neural network.

Referring to Fig. 4, Fig. 4 be another embodiment of the present invention provides a kind of voice messaging batch screening technique signal Flow chart.As shown in figure 4, the step of this method includes step S401~S404.Wherein with the step S101- in above-described embodiment The relevant explanation of S106 similar step and it is described in detail that details are not described herein, the following detailed description of to be increased in the present embodiment The step of adding.

Step S401 obtains the address of the file where preset training set, and will if receiving message processing directives As import folders address, the training set includes multiple voice messagings to be processed for the address.

Step S402, with determining preset threshold and preset first export folders according to the message processing directives Location, the second export folders address, wherein first export folders address is the ground that the first export folders is saved Location, first export folders include multiple readable text files, and second export folders address is the second output text The address that part double-layered quilt saves.

Step S403 reads the import folders address to obtain all voice messagings to be processed.

Step S403a, the feature letter being successively read in all readable text files in first export folders Cease the audio format to determine each voice messaging to be processed respectively.

In the present embodiment, in order to further assure that management server preferably call preset voice screening script with The characteristic information of each voice messaging to be processed of batch extracting, it is also necessary to carry out the audio format of each audio-frequency information to be processed Unified conversion process.Wherein, audio format refer to specifically can be may include AIFF, MPEG, MP3, MIDI, WMA, The formats such as FLAC, APE, AMR, WAV, management server are all readable in first export folders by being successively read Characteristic information in text file can determine the audio format of each voice messaging to be processed in batches.

Step S403b is kept described to be processed if the audio format of the voice messaging to be processed is preset audio format The audio format of voice messaging is constant.

Wherein, when the audio format of the voice messaging to be processed is preset audio format, then show at this time to be processed Voice messaging does not need to be handled, it can keeps the audio format of the voice messaging to be processed constant.For example, when default Audio format is WAV format, and when the audio format of voice messaging to be processed is also WAV format, then it does not need to carry out audio lattice Formula conversion.

Step S403c, if the audio format of the voice messaging to be processed is not preset audio format, according to preset sound The audio format of the voice messaging to be processed is converted to preset audio format by frequency format transformation rule.

It wherein, is that basis is needed to preset when the audio format of voice messaging to be processed is not preset audio format Audio format transformation rule audio format conversion is carried out to it.For example, the preset audio format transformation rule can be Audio format conversion is carried out to voice messaging to be processed by Ffmpeg script.When preset audio format is WAV format, and wait locate When the audio format for managing voice messaging is also MP3 format, the audio format by the voice to be processed is needed to convert.

Step S404 calls preset voice screening script to extract the feature letter of each voice messaging to be processed respectively Breath, and the characteristic information of each voice messaging to be processed is respectively written into different readable text files.

Step S405, the characteristic information being successively read in all readable text files in first export folders To judge whether the characteristic information in the readable text file matches with preset threshold.

Step S406, if the characteristic information in the readable text file matches with preset threshold, by the machine readable text Voice messaging to be measured corresponding to this document is stored into second export folders for batch signatures.

As further embodiment, the message processing directives include preset third export folders address, described Method can with the following steps are included:

Step S407, the characteristic information being successively read in all readable text files in first export folders To judge whether the type of the characteristic information in each readable text file matches with the type of preset characteristic information respectively.

Wherein, according to the type of the type of the characteristic information of voice messaging to be processed and preset characteristic information whether Match, so that it may judge whether it is effective voice messaging, if such as characteristic information need include voice messaging audio duration, Sampling number, zoom factor and maximum amplitude value, and the preservation in readable text file only only has audio duration and adopts Number of samples, then then the corresponding voice messaging to be processed of the readable text part is invalid information.And in readable text file Save include audio duration, sampling number, zoom factor and maximum amplitude value, then show the readable text part it is corresponding to Processing voice messaging is effective information.

Step S408, if the type of the characteristic information in the readable text file and the type of preset characteristic information are not Match, determine that voice messaging to be measured is invalid voice information corresponding to the readable text file, and by the voice to be measured Information is stored into third export folders corresponding to third export folders address.

Wherein, in order to further distinguish the property of voice messaging to be measured, it can will be determined as the to be measured of invalid voice information Voice messaging is stored into third export folders corresponding to third export folders address.

Step S409, it is if the characteristic information in the readable text file is not matched that with preset threshold, this is readable Voice messaging to be measured corresponding to text file is stored to the 4th output file corresponding to the 4th export folders address In folder.

Those having ordinary skill in the art is understood that realize all or part of the process in above-described embodiment method, is that can lead to Computer program is crossed to instruct relevant hardware and complete, the program can be stored in a computer-readable storage medium In, the program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, the storage medium can be magnetic Dish, CD, read-only memory (Read-Only Memory, ROM) etc..

Referring to Fig. 5, a kind of corresponding above-mentioned batch screening technique of voice messaging, the embodiment of the present invention also propose a kind of language The batch screening plant of message breath, the device 100 include: address acquisition unit 101, information determination unit 102, information reading list Member 103, feature extraction unit 104, information judging unit 105 and the first storage unit 106.

The address acquisition unit 101, if obtaining the text where preset training set for receiving message processing directives The address of part folder, and using the address as import folders address, the training set includes multiple voice messagings to be processed.

The information determination unit 102, for determining preset threshold and preset according to the message processing directives One export folders address, the second export folders address, wherein first export folders address is the first output file The address that double-layered quilt saves, first export folders includes multiple readable text files, second export folders address The address being saved for the second export folders.

In one embodiment, as shown in fig. 6, the information determination unit 102 may include instruction resolution unit 201 and Information extraction unit 202.

Described instruction resolution unit 201, for parsing the message processing directives to obtain corresponding presupposed information.

The information extraction unit 202, for determining preset threshold and preset first defeated according to the presupposed information Folder address, the second export folders address out.

The Information reading unit 103, for reading the import folders address to obtain all voices to be processed Information.

The feature extraction unit 104, for calling preset voice screening script to extract each language to be processed respectively The characteristic information of message breath, and the characteristic information of each voice messaging to be processed is respectively written into different readable text files In.

The information judging unit 105, all readable texts for being successively read in first export folders Characteristic information in file is to judge whether the characteristic information in the readable text file matches with preset threshold.

In one embodiment, as shown in fig. 7, the preset threshold includes preset duration threshold values, the characteristic information includes Audio duration, the information judging unit 105 may include duration reading unit 301, duration judging unit 302 and first sentence Order member 303.

The duration reading unit 301, all readable texts for being successively read in first export folders Audio duration in file.

The duration judging unit 302, for judging whether the audio duration in the readable text file is greater than or waits In preset duration threshold values.

First judging unit 303, if being greater than or equal to for the audio duration in the readable text file default Duration threshold values then determines that the characteristic information in the readable text file matches with preset threshold.

In addition, the information judging unit 105 may include:

Second judging unit 303a, if being less than preset duration threshold values for the audio duration in the readable text file, Then determine that the characteristic information in the readable text file is not matched that with preset threshold.Sound in the readable text file When frequency duration is less than preset duration threshold values, need to sieve voice messaging to be processed corresponding to the readable text file It removes.

As further embodiment, can also include: before first judging unit 303

Numerical value reading unit 304, if being greater than or equal to preset duration for the audio duration in the readable text file Threshold values, the sampling number being successively read in all readable text files in first export folders.

Numerical value judging unit 305, for judging that the sampling number in the readable text file is greater than or equal to default adopt Number of samples.

Specifically, in one embodiment, if first judging unit 303 is also used to adopting in the readable text file Number of samples is greater than or equal to default sampling number, determines characteristic information and preset threshold phase in the readable text file Match.Wherein, if being greater than or equal to default sampling number using points in the readable text file, show the readable text Voice messaging to be processed corresponding to file is effective voice messaging, therefore can be determined that the feature in the readable text file What information and preset threshold matched.

In addition, in one embodiment, if the second judging unit 303a is also used to adopting in the readable text file Number of samples is less than default sampling number, determines that the characteristic information in the readable text file is not matched that with preset threshold.Its In, when the audio duration in the readable text file is less than preset duration threshold values, then show voice to be measured letter at this time Breath be it is undesirable, need to screen out voice messaging to be processed corresponding to the readable text file.

First storage unit 106, if for characteristic information and preset threshold phase in the readable text file Match, then stores voice messaging to be measured corresponding to the readable text file into second export folders to be used for batch Output.

As further embodiment, the message processing directives include the 4th export folders address, described device 100 Can also include:

Second storage unit 107, if the characteristic information in the readable text file is not matched that with preset threshold, Then voice messaging to be measured corresponding to the readable text file is stored to corresponding to the 4th export folders address In four export folders.

Referring to Fig. 8, a kind of corresponding above-mentioned batch screening technique of voice messaging, another embodiment of the present invention also propose one The batch screening plant of kind voice messaging, the device 400 include: address acquisition unit 401, information determination unit 402, information reading Take unit 403, format determination unit 403a, format holding unit 403b, format conversion unit 403c, feature extraction unit 404, Information judging unit 405 and the first storage unit 406.

Address acquisition unit 401, if obtaining the file where preset training set for receiving message processing directives Address, and using the address as import folders address, the training set includes multiple voice messagings to be processed.

Information determination unit 402, for determining preset threshold and preset first defeated according to the message processing directives Folder address, the second export folders address out, wherein first export folders address is the first output file double-layered quilt The address of preservation, first export folders include multiple readable text files, and second export folders address is the The address that two export folders are saved.

Information reading unit 403, for reading the import folders address to obtain all voice messagings to be processed.

Format determination unit 403a, all readable text files for being successively read in first export folders In characteristic information to determine the audio format of each voice messaging to be processed respectively.

Format holding unit 403b is protected if the audio format for the voice messaging to be processed is preset audio format The audio format for holding the voice messaging to be processed is constant.

Format conversion unit 403c, if the audio format for the voice messaging to be processed is not preset audio format, The audio format of the voice messaging to be processed is converted into preset audio format according to preset audio format transformation rule.

Feature extraction unit 404 extracts each voice letter to be processed for calling preset voice screening script respectively The characteristic information of breath, and the characteristic information of each voice messaging to be processed is respectively written into different readable text files.

Information judging unit 405, all readable text files for being successively read in first export folders In characteristic information to judge whether the characteristic information in the readable text file matches with preset threshold.

First storage unit 406, if the characteristic information in the readable text file matches with preset threshold, Voice messaging to be measured corresponding to the readable text file is stored into second export folders to be used for batch signatures.

As further embodiment, the message processing directives include preset third export folders address, described Device 400 may further include the following units:

Type judging unit 407, all readable text files for being successively read in first export folders In characteristic information with judge respectively the characteristic information in each readable text file type whether with preset characteristic information Type match.

Third judging unit 408, if type and preset feature for the characteristic information in the readable text file The type of information do not match that, determines that voice messaging to be measured is invalid voice information corresponding to the readable text file, and The voice messaging information to be measured is stored into third export folders corresponding to third export folders address.

As further embodiment, the message processing directives include the 4th export folders address, described device 400 Can also include:

Second storage unit 409, if the characteristic information in the readable text file is not matched that with preset threshold, Then voice messaging to be measured corresponding to the readable text file is stored to corresponding to the 4th export folders address In four export folders.

It should be noted that it is apparent to those skilled in the art that, the batch sieve of above-mentioned voice messaging The specific implementation process of screening device 100 and each unit, can be with reference to the corresponding description in preceding method embodiment, for description Convenienct and succinct, details are not described herein.

As seen from the above, in hardware realization, the above address acquisition unit 101, information determination unit 102, information are read Unit 103, feature extraction unit 104, information judging unit 105 and first storage unit 106 etc. can be interior in the form of hardware Be embedded in or the device reported a case to the security authorities independently of life insurance in, depositing for the batch screening plant of voice messaging can also be stored in a software form In reservoir, the corresponding operation of above each unit is executed so that processor calls.The processor can be central processing unit (CPU), microprocessor, single-chip microcontroller etc..

The batch screening plant of above-mentioned voice messaging can be implemented as a kind of form of computer program, and computer program can To be run in computer equipment as shown in Figure 9.

Fig. 9 is a kind of structure composition schematic diagram of computer equipment of the present invention.The equipment can be server, wherein clothes Business device can be independent server, be also possible to the server cluster of multiple server compositions.

Referring to Fig. 9, which includes processor 502, memory, the memory connected by system bus 501 Reservoir 504 and network interface 505, wherein memory may include non-volatile memory medium 503 and built-in storage 504.

The non-volatile memory medium 503 can storage program area 5031 and computer program 5032, the computer program 5032 are performed, and processor 502 may make to execute a kind of batch screening technique of voice messaging.

The processor 502 supports the operation of entire computer equipment 500 for providing calculating and control ability.

The built-in storage 504 provides environment for the operation of the computer program 5032 in non-volatile memory medium 503, should When computer program 5032 is executed by processor 502, processor 502 may make to execute a kind of batch screening side of voice messaging Method.

The network interface 505 is used to carry out network communication with other equipment.It will be understood by those skilled in the art that in Fig. 9 The structure shown, only the block diagram of part-structure relevant to application scheme, does not constitute and is applied to application scheme The restriction of computer equipment 500 thereon, specific computer equipment 500 may include more more or fewer than as shown in the figure Component perhaps combines certain components or with different component layouts.

Wherein, the processor 502 is for running computer program 5032 stored in memory, to realize institute as above Step in the batch screening technique for the voice messaging stated.

It should be appreciated that in the embodiment of the present application, processor 502 can be central processing unit (Central Processing Unit, CPU), which can also be other general processors, digital signal processor (Digital Signal Processor, DSP), specific integrated circuit (Application Specific Integrated Circuit, ASIC), ready-made programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic Device, discrete gate or transistor logic, discrete hardware components etc..Wherein, general processor can be microprocessor or Person's processor is also possible to any conventional processor etc..

Those of ordinary skill in the art will appreciate that be realize above-described embodiment method in all or part of the process, It is that relevant hardware can be instructed to complete by computer program.The computer program can be stored in a storage medium, The storage medium is computer readable storage medium.The computer program is held by least one processor in the computer system Row, to realize the process step of the embodiment of the above method.

Therefore, the present invention also provides a kind of storage mediums.The storage medium can be computer readable storage medium.This is deposited Storage media is stored with computer program, which makes processor execute voice letter as described above when being executed by processor Step in the batch screening technique of breath.

The storage medium can be USB flash disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), magnetic disk Or the various computer readable storage mediums that can store program code such as CD.

Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure Member and algorithm steps, can be realized with electronic hardware, computer software, or a combination of the two, in order to clearly demonstrate hardware With the interchangeability of software, each exemplary composition and step are generally described according to function in the above description.This A little functions are implemented in hardware or software actually, the specific application and design constraint depending on technical solution.Specially Industry technical staff can use different methods to achieve the described function each specific application, but this realization is not It is considered as beyond the scope of this invention.

In several embodiments provided by the present invention, it should be understood that disclosed device and method can pass through it Its mode is realized.For example, the apparatus embodiments described above are merely exemplary.For example, the division of each unit, only Only a kind of logical function partition, there may be another division manner in actual implementation.Such as multiple units or components can be tied Another system is closed or is desirably integrated into, or some features can be ignored or not executed.

The steps in the embodiment of the present invention can be sequentially adjusted, merged and deleted according to actual needs.This hair Unit in bright embodiment device can be combined, divided and deleted according to actual needs.In addition, in each implementation of the present invention Each functional unit in example can integrate in one processing unit, is also possible to each unit and physically exists alone, can also be with It is that two or more units are integrated in one unit.

If the integrated unit is realized in the form of SFU software functional unit and when sold or used as an independent product, It can store in one storage medium.Based on this understanding, technical solution of the present invention is substantially in other words to existing skill The all or part of part or the technical solution that art contributes can be embodied in the form of software products, the meter Calculation machine software product is stored in a storage medium, including some instructions are used so that a computer equipment (can be a People's computer, terminal or network equipment etc.) it performs all or part of the steps of the method described in the various embodiments of the present invention.

The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can readily occur in various equivalent modifications or replace It changes, these modifications or substitutions should be covered by the protection scope of the present invention.Therefore, protection scope of the present invention should be with right It is required that protection scope subject to.

Claims

1. a kind of batch screening technique of voice messaging, which is characterized in that the described method includes:

If receiving message processing directives, the address of the file where preset training set is obtained, and using the address as defeated Enter folder address, the training set includes multiple voice messagings to be processed；

Preset threshold and preset first export folders address, the second output file are determined according to the message processing directives Press from both sides address, wherein first export folders address is the address that the first export folders is saved, the first output text Part folder includes multiple readable text files, and second export folders address is the address that the second export folders is saved；

Call preset voice screening script to extract the characteristic information of each voice messaging to be processed respectively, and will be each wait locate The characteristic information of reason voice messaging is respectively written into different readable text files；

Be successively read the characteristic information in all readable text files in first export folders with judge it is described can Whether the characteristic information read in text file matches with preset threshold；

It, will be corresponding to the readable text file if the characteristic information in the readable text file matches with preset threshold Voice messaging to be measured is stored into second export folders for batch signatures.

2. the method as described in claim 1, which is characterized in that it is described according to the message processing directives determine preset threshold with And the step of preset first export folders address, the second export folders address, comprising:

The message processing directives are parsed to obtain corresponding presupposed information；

Preset threshold and preset first export folders address, the second export folders are determined according to the presupposed information Location.

3. the method as described in claim 1, which is characterized in that the preset threshold includes preset duration threshold values, the feature Information includes audio duration, the feature in all readable text files being successively read in first export folders Information is the step of whether characteristic information in the readable text file matches with preset threshold judged, comprising:

The audio duration being successively read in all readable text files in first export folders；

Judge whether the audio duration in the readable text file is greater than or equal to preset duration threshold values；

If the audio duration in the readable text file is greater than or equal to preset duration threshold values, determine the machine readable text herein Characteristic information in part matches with preset threshold.

4. method as claimed in claim 3, which is characterized in that the characteristic information determined in the readable text file with Before the step of preset threshold matches, comprising:

If the audio duration in the readable text file is greater than or equal to preset duration threshold values, it is successively read first output The sampling number in all readable text files in file；

Judge that the sampling number in the readable text file is greater than or equal to default sampling number；

If the sampling number in the readable text file is greater than or equal to default sampling number, determine the machine readable text herein Characteristic information in part matches with preset threshold.

5. the method as described in claim 1, which is characterized in that described to call preset voice screening script every to extract respectively The characteristic information of a voice messaging to be processed, and the characteristic information of each voice messaging to be processed is respectively written into different readable Before step in text file, comprising:

It is every to determine respectively to be successively read the characteristic information in all readable text files in first export folders The audio format of a voice messaging to be processed；

If the audio format of the voice messaging to be processed is preset audio format, the audio of the voice messaging to be processed is kept Format is constant；

If the audio format of the voice messaging to be processed is not preset audio format, according to preset audio format transformation rule The audio format of the voice messaging to be processed is converted into preset audio format.

6. method as claimed in claim 5, which is characterized in that the message processing directives include preset third output file Address is pressed from both sides, the method also includes:

It is every to judge respectively to be successively read the characteristic information in all readable text files in first export folders Whether the type of the characteristic information in a readable text file matches with the type of preset characteristic information；

If the type of the characteristic information in the readable text file and the type of preset characteristic information do not match that, institute is determined Stating voice messaging to be measured corresponding to readable text file is invalid voice information, and by the voice messaging information to be measured store to In third export folders corresponding to third export folders address.

7. the method as described in claim 1, which is characterized in that the message processing directives are including the 4th export folders Location, the method also includes:

It, will be corresponding to the readable text file if the characteristic information in the readable text file is not matched that with preset threshold Voice messaging to be measured store into the 4th export folders corresponding to the 4th export folders address.

8. a kind of batch screening plant of voice messaging, which is characterized in that including for executing such as any one of claim 1-7 institute State the unit of method.

9. a kind of computer equipment, which is characterized in that the computer equipment includes memory and processor, on the memory It is stored with computer program, the processor is realized as described in any one of claim 1-7 when executing the computer program Method.

10. a kind of computer readable storage medium, which is characterized in that the storage medium is stored with computer program, the meter Calculation machine program makes the processor execute such as method of any of claims 1-7 when being executed by processor.