[go: up one dir, main page]

WO2020211004A1 - Procédé et dispositif de traitement de signal audio, et support de stockage - Google Patents

Procédé et dispositif de traitement de signal audio, et support de stockage Download PDF

Info

Publication number
WO2020211004A1
WO2020211004A1 PCT/CN2019/083006 CN2019083006W WO2020211004A1 WO 2020211004 A1 WO2020211004 A1 WO 2020211004A1 CN 2019083006 W CN2019083006 W CN 2019083006W WO 2020211004 A1 WO2020211004 A1 WO 2020211004A1
Authority
WO
WIPO (PCT)
Prior art keywords
frequency domain
audio signal
domain data
data
channels
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2019/083006
Other languages
English (en)
Chinese (zh)
Inventor
莫品西
吴晟
边云锋
薛政
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SZ DJI Technology Co Ltd
Original Assignee
SZ DJI Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SZ DJI Technology Co Ltd filed Critical SZ DJI Technology Co Ltd
Priority to CN201980005583.8A priority Critical patent/CN111345047A/zh
Priority to PCT/CN2019/083006 priority patent/WO2020211004A1/fr
Publication of WO2020211004A1 publication Critical patent/WO2020211004A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones

Definitions

  • the embodiments of the present invention relate to the field of signal processing, and in particular, to an audio signal processing method, device, and storage medium.
  • the dynamic range of a microphone refers to the ratio of the maximum sound and the minimum sound without distortion that the microphone can record.
  • the dynamic range of an ordinary microphone generally does not exceed 100dBA.
  • circuit noise and quantization noise usually there are slight circuit noise and quantization noise in the signal, and the effective dynamic range will be further compressed.
  • recording requires a higher dynamic range, and ordinary microphones cannot meet the demand.
  • the dynamic range of an ordinary microphone is still limited, and it is difficult to record both large and small signal sounds at the same time. If it meets the requirements for small voice recording, loud voice overload will occur; if it meets loud voice recording, then There will be a low signal-to-noise ratio problem for small sounds.
  • the second method two or more microphones covering different dynamic ranges are required to record, and finally only one sound signal is synthesized. The increase in the number of microphones and the selection, adjustment, and testing of multiple microphones significantly increase the cost.
  • the embodiments of the present invention provide an audio signal processing method, device, and storage medium, so as to effectively increase the dynamic range of the recording system.
  • the first aspect of the embodiments of the present invention is to provide an audio signal processing method, including:
  • a plurality of preprocessing circuits are used to process the analog audio signal to be processed to obtain multiple digital audio signals.
  • Each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter. The analog gains of the amplifiers are different;
  • the frequency domain fusion data is converted into a time domain audio signal, and an output audio signal is obtained according to the time domain audio signal.
  • the second aspect of the embodiments of the present invention is to provide an audio signal processing device, including: a memory and a processor;
  • the memory is used to store program codes
  • the processor calls the program code, and when the program code is executed, is used to perform the following operations:
  • a plurality of preprocessing circuits are used to process the analog audio signal to be processed to obtain multiple digital audio signals.
  • Each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter. The analog gains of the amplifiers are different;
  • the frequency domain fusion data is converted into a time domain audio signal, and an output audio signal is obtained according to the time domain audio signal.
  • the third aspect of the embodiments of the present invention is to provide a recording system, including:
  • Microphone used to collect analog audio signals
  • the fourth aspect of the embodiments of the present invention is to provide a computer-readable storage medium having a computer program stored thereon, and the computer program is executed by a processor to implement the method described in the first aspect.
  • the audio signal processing method, device and storage medium provided in this embodiment use multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, wherein each of the multiple preprocessing circuits is preprocessed
  • the processing circuit includes an amplifier and an analog-to-digital converter, and the analog gains of the amplifiers of each pre-processing circuit are different; frequency domain conversion is performed on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data; One or at least two channels of target frequency domain data among the channels of frequency domain data determine frequency domain fusion data; the frequency domain fusion data is converted into a time domain audio signal, and an output audio signal is obtained according to the time domain audio signal.
  • the method of this embodiment can effectively improve the dynamic range of the recording system, has high sensitivity, and can reduce the noise floor and meet the requirement of high signal-to-noise ratio.
  • FIG. 1 is a flowchart of an audio signal processing method provided by an embodiment of the present invention
  • FIG. 2 is a schematic diagram of multiple preprocessing circuits in parallel provided by an embodiment of the present invention
  • FIG. 3 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • FIG. 4 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • FIG. 5 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • FIG. 6 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • FIG. 7 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • FIG. 8 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • Figure 9 is a graph of the mapping function of the nonlinear mapping
  • FIG. 10 is a flowchart of an audio signal processing method provided by another embodiment of the present invention.
  • Figure 11a is a time-domain signal diagram of the digital audio signal of the first frame output by the first preprocessing circuit
  • 11b is a time-domain signal diagram of the digital audio signal of the first frame output by the second preprocessing circuit
  • FIG. 12a is a time-frequency spectrum diagram of the digital audio signal of the first frame output by the first preprocessing circuit
  • Figure 12b is a time-frequency spectrum diagram of the digital audio signal of the first frame output by the second preprocessing circuit
  • Figure 13a is a time-domain signal diagram of the output audio signal of the lth frame
  • Fig. 13b is a time-frequency spectrum diagram of the output audio signal of the first frame
  • Fig. 14 is a structural diagram of an audio signal processing device provided by an embodiment of the present invention.
  • a component When a component is considered to be "connected" to another component, it can be directly connected to another component or there may be a centered component at the same time.
  • FIG. 1 is a flowchart of an audio signal processing method provided by an embodiment of the present invention. As shown in FIG. 1, the audio signal processing method in this embodiment may include:
  • Step S101 Use multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, wherein each of the multiple preprocessing circuits includes an amplifier and an analog-to-digital converter, and each preprocessing circuit The analog gains of the amplifiers of the processing circuit are different.
  • each preprocessing circuit includes an amplifier and an analog-to-digital converter.
  • the multiple preprocessing circuits include at least two preprocessing circuits. Processing circuit; where the amplifier can be used to amplify the power of the analog audio signal, the amplification factor is usually expressed by gain, the analog gain of the amplifier of each preprocessing circuit in this embodiment is different, and further, the two adjacent analog gains The dynamic range of the pre-processing circuit at least partially overlaps; and the analog-to-digital converter can be used to convert analog audio signals into digital audio signals for subsequent signal processing.
  • the analog audio signal x to be processed can be collected by a microphone, and then input into a plurality of parallel preprocessing circuits, respectively, through power amplification of different gains, and converted into a digital audio signal x 1 ,...x i ...,x I , where I is the number of preprocessing circuits, that is, the input of each preprocessing circuit is the same analog audio signal to be processed, and the output of each preprocessing circuit is digital audio
  • I is the number of preprocessing circuits, that is, the input of each preprocessing circuit is the same analog audio signal to be processed, and the output of each preprocessing circuit is digital audio
  • the signals are different, resulting in multiple digital audio signals.
  • the analog audio signal to be processed can be intercepted into different segments with a predetermined duration as a frame signal, or a predetermined number of sampled data can be used as a frame signal after analog-to-digital conversion.
  • the subsequent audio signal processing procedures are all It can be processed in units of one frame of signal. In order to ensure the continuity of the signal, there may be a certain overlap between adjacent frame signals, that is, the tail of the previous frame signal and the head of the next frame signal have an overlap amount, thereby establishing the correlation between adjacent frames.
  • Step S102 Perform frequency domain conversion on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data.
  • frequency domain conversion is performed on each digital audio signal in the multiple digital audio signals, so as to obtain frequency domain data corresponding to each digital audio signal.
  • the multiple channels of digital audio signals are subjected to frequency domain conversion to obtain multiple channels of frequency domain data, and the fusion of multiple channels of frequency domain data in the frequency domain can be realized.
  • the frequency domain conversion method may adopt Fourier transform (such as discrete Fourier transform), Laplace transform, Z transform, etc. The specific frequency domain conversion process will not be repeated here.
  • Step S103 Determine frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data.
  • each pre-processing circuit since each pre-processing circuit amplifies the same analog audio signal to be processed with different gains, each audio signal has a different maximum and minimum value, and the maximum value of the audio signal with a larger gain And the minimum value are relatively large, and the maximum and minimum values of the audio signal with a smaller gain are relatively small, and the dynamic range is the ratio of the maximum and minimum values of the audio signal without distortion.
  • the louder sound can be adjusted by A pre-processing circuit with a relatively large gain is provided, and a smaller sound can be provided by a pre-processing circuit with a relatively small gain, so that the dynamic system of the recording system can be improved, with higher sensitivity and lower noise at the same time.
  • the size of the sound can be measured by the energy feature information of the audio signal, such as the sound pressure level of the analog audio signal or digital audio signal, or the amplitude of the analog audio signal or digital audio signal.
  • Step S104 Convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
  • the frequency domain fusion data can be converted into a time domain audio signal.
  • the conversion method can adopt the inverse transformation of the Fourier transform (such as the discrete Fourier transform), the Lap The inverse transform of the Lass transform, the inverse transform of the Z transform, etc., will not be repeated here.
  • the output audio signal can be obtained according to the time domain audio signal.
  • operations such as compression and noise reduction can be performed on the time domain audio signal.
  • the process of obtaining the output audio signal according to the time domain audio signal also needs to splice the signals of each frame to establish the correlation between adjacent frames. Specifically, the time domain audio signal of the current frame and the time domain audio signal of the previous frame can be superimposed.
  • the audio signal processing method of this embodiment uses multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, wherein each of the multiple preprocessing circuits includes an amplifier and an analog audio signal. Digital converter, and the analog gains of the amplifiers of each pre-processing circuit are different; frequency domain conversion is performed on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data; according to the multiple channels of frequency domain data One or at least two channels of target frequency domain data determine frequency domain fusion data; convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
  • the method of this embodiment can effectively improve the dynamic range of the recording system, has high sensitivity, and can reduce the noise floor and meet the requirement of high signal-to-noise ratio.
  • the audio signal processing method further includes:
  • the energy characteristic information of the analog audio signal to be processed may be the sound pressure level of the analog audio signal or the amplitude of the analog audio signal, etc., and specifically may be the maximum, minimum, or instantaneous amplitude of the analog audio signal.
  • the intermediate value may also be the maximum, minimum or intermediate value of the average amplitude of the analog audio signal in a short time (preset duration).
  • the step S103 of determining frequency domain fusion data according to one or at least two channels of frequency domain data in the multiple channels of frequency domain data includes:
  • Step S201 Determine one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information
  • Step S202 Determine frequency domain fusion data according to the one or at least two channels of target frequency domain data.
  • one or at least two channels of target frequency domain data to be fused can be determined from multiple channels of frequency domain data according to the energy characteristic information of the analog audio signal to be processed, for example, according to the simulation to be processed
  • the size of the energy feature information of the audio signal determines the number of target frequency domain data. For example, the greater the energy feature information, the greater the number of target frequency domain data; in addition, the reference energy feature parameters of the processing circuit can also be preset based on the energy feature information Compare and then determine one or at least two channels of target frequency domain data to be fused.
  • Each of the multiple preset processing circuits corresponds to a different reference energy characteristic parameter.
  • the reference energy characteristic parameter is included by the preprocessing circuit Determined by the analog gain of the amplifier circuit, the reference energy characteristic parameter and the energy characteristic information belong to the same parameter, that is, the sound pressure level or amplitude of the digital audio signal output by the preprocessing circuit, which can be in the case of no distortion
  • the maximum, minimum or intermediate value of the instantaneous amplitude of the digital audio signal, or the maximum, minimum or intermediate value of the average amplitude of the digital audio signal in a short time (preset duration) when the preprocessing circuit includes the amplifier circuit The larger the analog gain of, the larger the corresponding reference energy characteristic parameter.
  • the determining one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information includes:
  • the first target frequency domain data and the second target frequency domain data are determined from the multiple channels of frequency domain data according to the energy feature information and multiple reference energy feature parameters, wherein the multiple reference energy feature parameters are based on the multiple frequency domain data.
  • the analog gain of the amplifier circuit included in each preprocessing circuit is determined.
  • At least two channels of target frequency domain data are determined from multiple channels of frequency domain data, where the first target frequency domain data may be only one channel of target frequency domain data, and of course, it may also be more than one channel of target frequency domain data; Similarly, the second target frequency domain data may also be only one channel of target frequency domain data or more than one channel of target frequency domain data.
  • determining the first target frequency domain data and the second target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information may specifically include:
  • Step S301 Determine a first reference energy characteristic parameter and a second reference energy characteristic parameter adjacent to the energy characteristic information from a plurality of reference energy characteristic parameters;
  • Step S302 Determine first target frequency domain data and second target frequency domain data from the multiple channels of frequency domain data according to the first reference energy characteristic parameter and the second reference energy characteristic parameter;
  • the first target frequency domain data and the second target frequency domain data are obtained by performing the frequency domain conversion on the first digital audio signal and the second digital audio signal in the multi-channel digital audio signal, respectively.
  • a digital audio signal and a second digital audio signal are respectively obtained from the analog audio signal to be processed by a first preprocessing circuit and a second preprocessing circuit corresponding to the first reference energy characteristic parameter and the second reference energy characteristic parameter .
  • Step S303 Determine frequency domain fusion data according to the first target frequency domain data and the second target frequency domain data.
  • the reference energy characteristic parameters (indicated by L 1 , L 2 , ..., L I , where I is the number of preset processing circuits) of multiple preset processing circuits are selected and the analog audio to be processed
  • the energy feature information of the signal (represented by L c ) is adjacent to the first reference energy feature parameter (represented by Li ′ , where 1 ⁇ i′ ⁇ I-1) and the second reference energy feature parameter (represented by L i′+ 1 means), that is, L c is between L i′ and L i′+1 , where the first reference energy characteristic parameter L i′ corresponds to the first preprocessing circuit, and the first digital audio output by the first preprocessing circuit
  • the frequency domain data obtained by frequency domain conversion of the signal is the first target frequency domain data
  • the second reference energy characteristic parameter Li′+1 corresponds to the second preprocessing circuit
  • the frequency domain data obtained by frequency domain conversion of the audio signal is the second target frequency domain data, and the frequency domain fusion data can be obtained
  • the present embodiment may be a second reference parameter L i and L i the energy characteristic parameters determining a first reference energy characteristics' + after 1 to L i 'and L i'-1 (this time need i'> 1)
  • the frequency domain data obtained by the frequency domain conversion of the digital audio signal output by the corresponding preprocessing circuit is the first target frequency domain data, which can also be L i′+1 and L i′+2 (in this case, i′ ⁇ I -1)
  • the frequency domain data obtained by frequency domain conversion of the digital audio signal output by the corresponding preprocessing circuit is the second target frequency domain data.
  • the first target frequency domain data and the second target frequency domain data may also include For more channels of frequency domain data, we will not give an example here.
  • the determining one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information includes:
  • Step S401 When the energy feature information is less than the smallest third reference energy feature parameter among the multiple reference energy feature parameters, determine from the multiple channels of frequency domain data according to the third reference energy feature parameter Third target frequency domain data;
  • the third target frequency domain data is obtained by performing the frequency domain conversion on a third digital audio signal in the multi-channel digital audio signal, and the third digital audio signal is corresponding to a third reference energy characteristic parameter Obtained by the third preprocessing circuit from the analog audio signal to be processed;
  • Step S402 Acquire the frequency domain fusion data according to the third target frequency domain data.
  • the smallest third reference energy characteristic parameter L 1 is greater than that of the analog audio signal to be processed
  • the energy feature information L c that is, L c is less than L 1
  • the third reference energy feature parameter L 1 corresponds to the third digital audio signal output by the third preprocessing circuit and the frequency domain data obtained by frequency domain conversion is the first Three target frequency domain data, and then the frequency domain fusion data can be obtained according to the third target frequency domain data.
  • the present third embodiment with reference to the energy parameters of L 1 may be smaller than in the embodiment determined to be processed analog audio signal energy characteristic information L c, the digital audio signals L 1 and L 2 corresponding to an output of the preprocessing circuit frequency-
  • the frequency domain data obtained by the domain conversion is the third target frequency domain data.
  • the third target frequency domain data may also include more channels of frequency domain data, and no examples are given here.
  • the fusion of the multiple channels of frequency domain data to obtain frequency domain fusion data further includes:
  • Step S501 When the energy feature information is greater than the largest fourth reference energy feature parameter among the multiple reference energy feature parameters, determine from the multiple channels of frequency domain data according to the fourth reference energy feature parameter The fourth target frequency domain data;
  • the fourth target frequency domain data is obtained by performing the frequency domain conversion on a fourth digital audio signal in the multi-channel digital audio signal, and the fourth digital audio signal is corresponding to a fourth reference energy characteristic parameter Obtained by the fourth preprocessing circuit from the analog audio signal to be processed;
  • Step S502 Acquire the frequency domain fusion data according to the fourth target frequency domain data.
  • the largest fourth reference energy characteristic parameter L I is smaller than that of the analog audio signal to be processed
  • the energy feature information L c that is, L c is greater than L I
  • the fourth reference energy feature parameter L I corresponds to the fourth digital audio signal output by the fourth preprocessing circuit and the frequency domain data obtained by frequency domain conversion is the first Four target frequency domain data, and the frequency domain fusion data can be obtained according to the fourth target frequency domain data.
  • the digital audio signal output by the preprocessing circuit corresponding to L I and L I-1 is the fourth target frequency domain data.
  • the fourth target frequency domain data may also include more channels of frequency domain data respectively, and no examples are given here.
  • the energy feature information L of the analog audio signal to be processed may be first c is compared with the reference energy characteristic parameters of multiple preset processing circuits, if L c is less than (or equal to) the third reference energy characteristic parameter L 1 , then steps 401-402 are executed; if L c is greater than (or equal to) the first For the fourth reference energy characteristic parameter L I , steps 501 to 502 are executed; if L c is between the adjacent third reference energy characteristic parameter L 1 and the fourth reference energy characteristic parameter L I , then steps 301-303 are executed.
  • the determining frequency domain fusion data according to the one or at least two channels of target frequency domain data includes:
  • the frequency domain fusion data is obtained by performing a spectrum superposition operation on the frequency domain data.
  • the acquiring frequency domain fusion data according to the one or more channels of frequency domain data includes:
  • performing the superposition operation can set weights for the first target frequency domain data and the second target frequency domain data, by superposing the first target frequency domain data and the second target frequency domain data with different weights. , You can get frequency domain fusion data with different dynamic ranges.
  • each of the plurality of preset processing circuits corresponds to a different reference energy characteristic parameter
  • the plurality of reference energy characteristic parameters are determined according to the analog gain of the amplifying circuit included in the plurality of preprocessing circuits , wherein the weights corresponding to the first target frequency domain data and the second target frequency domain data are based on the first preprocessing circuit and the first preprocessing circuit corresponding to the first target frequency domain data among the plurality of preset processing circuits
  • the reference energy characteristic parameter of the second preprocessing circuit corresponding to the second target frequency domain data is determined.
  • the weights corresponding to the first target frequency domain data and the second target frequency domain data are determined by the reference energy characteristic parameters of the corresponding preprocessing circuit. More specifically, the weights corresponding to the first target frequency domain data a first reference energy characteristic parameters L i and a second target frequency domain data corresponding to the second reference feature parameter L i + energy magnitude relation between the energy characteristic information L c 1 and the analog audio signal to be processed is determined, wherein the reference energy characteristic parameters of the analog audio signal closely approximates the energy characteristic information L c, the larger the weight, e.g., closer to L i L c, then the analog audio signal is closer to a digital audio signal L i corresponds to the pre-processing circuit, it is necessary to increase
  • the weight of the first target frequency domain data corresponding to L i .
  • the weight a 1 of the first target frequency domain data and the weight a 2 of the second target frequency domain data can be determined by the following formula:
  • the determining frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data includes:
  • Step S601 Perform compression processing on the one or at least two channels of target frequency domain data according to the compression coefficient corresponding to each of the one or at least two channels of target frequency domain data;
  • Step S602 Acquire frequency domain fusion data according to one or more channels of frequency domain data after the compression processing.
  • the output of the recording system has a digital quantization range, that is, there are restrictions on the maximum amplitude and minimum amplitude of the audio signal, the maximum amplitude of the audio signal cannot be greater than the maximum threshold, and the minimum amplitude of the audio signal The value cannot be less than the minimum threshold, so when acquiring the frequency domain fusion data, it is necessary to compress the target frequency domain data to prevent the fusion frequency domain fusion data from exceeding the digital quantization range.
  • the step of compressing the target frequency domain data can be performed simultaneously with the above-mentioned superposition operation, or it can be completed before the superposition operation.
  • the compression processing is linear compression processing.
  • the compression coefficient corresponding to each channel of frequency domain data of the one or more channels of frequency domain data is determined according to the analog gain of the amplifier included in the preprocessing circuit corresponding to each channel of frequency domain data.
  • the compression coefficient corresponding to any channel of frequency domain data may be the product of the channel equalization parameter and the scaling factor of the corresponding preprocessing circuit.
  • a certain preprocessing circuit is used as a reference preprocessing circuit, and the The channel equalization parameter is the ratio of the analog gain of the preprocessing circuit to the analog gain of the reference preprocessing circuit, and the scaling factor is obtained according to the size of the digital audio signal output by the preprocessing circuit.
  • the compression coefficient corresponding to any channel of frequency domain data can be obtained by the following formula:
  • G i′ is the analog gain of the preprocessing circuit
  • G ref is the reference preprocessing
  • is the scaling factor, used to scale the frequency domain data.
  • steps S601-602 can be executed only when the frequency domain fusion data exceeds the digital quantization range, and it can also be judged before step S601 whether the frequency domain fusion data has the possibility of exceeding the digital quantization range. When the range is possible, steps S601-602 are executed.
  • the time domain audio signal is the current frame time domain audio signal
  • the step S104 in the foregoing embodiment of obtaining the output audio signal according to the time domain audio signal includes:
  • the output audio signal is determined according to the time-domain fusion audio signal of the current frame.
  • the above-mentioned audio signal processing procedures are performed in units of one frame signal.
  • there may be a certain overlap between adjacent frame signals that is, the tail of the previous frame signal and the next
  • the header of the frame signal has an overlap amount, thereby establishing correlation between adjacent frames. Therefore, after converting the frequency domain fusion data of the current frame into the time domain audio signal of the current frame in S104, the overlapped part of the time domain audio signal of the current frame and the time domain audio signal of the previous frame can be overlapped and superimposed.
  • the non-overlapping part of the time domain audio signal and the previous frame of time domain audio signal is not superimposed to obtain the current frame time domain fused audio signal, and the output audio signal can be determined according to the current frame time domain fused audio signal.
  • the determining the output audio signal according to the time-domain fusion audio signal of the current frame may specifically include:
  • Step S701 Perform compression processing on the time-domain fused audio signal according to the current frame according to a preset compression coefficient
  • Step S702 Determine the output audio signal according to the time-domain fusion audio signal of the current frame after the compression processing.
  • the time-domain fusion audio signal of the current frame may be subjected to dynamic range compression, where the preset compression coefficient may be derived from a preset nonlinear function, that is, the current frame
  • the amplitude of the time-domain fusion audio signal is non-linearly mapped according to the non-linear function, so that the smaller signal remains unchanged or amplified, and the larger signal is compressed to the digital quantization range.
  • a typical nonlinear function is shown in the curve in Figure 9.
  • is input as the abscissa to the nonlinear function, and the ordinate function value of the nonlinear function is the compression
  • f(
  • the performing frequency domain conversion on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data includes:
  • an analysis window function is used to perform windowing processing on each channel of digital audio signals.
  • the analysis window function includes but is not limited to sine window function, rectangular window function, triangular window function, Hanning window function (Hanning), Gaussian window function. Window function (Gaussian), Blackman window function (Blackman), Chebyshev window function (Chebyshev), Hamming window function (Hamming), Flat top window function (Flap Top), Kaiser window function (Kaiser).
  • windowing is not required, which is equivalent to adding a rectangular window.
  • the obtaining the output audio signal according to the time domain audio signal includes:
  • the output audio signal is determined according to the time domain audio signal after the windowing process.
  • a synthesis window function is used to perform windowing processing on the time domain audio signal.
  • the synthesis window function includes, but is not limited to, a sine window function, a rectangular window function, a triangular window function, a Hanning window function (Hanning), and a Gaussian window. Function (Gaussian), Blackman window function (Blackman), Chebyshev window function (Chebyshev), Hamming window function (Hamming), flat top window function (Flap Top), Kaiser window function (Kaiser).
  • Windowing is not required, which is equivalent to adding a rectangular window.
  • FIG. 10 is a flowchart of an audio signal processing method provided by another embodiment of the present invention. As shown in FIG. 10, on the basis of the foregoing embodiment, the method in this embodiment may include:
  • Step S801 Use multiple pre-processing circuits to process analog audio signals to be processed to obtain multiple digital audio signals.
  • each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter, and the analog gains of the amplifiers of each preprocessing circuit are different.
  • G i 1, 2, ..., I
  • I is the number of preprocessing circuits.
  • Step S802 Extract N sampling points every M sampling points of each digital audio signal in the multi-channel digital audio signal, and use them as a frame of digital audio signal.
  • the digital audio signal of the first frame of the i-th preprocessing circuit is denoted as:
  • N is the frame length
  • M is the frame shift
  • 0 ⁇ M ⁇ N that is, the last NM sampling points of the l-1th frame are the same as the first NM sampling points of the lth frame, 0.005f s ⁇ N ⁇ f s
  • N is a power of 2.
  • Step S803 Perform windowing processing on the multiple channels of digital audio signals.
  • the first frame of digital audio signal x i (t) l of the i-th preprocessing circuit is windowed, and the windowed digital audio signal is:
  • h ana (t) is the N-point analysis window function.
  • Window functions can include but are not limited to sine window, rectangular window function, triangular window function, Hanning window, Gaussian window function, Black Blackman window function (Blackman), Chebyshev window function (Chebyshev), Hamming window function (Hamming), flat top window function (Flap Top), Kaiser window function (Kaiser) and so on.
  • Step S804 Perform frequency domain conversion according to the windowed multiple channels of digital audio signals to obtain multiple channels of frequency domain data.
  • n is the discrete spectrum sequence
  • e is the natural constant
  • It is an imaginary unit.
  • fast Fourier transform can be used to accelerate calculation.
  • Step S805 Determine frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data.
  • multiple channels of frequency domain data can be spectrum fused, and the spectrum signals of multiple channels of frequency domain data can be used for linear superposition with a predetermined weight, and the superposition logic is an analog audio signal with a better dynamic range covering the current frame , The higher the weight.
  • the frequency domain fusion data is X syn (n) l :
  • ⁇ i is the weight corresponding to the frequency domain data of the i-th preprocessing circuit, and ⁇ i ⁇ 0.
  • a 1 and a 2 are the weights of the frequency domain data Xi ′ (n) l and Xi ′+1 (n) l , respectively, which can be determined by the following formula:
  • the frequency domain data in order to prevent the final output audio signal from exceeding the digital quantization range of the recording system, can be linearly compressed in the superposition operation, that is, each frequency domain data is multiplied by the corresponding compression coefficient, where G x ( L i′ ) is the compression coefficient corresponding to the frequency domain data X i′ (n) i , which can be determined by the following formula:
  • G i′ is the analog gain of the preprocessing circuit
  • G ref is the reference preprocessing
  • is the scaling factor, which is used to scale the frequency domain data of the i'th preprocessing circuit.
  • each compression factor can be set to 1 in the above formula.
  • Step S806 Convert the frequency domain fusion data into a time domain audio signal.
  • the first frame of frequency domain fusion data X syn (n) l is processed by the inverse transform of the discrete Fourier transform to obtain the first frame of time domain audio signal x'syn (t) l :
  • the inverse transform of the fast Fourier transform can be used to accelerate the calculation.
  • Step S807 Perform windowing processing on the time domain audio signal.
  • the time domain audio signal x'syn (t) l of the l- th frame is windowed to obtain a windowed time domain audio signal
  • h syn (t) is an N-point synthesis window function.
  • Commonly used synthesis window functions can include but are not limited to sine window, rectangular window function, triangular window function, Hanning window, Gaussian window function, Blackman window function (Blackman), Chebyshev window function (Chebyshev), Hamming window function (Hamming), flat top window function (Flap Top), Kaiser window function (Kaiser) and so on.
  • Step S808 Perform superposition processing on the current frame time domain audio signal and the historical frame time domain audio signal obtained before the current frame time domain audio signal to obtain the current frame time domain fusion audio signal.
  • the overlapped part of the time domain audio signal of the current frame and the time domain audio signal of the previous frame may be overlapped and superimposed to obtain the time domain fusion audio signal of the current frame:
  • Step S809 Perform compression processing on the time-domain fusion audio signal according to the current frame; determine the output audio signal according to the time-domain fusion audio signal of the current frame after the compression processing.
  • step S805 if the dynamic range compression of the frequency domain data is not performed in step S805, or after the dynamic range compression of the frequency domain data in step S805, the amplitude of x syn (t) l still exceeds the digital quantization range, It is necessary to perform dynamic range compression in the time domain to meet the needs of digital quantization.
  • the dynamic range compression is equivalent to the nonlinear mapping of the signal amplitude in the time domain.
  • the basic principle is that the small signal remains unchanged or appropriately amplified, and the large signal is compressed to the digital quantization range L max , where L max is the maximum quantization. Amplitude.
  • a typical mapping function as shown in the curve in Fig. 9 can be selected, and the input abscissa is the current frame time-domain fusion audio signal
  • Output the compressed and processed current frame time-domain fused audio signal y(n) l corresponding to the M points overlapped in the l-th frame by the following formula:
  • y(n) l sign(x syn (t) l )f(
  • ) n 0,1,...,M-1
  • sign( ⁇ ) is a function for judging the sign, a positive number is 1, a negative number is -1, and a zero is 0.
  • step S809 may not be performed.
  • the dynamic range compression in the frequency domain is linear compression, which does not change the timbre of the sound
  • the dynamic range compression in the time domain is non-linear compression, which will cause certain timbre distortion.
  • the current frame time-domain fusion audio signal y(n) l after the compression processing corresponding to the M point overlapped in the l-th frame is obtained, which can be used as an output audio signal, which can be recorded and saved or played in real time.
  • the audio signal processing method of this embodiment uses multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, wherein each of the multiple preprocessing circuits includes an amplifier and an analog audio signal. Digital converter, and the analog gains of the amplifiers of each pre-processing circuit are different; frequency domain conversion is performed on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data; according to the multiple channels of frequency domain data One or at least two channels of target frequency domain data determine frequency domain fusion data; convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
  • the method of this embodiment can effectively improve the dynamic range of the recording system, has high sensitivity, and can reduce the noise floor, meet the requirements of high signal-to-noise ratio, and can simultaneously take into account the high signal-to-noise ratio recording of small voices and loud voices
  • the non-overload and distortion-free recording meets the recording needs of high dynamic range.
  • the test environment is an ordinary quiet room.
  • the first 3 seconds of the sound source is the voice signal, and the last 1 second is the tapping sound near the microphone.
  • the frame shift M 1024.
  • Both the analysis window and the synthesis window are sine windows:
  • the digital audio signal (time domain signal) of the first frame output by the first preprocessing circuit and the second preprocessing circuit is shown in Figure 11a and Figure 11b, respectively.
  • the output of the first preprocessing circuit and the second preprocessing circuit are
  • the digital audio signal (time spectrum) of the first frame is shown in Figure 12a and Figure 12b, respectively.
  • the time-domain signal and time-frequency spectrum of the output audio signal are shown in Figs. 13a and 13b, respectively.
  • the speech segment signal is basically the same as the digital audio signal of the first frame output by the high gain first preprocessing circuit, which meets the requirements of high signal-to-noise ratio;
  • the percussion segment retains the low gain second pre-processing Processing the frequency spectrum of the digital audio signal of the first frame of the output of the circuit, and dynamically adjusting its amplitude; and a continuous smooth transition between the small sound and the loud sound. Therefore, the method of this embodiment can simultaneously take into account the high signal-to-noise ratio recording of small sounds and the non-overload and distortion-free recording of loud sounds, meeting the recording requirements of high dynamic range.
  • FIG. 14 is a structural diagram of an audio signal processing device provided by an embodiment of the present invention.
  • the audio signal processing device 90 includes a memory 92 and a processor 91.
  • the memory 92 is used to store program codes
  • the processor 91 calls the program code, and when the program code is executed, is used to perform the following operations:
  • a plurality of preprocessing circuits are used to process the analog audio signal to be processed to obtain multiple digital audio signals.
  • Each of the plurality of preprocessing circuits includes an amplifier and an analog-to-digital converter. The analog gains of the amplifiers are different;
  • the frequency domain fusion data is converted into a time domain audio signal, and an output audio signal is obtained according to the time domain audio signal.
  • the processor 91 is configured to: obtain energy feature information of the analog audio signal to be processed;
  • the determining frequency domain fusion data according to one or at least two channels of frequency domain data among the multiple channels of frequency domain data includes:
  • each of the plurality of preset processing circuits corresponds to a different reference energy characteristic parameter
  • the processor 91 determines one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information, the processor 91 is configured to:
  • the first target frequency domain data and the second target frequency domain data are determined from the multiple channels of frequency domain data according to the energy feature information and multiple reference energy feature parameters, wherein the multiple reference energy feature parameters are based on the multiple frequency domain data.
  • the analog gain of the amplifier circuit included in each preprocessing circuit is determined.
  • the processing The device 91 is configured as:
  • the first target frequency domain data and the second target frequency domain data are determined from the multiple channels of frequency domain data according to the first reference energy characteristic parameter and the second reference energy characteristic parameter, where the first target frequency domain data and the second target
  • the frequency domain data is obtained by performing the frequency domain conversion on the first digital audio signal and the second digital audio signal in the multi-channel digital audio signal, and the first digital audio signal and the second digital audio signal are respectively
  • the analog audio signal to be processed is obtained by the first preprocessing circuit and the second preprocessing circuit corresponding to the first reference energy characteristic parameter and the second reference energy characteristic parameter.
  • the processor 91 determines one or at least two channels of target frequency domain data from the multiple channels of frequency domain data according to the energy characteristic information, the processor 91 is configured for:
  • a third target is determined from the multi-channel frequency domain data according to the third reference energy characteristic parameter Frequency domain data, wherein the third target frequency domain data is obtained by performing the frequency domain conversion on a third digital audio signal in the multi-channel digital audio signal, and the third digital audio signal is obtained from a third reference energy
  • the third preprocessing circuit corresponding to the characteristic parameter is obtained from the analog audio signal to be processed;
  • the processor 91 when the processor 91 fuses the multiple channels of frequency domain data to obtain frequency domain fusion data, the processor 91 is further configured to:
  • a fourth target is determined from the multiple channels of frequency domain data according to the fourth reference energy feature parameter Frequency domain data, where the fourth target frequency domain data is obtained by performing the frequency domain conversion on a fourth digital audio signal in the multiple digital audio signals, and the fourth digital audio signal is obtained by a fourth reference energy Obtained by the fourth preprocessing circuit corresponding to the characteristic parameter from the analog audio signal to be processed;
  • the processor 91 determines frequency domain fusion data according to the one or at least two channels of target frequency domain data
  • the processor 91 is configured to:
  • the one or more channels of frequency domain data include first target frequency domain data and second target frequency domain data;
  • the processor 91 When the processor 91 obtains frequency domain fusion data according to the one or more channels of frequency domain data, the processor 91 is configured to:
  • each of the multiple preset processing circuits corresponds to a different reference energy characteristic parameter
  • the multiple reference energy characteristic parameters are based on the fact that the multiple preprocessing circuits include
  • the analog gain of the amplification circuit is determined, wherein the weights corresponding to the first target frequency domain data and the second target frequency domain data are based on the weights corresponding to the first target frequency domain data in the plurality of preset processing circuits
  • the reference energy characteristic parameters of the first preprocessing circuit and the second preprocessing circuit corresponding to the second target frequency domain data are determined.
  • the processor 91 determines frequency domain fusion data according to one or at least two channels of target frequency domain data among the multiple channels of frequency domain data
  • the processor 91 is configured to :
  • the compression processing is linear compression processing.
  • the compression coefficient corresponding to each channel of frequency domain data of the one or more channels of frequency domain data is determined according to the analog gain of the amplifier included in the preprocessing circuit corresponding to each channel of frequency domain data of.
  • the time domain audio signal is the current frame time domain audio signal, wherein, when the processor 91 obtains the output audio signal according to the time domain audio signal, the processor 91 Is configured as:
  • the output audio signal is determined according to the time-domain fusion audio signal of the current frame.
  • the processor 91 determines the output audio signal according to the time-domain fusion audio signal of the current frame, the processor 91 is configured to:
  • the output audio signal is determined according to the time-domain fusion audio signal of the current frame after the compression processing.
  • the processor 91 when the processor 91 performs frequency domain conversion on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data, the processor 91 is configured to:
  • the processor 91 when the processor 91 obtains an output audio signal according to the time domain audio signal, the processor 91 is configured to:
  • the output audio signal is determined according to the time domain audio signal after the windowing process.
  • the audio signal processing device uses multiple preprocessing circuits to process analog audio signals to be processed to obtain multiple digital audio signals, where each of the multiple preprocessing circuits includes an amplifier and The analog-to-digital converter, and the analog gains of the amplifiers of each preprocessing circuit are different; frequency domain conversion is performed on the multiple channels of digital audio signals to obtain multiple channels of frequency domain data; according to the multiple channels of frequency domain data Determine the frequency domain fusion data from one or at least two channels of target frequency domain data; convert the frequency domain fusion data into a time domain audio signal, and obtain an output audio signal according to the time domain audio signal.
  • This embodiment can effectively improve the dynamic range of the recording system, has high sensitivity, can reduce the noise floor, meet the requirements of high signal-to-noise ratio, and can simultaneously take into account the high signal-to-noise ratio recording of small sounds and the absence of loud sounds. Overloading and distortion-free recording meets the recording needs of high dynamic range.
  • An embodiment of the present invention provides a recording system.
  • the recording system includes: a microphone for collecting analog audio signals; and an audio processing device 90 as described in the foregoing embodiment.
  • this embodiment also provides a computer-readable storage medium on which a computer program is stored, and the computer program is executed by a processor to implement the audio processing method described in the foregoing embodiment.
  • the disclosed device and method may be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical function division, and there may be other divisions in actual implementation, for example, multiple units or components can be combined or It can be integrated into another system, or some features can be ignored or not implemented.
  • the displayed or discussed mutual coupling or direct coupling or communication connection may be indirect coupling or communication connection through some interfaces, devices or units, and may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or they may be distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the objectives of the solutions of the embodiments.
  • the functional units in the various embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit may be implemented in the form of hardware, or may be implemented in the form of hardware plus software functional units.
  • the above-mentioned integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium.
  • the above-mentioned software functional unit is stored in a storage medium and includes several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) or a processor execute the method described in the various embodiments of the present invention. Part of the steps.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (Read-Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk and other media that can store program code .

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)

Abstract

Des modes de réalisation de la présente invention concernent un procédé et un dispositif de traitement de signal audio et un support de stockage. Le procédé consiste à : traiter un signal audio analogique à traiter en utilisant de multiples circuits de prétraitement, de façon à obtenir de multiples signaux audio numériques, chacun des multiples circuits de prétraitement comprenant un amplificateur et un convertisseur analogique-numérique, et les gains analogiques des amplificateurs des divers circuits de prétraitement étant différents les uns des autres ; effectuer une conversion dans le domaine fréquentiel sur les multiples signaux audio numériques de façon à obtenir de multiples éléments de données dans le domaine fréquentiel ; déterminer des données de fusion dans le domaine fréquentiel selon un ou au moins deux éléments de données dans le domaine fréquentiel cible parmi les multiples éléments de données dans le domaine fréquentiel ; et convertir les données de fusion dans le domaine fréquentiel en un signal audio dans le domaine temporel, et obtenir un signal audio de sortie conformément au signal audio dans le domaine temporel. Le procédé dans les modes de réalisation peut améliorer efficacement la plage dynamique d'un système d'enregistrement et assurer une sensibilité élevée, tout en permettant une réduction du bruit de fond et l'obtention du rapport signal/bruit élevé exigé.
PCT/CN2019/083006 2019-04-17 2019-04-17 Procédé et dispositif de traitement de signal audio, et support de stockage Ceased WO2020211004A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201980005583.8A CN111345047A (zh) 2019-04-17 2019-04-17 音频信号处理方法、设备及存储介质
PCT/CN2019/083006 WO2020211004A1 (fr) 2019-04-17 2019-04-17 Procédé et dispositif de traitement de signal audio, et support de stockage

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2019/083006 WO2020211004A1 (fr) 2019-04-17 2019-04-17 Procédé et dispositif de traitement de signal audio, et support de stockage

Publications (1)

Publication Number Publication Date
WO2020211004A1 true WO2020211004A1 (fr) 2020-10-22

Family

ID=71186612

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/083006 Ceased WO2020211004A1 (fr) 2019-04-17 2019-04-17 Procédé et dispositif de traitement de signal audio, et support de stockage

Country Status (2)

Country Link
CN (1) CN111345047A (fr)
WO (1) WO2020211004A1 (fr)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114812789B (zh) * 2021-01-28 2024-03-08 漳州立达信光电子科技有限公司 一种声音侦测方法、装置及设备
CN113079440B (zh) * 2021-03-22 2022-12-06 Oppo广东移动通信有限公司 音频信号的处理方法、装置、终端及存储介质
CN113823314B (zh) * 2021-08-12 2022-10-28 北京荣耀终端有限公司 语音处理方法和电子设备
CN113643719A (zh) * 2021-08-26 2021-11-12 Oppo广东移动通信有限公司 音频信号处理方法、装置、存储介质及终端设备
CN119200872A (zh) * 2023-06-27 2024-12-27 华为技术有限公司 检测方法和装置
CN117761393B (zh) * 2024-02-22 2024-05-07 南京派格测控科技有限公司 一种时域信号的获取方法及装置

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104066036A (zh) * 2014-06-19 2014-09-24 华为技术有限公司 拾音装置及拾音方法
US20170033754A1 (en) * 2015-07-29 2017-02-02 Invensense, Inc. Multipath digital microphones
CN108683978A (zh) * 2018-05-30 2018-10-19 中南大学 一种基于频域自适应均衡的音频处理方法

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2566658A1 (fr) * 1984-06-28 1986-01-03 Inst Nat Sante Rech Med Prothese auditive multivoie
US8085954B2 (en) * 2005-07-21 2011-12-27 Freescale Semiconductor, Inc. Microphone amplification arrangement and integrated circuit therefor
RU2472306C2 (ru) * 2007-09-26 2013-01-10 Фраунхофер-Гезелльшафт цур Фёрдерунг дер ангевандтен Форшунг Е.Ф. Устройство и способ для извлечения сигнала окружающей среды в устройстве и способ получения весовых коэффициентов для извлечения сигнала окружающей среды
US9380381B2 (en) * 2014-03-18 2016-06-28 Infineon Technologies Ag Microphone package and method for providing a microphone package
CN104954020B (zh) * 2014-03-28 2018-07-24 意法半导体股份有限公司 多通道换能器设备和其操作方法
US9609451B2 (en) * 2015-02-12 2017-03-28 Dts, Inc. Multi-rate system for audio processing
CN106162432A (zh) * 2015-04-03 2016-11-23 吴法功 一种音频处理器装置及其声音补偿架构处理实现方法
CN107895580B (zh) * 2016-09-30 2021-06-01 华为技术有限公司 一种音频信号的重建方法和装置

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104066036A (zh) * 2014-06-19 2014-09-24 华为技术有限公司 拾音装置及拾音方法
US20170033754A1 (en) * 2015-07-29 2017-02-02 Invensense, Inc. Multipath digital microphones
CN108683978A (zh) * 2018-05-30 2018-10-19 中南大学 一种基于频域自适应均衡的音频处理方法

Also Published As

Publication number Publication date
CN111345047A (zh) 2020-06-26

Similar Documents

Publication Publication Date Title
WO2020211004A1 (fr) Procédé et dispositif de traitement de signal audio, et support de stockage
US9210506B1 (en) FFT bin based signal limiting
CN103460716B (zh) 用于音频信号处理的方法与装置
US10109288B2 (en) Dynamic range and peak control in audio using nonlinear filters
CN105164918B (zh) 具有动态阈值的频带压缩
JP5984943B2 (ja) 聴覚装置における安定性と音声の聴き取り易さの改善
US9311933B2 (en) Method of processing a voice segment and hearing aid
WO2010089976A1 (fr) Prothèse auditive
TWI657435B (zh) 音訊處理裝置及方法
WO2020211017A1 (fr) Procédé et dispositif de traitement de signal audio, et support de stockage
JP4249729B2 (ja) 自動利得制御方法、自動利得制御装置、自動利得制御プログラム及びこれを記録した記録媒体
CN115348507A (zh) 脉冲噪声抑制方法、系统、可读存储介质及计算机设备
CN115623384A (zh) 音频动态范围控制方法及装置、存储介质、音频设备
US10841680B2 (en) Microphone and method for processing audio signals
CN114094964B (zh) 一种动态范围控制方法及装置、电子设备
CN115437599A (zh) 音频播放装置及其音频播放方法、存储介质
EP3513573A1 (fr) Procédé, appareil et programme informatique de traitement de signaux audio
CN115623385A (zh) 音频动态范围控制电路及音频设备
US20210329387A1 (en) Systems and methods for a hearing assistive device
Girisha et al. Implementation of novel algorithm for auditory compensation in hearing aids using STFT algorithm
WO2025264909A1 (fr) Annulation d'écho
CN116964665A (zh) 提高去混响的感知质量
CN120853599A (zh) 信号处理方法、装置及电子设备
WO2023249957A1 (fr) Amélioration de la parole et suppression des interférences
Girisha et al. IMPLEMENTATION OF NOVEL ALGORITHM FOR AUDITORY COMPENSATION IN HEARING AIDS USING STFT

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19925363

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19925363

Country of ref document: EP

Kind code of ref document: A1