EP1538867B1 - Handsfree system for use in a vehicle - Google Patents
Handsfree system for use in a vehicle Download PDFInfo
- Publication number
- EP1538867B1 EP1538867B1 EP20030022273 EP03022273A EP1538867B1 EP 1538867 B1 EP1538867 B1 EP 1538867B1 EP 20030022273 EP20030022273 EP 20030022273 EP 03022273 A EP03022273 A EP 03022273A EP 1538867 B1 EP1538867 B1 EP 1538867B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- filter
- signal
- adaptive
- microphones
- beamformer
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 230000003044 adaptive effect Effects 0.000 claims description 104
- 238000012545 processing Methods 0.000 claims description 32
- 238000000034 method Methods 0.000 claims description 25
- 238000001914 filtration Methods 0.000 claims description 19
- 230000004044 response Effects 0.000 claims description 19
- 230000001629 suppression Effects 0.000 claims description 9
- 230000002087 whitening effect Effects 0.000 claims description 9
- 230000009467 reduction Effects 0.000 claims description 8
- 230000000694 effects Effects 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 230000006870 function Effects 0.000 description 30
- 238000012546 transfer Methods 0.000 description 30
- 239000011159 matrix material Substances 0.000 description 21
- 230000003595 spectral effect Effects 0.000 description 14
- 230000000903 blocking effect Effects 0.000 description 13
- 238000005070 sampling Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 11
- 230000000875 corresponding effect Effects 0.000 description 10
- 238000013461 design Methods 0.000 description 10
- 238000003491 array Methods 0.000 description 9
- 230000002829 reductive effect Effects 0.000 description 7
- 238000001228 spectrum Methods 0.000 description 7
- 238000007493 shaping process Methods 0.000 description 6
- 230000002950 deficient Effects 0.000 description 5
- 230000035945 sensitivity Effects 0.000 description 5
- 230000003247 decreasing effect Effects 0.000 description 4
- 238000013459 approach Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 230000001934 delay Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012935 Averaging Methods 0.000 description 1
- 239000008186 active pharmaceutical agent Substances 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 238000005311 autocorrelation function Methods 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000005314 correlation function Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000002592 echocardiography Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 229920001296 polysiloxane Polymers 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
- H04R3/005—Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H04R1/406—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
- H04R2430/20—Processing of the output signals of the acoustic transducers of an array for obtaining a desired directivity characteristic
- H04R2430/23—Direction finding using a sum-delay beam-former
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2499/00—Aspects covered by H04R or H04S not otherwise provided for in their subgroups
- H04R2499/10—General applications
- H04R2499/13—Acoustic transducers and sound field adaptation in vehicles
Definitions
- the invention is directed to a handsfree system for use in a vehicle comprising a microphone array with at least two microphones and a signal processing means.
- handsfree systems are used more and more since they provide increased comfort and reduce the risk of an accident as the driver is distracted only marginally. Because of that, in many countries, handsfree devices are even required by law.
- WO 03/015464 discloses such a handsfree system for use in a vehicle.
- a handsfree system comprises a microphone that can be fastened to a user such as the driver.
- a beamformer processes signals emanating from a microphone array to obtain a combined signal.
- a beamformer comprises a beamsteering means being responsible for time delay compensation of the different microphones and a summing means.
- beamforming In its simplest form (Delay-and-Sum beamformer), beamforming only comprises delay compensation and summing of the compensated signals. Beamforming allows to provide a specific directivity pattern for a microphone array.
- a beamformer can be implemented as digital system with a plurality of digital filter using, for example, digital signal processors (DSP).
- DSP digital signal processors
- a beamformer can be configured as an adaptive or a non-adaptive beamformer.
- Adaptive means that relevant parameters such as filter coefficients can be re-calculated during use of the system in order to adapt the beamformer to changing conditions.
- the system parameters are determined once by calibrating the beamformer and, then, kept unchanged.
- the beamforming in principle, can be performed in the time domain or in the frequency domain.
- a handsfree system in accordance with the invention shows an excellent acoustic performance in a vehicular environment. Due to the beamformer, an improved directivity is obtained and, furthermore, speech signals are enhanced and ambient noise is reduced.
- the adaptive post-filter (responsible for filtering a signal after the beamforming) further reduces the noise in the signal.
- the adaptive post-filter can be a filter in the time domain. If the post-filtering is performed in the time domain, the delay time is reduced and the implementation is simplified.
- the adaptive post-filter is a Wiener filter. It turns out that a Wiener filter is particularly suitable for filtering in a car environment.
- the adaptive post-filter is a linear-phase filter.
- the adaptive post-filter is a linear-phase Wiener filter.
- the signal processing means can further comprise at least two adaptive filters having an input connected to the output of the beamsteering means and an output connected to the adaptive post-filter, wherein the at least two adaptive filters are configured to determine adaptive filter parameters for the adaptive post-filter.
- background filters are provided for adaptively estimating the filter parameters for the adaptive post-filter.
- an adaptive filter can be provided having an input connected to the output of the beamsteering means.
- adaptive filter parameters can be determined for the adaptive post-filter.
- the actual filter parameters of the post-filter can be given, for example, by the filter parameters determined by one of the adaptive filters or the mean of the filter parameters determined by several different adaptive filters.
- an input of each of the at least two adaptive filters can be further connected to the output of the beamformer. This allows for an adaption of the respective filter parameters directly with respect to the beamformed signal.
- the signal processing means can further comprise a pre-emphasis filter, in particular, comprising a pre-whitening filter, having an input connected to an output of the adaptive post-filter and/or a pre-emphasis filter, in particular, comprising a pre-whitening filter, having an input connected to the output of the beamsteering means and an output connected to the at least two adaptive filters.
- a pre-emphasis filter in particular, comprising a pre-whitening filter, having an input connected to an output of the adaptive post-filter and/or a pre-emphasis filter, in particular, comprising a pre-whitening filter, having an input connected to the output of the beamsteering means and an output connected to the at least two adaptive filters.
- the pre-emphasis filter can comprise a pre-whitening filter.
- a pre-whitening filter whitens the spectral distribution of a signal.
- the filter coefficients of such a pre-whitening filter can be determined using a linear predictive coding (LPC) analysis, for example, via an adaptive lattice predictor (ALP) algorithm.
- LPC linear predictive coding
- ALP adaptive lattice predictor
- the signal processing means can further comprise an inverse filter, particularly a warped inverse filter. These filters are especially useful to adjust the microphone transfer function and to match the microphones of the array in this way.
- the beamformer can comprise at least one inverse filter, in particular, having an output for providing an inversely filtered signal to a summing means.
- matched microphones on the basis of silicone or paired microphones may be used.
- the susceptibility of microphone arrays often increases with decreasing frequency. Due to this, a higher matching precision is preferred for low frequencies compared to high frequencies. A frequency depending adjustment of the microphone transfer functions with the use of warped filters reduces the required memory compared to the case of conventional FIR filters.
- each inverse filter can be an approximate inverse of a non-minimum phase filter. This results in an inverse filter which is both stable and has no phase error.
- an inverse filter may be combined with another filter of the handsfree system, for example, a filter of the beamformer.
- another filter of the handsfree system for example, a filter of the beamformer.
- the signal processing means of the above handsfree systems can comprise a non-adaptive post-filter having an input connected to an output of the adaptive post-filter.
- the non-adaptive post-filter may directly follow the adaptive post-filter.
- Such a filter is used to compensate for the ambient acoustics of a speaker.
- the non-adaptive post-filter may have the form of an inverse room filter.
- the signal processing means may further comprise an adaptive noise canceller (ANC), for electrical ANC implementations.
- ANC adaptive noise canceller
- the ANC can be connected to a non-acoustic sensor to determine a noise signal, for example, by using the tachometer of the vehicle.
- the ANC advantageously, can have an output connected to the input of the beamformer and/or of the adaptive post-filter.
- the signal processing means of the previously described handsfree systems can comprise an acoustic echo canceller AEC.
- the AEC can comprise an echo shaping filter. In this way, a frequency selected echo attenuation may be obtained.
- the AEC can have an output connected to the input of the beamformer and/or of the adaptive post-filter.
- the beamformer can be a non-adaptive beamformer.
- the computing power during operation of the system is reduced.
- the beamformer may be a superdirective beamformer which further improves the acoustic performance.
- the beamformer may be a regularized superdirective beamformer using a finite regularization parameter ⁇ .
- the regularization parameter usually enters the equation for computing the filter coefficients or, alternatively, is inserted into the cross-power spectrum matrix or the coherence matrix.
- the regularized superdirective beamformer has reduced noise and is less sensitive to an imperfect matching of the microphones.
- the finite regularization parameter ⁇ may depend on the frequency. This achieves an improved gain of the array compared to a regularized superdirective beamformer with fixed regularization parameter ⁇ .
- each superdirective filter may result from an iterative design based on a predetermined maximum susceptibility. This enables an optimal adjustment of the microphones, particularly with respect to the transfer function and the position of each microphone.
- the maximum susceptibility may be determined as a function of the error in the transfer characteristic of the microphones, the error in the microphone positions and a predetermined (required) maximum deviation in the directional diagram of the microphone array.
- the time-invariant impulse response of the filters will be determined iteratively only once; there is no adaption of the filter coefficients during operation.
- each superdirective filter can be a filter in the time domain. Filtering in the frequency domain is a possible alternative, however, requiring to perform a Fourier transform (FFT) and an inverse Fourier transform (IFFT), thus, increasing the required memory.
- FFT Fourier transform
- IFFT inverse Fourier transform
- the beamformer may have the structure of a generalized sidelobe canceller (GSC).
- GSC generalized sidelobe canceller
- the beamformer can be a minimum variance distortionless response (MVDR) beamformer.
- MVDR minimum variance distortionless response
- the microphone array can comprise at least two microphones being arranged in endfire orientation with respect to a first position.
- An array in endfire orientation has a better directivity and is less sensitive to a mismatched propagation or delay time compensation.
- the first position can be the location of the drivers head, for example.
- the microphone array can comprise at least two microphones being arranged in endfire orientation with respect to a second position.
- the handsfree system of the invention has a good directivity in two directions. Speech signals coming from two different positions, for example, from the driver and the front seat passenger, can both be recorded in good quality.
- the signal processing means may comprise at least two beamformers.
- a first beamformer may be used for signals from a first position and a second beamformer may be used for signals from a second position.
- the handsfree system may further comprise a voice activity detector (VAD) and/or a switch control means.
- VAD voice activity detector
- the switch control and the VAD are used to determine how to combine the output of the at least two beamformers.
- the handsfree system can comprise a residual echo suppression (RES) means and/or a dynamic volume control (DVC).
- a RES means serves for suppression of residual echoes, in particular, being present in the signal resulting from the adaptive post filter.
- a residual echo suppression means can comprise an input connected to the output of the adaptive post filter.
- a RES means can comprise an input for receiving a far end signal.
- a DVC is intended for dynamically adapting the output volume of a far end signal depending on the ambient noise level being present in the vehicle.
- the at least two microphones in the first endfire orientation (endfire orientation with respect to a first position) and the at least two microphones in the second endfire orientation (endfire orientation with respect to a second position) can have a microphone in common.
- a microphone array consisting of only three microphones provides an excellent directivity for use in a vehicular environment.
- the microphone array may comprise at least two subarrays.
- Each subarray of microphones may be optimized for a specific frequency band yielding an improved overall directivity.
- At least two subarrays may have at least one microphone in common.
- the above handsfree systems may comprise a frame wherein each microphone of the microphone array is arranged in a predetermined, in particular fixed, position in or on the frame. This ensures that after manufacture of the frame with the microphone, the relative positions of the microphones are known. Such an array can be easily mounted in a vehicular cabin.
- At least one microphone may be a directional microphone.
- the use of directional microphones improves the array gain.
- At least one directional microphone may have a cardioid characteristic. This further improves the array gain. More preferred, the cardioid characteristic is a hyper-cardioid characteristic.
- At least one directional microphone may be a differential microphone.
- the differential microphone may be a first order differential microphone.
- the invention is further directed to a vehicle, particularly a car, comprising any of the above-described handsfree systems.
- the invention is also directed to the use of any of the previously described handsfree systems in a vehicle, in particular, a car.
- the invention provides a method for noise reduction in a vehicular handsfree system according to claim 21.
- This method results in an excellent acoustic performance of a handsfree system in a vehicular environment.
- the adaptive filtering can be performed in the time domain. In this way, particularly the delay time is reduced.
- the method can further comprise
- adaptively filtering can further comprise receiving a signal resulting from the beamformed signal by at least one adaptive filter and wherein processing the beamsteered signal can comprise determining adaptive filter parameters using the at least one beamsteered signal and the signal resulting from the beamformed signal.
- an adaptive filter for each beamsteered signal, can be provided for determining adaptive filter parameters using the beamsteered signal and the signal resulting from the beamformed signal.
- receiving at least one beamsteered signal by at least one of the at least two adaptive filters can comprise processing the at least one beamsteered signal by a pre-emphasis filter, in particular, comprising a pre-whitening filter.
- the above methods can further comprise processing a signal resulting from the microphone array by an inverse filter, in particular, a warped inverse filter.
- the methods can further comprise non-adaptively filtering a signal resulting from the adaptively filtered signal and/or processing a signal resulting from the adaptively filtered signal by a pre-emphasis filter.
- the above method can further comprise processing a signal resulting from the microphone array, particularly resulting from the beamformed signal, by an adaptive noise canceller (ANC) and/or an acoustic echo canceller (AEC) and/or a residual echo suppression (RES) means.
- ANC adaptive noise canceller
- AEC acoustic echo canceller
- RES residual echo suppression
- the input signals can be processed by a non-adaptive and/or superdirective and/or minimum variance distortionless response (MVDR) beamformer.
- MVDR minimum variance distortionless response
- the invention also provides a computer program product comprising one or more computer readable media having computer-executable instructions for performing the steps of the above described methods.
- FIG. 1 An example of the handsfree system in accordance with the present invention is shown in Fig. 1 .
- the general structure will be shortly described, and, then, the different components will be explained in more detail.
- the dotted lines encasing some elements simply serve for better understanding of the figures without necessarily implying any actual combination or separation of different elements.
- the main components of the system are a microphone array, a beamformer and an adaptive post-filter in the time domain.
- the microphone array 101 in this example, comprises four microphones 102. Each microphone 102 yields an output signal x i [ k ].
- the microphone signals may be filtered by an optional high-pass filter 103.
- a beamformer may be a conventional delay and sum beamformer. However, in the present example, a preferred superdirective beamformer is shown.
- a beamformer comprises beamsteering means 104 and filters 105.
- the output signals of the beamformer may be passed through optional inverse filters 106 and, then, are summed by summing means 107 to yield a resulting beamformed signal x [ k ].
- This signal is passed through an adaptive post-filter 108 in the time domain which may be followed by an optional non-adaptive post-filter 109 and/or by an optional pre-emphasis filter (not shown).
- the adaption of the post-filter 108 is performed using a set of Wiener filters 110.
- the input signals of the Wiener filters 110 comprise, on the one hand, the individual signals resulting from the different microphones and, on the other hand, the summed signal x [ k ].
- the microphone signals are taken after the beamsteering.
- the beamformer comprises further (superdirective) filters 105 as in the present case, it is also possible to take the microphone signals after this additional filtering.
- the microphone signals are passed through an optional pre-emphasis filter 111.
- a microphone signal x [ k ] is the sum of the speech signal s [ k ] and the noise n [ k ].
- S ( ⁇ , ⁇ ) and X ( ⁇ , ⁇ ) are short-time spectra that may be determined, for example, with the help of DFT filter banks or an FFT.
- ⁇ is the time index and ⁇ the frequency index
- E ⁇ ⁇ . ⁇ represents the short-time average that may be obtained, for example, with the help of a first order IIR filter.
- the short-time auto power spectral density of the speech signal in the numerator of the above equation is to be estimated in a suitable way.
- Appropriate estimation methods include spectral subtraction (estimating the auto power spectral density of the noise), minimum mean square error short-time spectral amplitude (MMSE STSA) estimator or MMSE log-SA estimator or a speech pause detector, for example.
- the estimated short-time auto power spectral density of the noise signal may be used to estimate the absolute value of the most probable Fourier coefficient (using, for example, a spectral subtraction algorithm or an MMSE log-SA estimator) and to reconstruct the absolute value of the spectrum of the speech signal.
- the coherence or the cross power spectral density of the noise signals received by the microphones is vanishing.
- the adaption of the post-filter w ( k,i ) - k being the time index and i denoting the coefficient within the impulse response - is performed in the time domain, for example, with the help of the LMS algorithm.
- the form of the other three Wiener filters is obtained by a cyclic permutation of the indices.
- Wiener filters 110 are not restricted to a particular number of Wiener filters 110. Furthermore, not every Wiener filter 110 is always to be used to determine the adaptive post-filter 108. For example, one may use only the Wiener filter which uses the microphone signal of the microphone proximal to the source of speech.
- the filter coefficients of a linear-phase post-filter (with length L) can be obtained. Accordingly, the linear-phase post-filter has twice the length of one of the background filters 110 (with length L/2). Such a linear-phase filter only modifies the amplitude spectrum of the input signal of the filter without a frequency dependent distortion of the phase spectrum.
- the performance of the filter can be further improved by smoothing its frequency response. This can be achieved by weighting the filter coefficients with a window function.
- the inverse filters 106 serve to compensate for the acoustic transfer function of the path between the source of speech and the microphones.
- FIG 2 the structure of a superdirective beamformer is shown.
- the beamformer shown in this figure performs the filtering in the frequency domain, in contrast to the case of Figure 1 . If a beamformer in the frequency domain were used in Figure 1 , an inverse Fourier transform is to be performed on the signals before passing the signals to the Wiener filters 110 or the pre-emphasis filter 111.
- the microphone array consists of M microphones 102, each yielding a signal x i (t).
- the signals x i (t) are transferred to the frequency domain by fast Fourier transform (FFT) means 201, resulting in a signal X i ( ⁇ ).
- FFT fast Fourier transform
- the beamforming consists of a beamsteering and a filtering. The beamsteering is responsible for the propagation time compensation.
- p ref denotes the position of a reference microphone
- p n the position of microphone n
- q the position
- the signals are filtered by superdirective filters 203 that are filters in the frequency domain.
- the filtered signals are summed yielding a signal Y ( ⁇ ).
- IFFT inverse fast Fourier transform
- the above described design rule for computing the optimal filter coefficients A i ( ⁇ ) for a homogenous diffuse noise field is based on the assumption that the microphones are perfectly matched, i.e. point-like microphones having exactly the same transfer function.
- a so-called regularized filter design may be used to adjust the filter coefficients.
- a scalar (the regularization parameter ⁇ ) is added at the main diagonal of the cross-correlation matrix.
- the directional diagram or response pattern ( ⁇ ( ⁇ , ⁇ ) of a microphone array characterizes the sensitivity of the array as a function of the direction of incidence ⁇ for different frequencies .
- a measure to describe the directivity of an array is the so-called gain that does not depend on the angle of incidence ⁇ .
- the gain is defined as the sensitivity of the array in the main direction of incidence with respect to the sensitivity for omnidirectional incidence.
- the Front-To-Back-Ratio indicates the sensitivity in front receiving direction compared to the back.
- the white noise gain (WNG) describes the ability of the array to suppress uncorrelated noise, for example, the inherent noise of the microphones.
- the inverse of the white noise gain is the susceptibility K ( ⁇ ),
- the susceptibility K ( ⁇ ) describes the array's sensitivity to defective parameters. It is often preferred that the susceptibility K ( ⁇ ) of the array filters A i ( ⁇ ) does not exceed an upper bound K max ( ⁇ ) .
- the selection of this upper bound may be dependent on the relative error ⁇ 2 ( ⁇ , ⁇ ) of the microphones and, for example, on requirements regarding the directional diagram ⁇ ( ⁇ , ⁇ ).
- the relative error ⁇ 2 ( ⁇ , ⁇ ) in general, is the sum of the mean square error of the transfer properties of all microphones ⁇ 2 ( ⁇ , ⁇ ) and the Gaussian error with zero mean of the microphone positions ⁇ 2 ( ⁇ ).
- the error in the microphone transfer functions ⁇ ( ⁇ ) has a higher influence on the maximum susceptibility K max ( ⁇ ) and, thus, also on the maximum possible gain G ( ⁇ ) than the error ⁇ 2 ( ⁇ ) in the microphone positions.
- the defective transfer functions are mainly responsible for the limitation of the maximum susceptibility.
- a higher mechanical precision to reduce the position deviations of the microphones is only sensible up to a certain point since the microphones usually are modeled as being point-like, which is not true in reality.
- the microphones usually are modeled as being point-like, which is not true in reality.
- ⁇ 2 ( ⁇ ) 1% which is quite realistic.
- the error ⁇ ( ⁇ ) can be derived from the frequency depending deviations of the microphone transfer functions.
- inverse filters may be used to adjust the individual microphone transfer functions to a reference transfer function.
- a reference transfer function can be the transfer function of one microphone out of the array or, for example, the mean of all measured transfer functions.
- M being the number of microphones
- the transfer functions are not minimal phase, thus, a direct inversion would yield instable filters.
- the approximate inversion with the help of an FXLMS (filtered X least mean square) or the FXNLMS (filtered X normalized least mean square) algorithm will be described.
- the inverse filters may be coupled with the superdirective filters A i ( ⁇ ) such that, in the end, only one filter per viewing direction and microphone is to be implemented.
- the FXLMS or the FXNLMS algorithm is described with reference to Figure 3 .
- the update of the filter coefficients of w[ n ] is performed iteratively, i.e. at each time step n , whereby the filter coefficient w[ n ] are computed such that the instantaneous squared error e 2 [ n ] is minimized.
- the susceptibility increases with decreasing frequency.
- the FIR filters for example, are to be very long in order to obtain a sufficient frequency resolution in the desired frequency range. This means that the expenditure, in particular, regarding the memory, increases rapidly.
- the computing time does not impose a severe limitation.
- a suitable frequency depending adaption of the transfer functions can be achieved by using short WFIR filters (warped filters).
- a realization of the beamforming filters in the time domain - as in the system of Figure 1 - is described with reference to Figure 4 .
- Signals are recorded by microphones 102.
- a near field beamsteering 104 is performed using gain factors ⁇ k 401 to compensate for the amplitude differences and time delays ⁇ k 402 to compensate for the propagation time differences of the microphone signals x k [ i ]
- the realization of the superdirective beamforming is achieved using the filters (preferably, FIR filters) ⁇ k ( i ) indicated by reference sign 403.
- the impulse responses ⁇ 1 ( i ),..., ⁇ M ( i ) can be determined as follows:
- the microphone signals are directly processed using the beamsteering 104 in the time domain.
- the beamsteering 104 is followed by the FIR filtering 403. After summing the filtered signals, a resulting enhanced signal y [ k ] is obtained.
- ⁇ max d mic ⁇ f a c .
- the sampling frequency or the distance between the microphones can be chosen much higher than in the broad-side case, thus, resulting in an improved beamforming.
- the maximum microphone distance that can be chosen depends not only on the lower limiting frequency for the optimization of the directional characteristic, but also on the number of microphones and on the distance of the microphone array to the speaker. In general, the larger the number of microphones, the smaller their maximum distance in order to optimize the Signal-To-Noise-Ratio (SNR).
- a further improvement of the directivity, and, thus, of the gain, can be achieved by using unidirectional microphones instead of omnidirectional ones; this will be discussed in more detail below.
- Figures 5A and 5B show preferred arrangements of microphone arrays in a vehicle.
- the distance between the microphone array and the speaker should be as small as possible.
- each speaker 501 may have its own microphone array comprising at least two microphones 102.
- the microphone arrays may be provided at different locations, for example, within the headliner, dashboard, pillar, headrest, steering wheel, compartment door, visor or (driving) mirror.
- An arrangement within the roof is also a preferred possibility that is, however, not suitable for the case of a cabriolet. Both microphone arrays for each speaker are in endfire orientation.
- one microphone array is used for two neighboring speakers.
- directional microphones in particular, having a cardioid characteristic, may be used.
- the microphone array may be mounted within the mirror.
- Such a linear microphone array may be used for both the driver and the front seat passenger. A costly mounting of the microphones in the roof can be avoided.
- the array may be mounted in one piece, which ensures a high mechanical precision. Due to the adjustment of the mirror, the array would always be correctly oriented.
- Figure 6A shows a top view on a (driving) mirror 601 of a car with three microphones in two alternative arrangements.
- two microphones 602 and 603 are located in the center of the mirror in endfire orientation with respect to the driver and, preferably, have a distance d mic of about 5cm between each other.
- the microphones 603 and 604 are in endfire orientation with respect to the front seat passenger and have a distance of about 10cm between each other. Since the microphone 603 is used for both arrays, a cheap handsfree system can be provided.
- All three microphones may be directional microphones, preferably having a cardioid characteristic, for example, a hypercardioid characteristic.
- microphones 602 and 604 are directional microphones, whereas microphone 603 is an omnidirectional microphone which further reduces the costs. If all three microphones are directional microphones, preferably, microphones 602 and 603 are directed towards the driver.
- the front seat passenger beamformer Due to the larger distance between microphones 603 and 604 than between microphones 602 and 603, the front seat passenger beamformer has a better SNR at low frequencies.
- the microphone array for the driver consists of microphones 602' and 603' located at the left side of the mirror. In this case, the distance between this microphone array and the driver would be increased, thus, decreasing the performance. On the other hand, the distance between microphone 603' and 604 would be about 20cm, which yields a better gain for the front seat passenger at low frequencies.
- FIG. 6B A variant of two microphone arrays with improved precision is shown in Figure 6B .
- all microphones may be directional microphones, microphones 602 and 603 being directed to the driver, microphones 604 and 605 being directed to a front seat passenger.
- the microphone array for the front seat passenger comprises the three microphones 603, 604 and 605, which increases the gain considerably.
- subarray 701 with d mic 5 cm is used for the frequency band of 1400 - 3400 Hz
- subarray 702 with d mic 10 cm for the frequency band of 700 - 1400 Hz
- subarray 703 with d mic 20 cm for the band of frequencies smaller than 700 Hz.
- a lower limit of this frequency band may be imposed, for example, by the lowest frequency of the telephone band (the frequencies used in telephone applications) which, presently, is 300 Hz in most cases.
- the superdirective beamformer is designed as general sidelobe canceller (GSC).
- GSC general sidelobe canceller
- FIG. 8 Such a superdirective beamformer in GSC structure is shown in Figure 8 .
- the GSC structure is to be implemented in the frequency domain, thus, an FFT 201 is applied to the incoming signals x k ( t ) from microphones 102.
- a time alignment using phase factors e j ⁇ k has to be performed (in this figure, a far field beamsteering is shown).
- an inverse Fourier transform is to be performed before passing the signal to the Wiener filters 110 or the pre-emphasis filter 111.
- X denotes a vector comprising all time aligned input signals X i ( ⁇ ).
- a c is a vector comprising all frequency independent filter transfer functions A i that are necessary to observe the constraints in viewing direction; H is the vector of the transfer functions performing the actual superdirectivity; and B is the so-called blocking matrix projecting the input signals in X onto the "noise plane".
- the signal Y DS ( ⁇ ) denotes the output signal of the delay and sum beamformer, Y BM ( ⁇ ) the resulting output signal of the blocking branch, Y SD ( ⁇ ) the output signal of the superdirective beamformer x i ( t ), and X i ( ⁇ ) the input signals in the time and frequency domain that are not yet time aligned, and Y i ( ⁇ ) the output signals of the blocking matrix that ideally should block completely the desired or useful signal within the input signals.
- the signals Y i ( ⁇ ) ideally only comprise the noise signals.
- a GSC structure In addition to the superdirective output signal, a GSC structure also yields a delay and sum beamformer signal and a blocking output signal.
- a blocking matrix should have the following properties:
- ⁇ nn ( ⁇ ) may be replaced by the time aligned coherence matrix of the diffuse noise field, as discussed above.
- a regularization and the iterative design with predetermined susceptibility may be performed in the same way as above.
- ⁇ ⁇ ⁇ 0 ⁇ 1 2 ⁇ ⁇ - ⁇ ⁇ ⁇ 0 + ⁇ ⁇ 0 - ⁇ + 2 ⁇ ⁇ ⁇ e j 2 ⁇ ⁇ fd ij ⁇ cos ⁇ c ⁇ d ⁇ ⁇ e j 2 ⁇ ⁇ fd ij ⁇ cos ⁇ 0 c , i , j ⁇ 1 ...
- This method may also be generalized to the three-dimensional case. Then, in addition to the parameter ⁇ being responsible for the azimuth, a further parameter ⁇ is to be introduced for the elevation angle. This yields an analog equation for the coherence of the homogeneous diffuse 3D noise field.
- a superdirective beamformer based on an isotropic noise field is particularly useful for a handsfree system which is to be installed later in a vehicle. This is the case, for example, if the handsfree system is installed in the vehicle by the user itself.
- an MVDR beamformer may be relevant if there are specific noise sources at fixed relative positions or directions with respect to the position of the microphone array.
- the handsfree system can be adapted to a particular vehicular cabin by adjusting the beamformer such that its zeros point into the direction of specific noise sources.
- a noise source may be formed by a loudspeaker or a fan.
- a handsfree system with MVDR beamformer is already installed during manufacture of the vehicle.
- the typical distribution of noise or noise sources in a particular vehicular cabin can be determined by performing corresponding noise measurements under appropriate conditions (e.g., driving noise with and/or without loudspeaker and/or fan noise).
- the measured data are used for the design of the beamformer. It is to be noted that also in this case, no further adaption is performed during operation of the handsfree system.
- the corresponding superdirective filter coefficients can also be determined theoretically.
- FIG. 10 shows a superdirective beamformer with directional microphones 1001.
- each directional microphone 1001 is depicted by its equivalent circuit diagram.
- d DMA denotes the (virtual) distance of the two omnidirectional microphones composing the first order pressure gradient microphone in the circuit diagram.
- T is the (acoustic) delay line fixing the characteristic of the directional microphone and
- EQ TP is the equalizing low path filter yielding a frequency independent transfer behavior in viewing direction.
- circuits and filters may be realized purely mechanically by taking an appropriate mechanical directional microphone. Again, the distance between the directional microphones is d mic .
- the whole beamforming is performed in the time domain.
- a near field beamsteering 104 is applied to the signals x n [ i ] coming from the microphones and being filtered by the equalizing filter EQ TP .
- the gain factors ⁇ n compensate for the amplitude differences and the delays ⁇ n for the transit time differences of the signals.
- the FIR filters ⁇ n [ i ] realize the superdirectivity in the time domain.
- Mechanical pressure gradient microphones have a high quality and yield, in particular, using a hypercardioid characteristic, an excellent array gain.
- the use of directional microphones results in an excellent Front-to-Back-Ratio as well.
- FIG. 11 An example for another preferred embodiment of a handsfree system is shown in Fig. 11 .
- the system shown in this figure differs from the system of Fig. 1 in that an adaptive noise canceller (ANC) system 1101 is provided between the microphone array 101 and the high-pass filters 103.
- ANC adaptive noise canceller
- the ANC system is particularly useful to reduce motor harmonics in the signal.
- a wanted signal source 1201 particularly corresponding to a speaker, and a noise source 1202 are shown.
- the signal entering a microphone 102 which is part of the microphone array is the sum of a wanted signal s [ k ] and a noise signal n 0 [ k ].
- a noise sensor 1203 is present which is to provide a pure noise signal n 1 [ k ].
- the reference sensor 1203 is a microphone; in this case, such a microphone should be arranged at a place where no or almost no wanted signal is to be recorded.
- the output signal y [ k ] of the adaptive filter 1204 is subtracted from the output signal of the microphone 102.
- the input signal n 1 [ k ] for the adaptive filter 1204 and the output signal or error signal e [ k ] serve for adaption of the noise canceller.
- the noise reduction of the adaptive noise canceller depends only on the coherence of the signals of the microphone 102 and the reference sensor 1204; this coherence function in turn is depending on the distance between microphone and reference sensor.
- the reference sensor is not an acoustic sensor.
- One possibility is to couple an electrical sensor with the tachometer or speed counter which is usually present in a vehicle. After determining the interrelationship between the tachometer signal and the motor noise, the latter may be subtracted from the microphone signal via the adaptive noise canceller. Such an embodiment is shown in Fig. 11 where a tachometer 1102 is coupled to the ANC 1101.
- the ANC need not be placed directly behind the microphone array 101.
- the ANC may be used to filter the output signal x[k] of the superdirective beamformer.
- the ANC is to be placed between the summing circuit 107 and the adaptive post-filter 108.
- a further noise reduction with the help of an ANC system can be achieved by using additional - acoustical or non-acoustical - noise sensors.
- a corresponding embodiment is shown in Fig. 13 .
- the ANC system 1304 is used particularly to suppress signals coming from a loudspeaker 1301, for example, emitting a far end signal 1302.
- the ANC system is able to create a so-called area of silence around the noise sensor or noise sensors. If the microphone array 101 is located in the vicinity of the near speaker, the whole array 101 or one of its microphones 102 may be used as noise sensor. Alternatively, one or more acoustical (1203) and/or non-acoustical (1102) noise sensors are to be installed.
- an acoustical ANC can provide a noise reduction for both the near and far end speaker.
- an additional acoustic echo canceller (AEC) system 1303 may also be provided.
- AEC acoustic echo canceller
- each microphone 102 is provided with an individual AEC filter.
- an AEC filter may be placed between the summing circuit and the adaptive post-filter.
- an ANC may be placed between the summing circuit and the AEC.
- the AEC system used for this invention comprises a conventional AEC filter and an integrated frequency selected echo attenuation which acts as a residual echo suppression (RES) algorithm.
- RES residual echo suppression
- FIG. 14 A preferred embodiment of such a system is shown in Fig. 14 . It comprises a conventional AEC filter 1303 that filters the far end signal 1302. The adaption of the conventional AEC filter 1303 is performed using the signal 1401, e.g. the output signal of an electrical ANC or of one of the microphones.
- an echo shaping means 1402 is provided.
- This echo shaping means has the form of an adaptive FIR filter with coefficient vector H [ k ] that filters the compensated signal e [ k ].
- the coefficient vector H [ k ] of the adaptive filter is taken in each sampling step from another adaptive FIR filter with coefficient vector H 1[ k ].
- the filter with coefficient vector H 1[ k ] is a linear-phase filter of low order.
- the echo shaping means further comprises a delay element T H 1 .
- the resulting signal z [ k ] depends on the time varying factor ⁇ [ k ].
- the output signal y [ k ] of the ANC system is dominant ( ⁇ [ k ] close to 1) at the input of the adaptive filter with coefficient vector H 1[ k ]. Then, the adaptive filter with coefficient vector H 1[ k ] can reduce the error signal E ⁇ [ k ] only by suppressing the signal of the far end speaker. In this case, the near speaker and local noise signals will not be attenuated by the echo shaping means. It is to be noted that the echo shaping algorithm is frequency selective.
- the system shown in Fig. 15 is able to process speech signals from two different positions (for example, from the driver and the front seat passenger in a car).
- the microphone array 101 has a directional diagram with two preferred directions. For example, directional microphones may be used and/or the microphones may be arranged in a suitable way.
- One or several ANC or AEC filters can provide an estimation of the noise level present in the microphone signals that may be used in the dynamic volume control (DVC) 1501 to vary the volume of the far end speech signal 1511 in dependence of the noise level.
- the system comprises a beamformer for both wanted signal sources each comprising a beamsteering means 1502 and 1504 and beamformer filters 1503 and 1505. Following each beamformer, adaptive post-filters 1506 and 1507 are arranged which, in turn, are directly connected to non-adaptive post-filters 1508 and 1509.
- the output signals of the non-adaptive post-filters are fed to a unit 1510 comprising two voice activity detectors and a switch control that generates a weighting factor A for combining both signals.
- a unit 1510 comprising two voice activity detectors and a switch control that generates a weighting factor A for combining both signals.
- each of the signals s s 1 ⁇ k [ k ] and s s 2 ⁇ k [ k ] can be processed by a low-pass filter.
- the contained noise signal or its level is estimated using, for example, a minimum statistics.
- the noise signal level is subtracted from the corresponding filtered signal level.
- the resulting signal levels are compared to a threshold value. Depending on this comparison of both signal levels, the weighting factor A is determined.
- a possible weighting can be determined as follows. If both signal levels are larger than the threshold value, both signals are equally weighted. If one of the signal levels is larger than the threshold value and the other is smaller than the threshold value, the larger signal is weighted by a factor of 1 and the other is fully suppressed (weighting factor 0). If both signal levels are smaller than the threshold value, the signal stemming from the direction of the driver's seat is weighted by a factor of 1 and the other signal is fully suppressed.
- the combined signal may be subject to an additional post processing 1512, for example, a residual echo suppression (RES).
- RES residual echo suppression
- the combined signal is weighted by a spectral short time gain in the frequency domain, wherein the gain factor depends on the spectrum of the far end speech signal.
Landscapes
- Health & Medical Sciences (AREA)
- Otolaryngology (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- General Health & Medical Sciences (AREA)
- Circuit For Audible Band Transducer (AREA)
- Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)
Description
- The invention is directed to a handsfree system for use in a vehicle comprising a microphone array with at least two microphones and a signal processing means.
- For making telephone calls in a car, handsfree systems are used more and more since they provide increased comfort and reduce the risk of an accident as the driver is distracted only marginally. Because of that, in many countries, handsfree devices are even required by law.
-
discloses such a handsfree system for use in a vehicle.WO 03/015464 - Usually, a handsfree system comprises a microphone that can be fastened to a user such as the driver.
- Due to the relatively large distance between the speaker's mouth and the microphone, many handsfree devices today suffer from the drawback of a poor speech quality. This is particularly due to the fact that in a car, usually a large ambient noise is present interfering with the speech signal. The noise stems from different sources such as the motor, wind, or car radio.
- However, common methods for noise reduction are often costly to implement and require a large amount of memory and computing power. In particular, a signal processed by conventional noise reduction systems has a relatively large delay time which makes these systems unsuitable for real time applications, i.e. telephone applications.
- It is, therefore, the problem underlying the invention to overcome the above drawbacks and provide a handsfree system for use in a vehicle with improved speech quality.
- This problem is solved by a handsfree system according to
claim 1. - In the context of this invention, the term "connected" also includes the case that a filter or another signal processing means is provided along the signal path between two devices or means. A beamformer processes signals emanating from a microphone array to obtain a combined signal. A beamformer comprises a beamsteering means being responsible for time delay compensation of the different microphones and a summing means. In its simplest form (Delay-and-Sum beamformer), beamforming only comprises delay compensation and summing of the compensated signals. Beamforming allows to provide a specific directivity pattern for a microphone array. Usually, a beamformer can be implemented as digital system with a plurality of digital filter using, for example, digital signal processors (DSP). A beamformer can be configured as an adaptive or a non-adaptive beamformer. Adaptive means that relevant parameters such as filter coefficients can be re-calculated during use of the system in order to adapt the beamformer to changing conditions. In the non-adaptive case, the system parameters are determined once by calibrating the beamformer and, then, kept unchanged. In both cases of a non-adaptive and an adaptive beamformer, the beamforming, in principle, can be performed in the time domain or in the frequency domain.
- A handsfree system in accordance with the invention shows an excellent acoustic performance in a vehicular environment. Due to the beamformer, an improved directivity is obtained and, furthermore, speech signals are enhanced and ambient noise is reduced. The adaptive post-filter (responsible for filtering a signal after the beamforming) further reduces the noise in the signal.
- According to a preferred embodiment, the adaptive post-filter can be a filter in the time domain. If the post-filtering is performed in the time domain, the delay time is reduced and the implementation is simplified.
- The adaptive post-filter is a Wiener filter. It turns out that a Wiener filter is particularly suitable for filtering in a car environment.
- In order to reduce spectral distortions of the filtered signal, the adaptive post-filter is a linear-phase filter. In particular, the adaptive post-filter is a linear-phase Wiener filter.
- According to a preferred embodiment, the signal processing means can further comprise at least two adaptive filters having an input connected to the output of the beamsteering means and an output connected to the adaptive post-filter, wherein the at least two adaptive filters are configured to determine adaptive filter parameters for the adaptive post-filter.
- In this way, background filters are provided for adaptively estimating the filter parameters for the adaptive post-filter.
- Preferably, for each of the at least two microphones, an adaptive filter can be provided having an input connected to the output of the beamsteering means. Thus, for each output of the beamsteering signal corresponding to a microphone, adaptive filter parameters can be determined for the adaptive post-filter. The actual filter parameters of the post-filter can be given, for example, by the filter parameters determined by one of the adaptive filters or the mean of the filter parameters determined by several different adaptive filters.
- Advantageously, an input of each of the at least two adaptive filters can be further connected to the output of the beamformer. This allows for an adaption of the respective filter parameters directly with respect to the beamformed signal.
- According to a preferred embodiment, the signal processing means can further comprise a pre-emphasis filter, in particular, comprising a pre-whitening filter, having an input connected to an output of the adaptive post-filter and/or a pre-emphasis filter, in particular, comprising a pre-whitening filter, having an input connected to the output of the beamsteering means and an output connected to the at least two adaptive filters.
- Such a pre-emphasis filter, on the one hand, emphasizes high frequencies and, on the other hand, attenuates low frequencies which is particularly useful to reduce low frequency correlated noise. Preferably, the pre-emphasis filter can comprise a pre-whitening filter. A pre-whitening filter whitens the spectral distribution of a signal. The filter coefficients of such a pre-whitening filter can be determined using a linear predictive coding (LPC) analysis, for example, via an adaptive lattice predictor (ALP) algorithm.
- According to a preferred embodiment of the above handsfree systems, the signal processing means can further comprise an inverse filter, particularly a warped inverse filter. These filters are especially useful to adjust the microphone transfer function and to match the microphones of the array in this way. Preferably, the beamformer can comprise at least one inverse filter, in particular, having an output for providing an inversely filtered signal to a summing means.
- In order to overcome the matching problem, alternatively or additionally, matched microphones on the basis of silicone or paired microphones may be used.
- The susceptibility of microphone arrays often increases with decreasing frequency. Due to this, a higher matching precision is preferred for low frequencies compared to high frequencies. A frequency depending adjustment of the microphone transfer functions with the use of warped filters reduces the required memory compared to the case of conventional FIR filters.
- Preferably, each inverse filter can be an approximate inverse of a non-minimum phase filter. This results in an inverse filter which is both stable and has no phase error.
- According to a preferred embodiment, an inverse filter may be combined with another filter of the handsfree system, for example, a filter of the beamformer. Such a combination in one filter results in a simplified implementation.
- Preferably, the signal processing means of the above handsfree systems can comprise a non-adaptive post-filter having an input connected to an output of the adaptive post-filter. The non-adaptive post-filter may directly follow the adaptive post-filter. Such a filter is used to compensate for the ambient acoustics of a speaker. Thus, the non-adaptive post-filter may have the form of an inverse room filter.
- In order to further reduce low frequency noise, according to a preferred embodiment, the signal processing means may further comprise an adaptive noise canceller (ANC), for electrical ANC implementations.
- Preferably, the ANC can be connected to a non-acoustic sensor to determine a noise signal, for example, by using the tachometer of the vehicle. The ANC, advantageously, can have an output connected to the input of the beamformer and/or of the adaptive post-filter.
- For a further improvement of the speech signal quality, the signal processing means of the previously described handsfree systems can comprise an acoustic echo canceller AEC. Preferably, the AEC can comprise an echo shaping filter. In this way, a frequency selected echo attenuation may be obtained. As in the case of an ANC, the AEC can have an output connected to the input of the beamformer and/or of the adaptive post-filter.
- According to a preferred embodiment of all previously described handsfree systems, the beamformer can be a non-adaptive beamformer. By using a non-adaptive beamformer with fixed filters, the computing power during operation of the system is reduced.
- Preferably, the beamformer may be a superdirective beamformer which further improves the acoustic performance.
- Advantageously, the beamformer may be a regularized superdirective beamformer using a finite regularization parameter µ. The regularization parameter usually enters the equation for computing the filter coefficients or, alternatively, is inserted into the cross-power spectrum matrix or the coherence matrix. In contrast to the maximum superdirective beamformer (µ = 0), the regularized superdirective beamformer has reduced noise and is less sensitive to an imperfect matching of the microphones.
- The finite regularization parameter µ, preferably, may depend on the frequency. This achieves an improved gain of the array compared to a regularized superdirective beamformer with fixed regularization parameter µ. According to a preferred embodiment, each superdirective filter may result from an iterative design based on a predetermined maximum susceptibility. This enables an optimal adjustment of the microphones, particularly with respect to the transfer function and the position of each microphone.
- By using a predetermined maximum susceptibility, defective parameters of the microphone array can be taken into account to further improve the gain. The maximum susceptibility may be determined as a function of the error in the transfer characteristic of the microphones, the error in the microphone positions and a predetermined (required) maximum deviation in the directional diagram of the microphone array. The time-invariant impulse response of the filters will be determined iteratively only once; there is no adaption of the filter coefficients during operation.
- According to a preferred embodiment, each superdirective filter can be a filter in the time domain. Filtering in the frequency domain is a possible alternative, however, requiring to perform a Fourier transform (FFT) and an inverse Fourier transform (IFFT), thus, increasing the required memory.
- Advantageously, the beamformer may have the structure of a generalized sidelobe canceller (GSC). In this way, at least one filter can be saved. The implementation in the GSC structure, however, is only possible in the frequency domain.
- In order to obtain an optimal adaption of the handsfree system to a particular noise situation, according to a preferred embodiment, the beamformer can be a minimum variance distortionless response (MVDR) beamformer.
- According to a preferred embodiment, the microphone array can comprise at least two microphones being arranged in endfire orientation with respect to a first position. An array in endfire orientation has a better directivity and is less sensitive to a mismatched propagation or delay time compensation. The first position can be the location of the drivers head, for example.
- Preferably, the microphone array can comprise at least two microphones being arranged in endfire orientation with respect to a second position. Thus, the handsfree system of the invention has a good directivity in two directions. Speech signals coming from two different positions, for example, from the driver and the front seat passenger, can both be recorded in good quality.
- According to a preferred embodiment, the signal processing means may comprise at least two beamformers. A first beamformer may be used for signals from a first position and a second beamformer may be used for signals from a second position. In this case, advantageously, the handsfree system may further comprise a voice activity detector (VAD) and/or a switch control means. The switch control and the VAD are used to determine how to combine the output of the at least two beamformers.
- Advantageously, the handsfree system can comprise a residual echo suppression (RES) means and/or a dynamic volume control (DVC). A RES means serves for suppression of residual echoes, in particular, being present in the signal resulting from the adaptive post filter. Thus, a residual echo suppression means can comprise an input connected to the output of the adaptive post filter. Furthermore, a RES means can comprise an input for receiving a far end signal. A DVC is intended for dynamically adapting the output volume of a far end signal depending on the ambient noise level being present in the vehicle.
- According to a preferred embodiment, the at least two microphones in the first endfire orientation (endfire orientation with respect to a first position) and the at least two microphones in the second endfire orientation (endfire orientation with respect to a second position) can have a microphone in common. In this way, already a microphone array consisting of only three microphones provides an excellent directivity for use in a vehicular environment.
- According to a preferred embodiment of all previously discussed handsfree systems, the microphone array may comprise at least two subarrays. Each subarray of microphones may be optimized for a specific frequency band yielding an improved overall directivity.
- To decrease the total number of microphones, preferably, at least two subarrays may have at least one microphone in common.
- According to a preferred embodiment, the above handsfree systems may comprise a frame wherein each microphone of the microphone array is arranged in a predetermined, in particular fixed, position in or on the frame. This ensures that after manufacture of the frame with the microphone, the relative positions of the microphones are known. Such an array can be easily mounted in a vehicular cabin.
- According to a preferred embodiment, at least one microphone may be a directional microphone. The use of directional microphones improves the array gain.
- Preferably, at least one directional microphone may have a cardioid characteristic. This further improves the array gain. More preferred, the cardioid characteristic is a hyper-cardioid characteristic.
- Advantageously, at least one directional microphone may be a differential microphone. This results in a microphone array with excellent directivity and small dimensions, in particular, the differential microphone may be a first order differential microphone.
- The invention is further directed to a vehicle, particularly a car, comprising any of the above-described handsfree systems.
- The invention is also directed to the use of any of the previously described handsfree systems in a vehicle, in particular, a car.
- Furthermore, the invention provides a method for noise reduction in a vehicular handsfree system according to claim 21.
- This method results in an excellent acoustic performance of a handsfree system in a vehicular environment.
- According to a preferred embodiment, the adaptive filtering can be performed in the time domain. In this way, particularly the delay time is reduced.
- Preferably, the method can further comprise
- providing at least two adaptive filters, particularly Wiener filters, wherein
- beam processing the input signals by a beamformer forming comprises beamsteering the input signals for providing beamsteered signals corresponding to one of the at least two microphones and summing the signals, and
- adaptively filtering comprises receiving and processing at least one beamsteered signal by at least one of the at least two adaptive filters to determine adaptive filter parameters for the adaptive post filter.
- According to a preferred embodiment, adaptively filtering can further comprise receiving a signal resulting from the beamformed signal by at least one adaptive filter and wherein processing the beamsteered signal can comprise determining adaptive filter parameters using the at least one beamsteered signal and the signal resulting from the beamformed signal.
- Preferably, for each beamsteered signal, an adaptive filter can be provided for determining adaptive filter parameters using the beamsteered signal and the signal resulting from the beamformed signal.
- In order to reduce low frequency correlated noise, receiving at least one beamsteered signal by at least one of the at least two adaptive filters can comprise processing the at least one beamsteered signal by a pre-emphasis filter, in particular, comprising a pre-whitening filter.
- According to an advantageous embodiment, the above methods can further comprise processing a signal resulting from the microphone array by an inverse filter, in particular, a warped inverse filter.
- Preferably, the methods can further comprise non-adaptively filtering a signal resulting from the adaptively filtered signal and/or processing a signal resulting from the adaptively filtered signal by a pre-emphasis filter.
- The above method, advantageously, can further comprise processing a signal resulting from the microphone array, particularly resulting from the beamformed signal, by an adaptive noise canceller (ANC) and/or an acoustic echo canceller (AEC) and/or a residual echo suppression (RES) means.
- According to a preferred embodiment, the input signals can be processed by a non-adaptive and/or superdirective and/or minimum variance distortionless response (MVDR) beamformer.
- The invention also provides a computer program product comprising one or more computer readable media having computer-executable instructions for performing the steps of the above described methods.
- Additional features and advantages will be described with reference to the examples illustrated in the drawings:
- Fig. 1
- illustrates the structure of a handsfree system according to the invention with an adaptive post-filter in the time domain;
- Fig. 2
- shows the structure of a beamformer in the frequency domain;
- Fig. 3
- illustrates an FXLMS algorithm;
- Fig. 4
- shows the structure of a beamformer in the time domain;
- Figs. 5A, 5B
- illustrate preferred embodiments of arrangements of the microphone array in a vehicle;
- Figs. 6A, 6B
- illustrate preferred embodiments of arrangements of a microphone array in a mirror;
- Fig. 7
- shows a microphone array consisting of three subarray;
- Fig. 8
- illustrates a superdirective beamformer in a GSC structure;
- Fig. 9
- illustrates a microphone array with two microphones in a noise field with a noise free sector;
- Fig. 10
- shows the structure of a superdirective beamformer comprising four first order gradient microphones;
- Fig. 11
- illustrates the structure of a handsfree system with an electrical ANC;
- Fig. 12
- shows the structure of an ANC;
- Fig. 13
- shows the structure of an embodiment of a handsfree system according to the invention with an ANC and AEC;
- Fig. 14
- illustrates the structure of an AEC; and
- Fig. 15
- shows another embodiment of a handsfree system according to the invention.
- An example of the handsfree system in accordance with the present invention is shown in
Fig. 1 . In the following, first, the general structure will be shortly described, and, then, the different components will be explained in more detail. In the figures, it is to be noted that the dotted lines encasing some elements simply serve for better understanding of the figures without necessarily implying any actual combination or separation of different elements. - The main components of the system are a microphone array, a beamformer and an adaptive post-filter in the time domain. The
microphone array 101, in this example, comprises fourmicrophones 102. Eachmicrophone 102 yields an output signal xi[k]. The microphone signals may be filtered by an optional high-pass filter 103. - Then, the signals are passed to a beamformer. This beamformer may be a conventional delay and sum beamformer. However, in the present example, a preferred superdirective beamformer is shown. Such a beamformer comprises beamsteering means 104 and filters 105. The output signals of the beamformer may be passed through optional
inverse filters 106 and, then, are summed by summingmeans 107 to yield a resulting beamformed signal x[k]. - This signal is passed through an
adaptive post-filter 108 in the time domain which may be followed by an optionalnon-adaptive post-filter 109 and/or by an optional pre-emphasis filter (not shown). The adaption of the post-filter 108 is performed using a set of Wiener filters 110. The input signals of the Wiener filters 110 comprise, on the one hand, the individual signals resulting from the different microphones and, on the other hand, the summed signal x[k]. In the present example, the microphone signals are taken after the beamsteering. However, if the beamformer comprises further (superdirective) filters 105 as in the present case, it is also possible to take the microphone signals after this additional filtering. Before being presented to the Wiener filters, the microphone signals are passed through anoptional pre-emphasis filter 111. - In the following, the functioning of a Wiener filter will be explained. A microphone signal x[k] is the sum of the speech signal s[k] and the noise n[k]. The microphone signal will be filtered by an impulse response w(i) to obtain a noise reduced signal s̃[k]. It is the aim to minimize the mean square error between the undisturbed speech signal s[k] and the output signal s̃[k]:
- In other words, the partial derivative of the mean square error with respect to the coefficients of the impulse response has to vanish yielding the Wiener-Hopf equation:
wherein rxx (l) and rsx (l) are the auto-correlation function and the cross-correlation function of the microphone signal and the undisturbed speech signal. One may assume that the speech signal and the noise are statistically independent, i.e. rsx (l) = r ss(l), thus, -
- In order to obtain a time variant filter, the power spectral densities in the above equation may be replaced by the corresponding short-time estimated values that may be obtained, for example, by a recursive averaging:
wherein S(κ,ν) and X(κ,ν) are short-time spectra that may be determined, for example, with the help of DFT filter banks or an FFT. Here, κ is the time index and ν the frequency index; E̅{.} represents the short-time average that may be obtained, for example, with the help of a first order IIR filter. - The short-time auto power spectral density of the speech signal in the numerator of the above equation is to be estimated in a suitable way. Appropriate estimation methods include spectral subtraction (estimating the auto power spectral density of the noise), minimum mean square error short-time spectral amplitude (MMSE STSA) estimator or MMSE log-SA estimator or a speech pause detector, for example.
- It is also possible to estimate the short-time auto power spectral density of the noise signal with the help of the coherence between two or more microphones. In a second step, the estimated short-time auto power spectral density of the noise signal may be used to estimate the absolute value of the most probable Fourier coefficient (using, for example, a spectral subtraction algorithm or an MMSE log-SA estimator) and to reconstruct the absolute value of the spectrum of the speech signal. For the multi-channel noise reduction, one estimates that the coherence or the cross power spectral density of the noise signals received by the microphones is vanishing. In the case of two microphones, for example, the microphone signal has the form:
wherein h 1(i) and h 2(i) are the impulse responses representing the acoustic transfer between the source of speech and the microphones. Both parts of the speech signal filtered in this way are superimposed with the uncorrelated noise signals n 1[k] and n 2[k]. -
-
- In
Fig. 1 , the adaption of the post-filter w(k,i) - k being the time index and i denoting the coefficient within the impulse response - is performed in the time domain, for example, with the help of the LMS algorithm. The background Wiener filters w 1(k, i),...,w 4(k, i) are minimize the error signals e 1[k],...,e 4[k] such that, for example, the filter w 4(k, i) tends towards the frequency response wherein - The form of the other three Wiener filters is obtained by a cyclic permutation of the indices.
- It is to be understood that the system is not restricted to a particular number of Wiener filters 110. Furthermore, not every
Wiener filter 110 is always to be used to determine theadaptive post-filter 108. For example, one may use only the Wiener filter which uses the microphone signal of the microphone proximal to the source of speech. -
-
- Using this symmetry condition, the filter coefficients of a linear-phase post-filter (with length L) can be obtained. Accordingly, the linear-phase post-filter has twice the length of one of the background filters 110 (with length L/2). Such a linear-phase filter only modifies the amplitude spectrum of the input signal of the filter without a frequency dependent distortion of the phase spectrum.
- The performance of the filter can be further improved by smoothing its frequency response. This can be achieved by weighting the filter coefficients with a window function.
- The
inverse filters 106 serve to compensate for the acoustic transfer function of the path between the source of speech and the microphones. - In
Figure 2 , the structure of a superdirective beamformer is shown. The beamformer shown in this figure performs the filtering in the frequency domain, in contrast to the case ofFigure 1 . If a beamformer in the frequency domain were used inFigure 1 , an inverse Fourier transform is to be performed on the signals before passing the signals to the Wiener filters 110 or thepre-emphasis filter 111. - In
Figure 2 , the microphone array consists ofM microphones 102, each yielding a signal xi(t). The signals xi(t) are transferred to the frequency domain by fast Fourier transform (FFT) means 201, resulting in a signal Xi(ω). In general, the beamforming consists of a beamsteering and a filtering. The beamsteering is responsible for the propagation time compensation. The beamsteering is performed by a steering vector with and wherein pref denotes the position of a reference microphone, pn the position of microphone n, q the position of the source of sound (for example, the speaker), f the frequency and c the velocity of sound. In the far field, one has - According to a rule of thumb, one has the far field situation if the source of the useful signal is more than twice as far from the microphone array as the maximum dimension of the array. In
Figure 2 , a far field beamformer is shown since only a phase factor ejωτk denoted byreference sign 202 is applied to the signals Xk (ω). - After the beamsteering, the signals are filtered by
superdirective filters 203 that are filters in the frequency domain. The filtered signals are summed yielding a signal Y(ω). After an inverse fast Fourier transform (IFFT) bymeans 204, the resulting signal y[k] is obtained. -
-
-
- The above described design rule for computing the optimal filter coefficients Ai (ω) for a homogenous diffuse noise field is based on the assumption that the microphones are perfectly matched, i.e. point-like microphones having exactly the same transfer function. In practice, therefore, a so-called regularized filter design may be used to adjust the filter coefficients. To achieve this, a scalar (the regularization parameter µ) is added at the main diagonal of the cross-correlation matrix. In a slightly modified version, all elements of the coherence matrix not on the main diagonal are divided by (1 + µ):
- Alternatively, the regularization parameter µ may be introduced into the equation for computing the filter coefficients:
wherein I is the unity matrix. For convenience, in the following, the second approach where the regularization parameter is part of the filter equation will be discussed in more detail. It is to be understood, however, that the first approach is equally suitable. - Before discussing the superdirective beamformer in more detail, some characteristic quantities of a microphone array are to be defined. The directional diagram or response pattern (Ψ(ω,Θ) of a microphone array characterizes the sensitivity of the array as a function of the direction of incidence Θ for different frequencies .
- A measure to describe the directivity of an array is the so-called gain that does not depend on the angle of incidence Θ. The gain is defined as the sensitivity of the array in the main direction of incidence with respect to the sensitivity for omnidirectional incidence.
- The Front-To-Back-Ratio (FBR) indicates the sensitivity in front receiving direction compared to the back.
- The white noise gain (WNG) describes the ability of the array to suppress uncorrelated noise, for example, the inherent noise of the microphones. The inverse of the white noise gain is the susceptibility K(ω),
- The susceptibility K(ω) describes the array's sensitivity to defective parameters. It is often preferred that the susceptibility K(ω) of the array filters Ai (ω) does not exceed an upper bound K max(ω). The selection of this upper bound may be dependent on the relative error Δ2(ω, Θ) of the microphones and, for example, on requirements regarding the directional diagram Ψ(ω, Θ). The relative error Δ2(ω,Θ), in general, is the sum of the mean square error of the transfer properties of all microphones ε 2(ω,Θ) and the Gaussian error with zero mean of the microphone positions δ2(ω).
-
- It is to be noted that in many cases the dependence on the angle Θ can be neglected.
- In practice, the error in the microphone transfer functions ε(ω) has a higher influence on the maximum susceptibility K max(ω) and, thus, also on the maximum possible gain G(ω) than the error δ 2 (ω) in the microphone positions. In other words, the defective transfer functions are mainly responsible for the limitation of the maximum susceptibility.
- A higher mechanical precision to reduce the position deviations of the microphones is only sensible up to a certain point since the microphones usually are modeled as being point-like, which is not true in reality. Thus, one can fix the positioning errors δ 2(ω) to a specific value, even if a higher mechanical precision could be achieved. For example, one can take δ2 (ω)=1% which is quite realistic. The error ε(ω) can be derived from the frequency depending deviations of the microphone transfer functions.
- To compensate the above-mentioned errors, inverse filters may be used to adjust the individual microphone transfer functions to a reference transfer function. Such a reference transfer function can be the transfer function of one microphone out of the array or, for example, the mean of all measured transfer functions. In case of the first possibility, only M -1 inverse filters (M being the number of microphones) are to be computed and implemented.
- In general, the transfer functions are not minimal phase, thus, a direct inversion would yield instable filters. Usually, one inverts only the minimum phase part of the transfer function (resulting in a phase error) or one inverts the ideal (non-minimum phase) filter only approximately. In the following, the approximate inversion with the help of an FXLMS (filtered X least mean square) or the FXNLMS (filtered X normalized least mean square) algorithm will be described.
- After computing of the inverse filters, they may be coupled with the superdirective filters Ai (ω) such that, in the end, only one filter per viewing direction and microphone is to be implemented.
- The FXLMS or the FXNLMS algorithm is described with reference to
Figure 3 . The error signal e[n] at time n is calculated according to with the input signal vector wherein L denotes the filter length of the inverse filter W(z). The filter coefficient vector of the inverse filter has the form the filter coefficient vector of the reference transfer function P(z) and the filter coefficient vector of the n-th microphone transfer function s(z) - The update of the filter coefficients of w[n] is performed iteratively, i.e. at each time step n , whereby the filter coefficient w[n] are computed such that the instantaneous squared error e2 [n] is minimized. This can be achieved, for example, by using the LMS algorithm:
or by using the NLMS algorithm wherein µ characterizes the adaption steps and denotes the input signal vector filtered by S(z). - In general, the susceptibility increases with decreasing frequency. Thus, it is preferred to adjust the microphone transfer functions depending on frequency, in particular, with a high precision for low frequencies. To achieve a high precision of the inverse filters, the FIR filters, for example, are to be very long in order to obtain a sufficient frequency resolution in the desired frequency range. This means that the expenditure, in particular, regarding the memory, increases rapidly. When using a reduced sampling frequency of, for example, fa = 8kHz, the computing time does not impose a severe limitation. A suitable frequency depending adaption of the transfer functions can be achieved by using short WFIR filters (warped filters).
- One possible iterative method to design the filters Ai (ω) with predetermined susceptibility goes as follows:
- 1. Set µ(ω) = 1.
- 2. Determine the transfer functions of the filters Ai (ω) and the resulting susceptibilities K(ω) according to the equations:
and - 3. If the susceptibility K(ω) is larger than the maximum susceptibility (K(ω) > (Kmax (ω)), increase µ in the following step, otherwise, decrease µ.
- 4. Repeat steps 2 and 3 until the susceptibility K(ω) is sufficiently close to the predetermined value K max(ω). The iteration is to break off if µ becomes smaller than a lower limit of, for example, µmin =10-8 . Such a termination criterion is mainly necessary for high frequencies f ≥cl(2dmic ).
- Of course, there are other possibilities to compute the filters Ai (ω). For example, one can use a fixed parameter µ for all frequencies. This simplifies the computation of the filter coefficients. It is to be noted that the above iterative method is not used for a real time adaption of the filter coefficients during operation.
- A realization of the beamforming filters in the time domain - as in the system of
Figure 1 - is described with reference toFigure 4 . Signals are recorded bymicrophones 102. Anear field beamsteering 104 is performed using gain factors ν k 401 to compensate for the amplitude differences and time delays τ k 402 to compensate for the propagation time differences of the microphone signals xk [i] The realization of the superdirective beamforming is achieved using the filters (preferably, FIR filters) αk (i) indicated byreference sign 403. - The impulse responses α1(i),...,αM (i) can be determined as follows:
- 1. Determine the frequency responses Ai (ω) according to the above equation.
- 2. To obtain real valued impulse responses α1(i),...,αM (i), chose the frequency responses above half of the sampling frequency to
with ω A denoting the sampling angular frequency. - 3. Transfer these frequency responses to the time domain using an IFFT yielding the desired FIR filter coefficients α1(i),...,αM (i).
- 4. Applying a window function, for example, a Hamming window, to the FIR filter coefficients a 1 (i),...,αM (i).
- As can be seen in
Figure 4 , in contrast to the beamforming in the frequency domain as described above, the microphone signals are directly processed using thebeamsteering 104 in the time domain. Thebeamsteering 104 is followed by theFIR filtering 403. After summing the filtered signals, a resulting enhanced signal y[k] is obtained. -
- The higher the sampling frequency fa or the higher the distance between adjacent microphones, the more transit time Δmax (in taps of delay) is to be compensated for. The number of taps increases also if the distance between speaker and microphone arrays is decreased. In the near field, more transit time is to be compensated for than in the far field. It turns out that an array in endfire orientation is less sensitive to a defective transit time compensation Δmax than an array in broad-side orientation.
- In a vehicle, the average distance between the speaker, in particular, its head, and the array is about 50cm. Due to a movement of the head, this distance can change by about +/- 20cm. If a transit time error of 1 tap is acceptable, the distance between the microphones in broad-side orientation with a sampling frequency of fa = 8kHz should be smaller than about dmic_max (broad -side)≅ 5cm. With the same conditions, the maximum distance between the microphones in endfire orientation may be about dmic_max (endfire) ≅ 20cm.
- On the other hand, having a distance between the microphones of about 5cm, it turns out that a sampling frequency of fa = 16kHz provides excellent results for an endfire orientation whereas in broad-side orientation, only a sampling frequency of f a = 8kHz can be used without adaptive beamsteering. In other words, in endfire orientation, the sampling frequency or the distance between the microphones can be chosen much higher than in the broad-side case, thus, resulting in an improved beamforming.
- In this context, it is to be pointed out that the larger the distance between the microphones, the sharper the beam, in particular, for low frequencies. A sharper beam at low frequencies increases the gain in this range which is important for vehicles where the noise is mostly a low frequency noise.
-
- A violation of this sampling theorem has the consequence that at higher frequencies, large grating lobes appear. These grating lobes, however, are very narrow and deteriorate the gain only slightly. The maximum microphone distance that can be chosen depends not only on the lower limiting frequency for the optimization of the directional characteristic, but also on the number of microphones and on the distance of the microphone array to the speaker. In general, the larger the number of microphones, the smaller their maximum distance in order to optimize the Signal-To-Noise-Ratio (SNR). For a distance between array and speaker of 50cm, the microphone distance, preferably, is about dmic = 40cm with two microphones (M = 2) and about dmic = 20cm for M = 4.
- A further improvement of the directivity, and, thus, of the gain, can be achieved by using unidirectional microphones instead of omnidirectional ones; this will be discussed in more detail below.
-
Figures 5A and5B show preferred arrangements of microphone arrays in a vehicle. In general, the distance between the microphone array and the speaker should be as small as possible. - According to a first embodiment (
Figure 5A ), eachspeaker 501 may have its own microphone array comprising at least twomicrophones 102. The microphone arrays may be provided at different locations, for example, within the headliner, dashboard, pillar, headrest, steering wheel, compartment door, visor or (driving) mirror. An arrangement within the roof is also a preferred possibility that is, however, not suitable for the case of a cabriolet. Both microphone arrays for each speaker are in endfire orientation. - In an alternative embodiment (
Figure 5B ), one microphone array is used for two neighboring speakers. In both embodiments, preferably, directional microphones, in particular, having a cardioid characteristic, may be used. - In the embodiment of
Figure 5B , the microphone array may be mounted within the mirror. Such a linear microphone array may be used for both the driver and the front seat passenger. A costly mounting of the microphones in the roof can be avoided. Furthermore, the array may be mounted in one piece, which ensures a high mechanical precision. Due to the adjustment of the mirror, the array would always be correctly oriented. -
Figure 6A shows a top view on a (driving)mirror 601 of a car with three microphones in two alternative arrangements. According to the first alternative, two 602 and 603 are located in the center of the mirror in endfire orientation with respect to the driver and, preferably, have a distance dmic of about 5cm between each other. Themicrophones 603 and 604 are in endfire orientation with respect to the front seat passenger and have a distance of about 10cm between each other. Since themicrophones microphone 603 is used for both arrays, a cheap handsfree system can be provided. - All three microphones may be directional microphones, preferably having a cardioid characteristic, for example, a hypercardioid characteristic. Alternatively,
602 and 604 are directional microphones, whereasmicrophones microphone 603 is an omnidirectional microphone which further reduces the costs. If all three microphones are directional microphones, preferably, 602 and 603 are directed towards the driver.microphones - Due to the larger distance between
603 and 604 than betweenmicrophones 602 and 603, the front seat passenger beamformer has a better SNR at low frequencies.microphones - According to an alternative embodiment, the microphone array for the driver consists of microphones 602' and 603' located at the left side of the mirror. In this case, the distance between this microphone array and the driver would be increased, thus, decreasing the performance. On the other hand, the distance between
microphone 603' and 604 would be about 20cm, which yields a better gain for the front seat passenger at low frequencies. - A variant of two microphone arrays with improved precision is shown in
Figure 6B . Also in this case, all microphones may be directional microphones, 602 and 603 being directed to the driver,microphones 604 and 605 being directed to a front seat passenger. In this example, the microphone array for the front seat passenger comprises the threemicrophones 603, 604 and 605, which increases the gain considerably.microphones - It is to be noted that these arrangements are only examples that may be varied by changing the position and number of the microphones. In particular, an arrangement may be optimized with regard to a specific vehicular cabin.
-
Figure 7 illustrates a microphone array comprising three 701, 702, and 703, each subarray consisting of five microphones. Within eachsubarrays 701, 702, and 703, the microphones are equidistantly arranged. In thesubarray total array 704, the distances are no longer equal. As can be seen in this figure, some microphones are used for different arrays, therefore, for the total array, only 9 microphones and not 3.5 = 15 microphones are necessary. - In this figure, it is further indicated that the different subarrays are used for different frequency ranges. The resulting directional diagram is then built up of the directional diagrams of each subarray for the respective frequency range. For the special case of
Figure 7 ,subarray 701 with dmic = 5cm is used for the frequency band of 1400 - 3400 Hz,subarray 702 with dmic = 10cm for the frequency band of 700 - 1400 Hz, andsubarray 703 with dmic = 20cm for the band of frequencies smaller than 700 Hz. A lower limit of this frequency band may be imposed, for example, by the lowest frequency of the telephone band (the frequencies used in telephone applications) which, presently, is 300 Hz in most cases. - An improved directional characteristic may be obtained if the superdirective beamformer is designed as general sidelobe canceller (GSC). In this structure, at least one filter can be saved. Such a superdirective beamformer in GSC structure is shown in
Figure 8 . The GSC structure is to be implemented in the frequency domain, thus, anFFT 201 is applied to the incoming signals xk (t) frommicrophones 102. Before the general sidelobe cancelling, a time alignment using phase factors ejωτk has to be performed (in this figure, a far field beamsteering is shown). If a GSC beamformer is used in the handsfree system ofFigure 1 , for example, again, an inverse Fourier transform is to be performed before passing the signal to the Wiener filters 110 or thepre-emphasis filter 111. - In
Figure 8 , X denotes a vector comprising all time aligned input signals Xi (ω). A c is a vector comprising all frequency independent filter transfer functions Ai that are necessary to observe the constraints in viewing direction; H is the vector of the transfer functions performing the actual superdirectivity; and B is the so-called blocking matrix projecting the input signals in X onto the "noise plane". The signal YDS (ω) denotes the output signal of the delay and sum beamformer, YBM (ω) the resulting output signal of the blocking branch, YSD(ω) the output signal of the superdirective beamformer xi (t), and Xi (ω) the input signals in the time and frequency domain that are not yet time aligned, and Yi (ω) the output signals of the blocking matrix that ideally should block completely the desired or useful signal within the input signals. The signals Yi (ω) ideally only comprise the noise signals. - In addition to the superdirective output signal, a GSC structure also yields a delay and sum beamformer signal and a blocking output signal. The number of filters that can be saved using the GSC depends on the choice of the blocking matrix. Usually, a Walsh-Hadamard blocking matrix is preferred instead of a Griffiths-Jim blocking matrix since more filters can be saved with a Walsh-Hadamard blocking matrix. Unfortunately, the Walsh-Hadamard blocking matrix can only be given for arrays consisting of M = 2n microphones.
- In principle, a blocking matrix should have the following properties:
- 1. It is a (M -1)·M -Matrix.
- 2. The sum of the values within one row vanishes.
- 3. The matrix is of rank M -1.
-
-
-
- The computation of the filter coefficients of a superdirective beamformer in GSC structure is slightly different compared to the conventional superdirective beamformer. The transfer functions Hi (ω) are to be computed as
wherein B is the blocking matrix and Φnn (ω) the matrix of the cross-correlation power spectrum of the noise. In the case of a homogenous noise field, Φ nn (ω) may be replaced by the time aligned coherence matrix of the diffuse noise field, as discussed above. - A regularization and the iterative design with predetermined susceptibility may be performed in the same way as above.
- All previously discussed filter designs only assume that the noise field is homogenous and diffuse. These designs may be generalized by excluding a region around the main receiving direction Θ0 when determining the homogenous noise field. In this way, mainly the Front-To-Back-Ratio may be optimized. This is illustrated in
Figure 9 withmicrophones 102 where a sector of +/-δ is excluded. The computing of the two-dimensional diffuse (cylindrically isotropic), homogenous noise field can be performed using the new design parameter δ: - This method may also be generalized to the three-dimensional case. Then, in addition to the parameter δ being responsible for the azimuth, a further parameter ρ is to be introduced for the elevation angle. This yields an analog equation for the coherence of the homogeneous diffuse 3D noise field.
- A superdirective beamformer based on an isotropic noise field is particularly useful for a handsfree system which is to be installed later in a vehicle. This is the case, for example, if the handsfree system is installed in the vehicle by the user itself. On the other hand, an MVDR beamformer may be relevant if there are specific noise sources at fixed relative positions or directions with respect to the position of the microphone array. In this case, the handsfree system can be adapted to a particular vehicular cabin by adjusting the beamformer such that its zeros point into the direction of specific noise sources. For example, such a noise source may be formed by a loudspeaker or a fan. Preferably, a handsfree system with MVDR beamformer is already installed during manufacture of the vehicle.
- The typical distribution of noise or noise sources in a particular vehicular cabin can be determined by performing corresponding noise measurements under appropriate conditions (e.g., driving noise with and/or without loudspeaker and/or fan noise). The measured data are used for the design of the beamformer. It is to be noted that also in this case, no further adaption is performed during operation of the handsfree system.
- Alternatively, if the relative position of a noise source is known, the corresponding superdirective filter coefficients can also be determined theoretically.
- As already stated above, the use of directional microphones further improves the signal enhancement.
Figure 10 shows a superdirective beamformer withdirectional microphones 1001. In this figure, eachdirectional microphone 1001 is depicted by its equivalent circuit diagram. In these circuit diagrams, dDMA denotes the (virtual) distance of the two omnidirectional microphones composing the first order pressure gradient microphone in the circuit diagram. T is the (acoustic) delay line fixing the characteristic of the directional microphone and EQTP is the equalizing low path filter yielding a frequency independent transfer behavior in viewing direction. - In practice, these circuits and filters may be realized purely mechanically by taking an appropriate mechanical directional microphone. Again, the distance between the directional microphones is dmic . In
Figure 10 , the whole beamforming is performed in the time domain. Anear field beamsteering 104 is applied to the signals xn [i] coming from the microphones and being filtered by the equalizing filter EQTP . The gain factors ν n compensate for the amplitude differences and the delays τ n for the transit time differences of the signals. The FIR filters α n [i] realize the superdirectivity in the time domain. - Mechanical pressure gradient microphones have a high quality and yield, in particular, using a hypercardioid characteristic, an excellent array gain. The use of directional microphones results in an excellent Front-to-Back-Ratio as well.
- An example for another preferred embodiment of a handsfree system is shown in
Fig. 11 . The system shown in this figure differs from the system ofFig. 1 in that an adaptive noise canceller (ANC)system 1101 is provided between themicrophone array 101 and the high-pass filters 103. The ANC system is particularly useful to reduce motor harmonics in the signal. - The structure of an adaptive noise canceller is shown in
Fig. 12 . In this figure, a wantedsignal source 1201, particularly corresponding to a speaker, and anoise source 1202 are shown. The signal entering amicrophone 102 which is part of the microphone array is the sum of a wanted signal s[k] and a noise signal n 0[k]. In addition, anoise sensor 1203 is present which is to provide a pure noise signal n 1[k]. InFig. 12 , thereference sensor 1203 is a microphone; in this case, such a microphone should be arranged at a place where no or almost no wanted signal is to be recorded. The output signal y[k] of theadaptive filter 1204 is subtracted from the output signal of themicrophone 102. The input signal n 1[k] for theadaptive filter 1204 and the output signal or error signal e[k] serve for adaption of the noise canceller. The noise reduction of the adaptive noise canceller depends only on the coherence of the signals of themicrophone 102 and thereference sensor 1204; this coherence function in turn is depending on the distance between microphone and reference sensor. - According to a preferred alternative, the reference sensor is not an acoustic sensor. One possibility is to couple an electrical sensor with the tachometer or speed counter which is usually present in a vehicle. After determining the interrelationship between the tachometer signal and the motor noise, the latter may be subtracted from the microphone signal via the adaptive noise canceller. Such an embodiment is shown in
Fig. 11 where atachometer 1102 is coupled to theANC 1101. - The ANC need not be placed directly behind the
microphone array 101. According to a preferred alternative, the ANC may be used to filter the output signal x[k] of the superdirective beamformer. In this case, the ANC is to be placed between the summingcircuit 107 and theadaptive post-filter 108. - A further noise reduction with the help of an ANC system can be achieved by using additional - acoustical or non-acoustical - noise sensors. A corresponding embodiment is shown in
Fig. 13 . In this embodiment, theANC system 1304 is used particularly to suppress signals coming from aloudspeaker 1301, for example, emitting afar end signal 1302. The ANC system is able to create a so-called area of silence around the noise sensor or noise sensors. If themicrophone array 101 is located in the vicinity of the near speaker, thewhole array 101 or one of itsmicrophones 102 may be used as noise sensor. Alternatively, one or more acoustical (1203) and/or non-acoustical (1102) noise sensors are to be installed. Ideally, an acoustical ANC can provide a noise reduction for both the near and far end speaker. - As can be seen in
Fig. 13 , an additional acoustic echo canceller (AEC)system 1303 may also be provided. Such an AEC system is optional, but allows a suppression of reverberation. Preferably, eachmicrophone 102 is provided with an individual AEC filter. Alternatively, an AEC filter may be placed between the summing circuit and the adaptive post-filter. In such an alternative embodiment, an ANC may be placed between the summing circuit and the AEC. - Advantageously, the AEC system used for this invention comprises a conventional AEC filter and an integrated frequency selected echo attenuation which acts as a residual echo suppression (RES) algorithm. A preferred embodiment of such a system is shown in
Fig. 14 . It comprises aconventional AEC filter 1303 that filters thefar end signal 1302. The adaption of theconventional AEC filter 1303 is performed using thesignal 1401, e.g. the output signal of an electrical ANC or of one of the microphones. Furthermore, an echo shaping means 1402 is provided. This echo shaping means has the form of an adaptive FIR filter with coefficient vector H[k] that filters the compensated signal e[k]. The coefficient vector H[k] of the adaptive filter is taken in each sampling step from another adaptive FIR filter with coefficient vector H1[k]. Preferably, the filter with coefficient vector H1[k] is a linear-phase filter of low order. The echo shaping means further comprises a delay element T H1. The adaption algorithm is based on the power difference between y[k] and e[k] by filtering according to wherein the compensated signal e[k] is the reference signal. The resulting signal z[k] depends on the time varying factor α[k]. In case the far end speaker is active, the output signal y[k] of the ANC system is dominant (α[k] close to 1) at the input of the adaptive filter with coefficient vector H1[k]. Then, the adaptive filter with coefficient vector H1[k] can reduce the error signal E̅[k] only by suppressing the signal of the far end speaker. In this case, the near speaker and local noise signals will not be attenuated by the echo shaping means. It is to be noted that the echo shaping algorithm is frequency selective. - The system shown in
Fig. 15 is able to process speech signals from two different positions (for example, from the driver and the front seat passenger in a car). Themicrophone array 101 has a directional diagram with two preferred directions. For example, directional microphones may be used and/or the microphones may be arranged in a suitable way. One or several ANC or AEC filters can provide an estimation of the noise level present in the microphone signals that may be used in the dynamic volume control (DVC) 1501 to vary the volume of the farend speech signal 1511 in dependence of the noise level. The system comprises a beamformer for both wanted signal sources each comprising a beamsteering means 1502 and 1504 and 1503 and 1505. Following each beamformer,beamformer filters 1506 and 1507 are arranged which, in turn, are directly connected toadaptive post-filters 1508 and 1509.non-adaptive post-filters - The output signals of the non-adaptive post-filters are fed to a
unit 1510 comprising two voice activity detectors and a switch control that generates a weighting factor A for combining both signals. According to a possible example for such aunit 1510, each of the signals s [k] ands [k] can be processed by a low-pass filter. Then, for each of the filtered signals, the contained noise signal or its level is estimated using, for example, a minimum statistics. The noise signal level is subtracted from the corresponding filtered signal level. The resulting signal levels are compared to a threshold value. Depending on this comparison of both signal levels, the weighting factor A is determined. For example, a possible weighting can be determined as follows. If both signal levels are larger than the threshold value, both signals are equally weighted. If one of the signal levels is larger than the threshold value and the other is smaller than the threshold value, the larger signal is weighted by a factor of 1 and the other is fully suppressed (weighting factor 0). If both signal levels are smaller than the threshold value, the signal stemming from the direction of the driver's seat is weighted by a factor of 1 and the other signal is fully suppressed. - The combined signal may be subject to an
additional post processing 1512, for example, a residual echo suppression (RES). For a RES, the combined signal is weighted by a spectral short time gain in the frequency domain, wherein the gain factor depends on the spectrum of the far end speech signal.
Claims (31)
- Handsfree system for use in a vehicle comprising a microphone array (101) with at least two microphones (102), a signal processing means, and an adaptive post-filter (108),
wherein the signal processing means comprises a beamformer having an input connected to the at least two microphones and an output connected to the input of the adaptive post-filter, the beamformer comprising a beamsteering means and a summing means,
wherein the adaptive post-filter is a linear-phase Wiener filter. - Handsfree system according to claim 1, wherein the adaptive post-filter is a filter in the time domain.
- Handsfree system according to one of the preceding claims, wherein the signal processing means further comprises at least two adaptive filters (110) having an input connected to the output of the beamsteering means (104) and an output connected to the adaptive post-filter, wherein the at least two adaptive filters are configured to determine adaptive filter parameters for the adaptive post-filter.
- Handsfree system according to claim 3, wherein for each of the at least two microphones, an adaptive filter is provided having an input connected to the output of the beamsteering means.
- Handsfree system according to claim 3 or 4 , wherein an input of each of the at least two adaptive filters is further connected to the output of the beamformer.
- Handsfree system according to one of the preceding claims, wherein the signal processing means further comprises a pre-emphasis filter (111), in particular, comprising a pre-whitening filter, having an input connected to an output of the adaptive post-filter and/or a pre-emphasis filter, in particular, comprising a pre-whitening filter, having an input connected to the output of the beamsteering means and an output connected to the at least two adaptive filters.
- Handsfree system according to one of the preceding claims, wherein the signal processing means further comprises an inverse filter (106), particularly a warped inverse filter.
- Handsfree system according to one of the preceding claims, wherein the signal processing means further comprises a non-adaptive post-filter (109) having an input connected to an output of the adaptive post-filter.
- Handsfree system according to one of the preceding claims, wherein the signal processing means further comprises an adaptive noise canceller (ANC) (1101) and/or an acoustic echo canceller (AEC) (1303).
- Handsfree system according to one of the preceding claims, wherein the beamformer is a non-adaptive beamformer and/or a superdirective beamformer and/or a minimum variance distortionless response (MVDR) beamformer.
- Handsfree system according to one of the preceding claims, wherein the microphone array comprises at least two microphones being arranged in endfire orientation with respect to a first position.
- Handsfree system according to claim 11, wherein the microphone array comprises at least two microphones being arranged in endfire orientation with respect to a second position.
- Handsfree system according to claim 12, wherein the at least two microphones in the first endfire orientation and the at least two microphones in the second endfire orientation have a microphone in common.
- Handsfree system according to one of the preceding claims, wherein the signal processing means comprises at least two beamformers.
- Handsfree system according to claim 14, further comprising a voice activity detector (VAD) and/or a switch control means (1510).
- Handsfree system according to one of the preceding claims, further comprising a residual echo suppression (RES) means and/or a dynamic volume control (DVC) (1510).
- Handsfree system according to one of the preceding claims, wherein the microphone array comprises at least two subarrays (701, 702, 703).
- Handsfree system according to one of the preceding claims, further comprising a frame wherein each microphone of the microphone array is arranged in a predetermined, in particular fixed, position in or on the frame.
- Handsfree system according to one of the preceding claims, wherein at least one microphone is a directional microphone (1001), in particular, having a cardioid characteristic and/or being a differential microphone.
- Vehicle comprising a handsfree system according to one of the preceding claims.
- Method for noise reduction in a vehicular handsfree system, comprising
receiving input signals resulting from a microphone array with at least two microphones,
processing the input signals by a beamformer to provide a beamformed signal, and
adaptively filtering a signal resulting from the beamformed signal by an adaptive post-filter,
wherein the adaptive post-filter is a linear-phase Wiener filter. - Method according to claim 21, wherein the adaptive filtering is performed in the time domain.
- Method according to claim 21 or 22, further comprising
providing at least two adaptive filters, particularly Wiener filters, wherein
processing the input signals by a beamformer comprises beamsteering the input signals for providing beamsteered signals corresponding to one of the at least two microphones and summing the signals, and
adaptively filtering comprises receiving and processing at least one beamsteered signal by at least one of the at least two adaptive filters to determine adaptive filter parameters for the adaptive post filter. - Method according to claim 23, wherein adaptively filtering further comprises receiving a signal resulting from the beamformed signal by at least one adaptive filter and wherein processing the beamsteered signal comprises determining adaptive filter parameters using the at least one beamsteered signal and the signal resulting from the beamformed signal.
- Method according to claim 23 or 24, wherein for each beamsteered signal, an adaptive filter is provided for determining adaptive filter parameters using the beamsteered signal and the signal resulting from the beamformed signal.
- Method according to one of the claims 23 - 25, wherein receiving at least one beamsteered signal by at least one of the at least two adaptive filters comprises processing the at least one beamsteered signal by a pre-emphasis filter, in particular, comprising a pre-whitening filter.
- Method according to one of the claims 21 - 26, further comprising processing a signal resulting from the microphone array by an inverse filter, in particular, a warped inverse filter.
- Method according to one of the claims 21 - 27, further comprising non-adaptively filtering a signal resulting from the adaptively filtered signal and/or processing a signal resulting from the adaptively filtered signal by a pre-emphasis filter.
- Method according to one of the claims 21 - 28, further comprising processing a signal resulting from the microphone array, particularly resulting from the beamformed signal, by an adaptive noise canceller (ANC) and/or an acoustic echo canceller (AEC) and/or a residual echo suppression (RES) means.
- Method according to one of the claims 21 - 29, wherein the input signals are processed by a non-adaptive and/or superdirective and/or minimum variance distortionless response (MVDR) beamformer.
- Computer program product comprising one or more computer readable media having computer-executable instructions for performing the steps of the method according to claims 21 - 30.
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP20030022273 EP1538867B1 (en) | 2003-06-30 | 2003-10-01 | Handsfree system for use in a vehicle |
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP03014846.4A EP1524879B1 (en) | 2003-06-30 | 2003-06-30 | Handsfree system for use in a vehicle |
| EP03014846 | 2003-06-30 | ||
| EP20030022273 EP1538867B1 (en) | 2003-06-30 | 2003-10-01 | Handsfree system for use in a vehicle |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| EP1538867A1 EP1538867A1 (en) | 2005-06-08 |
| EP1538867B1 true EP1538867B1 (en) | 2012-07-18 |
Family
ID=34466327
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| EP20030022273 Expired - Lifetime EP1538867B1 (en) | 2003-06-30 | 2003-10-01 | Handsfree system for use in a vehicle |
Country Status (1)
| Country | Link |
|---|---|
| EP (1) | EP1538867B1 (en) |
Cited By (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2017099728A1 (en) * | 2015-12-08 | 2017-06-15 | Nuance Communications, Inc. | System and method for suppression of non-linear acoustic echoes |
| RU2641319C2 (en) * | 2012-12-21 | 2018-01-17 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Filter and method for informed spatial filtration using multiple numerical evaluations of arrival direction |
| EP3230981B1 (en) | 2014-12-12 | 2020-05-06 | Nuance Communications, Inc. | System and method for speech enhancement using a coherent to diffuse sound ratio |
Families Citing this family (17)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB0405790D0 (en) * | 2004-03-15 | 2004-04-21 | Mitel Networks Corp | Universal microphone array stand |
| WO2007026827A1 (en) * | 2005-09-02 | 2007-03-08 | Japan Advanced Institute Of Science And Technology | Post filter for microphone array |
| US7970123B2 (en) * | 2005-10-20 | 2011-06-28 | Mitel Networks Corporation | Adaptive coupling equalization in beamforming-based communication systems |
| US7626889B2 (en) | 2007-04-06 | 2009-12-01 | Microsoft Corporation | Sensor array post-filter for tracking spatial distributions of signals and noise |
| US8861756B2 (en) | 2010-09-24 | 2014-10-14 | LI Creative Technologies, Inc. | Microphone array system |
| US8818800B2 (en) | 2011-07-29 | 2014-08-26 | 2236008 Ontario Inc. | Off-axis audio suppressions in an automobile cabin |
| US8903722B2 (en) | 2011-08-29 | 2014-12-02 | Intel Mobile Communications GmbH | Noise reduction for dual-microphone communication devices |
| US9768829B2 (en) | 2012-05-11 | 2017-09-19 | Intel Deutschland Gmbh | Methods for processing audio signals and circuit arrangements therefor |
| US9635457B2 (en) | 2014-03-26 | 2017-04-25 | Sennheiser Electronic Gmbh & Co. Kg | Audio processing unit and method of processing an audio signal |
| DE102015203600B4 (en) | 2014-08-22 | 2021-10-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | FIR filter coefficient calculation for beamforming filters |
| US9906859B1 (en) | 2016-09-30 | 2018-02-27 | Bose Corporation | Noise estimation for dynamic sound adjustment |
| DE102016013042A1 (en) | 2016-11-02 | 2018-05-03 | Audi Ag | Microphone system for a motor vehicle with dynamic directional characteristics |
| US20190348056A1 (en) * | 2017-01-04 | 2019-11-14 | Harman Becker Automotive Systems Gmbh | Far field sound capturing |
| DE112018002744T5 (en) * | 2017-05-29 | 2020-02-20 | Harman Becker Automotive Systems Gmbh | sound detection |
| DE102018110759A1 (en) * | 2018-05-04 | 2019-11-07 | Sennheiser Electronic Gmbh & Co. Kg | microphone array |
| US11295718B2 (en) | 2018-11-02 | 2022-04-05 | Bose Corporation | Ambient volume control in open audio device |
| CN114743533B (en) * | 2022-03-09 | 2025-04-18 | 中科上声(苏州)电子有限公司 | Vehicle noise reduction method, device and storage medium for broadband noise |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE4330143A1 (en) * | 1993-09-07 | 1995-03-16 | Philips Patentverwaltung | Arrangement for signal processing of acoustic input signals |
| US5715319A (en) * | 1996-05-30 | 1998-02-03 | Picturetel Corporation | Method and apparatus for steerable and endfire superdirective microphone arrays with reduced analog-to-digital converter and computational requirements |
| FR2808391B1 (en) * | 2000-04-28 | 2002-06-07 | France Telecom | RECEPTION SYSTEM FOR MULTI-SENSOR ANTENNA |
| CA2354858A1 (en) * | 2001-08-08 | 2003-02-08 | Dspfactory Ltd. | Subband directional audio signal processing using an oversampled filterbank |
-
2003
- 2003-10-01 EP EP20030022273 patent/EP1538867B1/en not_active Expired - Lifetime
Cited By (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| RU2641319C2 (en) * | 2012-12-21 | 2018-01-17 | Фраунхофер-Гезелльшафт Цур Фердерунг Дер Ангевандтен Форшунг Е.Ф. | Filter and method for informed spatial filtration using multiple numerical evaluations of arrival direction |
| EP3230981B1 (en) | 2014-12-12 | 2020-05-06 | Nuance Communications, Inc. | System and method for speech enhancement using a coherent to diffuse sound ratio |
| WO2017099728A1 (en) * | 2015-12-08 | 2017-06-15 | Nuance Communications, Inc. | System and method for suppression of non-linear acoustic echoes |
| US10477031B2 (en) | 2015-12-08 | 2019-11-12 | Nuance Communications, Inc. | System and method for suppression of non-linear acoustic echoes |
Also Published As
| Publication number | Publication date |
|---|---|
| EP1538867A1 (en) | 2005-06-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP1538867B1 (en) | Handsfree system for use in a vehicle | |
| EP1524879B1 (en) | Handsfree system for use in a vehicle | |
| EP3190587B1 (en) | Noise estimation for use with noise reduction and echo cancellation in personal communication | |
| CN105590631B (en) | Signal processing method and device | |
| US9338547B2 (en) | Method for denoising an acoustic signal for a multi-microphone audio device operating in a noisy environment | |
| EP1994788B1 (en) | Noise-reducing directional microphone array | |
| JP5913340B2 (en) | Multi-beam acoustic system | |
| EP3542547B1 (en) | Adaptive beamforming | |
| Yousefian et al. | A dual-microphone speech enhancement algorithm based on the coherence function | |
| EP2848007B1 (en) | Noise-reducing directional microphone array | |
| US9860634B2 (en) | Headset with end-firing microphone array and automatic calibration of end-firing array | |
| US20210098014A1 (en) | Noise elimination device and noise elimination method | |
| Bitzer et al. | Multi-microphone noise reduction techniques as front-end devices for speech recognition | |
| Grimm et al. | Wind noise reduction for a closely spaced microphone array in a car environment | |
| JP5405130B2 (en) | Sound reproducing apparatus and sound reproducing method | |
| Stenzel et al. | A multichannel Wiener filter with partial equalization for distributed microphones | |
| Buck et al. | A compact microphone array system with spatial post-filtering for automotive applications | |
| Jinzai et al. | Wavelength proportional arrangement of virtual microphones based on interpolation/extrapolation for underdetermined speech enhancement | |
| Yermeche | Subband beamforming for speech enhancement in hands-free communication | |
| Freudenberger et al. | A two-microphone diversity system and its application for hands-free car kits | |
| Freudenberger et al. | Spectral combining for microphone diversity systems | |
| Segawa et al. | Applying virtual microphones to triangular microphone array in in-car communication | |
| Nguyen et al. | A Study Of Dual Microphone Array For Speech Enhancement In Noisy Environment | |
| Krini et al. | Adaptive Beamforming for Microphone Arrays on Seat Belts | |
| Krini et al. | A Practical Beamformer-Postfilter System for Microphone Arrays on Seat Belts |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
| AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
| AX | Request for extension of the european patent |
Extension state: AL LT LV MK |
|
| RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: CHRISTOPH, MARKUS |
|
| 17P | Request for examination filed |
Effective date: 20051208 |
|
| AKX | Designation fees paid |
Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
| 17Q | First examination report despatched |
Effective date: 20090722 |
|
| GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
| RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NUANCE COMMUNICATIONS, INC. |
|
| GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
| GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
| AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
| REG | Reference to a national code |
Ref country code: AT Ref legal event code: REF Ref document number: 567292 Country of ref document: AT Kind code of ref document: T Effective date: 20120815 Ref country code: IE Ref legal event code: FG4D |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R096 Ref document number: 60341562 Country of ref document: DE Effective date: 20120913 |
|
| REG | Reference to a national code |
Ref country code: NL Ref legal event code: VDEP Effective date: 20120718 |
|
| REG | Reference to a national code |
Ref country code: AT Ref legal event code: MK05 Ref document number: 567292 Country of ref document: AT Kind code of ref document: T Effective date: 20120718 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: AT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: FI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: BE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20121019 Ref country code: PT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20121119 Ref country code: SE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: DK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: ES Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20121029 Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 |
|
| PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
| STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: IT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 Ref country code: MC Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20121031 Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 |
|
| REG | Reference to a national code |
Ref country code: CH Ref legal event code: PL |
|
| 26N | No opposition filed |
Effective date: 20130419 |
|
| REG | Reference to a national code |
Ref country code: IE Ref legal event code: MM4A |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CH Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20121031 Ref country code: LI Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20121031 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20121018 Ref country code: IE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20121001 |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R097 Ref document number: 60341562 Country of ref document: DE Effective date: 20130419 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20120718 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: LU Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20121001 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20031001 |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 14 |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 15 |
|
| REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 16 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: HR Payment date: 20181127 Year of fee payment: 6 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20191031 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: GB Payment date: 20220818 Year of fee payment: 20 |
|
| PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: DE Payment date: 20220816 Year of fee payment: 20 |
|
| REG | Reference to a national code |
Ref country code: DE Ref legal event code: R071 Ref document number: 60341562 Country of ref document: DE |
|
| REG | Reference to a national code |
Ref country code: GB Ref legal event code: PE20 Expiry date: 20230930 |
|
| PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION Effective date: 20230930 |