[go: up one dir, main page]

WO2008148048A2 - Configuration de microphone double (dmc) de codage de la parole évoluée - Google Patents

Configuration de microphone double (dmc) de codage de la parole évoluée Download PDF

Info

Publication number
WO2008148048A2
WO2008148048A2 PCT/US2008/064773 US2008064773W WO2008148048A2 WO 2008148048 A2 WO2008148048 A2 WO 2008148048A2 US 2008064773 W US2008064773 W US 2008064773W WO 2008148048 A2 WO2008148048 A2 WO 2008148048A2
Authority
WO
WIPO (PCT)
Prior art keywords
microphone
distance
vector
approximately
range
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2008/064773
Other languages
English (en)
Other versions
WO2008148048A3 (fr
Inventor
Gregory C. Burnett
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AliphCom LLC
Original Assignee
AliphCom LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AliphCom LLC filed Critical AliphCom LLC
Publication of WO2008148048A2 publication Critical patent/WO2008148048A2/fr
Publication of WO2008148048A3 publication Critical patent/WO2008148048A3/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02165Two microphones, one receiving mainly the noise signal and the other one mainly the speech signal
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2410/00Microphones
    • H04R2410/05Noise reduction with a separate noise microphone

Definitions

  • the disclosure herein relates generally to communication systems.
  • this disclosure relates to microphone configurations for use in communication systems.
  • the noisy environment makes voice communication between human speakers difficult.
  • the communication between speakers is especially difficult when the speakers are communicating via microphones coupled to electronic devices (e.g , communication radios, cellular telephones, etc.).
  • Figure IA is a top view of a dual microphone configuration (DMC), under an embodiment.
  • DMC dual microphone configuration
  • Figure IB is a side view of the DMC, under an embodiment.
  • Figure 1C is a front view of the DMC, under an embodiment.
  • Figure ID shows dimensions of a microphone of the DMC, under an embodiment.
  • FIG. 2 is a block diagram showing microphone designation in the DMC, under an embodiment.
  • Figure 3 is a system including the DMC coupled or connected to components of an adaptive filter system, under an embodiment.
  • Figure 4 is a system including the DMC coupled or connected to components of an adaptive filter system that includes a VAD, under an embodiment.
  • the microphone array for use in ultra-high acoustical noise environments
  • the microphone array referred to herein as a dual microphone configuration (DMC)
  • DMC dual microphone configuration
  • the microphone array can be used along with an adaptive noise removal program such as the Pathfinder system to remove a significant portion of the noise from a desired speech signal.
  • the Pathfinder system which is available from Aliph, San Francisco, California, is described in detail in the Related Applications.
  • devoicing means the loss of desired speech energy in dB.
  • voice activity detection means the detection of voiced and unvoiced speech.
  • G2 means gradient microphone 2.
  • Mel means the microphone that captures the most speech.
  • Mic2 means the microphone that captures the least speech.
  • Figure IA is a top view of a dual microphone configuration (DMC) 100, under an embodiment
  • Figure IB is a side view of the DMC 100, under an embodiment
  • Figure 1C is a front view of the DMC 100, under an embodiment.
  • Figure ID shows dimensions of a microphone of the DMC 100, under an embodiment.
  • the DMC provides a configuration in which both microphones respond to noise with the same sensitivity, but the microphone closest to the speaker's mouth has a higher sensitivity to speech. All dimensions shown on Figures IA- ID are in millimeters (mm) unless otherwise stated herein.
  • the DMC of an embodiment includes a housing 101 (also referred to as a boom 101) having two receptacles, The receptacles receive and hold two microphones Gl and G2.
  • the boom is generally connected or coupled to a device that can be worn by a speaker, for example, a headset or earpiece (not shown) that positions or holds the microphones in the vicinity of the speaker's mouth.
  • the microphones of an embodiment are Gentex 3207-5 microphones, but the embodiment is not limited to these microphones.
  • the array of an embodiment places a first directional microphone Gl (e.g., Micl, the "speech" microphone) in the position normally occupied by a close-talk (gradient) microphone.
  • Gl e.g., Micl, the "speech" microphone
  • the position of Micl is generally directly or nearly directly in front of the speaker's lips and only a few millimeters from the lips; as an example, the close-talk microphone Micl is a distance in a range of approximately 0 to 10 mm from the speaker's lips.
  • the microphone Micl has a first vector normal to a front of the microphone, and the first vector is approximately parallel with an axis defined by, in this embodiment, the boom, and the axis is oriented in a direction toward a mouth of the speaker.
  • a second directional microphone G2 (e.g., Mic2, the "noise” microphone) is placed a distance behind Micl .
  • the distance of an embodiment is in a range of a few centimeters behind Micl .
  • the distance between the microphones is in a range of approximately 1 millimeter (mm) to 30 mm.
  • the distance between the microphones is in a range of approximately 30 mm to 50 mm. Due to the proximity effect, the speech will be significantly stronger in Micl than in Mic2, but the noise response should be about the same, greatly facilitating the noise removal process.
  • An open area 104 separates the first and second microphones, but the embodiment is not so limited.
  • the open area 104 of an embodiment comprises air.
  • a vent 106 (not shown on figure) is optionally placed in proximity to the first microphone
  • the second microphone Mic2 is connected to the boom, and has a second vector normal to the front of the second microphone Mic2.
  • the second vector forms an angle relative to the first vector of the first microphone Micl.
  • the angle between the first vector and the second vector is approximately zero (0) degrees.
  • the angle between the first vector and the second vector is in a range of approximately zero (0) degrees to 45 degrees.
  • the first vector is separated from the second vector by a vector distance in a range of approximately zero (0) to 15 mm.
  • the microphones G1/G2 of an embodiment are parallel to each other and separated by a distance D of approximately 20.8 mm apart
  • the microphones Gl /G2 are not required to be parallel to each other and are not required to separated by this exact spacing. Any distance D may separate the two microphones with the understanding that the smaller the distance D, the better the denoising, but the more devoicing A larger separation D between the microphones can lead to poorer denoising, but less devoicing.
  • the spacing D of 20.8 mm was determined to provide good denoising performance and acceptable devoicing.
  • Optimum performance was observed when the noise microphone Mic2 is parallel to Micl and on the same axis as Micl, but the embodiment is not so limited
  • the DMC 100 is symmetric and is used in the same configuration or manner as a single close-talk microphone If one of the gradient microphones is designated as Gl and the other as G2, then either microphone can be placed closest to the mouth and designated as Micl .
  • the other gradient microphone then assumes the role of Mic2 Either microphone may fulfill each role, as the proximity effect is used to determine which microphone is Micl and which microphone is Mic2 (e.g., Micl is the microphone in which speech is much louder than in Mic2).
  • Figure 2 is a block diagram showing microphone designation 200 in the DMC, under an embodiment.
  • the noise response of the microphones being approximately equal facilitates Micl identification. Both configurations are used to suppress the noise, and the microphone that has the highest residual energy is used to output the speech. This functions because the speech will not be removed nearly as well as the noise, so the correct configuration will be the one with the highest energy residual. Thus, if in a given time period the total energy in a first microphone exceeds, by a given threshold, the energy in the second microphone, then the first microphone is assumed to be nearest the speaker's mouth.
  • the microphone designation 200 receives a first signal A from the DMC having microphone Gl designated as Micl and microphone G2 designated as Mic2.
  • the microphone designation 200 also receives a second signal B from the DMC having microphone G2 designated as Micl and microphone Gl designated as Mic2
  • the energy of signal A is compared with the energy of signal B
  • microphone Gl is designated as Micl
  • microphone G2 is designated as Mic2 for all further operations of the DMC.
  • the energy of signal B is higher than the energy of signal A
  • microphone G2 is designated as Micl and microphone Gl is designated as Mic2 for all further operations of the DMC.
  • the DMC 100 of an embodiment is coupled or connected to one or more remote devices In this system configuration, the DMC 100 outputs signals to the remote devices.
  • the remote devices include, but are not limited to, at least one of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), personal computers (PCs), headset devices, head-worn devices, and earpieces.
  • the DMC 100 of an embodiment can be a component or subsystem integrated with a host device.
  • the DMC outputs signals to components or subsystems of the host device.
  • the host device includes, but is not limited to, at least one of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), personal computers (PCs), headset devices, head-worn devices, and earpieces.
  • the DMC can be coupled or connected to be a component of a system that includes an adaptive filter system
  • Figure 3 is a system 300 including the DMC 100 coupled or connected to components of an adaptive filter system 330, under an embodiment.
  • a single noise source 320 and a direct path to the microphones Micl and Mic2 are assumed.
  • An operational description of the noise removal of an embodiment is provided using a single speech source 310 and a single noise source 320, but is not so limited.
  • the system 300 uses two microphones which, in an embodiment, represent the DMC (e.g , microphones Gl and G2, or Micl and Mic2) described herein with reference to Figures 1 and 2.
  • Micl is designated as a "speech” microphone and Mic2 is designated as a "noise” microphone, but the DMC is not so limited
  • the speech microphone Micl is assumed to capture mostly speech with some noise, while Mic2 captures mostly noise with some speech.
  • the data from the speech source 310 to Micl is denoted by s(n), where s(n) is a discrete sample of the analog signal from the source 310.
  • the data from the speech source 310 to Mic2 is denoted by s 2 (n).
  • the data from the noise source 320 to Mic2 is denoted by n(n)
  • the data from the noise source 320 to Micl is denoted by n 2 (n).
  • the data from Micl to noise removal element 330 is denoted by Hi 1 (n)
  • the data from Mic2 to noise removal element 330 is denoted by m 2 (n)
  • the transfer function from the speech source 310 to Mic2 is denoted by H 2 (Z)
  • the transfer function from the noise source 320 to Micl is denoted by H 1 (Z)
  • the DMC 100 can be used with the Pathfinder system as the adaptive filter system 330 of system 300.
  • any adaptive filter or noise removal algorithm can be used with the DMC in one or more various alternative embodiments or configurations.
  • the Pathfinder system generally provides adaptive noise cancellation by combining the two microphone signals (e.g., Micl, Mic2) by filtering and summing in the time domain
  • the adaptive filter uses the signal received from the far microphone (e.g., Mic2) to remove noise from the speech received from the near microphone (e.g., Micl), which relies on a slowly varying linear transfer function between the two microphones for sources of noise
  • output of Micl is a first channel
  • output of Mic2 is a second channel
  • Tests using system 300 in a configuration including the DMC 100 described herein along with the Pathfinder noise suppression system have yielded signal-to- noise (SNR) improvements from 20 to 30 dB in extremely high noise environments (105+ dBA) in a range of frequencies of approximately 100 Hz to 3900 Hz
  • SNR signal-to- noise
  • the system 300 of an embodiment including the adaptive filter system 330 and the DMC 100 can be coupled or connected to one or more remote devices.
  • the system 300 outputs signals to the remote devices.
  • the remote devices include, but are not limited to, at least one of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), personal computers (PCs), headset devices, head-worn devices, and earpieces.
  • the adaptive filter system 330 can be a component of the DMC 100 or the remote device.
  • the system 300 of an embodiment including the adaptive filter system 330 and the DMC 100 can be a component or subsystem integrated with a host device.
  • the system 300 outputs signals to components or subsystems of the host device.
  • the host device includes, but is not limited to, at least one of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), personal computers (PCs), headset devices, head- worn devices, and earpieces
  • the DMC can also be coupled or connected as a component of a system that includes an adaptive filter system and a VAD.
  • Figure 4 is a system 400 including the DMC 100 coupled or connected to components of an adaptive filter system 330 that includes a VAD 440, under an embodiment.
  • a single noise source 320 and a direct path to the microphones Micl and Mic2 are assumed
  • An operational description of the noise removal of an embodiment is provided using a single speech source 310 and a single noise source 320, but is not so limited.
  • the system 300 uses two microphones which, in an embodiment, represent the DMC 100 described herein with reference to Figures 1 and 2. In this example, Micl is designated as a "speech" microphone and Mic2 is designated as a "noise” microphone.
  • the speech microphone Micl is assumed to capture mostly speech with some noise, while Mic2 captures mostly noise with some speech.
  • the data from the speech source 310 to Micl is denoted by s(n), where s(n) is a discrete sample of the analog signal from the source 310
  • the data from the speech source 310 to Mic2 is denoted by s 2 (n)
  • the data from the noise source 320 to Mi c2 is denoted by n(n)
  • the data from the noise source 320 to Micl is denoted by n 2 (n).
  • the data from Micl to noise removal element 330 is denoted by mi(n)
  • the data from Mic2 to noise removal element 330 is denoted by m 2 (n).
  • the noise removal element 330 optionally receives a signal from a voice activity detection (VAD) element 440.
  • VAD voice activity detection
  • the VAD 340 uses physiological and/or acoustic information to determine when a speaker is speaking.
  • the VAD can include at least one of an accelerometer, at least one conventional acoustic microphone, a skin surface microphone in physical contact with skin of a user, a human tissue vibration detector, a radio frequency (RF) vibration and/or motion detector/device, an electroglottograph, an ultrasound device, an acoustic microphone that is being used to detect acoustic frequency signals that correspond to the user's speech directly from the skin of the user (anywhere on the body), an airflow detector, and a laser vibration detector to name a few.
  • RF radio frequency
  • the strong proximity effect of the DMC 100 of an embodiment allows a simple acoustic-only VAD to be used using Micl of the DMC 100 to generate the VAD 440 data in system 300
  • the VAD data in system 300 may also be generated using information from both Micl and Mic2 of DMC 100.
  • the output of the noise removal system 330 may be used to generate VAD information.
  • a non-acoustic speech vibration detector such as the Aliph Radio Vibrometer (ARV) (available from Aliph, San Francisco, California) is recommended as a substitute or supplement to the acoustic VAD 440.
  • This microphone configuration will however work with any VAD signal, or the VAD may be set to zero with only minor disruption of the denoised speech. This is because there is much more speech in Micl than in Mic2, a key to good performance
  • the DMC 100 can be used with the Pathfinder system as the adaptive filter system of system 400.
  • any adaptive filter or noise removal algorithm and any VAD can be used with the DMC 100 in one or more various alternative embodiments or configurations
  • the system 400 including the adaptive filter system 330, the VAD 440, and the DMC 100 of an embodiment can be coupled or connected to one or more remote devices In this system configuration, the system 400 outputs signals to the remote devices.
  • the remote devices include, but are not limited to, at least one of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), personal computers (PCs), headset devices, head-worn devices, and earpieces
  • PDAs personal digital assistants
  • PCs personal computers
  • headset devices head-worn devices, and earpieces
  • the adaptive filter system 330 can be a component of the DMC 100 or the remote device
  • the VAD 440 can be a component of the adaptive filter system 330, the DMC 100 or the remote device
  • the system 400 of an embodiment including the adaptive filter system 330, the VAD 440, and the DMC 100 can be a component or subsystem integrated with a host device.
  • the system 400 outputs signals to components or subsystems of the host device.
  • the host device includes, but is not limited to, at least one of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), personal computers (PCs), headset devices, head-worn devices, and earpieces.
  • the DMC can be a component of a single system, multiple systems, and/or geographically separate systems.
  • the DMC can also be a subcomponent or subsystem of a single system, multiple systems, and/or geographically separate systems
  • the DMC can be coupled to one or more other components (not shown) of a host system or a system coupled to the host system.
  • One or more components of the DMC and/or a corresponding system or application to which the DMC is coupled or connected includes and/or runs under and/or in association with a processing system.
  • the processing system includes any collection of processor-based devices or computing devices operating together, or components of processing systems or devices, as is known in the art.
  • the processing system can include one or more of a portable computer, portable communication device operating in a communication network, and/or a network server.
  • the portable computer can be any of a number and/or combination of devices selected from among personal computers, cellular telephones, personal digital assistants, portable computing devices, and portable communication devices, but is not so limited
  • the processing system can include components within a larger computer system.
  • the processing system of an embodiment includes at least one processor and at least one memory device or subsystem.
  • the processing system can also include or be coupled to at least one database.
  • the term "processor” as generally used herein refers to any logic processing unit, such as one or more central processing units (CPUs), digital signal processors (DSPs), application-specific integrated circuits (ASIC), etc.
  • the processor and memory can be monolithically integrated onto a single chip, distributed among a number of chips or components, and/or provided by some combination of algorithms
  • the methods described herein can be implemented in one or more of software algorithm(s), programs, firmware, hardware, components, circuitry, in any combination.
  • Communication paths couple the components and include any medium for communicating or transferring files among the components.
  • the communication paths include wireless connections, wired connections, and hybrid wireless/wired connections.
  • the communication paths also include couplings or connections to networks including local area networks (LANs), metropolitan area networks (MANs), wide area networks (WANs), proprietary networks, interoffice or backend networks, and the Internet
  • LANs local area networks
  • MANs metropolitan area networks
  • WANs wide area networks
  • proprietary networks interoffice or backend networks
  • the Internet and the Internet
  • the communication paths include removable fixed mediums like floppy disks, hard disk drives, and CD-ROM disks, as well as flash RAM, Universal Serial Bus (USB) connections, RS-232 connections, telephone lines, buses, and electronic mail messages.
  • USB Universal Serial Bus
  • Embodiments of the DMC and corresponding systems and methods described herein include a device comprising: a boom having two receptacles that define an axis; a first microphone connected to the boom, the first microphone having a first vector normal to a front of the first microphone, the first vector approximately parallel with the axis; and a second microphone connected to the boom and positioned a first distance from the first microphone, the second microphone having a second vector normal to a front of the second microphone, wherein the second vector forms an angle relative to the first vector.
  • the angle of an embodiment is approximately zero (0) degrees
  • the angle of an embodiment is in a range of approximately zero (0) degrees to 45 degrees
  • the first vector of an embodiment is separated from the second vector by a vector distance in a range of approximately zero (0) to 15 mm
  • the first distance of an embodiment is in a range of approximately 1 millimeter (mm) to 30 mm.
  • the first distance of an embodiment is in a range of approximately 30 mm to 50 mm.
  • the second distance of an embodiment is in a range of approximately 0 to 10 mm.
  • the first distance of an embodiment is in a range of approximately 1 mm to 30 mm, wherein the first microphone of an embodiment is positioned a second distance from a mouth of a speaker wearing the boom, the second distance of an embodiment in a range of approximately 0 to 10 mm
  • the axis of an embodiment is oriented in a direction toward a mouth of a user.
  • a space between the first microphone and the second microphone of an embodiment is air.
  • Embodiments of the DMC and corresponding systems and methods described herein include a device comprising: a headset including at least one loudspeaker, wherein the headset attaches to a region of a human head; and a microphone array connected to the headset, the microphone array including a first microphone and a second microphone, the first microphone having a first vector normal to a front of the first microphone, the first vector defining an axis, and the second microphone positioned a first distance from the first microphone, the second microphone having a second vector normal to a front of the second microphone, wherein the second vector forms an angle relative to the first vector.
  • the angle of an embodiment is approximately zero (0) degrees
  • the angle of an embodiment is in a range of approximately zero (0) degrees to 45 degrees.
  • the first vector of an embodiment is separated from the second vector by a vector distance in a range of approximately zero (0) to 15 mm
  • the first distance of an embodiment is in a range of approximately 1 millimeter (mm) to 30 mm.
  • the first distance of an embodiment is in a range of approximately 30 mm to 50 mm
  • the first microphone of an embodiment is positioned a second distance from a mouth of a human wearing the headset.
  • the second distance of an embodiment is in a range of approximately 0 to 10 mm
  • the first distance of an embodiment is in a range of approximately 1 mm to 30 mm, wherein the first microphone of an embodiment is positioned a second distance from a mouth of a human wearing the headset, the second distance of an embodiment in a range of approximately 0 to 10 mm
  • the device of an embodiment comprises a voice activity detector (VAD) connected to the headset, the VAD generating voice activity signals.
  • VAD voice activity detector
  • the first microphone of an embodiment generates the voice activity signals.
  • the device of an embodiment comprises an adaptive noise removal application coupled to the headset.
  • the adaptive noise removal application of an embodiment receives acoustic signals from the microphone array and generating an output signal, wherein the output signal is a denoised acoustic signal
  • the device of an embodiment comprises a communication channel coupled to the headset.
  • the communication channel of an embodiment comprises at least one of a wireless channel, a wired channel, and a hybrid wireless/wired channel.
  • the device of an embodiment comprises a communication device coupled to the headset via the channel.
  • the communication device of an embodiment comprises one or more of cellular telephones, satellite telephones, portable telephones, wireline telephones, Internet telephones, wireless transceivers, wireless communication radios, personal digital assistants (PDAs), and personal computers (PCs).
  • PDAs personal digital assistants
  • PCs personal computers
  • Embodiments of the DMC and corresponding systems and methods described herein include a device comprising a first microphone having a first vector normal to a front of the first microphone, the first vector approximately parallel with an axis oriented in a direction toward a mouth of a speaker, wherein the first microphone is positioned a first distance from the mouth; and a second microphone positioned a second distance from the first microphone, the second microphone having a second vector normal to a front of the second microphone, wherein the second vector forms an angle relative to the first vector, wherein the angle is in a range of approximately zero (0) degrees to 45 degrees.
  • the second distance of an embodiment is in a range of approximately 1 millimeter (mm) to 30 mm.
  • the second distance of an embodiment is in a range of approximately 30 mm to 50 mm.
  • the first distance of an embodiment is in a range of approximately 0 to 10 mm
  • the device of an embodiment comprises an adaptive noise removal application coupled to the first microphone and the second microphone.
  • the adaptive noise removal application of an embodiment receives acoustic signals from the first microphone and the second microphone and generates an output signal, wherein the output signal is a denoised acoustic signal.
  • the device of an embodiment comprises a voice activity detector (VAD) coupled to the adaptive noise removal application, the VAD generating voice activity signals.
  • VAD voice activity detector
  • Embodiments of the DMC and corresponding systems and methods described herein include a system comprising: a first microphone having a first vector normal to a front of the first microphone, the first vector approximately parallel with an axis oiiented in a direction toward a mouth of a speaker, wherein the first microphone is positioned a first distance from the mouth, a second microphone positioned a second distance from the first microphone, the second microphone having a second vector normal to a front of the second microphone, wherein the second vector forms an angle relative to the first vector, wherein the angle is in a range of approximately zero (0) degrees to 45 degrees; and an adaptive noise removal application receiving acoustic signals from the first microphone and the second microphone and generating an output signal, wherein the output signal is a denoised acoustic signal.
  • aspects of the DMC and corresponding systems and methods described herein may be implemented as functionality programmed into any of a variety of circuitry, including programmable logic devices (PLDs), such as field programmable gate arrays (FPGAs), programmable array logic (PAL) devices, electrically programmable logic and memory devices and standard cell-based devices, as well as application specific integrated circuits (ASICs)
  • PLDs programmable logic devices
  • FPGAs field programmable gate arrays
  • PAL programmable array logic
  • ASICs application specific integrated circuits
  • microcontrollers with memory such as electronically erasable programmable read only memory (EEPROM)
  • embedded microprocessors firmware, software, etc
  • aspects of the DMC and corresponding systems and methods may be embodied in microprocessors having software-based circuit emulation, discrete logic (sequential and combinatorial), custom devices, fuzzy (neural) logic, quantum devices, and hybrids of any of the above device types.
  • MOSFET metal-oxide semiconductor field-effect transistor
  • CMOS complementary metal-oxide semiconductor
  • ECL emitter-coupled logic
  • polymer technologies e.g., silicon-conjugated polymer and metal-conjugated polymer -metal structures
  • mixed analog and digital etc.
  • any system, method, and/or other components disclosed herein may be described using computer aided design tools and expressed (or represented), as data and/or instructions embodied in various computer-readable media, in terms of their behavioral, register transfer, logic component, transistor, layout geometries, and/or other characteristics.
  • Computer-readable media in which such formatted data and/or instructions may be embodied include, but are not limited to, non-volatile storage media in various forms (e g , optical, magnetic or semiconductor storage media) and carrier waves that may be used to transfer such formatted data and/or instructions through wireless, optical, or wired signaling media or any combination thereof.
  • Examples of transfers of such formatted data and/or instructions by carrier waves include, but are not limited to, transfers (uploads, downloads, e-mail, etc ) over the Internet and/or other computer networks via one or more data transfer protocols (e g , HTTP, FTP, SMTP, etc )
  • data transfer protocols e g , HTTP, FTP, SMTP, etc
  • a processing entity e.g., one or more processors

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • General Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Soundproofing, Sound Blocking, And Sound Damping (AREA)
  • Obtaining Desirable Characteristics In Audible-Bandwidth Transducers (AREA)

Abstract

L'invention concerne un réseau de microphone à utiliser dans des environnements de bruits acoustiques très élevés. Le réseau de microphone comprend deux microphones de diaphonie directionnels. Les deux microphones sont séparés d'une courte distance si bien qu'un microphone capte plus de paroles que l'autre. Le réseau de microphone peut être utilisé conjointement avec un programme d'élimination du bruit adaptatif pour éliminer une partie significative du bruit provenant d'un signal de parole d'intérêt.
PCT/US2008/064773 2007-05-23 2008-05-23 Configuration de microphone double (dmc) de codage de la parole évoluée Ceased WO2008148048A2 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US93163707P 2007-05-23 2007-05-23
US60/931,637 2007-05-23

Publications (2)

Publication Number Publication Date
WO2008148048A2 true WO2008148048A2 (fr) 2008-12-04
WO2008148048A3 WO2008148048A3 (fr) 2009-07-30

Family

ID=40075751

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2008/064773 Ceased WO2008148048A2 (fr) 2007-05-23 2008-05-23 Configuration de microphone double (dmc) de codage de la parole évoluée

Country Status (2)

Country Link
US (3) US8625816B2 (fr)
WO (1) WO2008148048A2 (fr)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8625816B2 (en) * 2007-05-23 2014-01-07 Aliphcom Advanced speech encoding dual microphone configuration (DMC)
WO2012069020A1 (fr) * 2010-11-25 2012-05-31 歌尔声学股份有限公司 Procédé et dispositif d'amélioration de la qualité de la parole, et casque de communication avec réduction du bruit
TWI412023B (zh) * 2010-12-14 2013-10-11 Univ Nat Chiao Tung 可消除噪音且增進語音品質之麥克風陣列架構及其方法
JP5813562B2 (ja) * 2012-04-16 2015-11-17 株式会社豊田中央研究所 操作支援装置、操作システム、操作支援方法、及びプログラム
US9516442B1 (en) * 2012-09-28 2016-12-06 Apple Inc. Detecting the positions of earbuds and use of these positions for selecting the optimum microphones in a headset
US9648419B2 (en) 2014-11-12 2017-05-09 Motorola Solutions, Inc. Apparatus and method for coordinating use of different microphones in a communication device
WO2017016587A1 (fr) 2015-07-27 2017-02-02 Sonova Ag Ensemble microphone clipsable
KR102814684B1 (ko) * 2016-08-26 2025-05-29 삼성전자주식회사 외부 기기를 제어하는 휴대 기기 및 이의 오디오 신호 처리 방법

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4412097A (en) * 1980-01-28 1983-10-25 Victor Company Of Japan, Ltd. Variable-directivity microphone device
US5732143A (en) * 1992-10-29 1998-03-24 Andrea Electronics Corp. Noise cancellation apparatus
US7146013B1 (en) * 1999-04-28 2006-12-05 Alpine Electronics, Inc. Microphone system
JP2000312395A (ja) * 1999-04-28 2000-11-07 Alpine Electronics Inc マイクロホンシステム
US20010028718A1 (en) * 2000-02-17 2001-10-11 Audia Technology, Inc. Null adaptation in multi-microphone directional system
US7561700B1 (en) * 2000-05-11 2009-07-14 Plantronics, Inc. Auto-adjust noise canceling microphone with position sensor
TW200305854A (en) * 2002-03-27 2003-11-01 Aliphcom Inc Microphone and voice activity detection (VAD) configurations for use with communication system
US7983720B2 (en) * 2004-12-22 2011-07-19 Broadcom Corporation Wireless telephone with adaptive microphone array
US8625816B2 (en) * 2007-05-23 2014-01-07 Aliphcom Advanced speech encoding dual microphone configuration (DMC)

Also Published As

Publication number Publication date
US20140192998A1 (en) 2014-07-10
US11638092B2 (en) 2023-04-25
US8625816B2 (en) 2014-01-07
WO2008148048A3 (fr) 2009-07-30
US20220394381A1 (en) 2022-12-08
US20090003622A1 (en) 2009-01-01

Similar Documents

Publication Publication Date Title
US20220394381A1 (en) Advanced speech encoding dual microphone configuration (dmc)
US20240129660A1 (en) Forming virtual microphone arrays using dual omnidirectional microphone array (doma)
US9099094B2 (en) Microphone array with rear venting
US8467543B2 (en) Microphone and voice activity detection (VAD) configurations for use with communication systems
US8254617B2 (en) Microphone array with rear venting
US8280072B2 (en) Microphone array with rear venting
US8477961B2 (en) Microphone array with rear venting
CA2798512A1 (fr) Capteur de vibration et systeme de detection d'activite vocale (vads) acoustique a utiliser avec des systemes electroniques
EP2567377A1 (fr) Composant de suppression/remplacement du vent à utiliser avec des systèmes électroniques
US8682018B2 (en) Microphone array with rear venting
US20140372113A1 (en) Microphone and voice activity detection (vad) configurations for use with communication systems
US20220417652A1 (en) Microphone array with rear venting
HK1145751A (en) Dual omnidirectional microphone array

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08756246

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08756246

Country of ref document: EP

Kind code of ref document: A2