[go: up one dir, main page]

WO2017003958A1 - Sélection automatique d'un microphone dans une caméra de sport - Google Patents

Sélection automatique d'un microphone dans une caméra de sport Download PDF

Info

Publication number
WO2017003958A1
WO2017003958A1 PCT/US2016/039679 US2016039679W WO2017003958A1 WO 2017003958 A1 WO2017003958 A1 WO 2017003958A1 US 2016039679 W US2016039679 W US 2016039679W WO 2017003958 A1 WO2017003958 A1 WO 2017003958A1
Authority
WO
WIPO (PCT)
Prior art keywords
microphone
audio signal
correlation
metric
responsive
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/US2016/039679
Other languages
English (en)
Inventor
Zhinian Jing
Erich Tisch
Ke Li
Paul Beckmann
Joyce ROSENBAUM
Magnus Hansson
Evan L. COONS
Alexander Wroblewski
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GoPro Inc
Original Assignee
GoPro Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US15/083,262 external-priority patent/US9706088B2/en
Application filed by GoPro Inc filed Critical GoPro Inc
Publication of WO2017003958A1 publication Critical patent/WO2017003958A1/fr
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/027Spatial or constructional arrangements of microphones, e.g. in dummy heads
    • GPHYSICS
    • G03PHOTOGRAPHY; CINEMATOGRAPHY; ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ELECTROGRAPHY; HOLOGRAPHY
    • G03BAPPARATUS OR ARRANGEMENTS FOR TAKING PHOTOGRAPHS OR FOR PROJECTING OR VIEWING THEM; APPARATUS OR ARRANGEMENTS EMPLOYING ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ACCESSORIES THEREFOR
    • G03B17/00Details of cameras or camera bodies; Accessories therefor
    • G03B17/02Bodies
    • G03B17/08Waterproof bodies or housings
    • GPHYSICS
    • G03PHOTOGRAPHY; CINEMATOGRAPHY; ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ELECTROGRAPHY; HOLOGRAPHY
    • G03BAPPARATUS OR ARRANGEMENTS FOR TAKING PHOTOGRAPHS OR FOR PROJECTING OR VIEWING THEM; APPARATUS OR ARRANGEMENTS EMPLOYING ANALOGOUS TECHNIQUES USING WAVES OTHER THAN OPTICAL WAVES; ACCESSORIES THEREFOR
    • G03B31/00Associated working of cameras or projectors with sound-recording or sound-reproducing means
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/1613Constructional details or arrangements for portable computers
    • G06F1/163Wearable computers, e.g. on a belt
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/16Constructional details or arrangements
    • G06F1/1613Constructional details or arrangements for portable computers
    • G06F1/1633Constructional details or arrangements of portable computers not specific to the type of enclosures covered by groups G06F1/1615 - G06F1/1626
    • G06F1/1656Details related to functional adaptations of the enclosure, e.g. to provide protection against EMI, shock, water, or to host detachable peripherals like a mouse or removable expansions units like PCMCIA cards, or to provide access to internal components for maintenance or to removable storage supports like CDs or DVDs, or to mechanically mount accessories
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R29/00Monitoring arrangements; Testing arrangements
    • H04R29/004Monitoring arrangements; Testing arrangements for microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/11Transducers incorporated or for use in hand-held devices, e.g. mobile phones, PDA's, camera's

Definitions

  • This disclosure relates to audio capture, and more specifically, to the selecting between multiple available microphones in an audio capture system.
  • FIG. 1 is a block diagram illustrating an example embodiment of an audio capture system.
  • FIG. 2 is a flowchart illustrating a first embodiment of a process for selecting between audio signals from different microphones in an audio capture system with multiple microphones.
  • FIG. 3 is a flowchart illustrating a second embodiment of a process for selecting between audio signals from different microphones in an audio capture system with multiple microphones.
  • FIG. 4 is a flowchart illustrating an embodiment of a process for detecting a wet microphone condition.
  • FIG. 5 is a flowchart illustrating an embodiment of a process for selecting a subset of microphones out of a group of microphones.
  • FIG. 6A is first perspective view of an example camera system.
  • FIG. 6B is second perspective view of an example camera system.
  • FIG. 7 illustrates an example of a drainage enhancement feature for an enhanced microphone in a camera system.
  • an output audio signal is generated in an audio capture system having multiple microphones including at least a first microphone and a second microphone.
  • the first microphone includes a drainage enhancement feature structured to drain liquid more quickly than the second microphone lacking the drainage enhancement feature.
  • a first audio signal is received from the first microphone representing ambient audio captured by the first microphone during a time interval.
  • a second audio signal is received from the second microphone representing ambient audio captured by the second microphone during the time interval.
  • a correlation metric is determined between the first audio signal and the second audio signal representing a similarity between the first audio signal and the second audio signal. Responsive to the correlation metric exceeding a predefined threshold, the first audio signal is outputted for the time interval.
  • a first noise metric is determined for the first audio signal and a second noise metric is determined for the second audio signal. Responsive to the sum of the first noise metric and a bias value being less than the second noise metric, the first audio signal is output for the time interval. Responsive to the sum of the first noise metric and the bias value being greater than the second noise metric, the second audio signal is output for the time interval.
  • an output audio signal is generated in an audio capture system having multiple microphones including at least a first microphone and a second microphone.
  • the first microphone includes a drainage enhancement feature structured to drain liquid more quickly than the second microphone lacking the drainage enhancement feature.
  • a first audio signal is received from the first microphone representing ambient audio captured by the first microphone during a time interval.
  • a second audio signal is received from the second microphone representing ambient audio captured by the second microphone during the time interval.
  • a correlation metric is determined between the first audio signal and the second audio signal representing a similarity between the first audio signal and the second audio signal. Responsive to the correlation metric exceeding a first predefined threshold, the first audio signal is output for the time interval.
  • the microphones Responsive to the correlation metric not exceeding the first predefined threshold, it is determined whether the microphones are submerged in liquid. If the microphones are not submerged, it is determined whether the first microphone is wet. If the first microphone is wet, the second microphone signal is output for the time interval. Responsive to determining that first microphone is not wet or that the microphones are submerged, a first noise metric is determined for the first audio signal and a second noise metric is determined for the second audio signal. Responsive to the sum of the first noise metric and a bias value being less than the second noise metric, the first audio signal is output for the time interval. Responsive to the sum of the first noise metric and the bias value being greater than the second noise metric, the second audio signal is output for the time interval.
  • a method determines if a first microphone is wet in an camera system having a first microphone and a second microphone, where the first microphone is positioned in a recess of an inner side of a face of the camera, where the recess is coupled to a channel coupled to a lower drain below the channel to drain water from the recess away from the microphone via the channel, and where the second microphone is positioned away from the channel and the drain.
  • a first average signal level of the first audio signal and a second average signal level of the second audio signal are determined over a predefined time interval.
  • a ratio of the first average signal level to the second average signal level is determined.
  • a camera comprises a lens assembly, a substantially cubic camera housing, a first microphone, a lower drain, an upper drain, a channel, and a second microphone.
  • the lens assembly directs light received through a lens window to an image sensor.
  • the substantially cubic camera housing encloses the lens assembly and comprises a bottom face, left face, right face, back face, top face, and front face.
  • the first microphone is integrated with the front face of the camera and positioned within a recess on an interior facing portion of the front face.
  • the lower drain is below the first microphone and comprises an opening in the substantially cubic camera housing near the front face. The lower drain allows water that collects in the recess housing the first microphone to drain.
  • the upper drain is above the first microphone and comprises an opening in the substantially cubic housing near the front face.
  • the upper drain allows air to enter the recess as the water drains.
  • the channel through the interior facing portion of the front face couples the recess to the lower drain.
  • the second microphone is integrated with a rear portion of the substantially cubic camera housing.
  • an audio capture system comprises a substantially cubic housing including a bottom face, left face, right face, back face, top face, and front face.
  • a first microphone is integrated with the front face of the audio capture system and positioned within a recess on an interior facing portion of the front face.
  • a lower drain below the first microphone comprises an opening in the substantially cubic housing near the front face to allow water that collects in the recess housing the first microphone to drain.
  • An upper drain above the first microphone comprises an opening in the substantially cubic housing near the front face to allow air to enter the recess as the water drains.
  • a channel through the interior facing portion of the front face couples the recess to the lower drain.
  • a second microphone is integrated with a rear portion of the substantially cubic housing.
  • FIG. 1 illustrates an example of an audio capture system 100 including multiple microphones.
  • the audio capture system 100 includes at least one "enhanced” microphone 110, at least one "reference” microphone 120, a microphone selection controller 130, and an audio encoder 140.
  • the enhanced microphone 110 includes a drainage enhancement feature to enable water to drain from the microphone more quickly than the reference microphone 120.
  • the drainage enhancement feature may be accomplished utilizing gravity and/or surface tension forces.
  • the drainage enhancement feature may be implemented using an inner surface energy coating or particular hole dimensions, shapes, density, patterns, or interior curvature or a combination of features that affect that drainage profile of the enhanced microphone 1 10.
  • the reference microphone 120 includes a physical barrier between the splashing water and a waterproof membrane over the microphone to mitigate the impulses from splashing water.
  • the barrier comprises a plastic barrier that absorbs some of the water impact impulse.
  • an air buffer may exist between the barrier and the waterproof membrane over the microphone.
  • a porting structure traps a buffer layer of water on the outside of a waterproof membrane over the microphone, thus creating a protective layer that blocks splashing water from directly impacting the waterproof membrane.
  • both the enhanced microphone 110 and reference microphone 120 capture ambient audio 105 and pass the captured audio to the microphone selection controller 130.
  • the audio captured by the enhanced microphone 110 and the reference microphone 120 may have varying audio characteristics due to the different structural features of the microphones 1 10, 120.
  • the enhanced microphone 110 will have more spectral artifacts both in open air and when operating under water due to the drainage enhancement feature.
  • the enhanced microphone 110 may have degraded signal-to-noise in windy conditions due to the drainage enhancement feature.
  • the enhanced microphone 1 10 will generally have better signal-to-noise ratio performance out of water in non-windy conditions relative to the reference microphone 120. Therefore, a different selection between the enhanced microphone 110 and the reference microphone 120 may be desirable under different audio capture conditions.
  • the microphone selection controller 130 processes the audio captured from the enhanced microphone 1 10 and the reference microphone 120 and selects, based on the audio characteristics, which of the audio signals to pass to the audio encoder 140. In one embodiment, the microphone selection controller 130 operates on a block-by -block basis. In this embodiment, for each time interval, the microphone selection controller 130 receives a first block of audio data from the enhanced microphone and a second block of audio data from the reference microphone 120, each corresponding to ambient audio 105 captured by the respective microphones 1 10, 120 during the same time interval. The microphone selection controller 130 processes the pair of blocks to determine which block to pass the audio encoder 140.
  • the microphone selection controller 130 generally operates to select the enhanced microphone 110 directly after transitioning out of water since the enhanced microphone 1 10 tends to drain the water faster and has better out of water audio quality. Furthermore, the microphone selection controller 130 generally operates to select the reference microphone 120 when in the water and when transitioning between air and water because it better mitigates the unnatural impulses caused by splashing water.
  • the audio encoder 140 encodes the blocks of audio received from the microphone selection controller 130 to generate an encoded audio signal 145.
  • the microphone selection control 130 and/or the audio encoder 140 are implemented as a processor and a non-transitory computer -readable storage medium storing instructions that when executed by the processor carry out the functions attributed to the microphone selection controller 130 and/or audio encoder 140 described herein.
  • the microphone selection controller 130 and audio encoder 140 may be implemented using a common processor or separate processors.
  • the microphone selection controller 130 and/or audio encoder 140 may be implemented in hardware, (e.g., with an FPGA or ASIC), firmware, or a combination of hardware, firmware and software.
  • the audio capture system 100 is implemented within a camera system such as the camera 500 described below with respect to FIG. 5. Such a camera may use the encoded audio 145 captured by the audio capture system 100 as an audio channel for video captured by the camera. Thus, the audio capture system 100 may capture audio in a manner that is concurrent and synchronized with corresponding frames of video.
  • FIG. 2 is a flowchart illustrating an embodiment of a process for selecting between an enhanced microphone 1 10 and a reference microphone 120.
  • a correlation metric is determined 202 between signal levels of audio blocks captured by the enhanced microphone 1 10 and reference microphone 120 respectively.
  • the correlation metric represents a similarity between a first audio signal captured from the enhanced microphone 1 10 during a time interval and a second audio signal captured from the reference microphone 120 during the same time interval.
  • the signals will be well-correlated in the absence of wind noise, but will be poorly correlated when wind noise is present.
  • the correlation metric may operate as a wind detector.
  • the correlation metric comprises a value from 0 to 1 where a correlation metric of 1 represents a situation where there is no wind, and a correlation metric of 0 means that the captured audio is entirely wind noise.
  • the correlation metric is determined using a correlation function that includes a regularization term ⁇ to handle low level signals.
  • the correlation function is given by:
  • X max(0, ⁇ (L[n] + ⁇ ) * (R[n] + y) ) (1)
  • (*) represents a scalar multiplication
  • N is the block size
  • L[n] and R[n] are the samples from the enhanced microphone and reference microphone respectively.
  • the max operator constrains the correlation metric Xto be in the range 0 and +1. In one embodiment, the correlation metric is calculated over a predefined spectral range (e.g., 600-1200 Hz).
  • the correlation metric is updated at a frequency based on the audio sample rate and sample block size. For example, if a 32kHz sampling rate is used with a block size of 1024 samples, the correlation metric may be updated approximately every 32 milliseconds. In one embodiment, the correlation metric is smoothed over time.
  • the correlation metric is compared 204 to a predefined threshold.
  • the predefined threshold may changes between two or more predefined thresholds depending on the previous state (e.g., whether the reference microphone or enhanced microphone was selected) to include a hysteresis effect. For example, if for the previously processed block, the correlation metric exceeded the predefined threshold (e.g., a predefined threshold of 0.8) indicating that low wind noise detected, then the predefined threshold is set lower for the current block (e.g. 0.7). If for the previously processed block, the correlation metric did not exceed the predefined threshold (e.g., a predefined threshold of 0.8), indicating that high wind noise was detected, then the predefined threshold for the current block is set higher (e.g., to 0.8).
  • the predefined threshold e.g., a predefined threshold of 0.8
  • the enhanced microphone 1 10 is selected because it typically has better signal-to-noise ratio. If the correlation metric does not exceed 204 the predefined threshold, noise metrics are determined for the audio signals captured by the enhanced microphone 110 and the reference microphone 120. Under some conditions, it may be reasonably presumed that both microphones 1 10, 120 pick up the desired (noiseless) signal at approximately, the same level and if one of the microphones is slightly blocked, then the correlation metric will still be relatively high indicating that there is low wind. Furthermore, it may be reasonably presumed that noise from the effects of wind or water is local to each microphone and that the noise will not destructively cancel out the signal.
  • the microphone that is louder during a low correlation condition is determined to be the microphone that has the noise.
  • the noise metrics simply comprise root-mean-squared amplitude levels of the enhanced and reference microphones over a predefined time period.
  • the predefined time period may include a sliding time window that includes the currently processed block and a fixed number of blocks prior to the current block (e.g., an approximately 4 second window).
  • a recursive-based RMS value is used (e.g., with a time constant of approximately 4 seconds).
  • the noise metric is based on equalized amplitude levels of the microphones.
  • the equalization levels are set so that the microphones have similar amplitude characteristics under normal conditions (e.g., non-windy and non-watery conditions).
  • the noise metric is measured across substantially the entire audible band (e.g., between 20 Hz and 16kHz).
  • the microphone selection controller 130 selects 212 the enhanced microphone. If the sum of the noise metric for the enhanced microphone 110 and a bias value is less than the noise metric for the reference microphone 120, then the microphone selection controller 130 selects 212 the enhanced microphone. On the other hand, if the sum of the noise metric for the enhanced microphone 1 10 and the bias value is not less than (e.g., greater than) the noise metric for the reference microphone 120, then the microphone selection controller 130 selects 212 the reference microphone 120.
  • the bias value may comprise either a positive or negative offset that is dynamically adjusted based on the correlation metric. For example, if the correlation metric is below a lower threshold (e.g., 0.4), then a first bias value is used which may be a positive bias value (e.g., l OdB). If the correlation metric is above an upper threshold (e.g., 0.8), then a second bias value is used which may be a negative bias value (e.g., -6dB). If the correlation metric is between the lower threshold (e.g., 0.4) and the upper threshold (e.g., 0.8), the bias value is a linear function of the correlation metric X. For example, in one embodiment, the bias value is given by:
  • biasi is the first bias value used when the correlation metric Xis below the lower threshold TII L and bias 2 is the second bias value used when the correlation metric Xis above the upper threshold Thy.
  • a hysteresis component is additionally included in the bias value.
  • the bias value is adjusted up or down depending on whether the reference microphone 120 or the enhanced microphone 110 was selected for the previous block, so as to avoid switching between the microphones 1 10, 120 too frequently. For example, in one embodiment, if the enhanced microphone 110 was selected for the previous block, an additional hysteresis bias (e.g., 5 db) is subtracted from the bias value to make it more likely that the enhanced microphone 1 10 will be selected again as shown in the equation below:
  • biasn is the hysteresis bias
  • the additional hysteresis bias (e.g,. 5 dB) is added to the bias value to make it more likely that the reference microphone is selected again as shown in the equation below:
  • the bias value takes into account that not all wind level is created equal. It is possible to have wind that is softer, but generates more perceptive noise, than a louder wind. With high amounts of wind (low correlation metric), the enhanced microphone 1 10 tends to generate more perceptive noise than the reference microphone 120 during high wind condition due to the drainage enhancement feature. Thus, the bias value is used to penalize the enhanced microphone 1 10 for low correlation metrics.
  • FIG. 3 is a flowchart illustrating another embodiment of a process for selecting between an enhanced microphone 1 10 and a reference microphone 120.
  • a correlation metric is determined 302 between signal levels of audio blocks captured by the enhanced microphone 1 10 and reference microphone 120 respectively. If the correlation metric exceeds 304 a predefined threshold, then the enhanced microphone 1 10 is selected because it typically has better signal-to-noise ratio. If the correlation metric does not exceed 304 the threshold, it is determined 306 if the microphones are submerged in liquid (e.g. , water).
  • the predefined threshold may be determined in the same manner described above.
  • a water submersion sensor may be used to determine if the microphones are submerged.
  • an image analysis may be performed to detect features
  • detecting color loss may be indicative of the camera being submerged because it causes exponential loss of light intensity depending on wavelength.
  • crinkle patterns may be present in the image when the camera is submerged because the water surface can form small concave and convex lenses that create patches of light and dark. Additionally, light reflecting off particles in the water creates scatter and diffusion that can be detected to determine if the camera is submerged.
  • water pressure on the microphone's waterproof membrane may be detected because the waterproof membrane will deflect under external water pressure. This causes increased tension which shifts the waterproof membrane' s resonance higher from its nominal value and can be detected in the microphone signal.
  • the deflection of the waterproof membrane will results in a positive pressure on and deflection of the microphone membrane which could manifest itself as a shift in microphone bias.
  • a sensor could be placed near the waterproof membrane to detect an increase in shear force caused by deflection of the waterproof membrane that is indicative of the microphone being submerged.
  • the microphones are not submerged, then it is determined 316 whether the enhanced microphone 110 is wet (e.g., not sufficiently drained after being removed from water).
  • the wet microphone condition can be detected by observing spectral response changes over a predefined frequency range (e.g., 2kHz - 4kHz) or by detecting the sound pattern known to be associated with a wet microphone as compared to a drained microphone.
  • the spectral features associated with a wet (undrained) microphone can be found through empirical means. In general, when a microphone membrane is wet, higher frequency sounds are attenuated because the extra weight of the water on the membrane reduces the vibration of the membrane.
  • the water generally acts as a low pass filter.
  • An example of a process for detecting wet microphones is described in FIG. 4 below.
  • spectral changes can be monitored based on the measured known drain time constant differences between the microphone geometries. If the enhanced microphone 110 is wet (e.g., not sufficiently drained), then the reference microphone 120 is selected 320. Otherwise, if the microphones are submerged or if the enhanced microphone 110 is not wet, then noise metrics are determined 310 for the audio blocks captured by the enhanced microphone 110 and the reference microphone 120. The noise metrics may be determined in the same manner as described above in FIG. 2.
  • the microphone selection controller 130 selects 314 the enhanced microphone. If the sum of the noise metric for the enhanced microphone 110 and the bias value is not less than the noise metric for the reference microphone 120, then the microphone selection controller 130 selects 320 the reference microphone 120.
  • the bias value may be determined based on equations (2) - (4) described above.
  • FIG. 4 is a flowchart illustrating an embodiment of a process for detecting a wet microphone.
  • water on a microphone has a transfer function approximating a low pass filter.
  • the amount of attenuation and the cutoff frequency of the wet microphone transfer function is dependent on how much water is on the microphone. Particularly, the more water on the microphone membrane, the greater the attenuation and the lower the cutoff frequency. This phenomenon is due to the added mass of the water on the microphone membrane dampening the movement of the membrane.
  • root-mean- squared (RMS) signal levels of the audio blocks captured by the enhanced microphone 1 10 and reference microphone 120 are calculated 402 across a predefined frequency range (e.g., 2kHz - 4kHz).
  • a smoothing filter may be applied 404 to smooth the a ratio of the enhanced microphone RMS signal level to the reference microphone RMS signal level over time. If it is determined 406 that the ratio of the enhanced microphone RMS signal level to the reference microphone RMS signal level is above a predefined threshold, then the wet microphone is not detected 412. Otherwise, if it is determined 406 that the ratio of the RMS signal levels is not above the predefined threshold, it is determined 408 if wind is present since the presence of wind can result in similar RMS ratios.
  • the presence of wind can be determined based on, for example, a detection signal from a wind detector that determines the presence of wind based on a correlation metric as described above.
  • wind noise threshold is met (i.e., the correlation metric is less than a predefined threshold)
  • the wet microphone is not detected 412. Otherwise, if the wind noise threshold is not met (i.e., the correlation metric is greater than a predefined threshold), then the wet microphone condition is detected 410.
  • the selection algorithm described above may be applied to a group of enhanced microphones 1 10 and group of reference microphones 120 instead of a single enhanced microphone 1 10 and single reference microphone 120.
  • the enhanced microphone signal and reference microphone signal inputted to the processes above may comprise, for example, an average of all of the enhanced microphones and the reference microphones respectively. Then the processes described above select either the enhanced microphone group or the reference group.
  • a separate selection algorithm may be applied to select an audio block from one of the microphones in the selected group to provide to the audio encoder 140 (e.g., the signal with the lowest noise).
  • a process selects a subset of microphones out of a group of microphones that may include reference microphones or enhanced microphones.
  • FIG. 5 illustrates an embodiment of a process performed by the microphone selection controller 130 for choosing N microphones out of a group of M microphones. Audio signals are received 502 from each of the microphones in the group. Adverse conditions such as wind (e.g., low correlation value) or wet microphone (e.g., using the process of FIG. 4) are detected 504 if present. If no adverse conditions (e.g., wind, water, etc.) are detected, the microphone selection controller 130 selects 506 N microphones in the group of M microphones that are pre-identified as being preferred microphones.
  • wind e.g., low correlation value
  • wet microphone e.g., using the process of FIG.
  • the RMS levels of each of the M microphones are measured 508 and a bias value is added to each microphone.
  • the bias value is determined based on the bias equations (2) - (4) described above.
  • the bias value for each microphone may be different depending on the configuration of each microphone.
  • the bias function can be a function of the correlation metric, the RMS values of all other microphones and the determination of whether or not the microphone is under water. Then, the N microphones having the lowest sums of their respective bias values and RMS levels are selected 510.
  • the microphone selection controller 130 picks the N microphones having the smallest cost value of J and where Ji is a cost value associated with the z ' th microphone, is the correlation metric, R, is the RMS value of the z ' th microphone, and f, is a predefined cost function.
  • g(X) is the piecewise linear function described in the bias equations above
  • f ⁇ is the cost function for the enhanced microphone 1 10
  • / 2 is the cost function for the reference microphone 120.
  • a hysteresis bias may also be included as described above, except with potentially different thresholds, depending on the configuration.
  • FIGs. 6A-6B illustrate perspective views of an example camera 600 in which the audio capture system 100 may be integrated.
  • the camera 600 comprises at least one cross- section having four approximately equal length sides in a two-dimensional plane. Although the cross-section is substantially square, the corners of the cross-section may be rounded in some embodiments (e.g., a rounded square or squircle).
  • the exterior of the square camera 600 includes 6 surfaces (i.e. a front face, a left face, a right face, a back face, a top face, and a bottom face). In the illustrated embodiment, the exterior surfaces substantially conform to a rectangular cuboid, which may have rounded or unrounded corners.
  • all camera surfaces may also have a substantially square (or rounded square) profile, making the square camera 600 substantially cubic.
  • only two of the six faces e.g., the front face 610 and back face 640
  • the other faces may be other shapes, such as rectangles.
  • the camera 600 can have a small form factor (e.g. a height of 2 cm to 9 cm, a width of 2 cm to 9 cm, and a depth of 2 cm to 9 cm) and is made of a rigid material such as plastic, rubber, aluminum, steel, fiberglass, or a combination of materials.
  • the camera 600 may have a different form factor.
  • the camera 600 includes a camera lens window 602 surrounded by a front face perimeter portion 608 on a front face 610, an interface button 604 and a display 614 on a top face 620, an I/O door 606 on a side face 630, and a back door 612 on a back face 640.
  • the camera lens window 602 comprises a transparent or substantially transparent material (e.g., glass or plastic) that enables light to pass through to an internal lens assembly.
  • the camera lens window 602 is substantially flat (as opposed to a convex lens window found in many conventional cameras).
  • the front face 610 of the camera 600 furthermore comprises a front face perimeter portion 608 that surrounds the lens window 602.
  • the front face perimeter portion 608 comprises a set of screws to secure the front face perimeter portion 608 to the remainder of the housing of the camera 600 and to hold the lens window 602 in place.
  • the interface button 604 provides a user interface that when activated enables a user to control various functions of the camera 600. For example, pressing the button 604 may control the camera to power on or power off, take pictures or record video, save a photo, adjust camera settings, or perform any other action relevant to recording or storing digital media.
  • the interface button 604 may perform different functions depending on the type of interaction (e.g., short press, long press, single tap, double tap, triple tap, etc.) In alternative embodiments, these functions may also be controlled by other types of interfaces such as a knob, a switch, a dial, a touchscreen, voice control, etc.
  • the camera 600 may have more than one interface button 604 or other controls.
  • the display 614 comprises, for example, a light emitting diode (LED) display, a liquid crystal display (LCD) or other type of display for displaying various types of information such as camera status and menus.
  • LED light emitting diode
  • LCD liquid crystal display
  • the interface button 604, display 606, and/or other interface features may be located elsewhere on the camera 600.
  • the I/O door 606 provides a protective cover for various input/output ports of the camera 600.
  • the camera 600 includes a Universal Serial Bus (USB) port and/or a High-Definition Media Interface (HDMI) port, and a memory card slot accessible behind the I/O door 606.
  • USB Universal Serial Bus
  • HDMI High-Definition Media Interface
  • additional or different input/output ports may be available behind the I/O door 606 or elsewhere on the camera 600.
  • the back door 612 provides a protective cover that when removed enables access to internal components of the camera 600.
  • a removable battery is accessible via the back door 612.
  • the camera 600 described herein includes features other than those described below.
  • the square camera 600 can include additional buttons or different interface features such as a speakers and/or various input/output ports.
  • the reference microphone 110 is integrated with or near the back door 612 of the camera 600 such that it is positioned near the rear of the camera 600, and the enhanced microphone is integrated with the front face 610 of the camera 600 such that it is positioned near the front of the camera 600.
  • FIG. 7 illustrates an example of a front face perimeter portion 608 of a camera 600 with an integrated drain enhancement feature in the form of a channel 702 between a recess 704 where the enhanced microphone 110 (not shown) is positioned, and one or more drains (e.g,. an upper drain structure 708 and a lower drain structure 706, each of which may comprise a single drain or multiple drains) to enable liquid to drain.
  • Drains e.g,. an upper drain structure 708 and a lower drain structure 706, each of which may comprise a single drain or multiple drains
  • Microphone ports 710 provide openings to let sound reach the microphone(s) housed in recess 704.
  • the upper drain structure 708 is positioned above the channel 702 and the lower drain structure 706 is positioned below the channel 702.
  • the lower drain structure 706 is generally much larger than the upper drain structure 708.
  • the entire channel 702 generally fills with water.
  • the large mass of water in the channel 702 flows out through the lower drain structure 706 through the force of gravity. This pulls air in through upper drain structure 708 and clears water from the recess 704, the upper drain structure 708, and/or the microphone ports 710, thus allowing the microphone to resume normal acoustic performance.
  • Coupled along with its derivatives.
  • the term “coupled” as used herein is not necessarily limited to two or more elements being in direct physical or electrical contact. Rather, the term
  • Coupled may also encompass two or more elements are not in direct contact with each other, but yet still co-operate or interact with each other, or are structured to provide a drainage path between the elements.
  • any reference to "one embodiment” or “an embodiment” means that a particular element, feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment.
  • the appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computer Hardware Design (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

Un système de capture audio pour caméra de sport comprend au moins un microphone « évolué » et au moins un microphone de « référence ». Le microphone évolué présente une caractéristique d'amélioration de l'évacuation permettant une évacuation de l'eau plus rapide dans le microphone évolué que dans le microphone de référence. Un dispositif de commande de sélection de microphone procède à une sélection entre les microphones sur la base d'un algorithme de sélection de microphone afin de permettre des conditions de qualité élevée dans lesquelles la caméra de sport se trouve alternativement dans et hors de l'eau lors d'activités telles que le surf, le ski nautique et la natation, ou dans d'autres environnements humides.
PCT/US2016/039679 2015-07-02 2016-06-27 Sélection automatique d'un microphone dans une caméra de sport Ceased WO2017003958A1 (fr)

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
US201562188450P 2015-07-02 2015-07-02
US62/188,450 2015-07-02
US15/083,262 US9706088B2 (en) 2015-07-02 2016-03-28 Automatic microphone selection in a sports camera
US15/083,264 2016-03-28
US15/083,267 2016-03-28
US15/083,264 US9661195B2 (en) 2015-07-02 2016-03-28 Automatic microphone selection in a sports camera based on wet microphone determination
US15/083,262 2016-03-28
US15/083,267 US9787884B2 (en) 2015-07-02 2016-03-28 Drainage channel for sports camera
US15/083,266 2016-03-28
US15/083,266 US9769364B2 (en) 2015-07-02 2016-03-28 Automatically determining a wet microphone condition in a sports camera

Publications (1)

Publication Number Publication Date
WO2017003958A1 true WO2017003958A1 (fr) 2017-01-05

Family

ID=56411910

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2016/039679 Ceased WO2017003958A1 (fr) 2015-07-02 2016-06-27 Sélection automatique d'un microphone dans une caméra de sport

Country Status (1)

Country Link
WO (1) WO2017003958A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021257620A1 (fr) * 2020-06-15 2021-12-23 Axon Enterprise, Inc. Audio directionnel adaptatif pour dispositifs audio habitroniques

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6292213B1 (en) * 1997-03-30 2001-09-18 Michael J. Jones Micro video camera usage and usage monitoring
US20130282369A1 (en) * 2012-04-23 2013-10-24 Qualcomm Incorporated Systems and methods for audio signal processing
US20140185853A1 (en) * 2012-12-27 2014-07-03 Panasonic Corporation Waterproof microphone device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6292213B1 (en) * 1997-03-30 2001-09-18 Michael J. Jones Micro video camera usage and usage monitoring
US20130282369A1 (en) * 2012-04-23 2013-10-24 Qualcomm Incorporated Systems and methods for audio signal processing
US20140185853A1 (en) * 2012-12-27 2014-07-03 Panasonic Corporation Waterproof microphone device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2021257620A1 (fr) * 2020-06-15 2021-12-23 Axon Enterprise, Inc. Audio directionnel adaptatif pour dispositifs audio habitroniques

Similar Documents

Publication Publication Date Title
US10771660B2 (en) Automatically determining a wet microphone condition in a camera
US12014116B2 (en) Generating an audio signal from multiple microphones based on uncorrelated noise detection
US12088894B2 (en) Drainage channels for use in a camera
KR102313894B1 (ko) 바람 잡음 검출을 위한 방법 및 장치
JP5594133B2 (ja) 音声信号処理装置、音声信号処理方法及びプログラム
JP2005110127A (ja) 風雑音検出装置及びそれを有するビデオカメラ装置
US9826134B2 (en) Imaging apparatus having a microphone and directivity control
CN103688307A (zh) 音频信号处理装置、成像装置、音频信号处理方法、程序和记录介质
JP2006222618A (ja) カメラ装置、カメラ制御プログラム及び記録音声制御方法
WO2017003958A1 (fr) Sélection automatique d'un microphone dans une caméra de sport
US9872006B2 (en) Audio signal level estimation in cameras
Terano et al. Sound capture from rolling-shuttered visual camera based on edge detection
JPH05119794A (ja) 収音装置
WO2012066265A4 (fr) Systeme de correction de spectre destine notamment a une salle de spectacle
JP2010081395A (ja) 電子機器

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 16738940

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 16738940

Country of ref document: EP

Kind code of ref document: A1