US20040131196A1 - Sound processing - Google Patents
Sound processing Download PDFInfo
- Publication number
- US20040131196A1 US20040131196A1 US10/475,282 US47528204A US2004131196A1 US 20040131196 A1 US20040131196 A1 US 20040131196A1 US 47528204 A US47528204 A US 47528204A US 2004131196 A1 US2004131196 A1 US 2004131196A1
- Authority
- US
- United States
- Prior art keywords
- sounding object
- sound
- sounding
- impulse
- impulse response
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/20—Arrangements for obtaining desired frequency or directional characteristics
- H04R1/32—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
- H04R1/40—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
- H04R1/406—Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Definitions
- This invention relates to sound processing and is concerned particularly although not exclusively with methods and processors for encoding radiation characteristics of sounding bodies.
- the sound source is given a limited directional variability, for instance, having a cone of directions where the sound changes character so as to appear to be facing towards or away from the listener's position.
- These forms of sounding body synthesis are very limited in their ability to provide realistic sound images especially as, in general, there is little or no provision for the effects of source-listener distance.
- Proper modelling of radiation characteristics over the whole surface is also important when generating the early reflections for a reverberation unit, since the reflections will, in most cases, not be of the part of the sounding object facing the listener.
- Some improvement may be made by means of a simplified model of the radiation pattern of the object. This may be coded using spherical harmonics in a manner analogous to the coding of soundfields. This allows the object to be rotated so that it maybe oriented correctly to the listening position, but it does not allow the effects of variation of the sound at the listening position with distance to be simulated appropriately. Tis variation is due to changes in the impulse response at the listener's position The impulse response changes with differing distances in two ways. Tis is illustrated in FIG. 1 of the accompanying drawings, which shows impulse responses at points spaced from a sounding object.
- the impulse response is, for simplicity, shown as being provided by three points, A, B and C on the sounding object (although in reality all points on the surface would contribute) and for two listener positions, P and Q. Both the position of the impulses in time and the differences in their amplitudes change with distance. Note that, as the distance increases between the object and the listener, the extra distance contribution of the displacement away from the origin along the Y axis decreases leading eventually; in the far field, to the situation where only distances along the X axis count.
- Preferred embodiments of the present invention aim to provide systems in which further characteristics of a sounding body are encoded using spherical harmonics in such a way as to allow simulation of both the radiation pattern of the sounding body and the effects of source-listener distance.
- This use of spherical harmonics permits the sounding object to be realistically portrayed without imposing heavy computational loads.
- a method of sound processing comprising the step of encoding by spherical harmonics the spatial radiation characteristics of a sounding object.
- a sound processor arranged to encode by spherical harmonics the spatial radiation characteristics of a sounding object.
- Said encoding may include generating impulse responses of the sounding object.
- Said impulse responses maybe measured or calculated.
- a microphone may be spaced from the sounding object and used to measure said impulse responses.
- Shape data may be input to represent the shape of the sounding object, from which data said impulse responses are calculated.
- Said shape data may be derived from the time of arrival of a first sound at each microphone of an array of microphones placed around the sounding object.
- Said shape data maybe synthesised.
- the shape of the sounding object maybe traced.
- Sound processing methods or sound processors as above may provide for manipulating the spatial characteristics of the sounding object prior to embedding the object in a final soundfield.
- Manipulating the spatial characteristics of the sounding object may include transforming the apparent orientation of the sounding object with respect to a listener.
- Manipulating the spatial characteristics of the sounding object may include transforming the apparent distance of the sounding object from to a listener.
- Sound processing methods or sound processors as above may generate a final impulse response to represent the spatial radiation characteristics of the sounding object and apply said final impulse response to a sound source.
- Sound processing methods and sound processors as above may include any one or more of the features disclosed in this specification.
- FIGS. 2 to 6 of the accompanying diagrammatic drawings in which:
- FIG. 2 is a flowchart to illustrate one example of an encoding process in accordance with one example of the invention
- FIG. 3 shows a non-distance weighted impulse response for a zero order spherical harmonic
- FIG. 4 shows a non-distance weighted impulse response for a first order spherical harmonic
- FIG. 5 illustrates an array of microphones for measuring the shape of a sounding object
- FIG. 6 illustrates use of a microphone, placed far away from a sounding body, to measure an impulse response of correct modified form.
- the shape of a sounding object is encoded in such a way as to allow easy calculation of the impulse response at the listening point.
- the shape is decomposed in step 105 into a weighted sum of spherical harmonics, comprising at least the order 0 components and such higher orders as are deemed necessary.
- the weights are stored individually.
- the spherical harmonics may take the same names as in ambisonic B format, such that W and X, Y, Z are the order zero harmonic and the three order one harmonics, respectively.
- Each shape as defined by the individual spherical harmonics is also used to calculate an impulse response for that spherical harmonic, in step 106 .
- impulse responses are of a modified form where the impulse consists of sums of equally weighted components, so each time point can only take integer values for the size of the impulse at that point.
- Each point on the shape that has the same delay as another contributes a unit amount to the corresponding time point in the final non-distance weighted impulse response.
- the length of the impulse response is determined by the overall size of the sounding body.
- the shape may be synthesised according to the wishes of the user, using any suitable means, such as a Computer Aided Design Package, or by direct input of shape data, as in step 102 .
- the shape of a real object for instance a piano or an aeroplane, can be traced, as in step 101 .
- step 107 the process allows the apparent orientation and distance of the sounding object to be varied.
- step 108 the sounding object is first oriented in the acoustic scene in accordance with its relationship to the listener, for instance by applying rotational transforms such as an angular rotation to the left by an angle of ⁇ from the centre front coupled with a tilt by an angle a from the horizontal which requires the following transformation
- step 109 a weighted sum of the spherical harmonic coded impulse responses may be produced, corresponding to the non-distance weighted impulse response required for the relationship of the sounding object to the listening position.
- the form of these non-distance weighted impulse responses is shown in FIG. 3, which displays the zeroeth spherical harmonic and in FIG. 4, which shows one of the first order spherical harmonics.
- the effects of distance on the amplitude of each impulse can then be applied in step 110 by weighting the value of the impulse at each time point according to the inverse square law, derived by using the formula
- Ts is the tune of appearance of the first component in the impulse response and Tc that of the current component. This produces the final impulse response, the accuracy of whose match to reality can be chosen, in accordance with the computing power available and the quality of effect desired, by varying the number and maximum order of spherical harmonics used.
- any sound, recorded or synthesised may be processed by using the impulse response so generated, via means such as convolution in step 111 , so as to apply the appropriate frequency domain corrections such tat it will sound as if it was emitted by the sounding object at the desired distance and orientation from the listening body. Further processing by the already known ambisonic panning processes, or by any other form of sound spatialization, will yield a final image of the desired nature, in step 112 .
- the surface shape of the object can be determined by normal measurement means and the weighting of the spherical harmonics encoding the shape maybe derived by means of a suitable Fourier series analysis in step 105 , yielding the following formulae for the weights of each spherical harmonic component:
- the shape of the sounding object may be measured using an array of microphones such as is illustrated in FIG. 5, where the time of arrival of the first sound at each microphone can be used to determine the distance to the nearest point to that microphone.
- FIG. 6 illustrates a further option of this example of the invention, whereby a microphone, if placed far enough away from the sounding body, may be used to measure an impulse response of the correct modified form, as in step 103 .
- a microphone if placed far enough away from the sounding body, may be used to measure an impulse response of the correct modified form, as in step 103 .
- Measurement of a sufficient number of these inpulse responses over an appropriate grid of measurement points enables a spherical harmonic encoded form to be derived in step 104 , via a process of approximation similar to that discussed above.
- step 113 another similar process of spherical harmonic coding can be used to define the distribution of radiation characteristics across the surface of the sphere. This may be accomplished in step 113 by means such as providing different filtering functions to model bright or dull sounding areas of the surface. This is important in, for instance, modelling speech, where the spectral content of the speech varies, depending on whether the person speaking is facing the listener or not.
- the use of spherical harmonic encoding for the variations of these filtering functions over the surface of the object means that they may be oriented correctly in step 114 , in a manner similar to that used for the impulse responses, prior to being applied to the sound instep 115 .
- the apparent size of the object may be varied by varying the length of the impulse response. This may be accomplished either by recalculating the basic impulse response or otherwise. In one example, this is done by placing the impulse response in a look-up table and using computing means to vary the rate at which values are read out. By either discarding unwanted values when the new impulse response is shorter than the original or, in the case where the new impulse response is longer than the original, by calculating new intermediate values, either by interpolation from adjacent values or otherwise, the length of the impulse response and hence the size of the object can be controlled.
- the effect on the impulse response of the distance between the sounding body and the listener being such that the effect of the distance along the Y-axis becomes significant can be incorporated.
- the time axis may be warped to model the extra delay imposed by the point's distance from the Y-axis.
- a typical warping factor is represented by that for the zero order spherical harmonic
- n is the number of the sample and all points are expressed in terms of multiples of the size of the object.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
- Sink And Installation For Waste Water (AREA)
- Saccharide Compounds (AREA)
- Massaging Devices (AREA)
- Transducers For Ultrasonic Waves (AREA)
Abstract
The spatial radiation characteristics of a sounding object are encoded by spherical harmonics. The shape is decomposed (105) into a weighted sum of spherical harmonics, comprising at least the order 0 components and such higher orders as are deemed necessary. The weights are stored individually. Each shape as defined by the individual spherical harmonics is also used to calculate an impulse response for that spherical harmonic (106). These impulse responses are of a modified form where the impulse consists of sums of equally weighted components, so each time point can only take integer values for the size of the impulse at that point. The modified impulse responses are transformed into spherical harmonic form (107), after which the apparent orientation and distance of the sounding object may be varied. Any sound may be processed by using the impulse response so generated (111).
Description
- This invention relates to sound processing and is concerned particularly although not exclusively with methods and processors for encoding radiation characteristics of sounding bodies.
- Systems for recording and reproducing sounds capable of retaining the spatial characteristics of an original soundfield have been known for many years. For instance, the ambisonic surround sound system uses spherical harmonics to encode the direction of sound sources win a three dimensional soundfield. Recently, this form of representation of a soundfield has been extended from the original, four channel, first order version to include second and possible higher order spherical harmonics necessary to attain higher precision and a wider useful audience area. However, even first order, four channel soundfields, recorded from real acoustic scenes using a suitable microphone, capture well the complex extended nature of real sound radiating bodies. On the other hand, even within ambisonic systems, when soundfields have to be synthesised, for instance, when constructing an artificial sound image for a film soundtrack or a computer game, the ability to portray sound sources as extended objects has been limited by available technology. As a result, this portrayal has largely been limited to either idealised point sources or to sources having a very simplified impression of being “larger than a point source”. Typically, this enlargement has, in ambisonic systems, been implemented either by simply exaggerating the non-directional zeroeth order spherical harmonic or by phase shifted based ‘spreader’ controls. In some other systems, for instance Microsoft's DirectSound, the sound source is given a limited directional variability, for instance, having a cone of directions where the sound changes character so as to appear to be facing towards or away from the listener's position. These forms of sounding body synthesis are very limited in their ability to provide realistic sound images especially as, in general, there is little or no provision for the effects of source-listener distance. Proper modelling of radiation characteristics over the whole surface is also important when generating the early reflections for a reverberation unit, since the reflections will, in most cases, not be of the part of the sounding object facing the listener.
- On the other hand, it should be noted that within full acoustic simulation systems, the contributions of sounds arriving at the listening position from all points on a sounding object can be calculated by solving the wave equations for each source-listener path or by other suitable means, and this can provide fully realistic sound images. This approach, however, imposes heavy computational loadings on systems, which can be inconvenient when there is restricted available computing power or when realtime operation is desired.
- Some improvement may be made by means of a simplified model of the radiation pattern of the object. This may be coded using spherical harmonics in a manner analogous to the coding of soundfields. This allows the object to be rotated so that it maybe oriented correctly to the listening position, but it does not allow the effects of variation of the sound at the listening position with distance to be simulated appropriately. Tis variation is due to changes in the impulse response at the listener's position The impulse response changes with differing distances in two ways. Tis is illustrated in FIG. 1 of the accompanying drawings, which shows impulse responses at points spaced from a sounding object.
- In FIG. 1, the impulse response is, for simplicity, shown as being provided by three points, A, B and C on the sounding object (although in reality all points on the surface would contribute) and for two listener positions, P and Q. Both the position of the impulses in time and the differences in their amplitudes change with distance. Note that, as the distance increases between the object and the listener, the extra distance contribution of the displacement away from the origin along the Y axis decreases leading eventually; in the far field, to the situation where only distances along the X axis count.
- Preferred embodiments of the present invention aim to provide systems in which further characteristics of a sounding body are encoded using spherical harmonics in such a way as to allow simulation of both the radiation pattern of the sounding body and the effects of source-listener distance. This use of spherical harmonics permits the sounding object to be realistically portrayed without imposing heavy computational loads.
- More generally, according to one aspect of the present invention, there is provided a method of sound processing, comprising the step of encoding by spherical harmonics the spatial radiation characteristics of a sounding object.
- According to another aspect of the present invention, there is provided a sound processor arranged to encode by spherical harmonics the spatial radiation characteristics of a sounding object.
- Said encoding may include generating impulse responses of the sounding object.
- Said impulse responses maybe measured or calculated.
- A microphone may be spaced from the sounding object and used to measure said impulse responses.
- Shape data may be input to represent the shape of the sounding object, from which data said impulse responses are calculated.
- Said shape data may be derived from the time of arrival of a first sound at each microphone of an array of microphones placed around the sounding object.
- Said shape data maybe synthesised.
- The shape of the sounding object maybe traced.
- Sound processing methods or sound processors as above may provide for manipulating the spatial characteristics of the sounding object prior to embedding the object in a final soundfield.
- Manipulating the spatial characteristics of the sounding object may include transforming the apparent orientation of the sounding object with respect to a listener.
- Manipulating the spatial characteristics of the sounding object may include transforming the apparent distance of the sounding object from to a listener.
- Sound processing methods or sound processors as above may generate a final impulse response to represent the spatial radiation characteristics of the sounding object and apply said final impulse response to a sound source.
- Sound processing methods and sound processors as above may include any one or more of the features disclosed in this specification.
- For a better understanding of the invention, and to show how embodiments of the same may be carried into effect, reference will now be made, by way of example, to FIGS. 2 to 6 of the accompanying diagrammatic drawings, in which:
- FIG. 2 is a flowchart to illustrate one example of an encoding process in accordance with one example of the invention;
- FIG. 3 shows a non-distance weighted impulse response for a zero order spherical harmonic;
- FIG. 4 shows a non-distance weighted impulse response for a first order spherical harmonic;
- FIG. 5 illustrates an array of microphones for measuring the shape of a sounding object; and
- FIG. 6 illustrates use of a microphone, placed far away from a sounding body, to measure an impulse response of correct modified form.
- Referring now to FIG. 2, in one example of the invention, the shape of a sounding object is encoded in such a way as to allow easy calculation of the impulse response at the listening point. The shape is decomposed in
step 105 into a weighted sum of spherical harmonics, comprising at least theorder 0 components and such higher orders as are deemed necessary. The weights are stored individually. The spherical harmonics may take the same names as in ambisonic B format, such that W and X, Y, Z are the order zero harmonic and the three order one harmonics, respectively. Each shape as defined by the individual spherical harmonics is also used to calculate an impulse response for that spherical harmonic, instep 106. These impulse responses are of a modified form where the impulse consists of sums of equally weighted components, so each time point can only take integer values for the size of the impulse at that point. Each point on the shape that has the same delay as another contributes a unit amount to the corresponding time point in the final non-distance weighted impulse response. The length of the impulse response is determined by the overall size of the sounding body. The shape may be synthesised according to the wishes of the user, using any suitable means, such as a Computer Aided Design Package, or by direct input of shape data, as instep 102. Alternatively, the shape of a real object, for instance a piano or an aeroplane, can be traced, as instep 101. - Once the modified impulse responses have been computed, or measured, and transformed into spherical harmonic form in
step 107, which we call ‘O’ format, the process allows the apparent orientation and distance of the sounding object to be varied. Instep 108, the sounding object is first oriented in the acoustic scene in accordance with its relationship to the listener, for instance by applying rotational transforms such as an angular rotation to the left by an angle of β from the centre front coupled with a tilt by an angle a from the horizontal which requires the following transformation - W′=W
- X′=X*cos βY*sin β
- Y′=X*sin β*cos a+Y*cos β*cos a−Z*sin a
- Z′=X*sin β*sin a+Y*cos β*sin a+Z*cos a
- where W′, X′, Y′, Z′ form the rotated and tilted spherical harmonics describing the reoriented sounding object Following this transformation, in
step 109, a weighted sum of the spherical harmonic coded impulse responses may be produced, corresponding to the non-distance weighted impulse response required for the relationship of the sounding object to the listening position. The form of these non-distance weighted impulse responses is shown in FIG. 3, which displays the zeroeth spherical harmonic and in FIG. 4, which shows one of the first order spherical harmonics. The effects of distance on the amplitude of each impulse can then be applied instep 110 by weighting the value of the impulse at each time point according to the inverse square law, derived by using the formula - (Ts/Tc)2
- where Ts is the tune of appearance of the first component in the impulse response and Tc that of the current component. This produces the final impulse response, the accuracy of whose match to reality can be chosen, in accordance with the computing power available and the quality of effect desired, by varying the number and maximum order of spherical harmonics used.
- Following computation of the final impulse response, any sound, recorded or synthesised, may be processed by using the impulse response so generated, via means such as convolution in
step 111, so as to apply the appropriate frequency domain corrections such tat it will sound as if it was emitted by the sounding object at the desired distance and orientation from the listening body. Further processing by the already known ambisonic panning processes, or by any other form of sound spatialization, will yield a final image of the desired nature, instep 112. - It will be understood that the surface shape of the object can be determined by normal measurement means and the weighting of the spherical harmonics encoding the shape maybe derived by means of a suitable Fourier series analysis in
step 105, yielding the following formulae for the weights of each spherical harmonic component: -
- Other forms of approximation may be adopted appropriate to the distribution of convenient measurement points. The shape of the sounding object may be measured using an array of microphones such as is illustrated in FIG. 5, where the time of arrival of the first sound at each microphone can be used to determine the distance to the nearest point to that microphone.
- FIG. 6 illustrates a further option of this example of the invention, whereby a microphone, if placed far enough away from the sounding body, may be used to measure an impulse response of the correct modified form, as in
step 103. This results when the angels subtended by all points on the surface away from the microphone's axis are so small that there is an insignificant extra time difference between points on the microphone axis and those off it. Measurement of a sufficient number of these inpulse responses over an appropriate grid of measurement points enables a spherical harmonic encoded form to be derived instep 104, via a process of approximation similar to that discussed above. - In another option of this exile of the invention, another similar process of spherical harmonic coding can be used to define the distribution of radiation characteristics across the surface of the sphere. This may be accomplished in
step 113 by means such as providing different filtering functions to model bright or dull sounding areas of the surface. This is important in, for instance, modelling speech, where the spectral content of the speech varies, depending on whether the person speaking is facing the listener or not. The use of spherical harmonic encoding for the variations of these filtering functions over the surface of the object means that they may be oriented correctly instep 114, in a manner similar to that used for the impulse responses, prior to being applied to thesound instep 115. - In a further option of this example of the invention, the apparent size of the object may be varied by varying the length of the impulse response. This may be accomplished either by recalculating the basic impulse response or otherwise. In one example, this is done by placing the impulse response in a look-up table and using computing means to vary the rate at which values are read out. By either discarding unwanted values when the new impulse response is shorter than the original or, in the case where the new impulse response is longer than the original, by calculating new intermediate values, either by interpolation from adjacent values or otherwise, the length of the impulse response and hence the size of the object can be controlled.
- By a similar means, the effect on the impulse response of the distance between the sounding body and the listener being such that the effect of the distance along the Y-axis becomes significant can be incorporated. In this case, the time axis may be warped to model the extra delay imposed by the point's distance from the Y-axis. A typical warping factor is represented by that for the zero order spherical harmonic
- {square root}{square root over ((sin(cos−1(n)))2+(1−n)2)}
- where n is the number of the sample and all points are expressed in terms of multiples of the size of the object. By a similar means, or otherwise, the effect of sound diffusion from areas of the sounding object facing away from the listener or otherwise obstructed from having a direct path to the listening position may be modelled, such that sounds of some wavelengths are delayed more than others, as is well known from the study of acoustics.
- The above-described and illustrated examples of the invention enable the construction of more realistic sound objects for use within synthesised ambisonic soundfields, whilst maintaining the simplicity and ease of use of ambisonics.
- The above-described and illustrated examples of using spherical harmonics allow sound objects to be manipulated spatially at low computational cost, with processing effects such as rotation, tilt, tumbling, etc., prior to embedding the sound object in a final soundfield. After embedding, only normal manipulations of the soundfield as a whole would normally be possible. The order of the format of the sound object prior to embedding does not have to match that of the soundfield it is eventually embedded in, since it may be passed through a matrix akin to that used for speaker decoding prior to being added, and only the output of the matrix need be of matching order. Tis means that high order descriptions of sound objects can be embedded in standard low order soundfields, allowing very rich acoustic behavior to be implemented without necessarily impacting on the final channel numbers and hence the storage required.
- In this specification, the verb “comprise” has its normal dictionary meaning, to denote non-exclusive inclusion. That is, use of the word “comprise” (or any of its derivatives) to include one feature or more, does not exclude the possibility of also including further features.
- All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive.
- Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
- The invention is not restricted to the details of the foregoing embodiment(s). The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed.
Claims (17)
1. A method of sound processing, comprising the step of encoding by spherical harmonics the spatial radiation characteristics of a sounding object.
2. A method according to claim 1 , wherein said encoding step includes generating impulse responses of the sounding object.
3. A method according to claim 2 , wherein said impulse responses are measured.
4. A method according to claim 3 , wherein a microphone is spaced from the sounding object and used to measure said impulse responses.
5. A method according to claim 2 , wherein said impulse responses are calculated.
6. A method according to claim 5 , including the step of inputting shape data representing the shape of the sounding object, from which data said impulse responses are calculated.
7. A method according to claim 6 , including the step of deriving said shape data from the time of arrival of a first sound at each microphone of an array of microphones placed around the sounding object.
8. A method according to claim 6 , including the step of synthesising said shape data.
9. A method according to claim 6 , including the step of tracing the shape of the sounding object.
10. A method according to any of the preceding claims, including the step of manipulating the spatial characteristics of the sounding object prior to embedding the object in a final soundfield.
11. A method according to claim 10 , wherein said step of manipulating the spatial characteristics of the sounding object includes transforming the apparent orientation of the sounding object with respect to a listener.
12. A method according to claim 10 or 11, wherein said step of manipulating the spatial characteristics of the sounding object includes transforming the apparent distance of the sounding object from to a listener.
13. A method according to any of the preceding claims, including the step of generating a final impulse response to represent the spatial radiation characteristics of the sounding object and applying said final impulse response to a sound source.
14. A method of sound processing, the method being substantially as hereinbefore described with reference to the accompanying drawings.
15. A sound processor arranged to encode by spherical harmonics the spatial radiation characteristics of a sounding object.
16. A sound processor according to claim 14 and arranged to carry out a method according to any of claims 1 to 14 .
17. A sound processor substantially as hereinbefore described with reference to the accompanying drawings.
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| GB0109498A GB2379147B (en) | 2001-04-18 | 2001-04-18 | Sound processing |
| GB0109498.6 | 2001-04-18 | ||
| PCT/GB2002/001719 WO2002085068A2 (en) | 2001-04-18 | 2002-04-18 | Sound processing |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| US20040131196A1 true US20040131196A1 (en) | 2004-07-08 |
Family
ID=9912972
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US10/475,282 Abandoned US20040131196A1 (en) | 2001-04-18 | 2002-04-18 | Sound processing |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20040131196A1 (en) |
| EP (1) | EP1380189B1 (en) |
| AT (1) | ATE276637T1 (en) |
| DE (1) | DE60201267T2 (en) |
| GB (1) | GB2379147B (en) |
| WO (1) | WO2002085068A2 (en) |
Cited By (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20140358557A1 (en) * | 2013-05-29 | 2014-12-04 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US20150154971A1 (en) * | 2012-07-16 | 2015-06-04 | Thomson Licensing | Method and apparatus for encoding multi-channel hoa audio signals for noise reduction, and method and apparatus for decoding multi-channel hoa audio signals for noise reduction |
| US9489955B2 (en) | 2014-01-30 | 2016-11-08 | Qualcomm Incorporated | Indicating frame parameter reusability for coding vectors |
| US9495968B2 (en) | 2013-05-29 | 2016-11-15 | Qualcomm Incorporated | Identifying sources from which higher order ambisonic audio data is generated |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US9641834B2 (en) | 2013-03-29 | 2017-05-02 | Qualcomm Incorporated | RTP payload format designs |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| CN112005556A (en) * | 2018-02-22 | 2020-11-27 | 诺莫诺股份有限公司 | Positioning sound source |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB2476747B (en) | 2009-02-04 | 2011-12-21 | Richard Furse | Sound system |
| ITUD20090183A1 (en) * | 2009-10-14 | 2011-04-15 | Advanced Ind Design In Acous Tic Srl | PROBE FOR VECTOR UNDERWATER MEASUREMENTS OF SOUND FIELD |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5555306A (en) * | 1991-04-04 | 1996-09-10 | Trifield Productions Limited | Audio signal processor providing simulated source distance control |
| US6713539B2 (en) * | 2000-08-18 | 2004-03-30 | China Petroleum & Chemical Corporation | Storage-stable modified asphalt composition and its preparation process |
| US6904152B1 (en) * | 1997-09-24 | 2005-06-07 | Sonic Solutions | Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions |
Family Cites Families (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| GB9211756D0 (en) * | 1992-06-03 | 1992-07-15 | Gerzon Michael A | Stereophonic directional dispersion method |
| GB2352152B (en) * | 1998-03-31 | 2003-03-26 | Lake Technology Ltd | Formulation of complex room impulse responses from 3-D audio information |
| GB2343347B (en) * | 1998-06-20 | 2002-12-31 | Central Research Lab Ltd | A method of synthesising an audio signal |
| WO2000019415A2 (en) * | 1998-09-25 | 2000-04-06 | Creative Technology Ltd. | Method and apparatus for three-dimensional audio display |
| EP1158486A1 (en) * | 2000-05-18 | 2001-11-28 | TC Electronic A/S | Method of processing a signal |
| US6239348B1 (en) * | 1999-09-10 | 2001-05-29 | Randall B. Metcalf | Sound system and method for creating a sound event based on a modeled sound field |
| WO2001082651A1 (en) * | 2000-04-19 | 2001-11-01 | Sonic Solutions | Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions |
-
2001
- 2001-04-18 GB GB0109498A patent/GB2379147B/en not_active Expired - Fee Related
-
2002
- 2002-04-18 EP EP02720211A patent/EP1380189B1/en not_active Expired - Lifetime
- 2002-04-18 WO PCT/GB2002/001719 patent/WO2002085068A2/en not_active Ceased
- 2002-04-18 US US10/475,282 patent/US20040131196A1/en not_active Abandoned
- 2002-04-18 DE DE60201267T patent/DE60201267T2/en not_active Expired - Fee Related
- 2002-04-18 AT AT02720211T patent/ATE276637T1/en not_active IP Right Cessation
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5555306A (en) * | 1991-04-04 | 1996-09-10 | Trifield Productions Limited | Audio signal processor providing simulated source distance control |
| US6904152B1 (en) * | 1997-09-24 | 2005-06-07 | Sonic Solutions | Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions |
| US6713539B2 (en) * | 2000-08-18 | 2004-03-30 | China Petroleum & Chemical Corporation | Storage-stable modified asphalt composition and its preparation process |
Cited By (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TWI602444B (en) * | 2012-07-16 | 2017-10-11 | 杜比國際公司 | Method and apparatus for encoding multi-channel hoa audio signals for noise reduction, and method and apparatus for decoding multi-channel hoa audio signals for noise reduction |
| US20150154971A1 (en) * | 2012-07-16 | 2015-06-04 | Thomson Licensing | Method and apparatus for encoding multi-channel hoa audio signals for noise reduction, and method and apparatus for decoding multi-channel hoa audio signals for noise reduction |
| US9460728B2 (en) * | 2012-07-16 | 2016-10-04 | Dolby Laboratories Licensing Corporation | Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction |
| TWI691214B (en) * | 2012-07-16 | 2020-04-11 | 瑞典商杜比國際公司 | Method and apparatus for decoding higher order ambisonics (hoa) audio signals and computer readable medium thereof |
| US10614821B2 (en) | 2012-07-16 | 2020-04-07 | Dolby Laboratories Licensing Corporation | Methods and apparatus for encoding and decoding multi-channel HOA audio signals |
| TWI674009B (en) * | 2012-07-16 | 2019-10-01 | 杜比國際公司 | Method and apparatus for decoding encoded hoa audio signals |
| US10304469B2 (en) | 2012-07-16 | 2019-05-28 | Dolby Laboratories Licensing Corporation | Methods and apparatus for encoding and decoding multi-channel HOA audio signals |
| US9837087B2 (en) | 2012-07-16 | 2017-12-05 | Dolby Laboratories Licensing Corporation | Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction |
| US9641834B2 (en) | 2013-03-29 | 2017-05-02 | Qualcomm Incorporated | RTP payload format designs |
| US9749768B2 (en) | 2013-05-29 | 2017-08-29 | Qualcomm Incorporated | Extracting decomposed representations of a sound field based on a first configuration mode |
| US9774977B2 (en) | 2013-05-29 | 2017-09-26 | Qualcomm Incorporated | Extracting decomposed representations of a sound field based on a second configuration mode |
| US9716959B2 (en) | 2013-05-29 | 2017-07-25 | Qualcomm Incorporated | Compensating for error in decomposed representations of sound fields |
| US11146903B2 (en) | 2013-05-29 | 2021-10-12 | Qualcomm Incorporated | Compression of decomposed representations of a sound field |
| US9466305B2 (en) * | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US20140358557A1 (en) * | 2013-05-29 | 2014-12-04 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
| US11962990B2 (en) | 2013-05-29 | 2024-04-16 | Qualcomm Incorporated | Reordering of foreground audio objects in the ambisonics domain |
| US9763019B2 (en) | 2013-05-29 | 2017-09-12 | Qualcomm Incorporated | Analysis of decomposed representations of a sound field |
| US9883312B2 (en) | 2013-05-29 | 2018-01-30 | Qualcomm Incorporated | Transformed higher order ambisonics audio data |
| US9769586B2 (en) | 2013-05-29 | 2017-09-19 | Qualcomm Incorporated | Performing order reduction with respect to higher order ambisonic coefficients |
| US10499176B2 (en) | 2013-05-29 | 2019-12-03 | Qualcomm Incorporated | Identifying codebooks to use when coding spatial components of a sound field |
| US9495968B2 (en) | 2013-05-29 | 2016-11-15 | Qualcomm Incorporated | Identifying sources from which higher order ambisonic audio data is generated |
| US9502044B2 (en) | 2013-05-29 | 2016-11-22 | Qualcomm Incorporated | Compression of decomposed representations of a sound field |
| US9854377B2 (en) | 2013-05-29 | 2017-12-26 | Qualcomm Incorporated | Interpolation for decomposed representations of a sound field |
| US9980074B2 (en) | 2013-05-29 | 2018-05-22 | Qualcomm Incorporated | Quantization step sizes for compression of spatial components of a sound field |
| US9489955B2 (en) | 2014-01-30 | 2016-11-08 | Qualcomm Incorporated | Indicating frame parameter reusability for coding vectors |
| US9922656B2 (en) | 2014-01-30 | 2018-03-20 | Qualcomm Incorporated | Transitioning of ambient higher-order ambisonic coefficients |
| US9502045B2 (en) | 2014-01-30 | 2016-11-22 | Qualcomm Incorporated | Coding independent frames of ambient higher-order ambisonic coefficients |
| US9754600B2 (en) | 2014-01-30 | 2017-09-05 | Qualcomm Incorporated | Reuse of index of huffman codebook for coding vectors |
| US9747912B2 (en) | 2014-01-30 | 2017-08-29 | Qualcomm Incorporated | Reuse of syntax element indicating quantization mode used in compressing vectors |
| US9747911B2 (en) | 2014-01-30 | 2017-08-29 | Qualcomm Incorporated | Reuse of syntax element indicating vector quantization codebook used in compressing vectors |
| US9653086B2 (en) | 2014-01-30 | 2017-05-16 | Qualcomm Incorporated | Coding numbers of code vectors for independent frames of higher-order ambisonic coefficients |
| US9852737B2 (en) | 2014-05-16 | 2017-12-26 | Qualcomm Incorporated | Coding vectors decomposed from higher-order ambisonics audio signals |
| US9620137B2 (en) | 2014-05-16 | 2017-04-11 | Qualcomm Incorporated | Determining between scalar and vector quantization in higher order ambisonic coefficients |
| US10770087B2 (en) | 2014-05-16 | 2020-09-08 | Qualcomm Incorporated | Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals |
| US9747910B2 (en) | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
| CN112005556A (en) * | 2018-02-22 | 2020-11-27 | 诺莫诺股份有限公司 | Positioning sound source |
| US11388512B2 (en) * | 2018-02-22 | 2022-07-12 | Nomono As | Positioning sound sources |
Also Published As
| Publication number | Publication date |
|---|---|
| EP1380189B1 (en) | 2004-09-15 |
| GB0109498D0 (en) | 2001-06-06 |
| GB2379147B (en) | 2003-10-22 |
| DE60201267T2 (en) | 2005-09-22 |
| DE60201267D1 (en) | 2004-10-21 |
| ATE276637T1 (en) | 2004-10-15 |
| WO2002085068A2 (en) | 2002-10-24 |
| WO2002085068A3 (en) | 2003-04-24 |
| WO2002085068A9 (en) | 2002-11-28 |
| GB2379147A (en) | 2003-02-26 |
| EP1380189A2 (en) | 2004-01-14 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Zotter | Analysis and synthesis of sound-radiation with spherical arrays | |
| Mehra et al. | Source and listener directivity for interactive wave-based sound propagation | |
| US20040131196A1 (en) | Sound processing | |
| US5452360A (en) | Sound field control device and method for controlling a sound field | |
| US20150326966A1 (en) | Methods, systems, and computer readable media for source and listener directivity for interactive wave-based sound propagation | |
| US20090310802A1 (en) | Virtual sound source positioning | |
| US10911885B1 (en) | Augmented reality virtual audio source enhancement | |
| JPH08272380A (en) | Method and device for reproducing virtual three-dimensional spatial sound | |
| Ahrens et al. | Computation of spherical harmonic representations of source directivity based on the finite-distance signature | |
| Ackermann et al. | Comparative evaluation of interpolation methods for the directivity of musical instruments | |
| Bodon | Development, evaluation, and validation of a high-resolution directivity measurement system for played musical instruments | |
| Mechel | Room acoustical fields | |
| Rungta et al. | Syncopation: Interactive synthesis-coupled sound propagation | |
| Pelzer et al. | Auralization of a virtual orchestra using directivities of measured symphonic instruments | |
| EP4205103B1 (en) | Method for generating a reverberation audio signal | |
| Ackermann et al. | Musical instruments as dynamic sound sources | |
| Pérez Carrillo et al. | Method for measuring violin sound radiation based on bowed glissandi and its application to sound synthesis | |
| Svensson | Modelling acoustic spaces for audio virtual reality | |
| Maestre et al. | State-space modeling of sound source directivity: An experimental study of the violin and the clarinet | |
| Malham | Spherical harmonic coding of sound objects-the ambisonic'O'format | |
| Kronland-Martinet et al. | Real-time perceptual simulation of moving sources: application to the Leslie cabinet and 3D sound immersion | |
| Plessas | Rigid sphere microphone arrays for spatial recording and holography | |
| US20090110202A1 (en) | Method of processing a signal | |
| Melchior | Investigations on spatial sound design based on measured room impulse responses | |
| Scavone | Modeling wind instrument sound radiation using digital waveguides |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| AS | Assignment |
Owner name: YORK, UNIVERSITY OF, UNITED KINGDOM Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MALHAM, DAVID GEORGE;REEL/FRAME:014976/0696 Effective date: 20031016 Owner name: YORK, UNIVERSITY OF, UNITED KINGDOM Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DEMEESTER, JOSEPH;DE SMEDT, STEFAAN;SANDERS, NIEK;REEL/FRAME:014976/0674;SIGNING DATES FROM 20031118 TO 20031205 |
|
| STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |