[go: up one dir, main page]

US20040131196A1 - Sound processing - Google Patents

Sound processing Download PDF

Info

Publication number
US20040131196A1
US20040131196A1 US10/475,282 US47528204A US2004131196A1 US 20040131196 A1 US20040131196 A1 US 20040131196A1 US 47528204 A US47528204 A US 47528204A US 2004131196 A1 US2004131196 A1 US 2004131196A1
Authority
US
United States
Prior art keywords
sounding object
sound
sounding
impulse
impulse response
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US10/475,282
Inventor
David Malham
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of York
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Assigned to YORK, UNIVERSITY OF reassignment YORK, UNIVERSITY OF ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SANDERS, NIEK, DEMEESTER, JOSEPH, DE SMEDT, STEFAAN
Assigned to YORK, UNIVERSITY OF reassignment YORK, UNIVERSITY OF ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MALHAM, DAVID GEORGE
Publication of US20040131196A1 publication Critical patent/US20040131196A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Definitions

  • This invention relates to sound processing and is concerned particularly although not exclusively with methods and processors for encoding radiation characteristics of sounding bodies.
  • the sound source is given a limited directional variability, for instance, having a cone of directions where the sound changes character so as to appear to be facing towards or away from the listener's position.
  • These forms of sounding body synthesis are very limited in their ability to provide realistic sound images especially as, in general, there is little or no provision for the effects of source-listener distance.
  • Proper modelling of radiation characteristics over the whole surface is also important when generating the early reflections for a reverberation unit, since the reflections will, in most cases, not be of the part of the sounding object facing the listener.
  • Some improvement may be made by means of a simplified model of the radiation pattern of the object. This may be coded using spherical harmonics in a manner analogous to the coding of soundfields. This allows the object to be rotated so that it maybe oriented correctly to the listening position, but it does not allow the effects of variation of the sound at the listening position with distance to be simulated appropriately. Tis variation is due to changes in the impulse response at the listener's position The impulse response changes with differing distances in two ways. Tis is illustrated in FIG. 1 of the accompanying drawings, which shows impulse responses at points spaced from a sounding object.
  • the impulse response is, for simplicity, shown as being provided by three points, A, B and C on the sounding object (although in reality all points on the surface would contribute) and for two listener positions, P and Q. Both the position of the impulses in time and the differences in their amplitudes change with distance. Note that, as the distance increases between the object and the listener, the extra distance contribution of the displacement away from the origin along the Y axis decreases leading eventually; in the far field, to the situation where only distances along the X axis count.
  • Preferred embodiments of the present invention aim to provide systems in which further characteristics of a sounding body are encoded using spherical harmonics in such a way as to allow simulation of both the radiation pattern of the sounding body and the effects of source-listener distance.
  • This use of spherical harmonics permits the sounding object to be realistically portrayed without imposing heavy computational loads.
  • a method of sound processing comprising the step of encoding by spherical harmonics the spatial radiation characteristics of a sounding object.
  • a sound processor arranged to encode by spherical harmonics the spatial radiation characteristics of a sounding object.
  • Said encoding may include generating impulse responses of the sounding object.
  • Said impulse responses maybe measured or calculated.
  • a microphone may be spaced from the sounding object and used to measure said impulse responses.
  • Shape data may be input to represent the shape of the sounding object, from which data said impulse responses are calculated.
  • Said shape data may be derived from the time of arrival of a first sound at each microphone of an array of microphones placed around the sounding object.
  • Said shape data maybe synthesised.
  • the shape of the sounding object maybe traced.
  • Sound processing methods or sound processors as above may provide for manipulating the spatial characteristics of the sounding object prior to embedding the object in a final soundfield.
  • Manipulating the spatial characteristics of the sounding object may include transforming the apparent orientation of the sounding object with respect to a listener.
  • Manipulating the spatial characteristics of the sounding object may include transforming the apparent distance of the sounding object from to a listener.
  • Sound processing methods or sound processors as above may generate a final impulse response to represent the spatial radiation characteristics of the sounding object and apply said final impulse response to a sound source.
  • Sound processing methods and sound processors as above may include any one or more of the features disclosed in this specification.
  • FIGS. 2 to 6 of the accompanying diagrammatic drawings in which:
  • FIG. 2 is a flowchart to illustrate one example of an encoding process in accordance with one example of the invention
  • FIG. 3 shows a non-distance weighted impulse response for a zero order spherical harmonic
  • FIG. 4 shows a non-distance weighted impulse response for a first order spherical harmonic
  • FIG. 5 illustrates an array of microphones for measuring the shape of a sounding object
  • FIG. 6 illustrates use of a microphone, placed far away from a sounding body, to measure an impulse response of correct modified form.
  • the shape of a sounding object is encoded in such a way as to allow easy calculation of the impulse response at the listening point.
  • the shape is decomposed in step 105 into a weighted sum of spherical harmonics, comprising at least the order 0 components and such higher orders as are deemed necessary.
  • the weights are stored individually.
  • the spherical harmonics may take the same names as in ambisonic B format, such that W and X, Y, Z are the order zero harmonic and the three order one harmonics, respectively.
  • Each shape as defined by the individual spherical harmonics is also used to calculate an impulse response for that spherical harmonic, in step 106 .
  • impulse responses are of a modified form where the impulse consists of sums of equally weighted components, so each time point can only take integer values for the size of the impulse at that point.
  • Each point on the shape that has the same delay as another contributes a unit amount to the corresponding time point in the final non-distance weighted impulse response.
  • the length of the impulse response is determined by the overall size of the sounding body.
  • the shape may be synthesised according to the wishes of the user, using any suitable means, such as a Computer Aided Design Package, or by direct input of shape data, as in step 102 .
  • the shape of a real object for instance a piano or an aeroplane, can be traced, as in step 101 .
  • step 107 the process allows the apparent orientation and distance of the sounding object to be varied.
  • step 108 the sounding object is first oriented in the acoustic scene in accordance with its relationship to the listener, for instance by applying rotational transforms such as an angular rotation to the left by an angle of ⁇ from the centre front coupled with a tilt by an angle a from the horizontal which requires the following transformation
  • step 109 a weighted sum of the spherical harmonic coded impulse responses may be produced, corresponding to the non-distance weighted impulse response required for the relationship of the sounding object to the listening position.
  • the form of these non-distance weighted impulse responses is shown in FIG. 3, which displays the zeroeth spherical harmonic and in FIG. 4, which shows one of the first order spherical harmonics.
  • the effects of distance on the amplitude of each impulse can then be applied in step 110 by weighting the value of the impulse at each time point according to the inverse square law, derived by using the formula
  • Ts is the tune of appearance of the first component in the impulse response and Tc that of the current component. This produces the final impulse response, the accuracy of whose match to reality can be chosen, in accordance with the computing power available and the quality of effect desired, by varying the number and maximum order of spherical harmonics used.
  • any sound, recorded or synthesised may be processed by using the impulse response so generated, via means such as convolution in step 111 , so as to apply the appropriate frequency domain corrections such tat it will sound as if it was emitted by the sounding object at the desired distance and orientation from the listening body. Further processing by the already known ambisonic panning processes, or by any other form of sound spatialization, will yield a final image of the desired nature, in step 112 .
  • the surface shape of the object can be determined by normal measurement means and the weighting of the spherical harmonics encoding the shape maybe derived by means of a suitable Fourier series analysis in step 105 , yielding the following formulae for the weights of each spherical harmonic component:
  • the shape of the sounding object may be measured using an array of microphones such as is illustrated in FIG. 5, where the time of arrival of the first sound at each microphone can be used to determine the distance to the nearest point to that microphone.
  • FIG. 6 illustrates a further option of this example of the invention, whereby a microphone, if placed far enough away from the sounding body, may be used to measure an impulse response of the correct modified form, as in step 103 .
  • a microphone if placed far enough away from the sounding body, may be used to measure an impulse response of the correct modified form, as in step 103 .
  • Measurement of a sufficient number of these inpulse responses over an appropriate grid of measurement points enables a spherical harmonic encoded form to be derived in step 104 , via a process of approximation similar to that discussed above.
  • step 113 another similar process of spherical harmonic coding can be used to define the distribution of radiation characteristics across the surface of the sphere. This may be accomplished in step 113 by means such as providing different filtering functions to model bright or dull sounding areas of the surface. This is important in, for instance, modelling speech, where the spectral content of the speech varies, depending on whether the person speaking is facing the listener or not.
  • the use of spherical harmonic encoding for the variations of these filtering functions over the surface of the object means that they may be oriented correctly in step 114 , in a manner similar to that used for the impulse responses, prior to being applied to the sound instep 115 .
  • the apparent size of the object may be varied by varying the length of the impulse response. This may be accomplished either by recalculating the basic impulse response or otherwise. In one example, this is done by placing the impulse response in a look-up table and using computing means to vary the rate at which values are read out. By either discarding unwanted values when the new impulse response is shorter than the original or, in the case where the new impulse response is longer than the original, by calculating new intermediate values, either by interpolation from adjacent values or otherwise, the length of the impulse response and hence the size of the object can be controlled.
  • the effect on the impulse response of the distance between the sounding body and the listener being such that the effect of the distance along the Y-axis becomes significant can be incorporated.
  • the time axis may be warped to model the extra delay imposed by the point's distance from the Y-axis.
  • a typical warping factor is represented by that for the zero order spherical harmonic
  • n is the number of the sample and all points are expressed in terms of multiples of the size of the object.

Landscapes

  • Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Stereophonic System (AREA)
  • Sink And Installation For Waste Water (AREA)
  • Saccharide Compounds (AREA)
  • Massaging Devices (AREA)
  • Transducers For Ultrasonic Waves (AREA)

Abstract

The spatial radiation characteristics of a sounding object are encoded by spherical harmonics. The shape is decomposed (105) into a weighted sum of spherical harmonics, comprising at least the order 0 components and such higher orders as are deemed necessary. The weights are stored individually. Each shape as defined by the individual spherical harmonics is also used to calculate an impulse response for that spherical harmonic (106). These impulse responses are of a modified form where the impulse consists of sums of equally weighted components, so each time point can only take integer values for the size of the impulse at that point. The modified impulse responses are transformed into spherical harmonic form (107), after which the apparent orientation and distance of the sounding object may be varied. Any sound may be processed by using the impulse response so generated (111).

Description

  • This invention relates to sound processing and is concerned particularly although not exclusively with methods and processors for encoding radiation characteristics of sounding bodies. [0001]
  • Systems for recording and reproducing sounds capable of retaining the spatial characteristics of an original soundfield have been known for many years. For instance, the ambisonic surround sound system uses spherical harmonics to encode the direction of sound sources win a three dimensional soundfield. Recently, this form of representation of a soundfield has been extended from the original, four channel, first order version to include second and possible higher order spherical harmonics necessary to attain higher precision and a wider useful audience area. However, even first order, four channel soundfields, recorded from real acoustic scenes using a suitable microphone, capture well the complex extended nature of real sound radiating bodies. On the other hand, even within ambisonic systems, when soundfields have to be synthesised, for instance, when constructing an artificial sound image for a film soundtrack or a computer game, the ability to portray sound sources as extended objects has been limited by available technology. As a result, this portrayal has largely been limited to either idealised point sources or to sources having a very simplified impression of being “larger than a point source”. Typically, this enlargement has, in ambisonic systems, been implemented either by simply exaggerating the non-directional zeroeth order spherical harmonic or by phase shifted based ‘spreader’ controls. In some other systems, for instance Microsoft's DirectSound, the sound source is given a limited directional variability, for instance, having a cone of directions where the sound changes character so as to appear to be facing towards or away from the listener's position. These forms of sounding body synthesis are very limited in their ability to provide realistic sound images especially as, in general, there is little or no provision for the effects of source-listener distance. Proper modelling of radiation characteristics over the whole surface is also important when generating the early reflections for a reverberation unit, since the reflections will, in most cases, not be of the part of the sounding object facing the listener. [0002]
  • On the other hand, it should be noted that within full acoustic simulation systems, the contributions of sounds arriving at the listening position from all points on a sounding object can be calculated by solving the wave equations for each source-listener path or by other suitable means, and this can provide fully realistic sound images. This approach, however, imposes heavy computational loadings on systems, which can be inconvenient when there is restricted available computing power or when realtime operation is desired. [0003]
  • Some improvement may be made by means of a simplified model of the radiation pattern of the object. This may be coded using spherical harmonics in a manner analogous to the coding of soundfields. This allows the object to be rotated so that it maybe oriented correctly to the listening position, but it does not allow the effects of variation of the sound at the listening position with distance to be simulated appropriately. Tis variation is due to changes in the impulse response at the listener's position The impulse response changes with differing distances in two ways. Tis is illustrated in FIG. 1 of the accompanying drawings, which shows impulse responses at points spaced from a sounding object. [0004]
  • In FIG. 1, the impulse response is, for simplicity, shown as being provided by three points, A, B and C on the sounding object (although in reality all points on the surface would contribute) and for two listener positions, P and Q. Both the position of the impulses in time and the differences in their amplitudes change with distance. Note that, as the distance increases between the object and the listener, the extra distance contribution of the displacement away from the origin along the Y axis decreases leading eventually; in the far field, to the situation where only distances along the X axis count. [0005]
  • Preferred embodiments of the present invention aim to provide systems in which further characteristics of a sounding body are encoded using spherical harmonics in such a way as to allow simulation of both the radiation pattern of the sounding body and the effects of source-listener distance. This use of spherical harmonics permits the sounding object to be realistically portrayed without imposing heavy computational loads. [0006]
  • More generally, according to one aspect of the present invention, there is provided a method of sound processing, comprising the step of encoding by spherical harmonics the spatial radiation characteristics of a sounding object. [0007]
  • According to another aspect of the present invention, there is provided a sound processor arranged to encode by spherical harmonics the spatial radiation characteristics of a sounding object. [0008]
  • Said encoding may include generating impulse responses of the sounding object. [0009]
  • Said impulse responses maybe measured or calculated. [0010]
  • A microphone may be spaced from the sounding object and used to measure said impulse responses. [0011]
  • Shape data may be input to represent the shape of the sounding object, from which data said impulse responses are calculated. [0012]
  • Said shape data may be derived from the time of arrival of a first sound at each microphone of an array of microphones placed around the sounding object. [0013]
  • Said shape data maybe synthesised. [0014]
  • The shape of the sounding object maybe traced. [0015]
  • Sound processing methods or sound processors as above may provide for manipulating the spatial characteristics of the sounding object prior to embedding the object in a final soundfield. [0016]
  • Manipulating the spatial characteristics of the sounding object may include transforming the apparent orientation of the sounding object with respect to a listener. [0017]
  • Manipulating the spatial characteristics of the sounding object may include transforming the apparent distance of the sounding object from to a listener. [0018]
  • Sound processing methods or sound processors as above may generate a final impulse response to represent the spatial radiation characteristics of the sounding object and apply said final impulse response to a sound source. [0019]
  • Sound processing methods and sound processors as above may include any one or more of the features disclosed in this specification.[0020]
  • For a better understanding of the invention, and to show how embodiments of the same may be carried into effect, reference will now be made, by way of example, to FIGS. [0021] 2 to 6 of the accompanying diagrammatic drawings, in which:
  • FIG. 2 is a flowchart to illustrate one example of an encoding process in accordance with one example of the invention; [0022]
  • FIG. 3 shows a non-distance weighted impulse response for a zero order spherical harmonic; [0023]
  • FIG. 4 shows a non-distance weighted impulse response for a first order spherical harmonic; [0024]
  • FIG. 5 illustrates an array of microphones for measuring the shape of a sounding object; and [0025]
  • FIG. 6 illustrates use of a microphone, placed far away from a sounding body, to measure an impulse response of correct modified form.[0026]
  • Referring now to FIG. 2, in one example of the invention, the shape of a sounding object is encoded in such a way as to allow easy calculation of the impulse response at the listening point. The shape is decomposed in [0027] step 105 into a weighted sum of spherical harmonics, comprising at least the order 0 components and such higher orders as are deemed necessary. The weights are stored individually. The spherical harmonics may take the same names as in ambisonic B format, such that W and X, Y, Z are the order zero harmonic and the three order one harmonics, respectively. Each shape as defined by the individual spherical harmonics is also used to calculate an impulse response for that spherical harmonic, in step 106. These impulse responses are of a modified form where the impulse consists of sums of equally weighted components, so each time point can only take integer values for the size of the impulse at that point. Each point on the shape that has the same delay as another contributes a unit amount to the corresponding time point in the final non-distance weighted impulse response. The length of the impulse response is determined by the overall size of the sounding body. The shape may be synthesised according to the wishes of the user, using any suitable means, such as a Computer Aided Design Package, or by direct input of shape data, as in step 102. Alternatively, the shape of a real object, for instance a piano or an aeroplane, can be traced, as in step 101.
  • Once the modified impulse responses have been computed, or measured, and transformed into spherical harmonic form in [0028] step 107, which we call ‘O’ format, the process allows the apparent orientation and distance of the sounding object to be varied. In step 108, the sounding object is first oriented in the acoustic scene in accordance with its relationship to the listener, for instance by applying rotational transforms such as an angular rotation to the left by an angle of β from the centre front coupled with a tilt by an angle a from the horizontal which requires the following transformation
  • W′=W
  • X′=X*cos βY*sin β
  • Y′=X*sin β*cos a+Y*cos β*cos a−Z*sin a
  • Z′=X*sin β*sin a+Y*cos β*sin a+Z*cos a
  • where W′, X′, Y′, Z′ form the rotated and tilted spherical harmonics describing the reoriented sounding object Following this transformation, in [0029] step 109, a weighted sum of the spherical harmonic coded impulse responses may be produced, corresponding to the non-distance weighted impulse response required for the relationship of the sounding object to the listening position. The form of these non-distance weighted impulse responses is shown in FIG. 3, which displays the zeroeth spherical harmonic and in FIG. 4, which shows one of the first order spherical harmonics. The effects of distance on the amplitude of each impulse can then be applied in step 110 by weighting the value of the impulse at each time point according to the inverse square law, derived by using the formula
  • (Ts/Tc)2
  • where Ts is the tune of appearance of the first component in the impulse response and Tc that of the current component. This produces the final impulse response, the accuracy of whose match to reality can be chosen, in accordance with the computing power available and the quality of effect desired, by varying the number and maximum order of spherical harmonics used. [0030]
  • Following computation of the final impulse response, any sound, recorded or synthesised, may be processed by using the impulse response so generated, via means such as convolution in [0031] step 111, so as to apply the appropriate frequency domain corrections such tat it will sound as if it was emitted by the sounding object at the desired distance and orientation from the listening body. Further processing by the already known ambisonic panning processes, or by any other form of sound spatialization, will yield a final image of the desired nature, in step 112.
  • It will be understood that the surface shape of the object can be determined by normal measurement means and the weighting of the spherical harmonics encoding the shape maybe derived by means of a suitable Fourier series analysis in [0032] step 105, yielding the following formulae for the weights of each spherical harmonic component: P mn = φ = 0 π θ = 0 2 π f ( θ , φ ) p m m ( θ , φ ) sin φ φ θ , 0 m n Q mn = φ = 0 π θ = 0 2 π f ( θ , φ ) q mn ( θ , φ ) sin φ φ θ , 1 m n
    Figure US20040131196A1-20040708-M00001
  • Since the measurements will, in general be taken on a discrete grid of N points, we may approximate this using a formula such as: [0033] φ = 0 π θ = 0 2 π f ( θ , φ ) S m n ( θ , φ ) sin φ φ θ i = 1 N f ( θ i , φ i ) S mn ( θ i , φ i )
    Figure US20040131196A1-20040708-M00002
  • Other forms of approximation may be adopted appropriate to the distribution of convenient measurement points. The shape of the sounding object may be measured using an array of microphones such as is illustrated in FIG. 5, where the time of arrival of the first sound at each microphone can be used to determine the distance to the nearest point to that microphone. [0034]
  • FIG. 6 illustrates a further option of this example of the invention, whereby a microphone, if placed far enough away from the sounding body, may be used to measure an impulse response of the correct modified form, as in [0035] step 103. This results when the angels subtended by all points on the surface away from the microphone's axis are so small that there is an insignificant extra time difference between points on the microphone axis and those off it. Measurement of a sufficient number of these inpulse responses over an appropriate grid of measurement points enables a spherical harmonic encoded form to be derived in step 104, via a process of approximation similar to that discussed above.
  • In another option of this exile of the invention, another similar process of spherical harmonic coding can be used to define the distribution of radiation characteristics across the surface of the sphere. This may be accomplished in [0036] step 113 by means such as providing different filtering functions to model bright or dull sounding areas of the surface. This is important in, for instance, modelling speech, where the spectral content of the speech varies, depending on whether the person speaking is facing the listener or not. The use of spherical harmonic encoding for the variations of these filtering functions over the surface of the object means that they may be oriented correctly in step 114, in a manner similar to that used for the impulse responses, prior to being applied to the sound instep 115.
  • In a further option of this example of the invention, the apparent size of the object may be varied by varying the length of the impulse response. This may be accomplished either by recalculating the basic impulse response or otherwise. In one example, this is done by placing the impulse response in a look-up table and using computing means to vary the rate at which values are read out. By either discarding unwanted values when the new impulse response is shorter than the original or, in the case where the new impulse response is longer than the original, by calculating new intermediate values, either by interpolation from adjacent values or otherwise, the length of the impulse response and hence the size of the object can be controlled. [0037]
  • By a similar means, the effect on the impulse response of the distance between the sounding body and the listener being such that the effect of the distance along the Y-axis becomes significant can be incorporated. In this case, the time axis may be warped to model the extra delay imposed by the point's distance from the Y-axis. A typical warping factor is represented by that for the zero order spherical harmonic[0038]
  • {square root}{square root over ((sin(cos−1(n)))2+(1−n)2)}
  • where n is the number of the sample and all points are expressed in terms of multiples of the size of the object. By a similar means, or otherwise, the effect of sound diffusion from areas of the sounding object facing away from the listener or otherwise obstructed from having a direct path to the listening position may be modelled, such that sounds of some wavelengths are delayed more than others, as is well known from the study of acoustics. [0039]
  • The above-described and illustrated examples of the invention enable the construction of more realistic sound objects for use within synthesised ambisonic soundfields, whilst maintaining the simplicity and ease of use of ambisonics. [0040]
  • The above-described and illustrated examples of using spherical harmonics allow sound objects to be manipulated spatially at low computational cost, with processing effects such as rotation, tilt, tumbling, etc., prior to embedding the sound object in a final soundfield. After embedding, only normal manipulations of the soundfield as a whole would normally be possible. The order of the format of the sound object prior to embedding does not have to match that of the soundfield it is eventually embedded in, since it may be passed through a matrix akin to that used for speaker decoding prior to being added, and only the output of the matrix need be of matching order. Tis means that high order descriptions of sound objects can be embedded in standard low order soundfields, allowing very rich acoustic behavior to be implemented without necessarily impacting on the final channel numbers and hence the storage required. [0041]
  • In this specification, the verb “comprise” has its normal dictionary meaning, to denote non-exclusive inclusion. That is, use of the word “comprise” (or any of its derivatives) to include one feature or more, does not exclude the possibility of also including further features. [0042]
  • All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive. [0043]
  • Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features. [0044]
  • The invention is not restricted to the details of the foregoing embodiment(s). The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed. [0045]

Claims (17)

1. A method of sound processing, comprising the step of encoding by spherical harmonics the spatial radiation characteristics of a sounding object.
2. A method according to claim 1, wherein said encoding step includes generating impulse responses of the sounding object.
3. A method according to claim 2, wherein said impulse responses are measured.
4. A method according to claim 3, wherein a microphone is spaced from the sounding object and used to measure said impulse responses.
5. A method according to claim 2, wherein said impulse responses are calculated.
6. A method according to claim 5, including the step of inputting shape data representing the shape of the sounding object, from which data said impulse responses are calculated.
7. A method according to claim 6, including the step of deriving said shape data from the time of arrival of a first sound at each microphone of an array of microphones placed around the sounding object.
8. A method according to claim 6, including the step of synthesising said shape data.
9. A method according to claim 6, including the step of tracing the shape of the sounding object.
10. A method according to any of the preceding claims, including the step of manipulating the spatial characteristics of the sounding object prior to embedding the object in a final soundfield.
11. A method according to claim 10, wherein said step of manipulating the spatial characteristics of the sounding object includes transforming the apparent orientation of the sounding object with respect to a listener.
12. A method according to claim 10 or 11, wherein said step of manipulating the spatial characteristics of the sounding object includes transforming the apparent distance of the sounding object from to a listener.
13. A method according to any of the preceding claims, including the step of generating a final impulse response to represent the spatial radiation characteristics of the sounding object and applying said final impulse response to a sound source.
14. A method of sound processing, the method being substantially as hereinbefore described with reference to the accompanying drawings.
15. A sound processor arranged to encode by spherical harmonics the spatial radiation characteristics of a sounding object.
16. A sound processor according to claim 14 and arranged to carry out a method according to any of claims 1 to 14.
17. A sound processor substantially as hereinbefore described with reference to the accompanying drawings.
US10/475,282 2001-04-18 2002-04-18 Sound processing Abandoned US20040131196A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB0109498A GB2379147B (en) 2001-04-18 2001-04-18 Sound processing
GB0109498.6 2001-04-18
PCT/GB2002/001719 WO2002085068A2 (en) 2001-04-18 2002-04-18 Sound processing

Publications (1)

Publication Number Publication Date
US20040131196A1 true US20040131196A1 (en) 2004-07-08

Family

ID=9912972

Family Applications (1)

Application Number Title Priority Date Filing Date
US10/475,282 Abandoned US20040131196A1 (en) 2001-04-18 2002-04-18 Sound processing

Country Status (6)

Country Link
US (1) US20040131196A1 (en)
EP (1) EP1380189B1 (en)
AT (1) ATE276637T1 (en)
DE (1) DE60201267T2 (en)
GB (1) GB2379147B (en)
WO (1) WO2002085068A2 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140358557A1 (en) * 2013-05-29 2014-12-04 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US20150154971A1 (en) * 2012-07-16 2015-06-04 Thomson Licensing Method and apparatus for encoding multi-channel hoa audio signals for noise reduction, and method and apparatus for decoding multi-channel hoa audio signals for noise reduction
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9641834B2 (en) 2013-03-29 2017-05-02 Qualcomm Incorporated RTP payload format designs
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
CN112005556A (en) * 2018-02-22 2020-11-27 诺莫诺股份有限公司 Positioning sound source

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2476747B (en) 2009-02-04 2011-12-21 Richard Furse Sound system
ITUD20090183A1 (en) * 2009-10-14 2011-04-15 Advanced Ind Design In Acous Tic Srl PROBE FOR VECTOR UNDERWATER MEASUREMENTS OF SOUND FIELD

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5555306A (en) * 1991-04-04 1996-09-10 Trifield Productions Limited Audio signal processor providing simulated source distance control
US6713539B2 (en) * 2000-08-18 2004-03-30 China Petroleum & Chemical Corporation Storage-stable modified asphalt composition and its preparation process
US6904152B1 (en) * 1997-09-24 2005-06-07 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9211756D0 (en) * 1992-06-03 1992-07-15 Gerzon Michael A Stereophonic directional dispersion method
GB2352152B (en) * 1998-03-31 2003-03-26 Lake Technology Ltd Formulation of complex room impulse responses from 3-D audio information
GB2343347B (en) * 1998-06-20 2002-12-31 Central Research Lab Ltd A method of synthesising an audio signal
WO2000019415A2 (en) * 1998-09-25 2000-04-06 Creative Technology Ltd. Method and apparatus for three-dimensional audio display
EP1158486A1 (en) * 2000-05-18 2001-11-28 TC Electronic A/S Method of processing a signal
US6239348B1 (en) * 1999-09-10 2001-05-29 Randall B. Metcalf Sound system and method for creating a sound event based on a modeled sound field
WO2001082651A1 (en) * 2000-04-19 2001-11-01 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5555306A (en) * 1991-04-04 1996-09-10 Trifield Productions Limited Audio signal processor providing simulated source distance control
US6904152B1 (en) * 1997-09-24 2005-06-07 Sonic Solutions Multi-channel surround sound mastering and reproduction techniques that preserve spatial harmonics in three dimensions
US6713539B2 (en) * 2000-08-18 2004-03-30 China Petroleum & Chemical Corporation Storage-stable modified asphalt composition and its preparation process

Cited By (37)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI602444B (en) * 2012-07-16 2017-10-11 杜比國際公司 Method and apparatus for encoding multi-channel hoa audio signals for noise reduction, and method and apparatus for decoding multi-channel hoa audio signals for noise reduction
US20150154971A1 (en) * 2012-07-16 2015-06-04 Thomson Licensing Method and apparatus for encoding multi-channel hoa audio signals for noise reduction, and method and apparatus for decoding multi-channel hoa audio signals for noise reduction
US9460728B2 (en) * 2012-07-16 2016-10-04 Dolby Laboratories Licensing Corporation Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction
TWI691214B (en) * 2012-07-16 2020-04-11 瑞典商杜比國際公司 Method and apparatus for decoding higher order ambisonics (hoa) audio signals and computer readable medium thereof
US10614821B2 (en) 2012-07-16 2020-04-07 Dolby Laboratories Licensing Corporation Methods and apparatus for encoding and decoding multi-channel HOA audio signals
TWI674009B (en) * 2012-07-16 2019-10-01 杜比國際公司 Method and apparatus for decoding encoded hoa audio signals
US10304469B2 (en) 2012-07-16 2019-05-28 Dolby Laboratories Licensing Corporation Methods and apparatus for encoding and decoding multi-channel HOA audio signals
US9837087B2 (en) 2012-07-16 2017-12-05 Dolby Laboratories Licensing Corporation Method and apparatus for encoding multi-channel HOA audio signals for noise reduction, and method and apparatus for decoding multi-channel HOA audio signals for noise reduction
US9641834B2 (en) 2013-03-29 2017-05-02 Qualcomm Incorporated RTP payload format designs
US9749768B2 (en) 2013-05-29 2017-08-29 Qualcomm Incorporated Extracting decomposed representations of a sound field based on a first configuration mode
US9774977B2 (en) 2013-05-29 2017-09-26 Qualcomm Incorporated Extracting decomposed representations of a sound field based on a second configuration mode
US9716959B2 (en) 2013-05-29 2017-07-25 Qualcomm Incorporated Compensating for error in decomposed representations of sound fields
US11146903B2 (en) 2013-05-29 2021-10-12 Qualcomm Incorporated Compression of decomposed representations of a sound field
US9466305B2 (en) * 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US20140358557A1 (en) * 2013-05-29 2014-12-04 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US11962990B2 (en) 2013-05-29 2024-04-16 Qualcomm Incorporated Reordering of foreground audio objects in the ambisonics domain
US9763019B2 (en) 2013-05-29 2017-09-12 Qualcomm Incorporated Analysis of decomposed representations of a sound field
US9883312B2 (en) 2013-05-29 2018-01-30 Qualcomm Incorporated Transformed higher order ambisonics audio data
US9769586B2 (en) 2013-05-29 2017-09-19 Qualcomm Incorporated Performing order reduction with respect to higher order ambisonic coefficients
US10499176B2 (en) 2013-05-29 2019-12-03 Qualcomm Incorporated Identifying codebooks to use when coding spatial components of a sound field
US9495968B2 (en) 2013-05-29 2016-11-15 Qualcomm Incorporated Identifying sources from which higher order ambisonic audio data is generated
US9502044B2 (en) 2013-05-29 2016-11-22 Qualcomm Incorporated Compression of decomposed representations of a sound field
US9854377B2 (en) 2013-05-29 2017-12-26 Qualcomm Incorporated Interpolation for decomposed representations of a sound field
US9980074B2 (en) 2013-05-29 2018-05-22 Qualcomm Incorporated Quantization step sizes for compression of spatial components of a sound field
US9489955B2 (en) 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US9754600B2 (en) 2014-01-30 2017-09-05 Qualcomm Incorporated Reuse of index of huffman codebook for coding vectors
US9747912B2 (en) 2014-01-30 2017-08-29 Qualcomm Incorporated Reuse of syntax element indicating quantization mode used in compressing vectors
US9747911B2 (en) 2014-01-30 2017-08-29 Qualcomm Incorporated Reuse of syntax element indicating vector quantization codebook used in compressing vectors
US9653086B2 (en) 2014-01-30 2017-05-16 Qualcomm Incorporated Coding numbers of code vectors for independent frames of higher-order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
CN112005556A (en) * 2018-02-22 2020-11-27 诺莫诺股份有限公司 Positioning sound source
US11388512B2 (en) * 2018-02-22 2022-07-12 Nomono As Positioning sound sources

Also Published As

Publication number Publication date
EP1380189B1 (en) 2004-09-15
GB0109498D0 (en) 2001-06-06
GB2379147B (en) 2003-10-22
DE60201267T2 (en) 2005-09-22
DE60201267D1 (en) 2004-10-21
ATE276637T1 (en) 2004-10-15
WO2002085068A2 (en) 2002-10-24
WO2002085068A3 (en) 2003-04-24
WO2002085068A9 (en) 2002-11-28
GB2379147A (en) 2003-02-26
EP1380189A2 (en) 2004-01-14

Similar Documents

Publication Publication Date Title
Zotter Analysis and synthesis of sound-radiation with spherical arrays
Mehra et al. Source and listener directivity for interactive wave-based sound propagation
US20040131196A1 (en) Sound processing
US5452360A (en) Sound field control device and method for controlling a sound field
US20150326966A1 (en) Methods, systems, and computer readable media for source and listener directivity for interactive wave-based sound propagation
US20090310802A1 (en) Virtual sound source positioning
US10911885B1 (en) Augmented reality virtual audio source enhancement
JPH08272380A (en) Method and device for reproducing virtual three-dimensional spatial sound
Ahrens et al. Computation of spherical harmonic representations of source directivity based on the finite-distance signature
Ackermann et al. Comparative evaluation of interpolation methods for the directivity of musical instruments
Bodon Development, evaluation, and validation of a high-resolution directivity measurement system for played musical instruments
Mechel Room acoustical fields
Rungta et al. Syncopation: Interactive synthesis-coupled sound propagation
Pelzer et al. Auralization of a virtual orchestra using directivities of measured symphonic instruments
EP4205103B1 (en) Method for generating a reverberation audio signal
Ackermann et al. Musical instruments as dynamic sound sources
Pérez Carrillo et al. Method for measuring violin sound radiation based on bowed glissandi and its application to sound synthesis
Svensson Modelling acoustic spaces for audio virtual reality
Maestre et al. State-space modeling of sound source directivity: An experimental study of the violin and the clarinet
Malham Spherical harmonic coding of sound objects-the ambisonic'O'format
Kronland-Martinet et al. Real-time perceptual simulation of moving sources: application to the Leslie cabinet and 3D sound immersion
Plessas Rigid sphere microphone arrays for spatial recording and holography
US20090110202A1 (en) Method of processing a signal
Melchior Investigations on spatial sound design based on measured room impulse responses
Scavone Modeling wind instrument sound radiation using digital waveguides

Legal Events

Date Code Title Description
AS Assignment

Owner name: YORK, UNIVERSITY OF, UNITED KINGDOM

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MALHAM, DAVID GEORGE;REEL/FRAME:014976/0696

Effective date: 20031016

Owner name: YORK, UNIVERSITY OF, UNITED KINGDOM

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:DEMEESTER, JOSEPH;DE SMEDT, STEFAAN;SANDERS, NIEK;REEL/FRAME:014976/0674;SIGNING DATES FROM 20031118 TO 20031205

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION