(19)
(11)EP 2 457 384 B1

(12)EUROPEAN PATENT SPECIFICATION

(45)Mention of the grant of the patent:
09.09.2020 Bulletin 2020/37

(21)Application number: 10745004.1

(22)Date of filing:  22.07.2010
(51)International Patent Classification (IPC): 
H04R 3/00(2006.01)
(86)International application number:
PCT/IB2010/053335
(87)International publication number:
WO 2011/010292 (27.01.2011 Gazette  2011/04)

(54)

AUDIO BEAMFORMING

AUDIO-STRAHLFORMUNG

FORMATION DE FAISCEAU AUDIO


(84)Designated Contracting States:
AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK SM TR

(30)Priority: 24.07.2009 EP 09166297

(43)Date of publication of application:
30.05.2012 Bulletin 2012/22

(73)Proprietor: MediaTek Inc.
Hsin-Chu 300 (TW)

(72)Inventor:
  • DERKX, Rene, Martinus, Maria
    NL-5656 AE Eindhoven (NL)

(74)Representative: Hoefer & Partner Patentanwälte mbB 
Pilgersheimer Straße 20
81543 München
81543 München (DE)


(56)References cited: : 
EP-A2- 1 455 552
US-A1- 2006 140 417
WO-A1-2009/034524
  
  • DERKX, R.: "Adaptive Azimuthal Null-Steering for a First-order Microphone Response", PROCEEDINGS OF THE 12TH INTERNATIONAL WORKSHOP ON ACOUSTIC ECHO AND NOISE CONTROL, 30 August 2010 (2010-08-30), - 2 September 2010 (2010-09-02), XP002608523, Tel-Aviv, Israel
  • DERKX, R.: "Optimal Azimuthal Steering of a First-order Superdirectional Microphone Response", PROCEEDINGS OF THE 11TH INTERNATIONAL WORKSHOP ON ACOUSTIC ECHO AND NOISE CONTROL, 14 September 2008 (2008-09-14), - 17 September 2008 (2008-09-17), XP002608524, Seattle, WA, USA
  • NAGATA N., ABE M.: "Two-Channel Adaptive Microphone Array with Target Tracking", ELECTRONICS AND COMMUNICATIONS IN JAPAN, PART 3, vol. 83, no. 12, 2000, pages 19-24, XP002608525,
  
Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 99(1) European Patent Convention).


Description

FIELD OF THE INVENTION



[0001] The invention relates to audio beamforming and in particular, but not exclusively, to audio beamforming using microphone arrays substantially smaller than the wavelength of the audio signals being beamformed.

BACKGROUND OF THE INVENTION



[0002] Advanced processing of audio signals has become increasingly important in many areas including e.g. telecommunication, content distribution etc. For example, in some applications, such as hands-free communication and voice control systems, complex processing of inputs from a plurality of microphones has been used to provide a configurable directional sensitivity for a microphone array comprising the microphones. Specifically, the processing of signals from a microphone array can generate an audio beam with a direction that can be changed simply by changing the characteristics of the combination of the individual microphone signals.

[0003] Typically, beam form algorithms seek to attenuate interferers while providing a high gain for a desired sound source. For example, a beamforming algorithm can be controlled to provide a strong attenuation (preferably a null) in the direction of a signal received from a main interferer.

[0004] For practical reasons it is desirable that the microphone array is relatively small. However, when the wavelength of the sound of interest is much larger than the size of the array, many beamforming algorithms, such as additive delay-and-sum beamforming algorithms, are not able to provide sufficient directivity as the beamwidth deteriorates substantially for such wavelengths.

[0005] One approach for achieving an improved directivity is to apply so called superdirective beamforming techniques. Such superdirective beamforming techniques are based on filters with asymmetrical filter coefficients and the approach essentially corresponds to subtraction of signals or determining spatial derivatives of the sound pressure field. However, although this may improve the directivity, it is also known that this is achieved at the expense of robustness, such as increased sensitivity to white (sensor) noise and an increased sensitivity to mismatches in microphones characteristics.

[0006] In the article "Optimal Azimuthal Steering of a First-order Superdirectional Microphone Response" by R.M.M. Derkx, International Workshop on Acoustic Echo and Noise Control, September 2008, Seattle, a system is analyzed which generates Eigenbeams for a two dimensional microphone array. The Eigenbeams are then combined to maximize the attenuation of a single point interference source. In particular, a null is located in the direction of a single point interferer while maintaining a suitable gain for the desired direction.

[0007] However, although this approach provides improved performance in many scenarios, it provides non optimal performance in some practical scenarios. It also tends to require relatively complex and resource demanding processing.

[0008] Hence, an improved approach for audio beamforming would be advantageous and in particular an approach allowing improved adaptation to current conditions and audio environment, increased flexibility, facilitated implementation, improved performance for different operating scenarios and/or improved performance would be advantageous.

SUMMARY OF THE INVENTION



[0009] Accordingly, the Invention seeks to preferably mitigate, alleviate or eliminate one or more of the above mentioned disadvantages singly or in any combination.

[0010] According to an aspect of the invention there is provided an audio beamforming apparatus in accordance with claim 1.

[0011] The invention may allow improved performance. In particular, an improved and/or facilitated adaptation to a current audio environment can be achieved. The invention may allow a beamforming approach which provides high performance for both directional point interference cancellation and for diffuse noise attenuation. The approach is particularly suitable for, and may provide particularly advantageous performance for, systems wherein the wavelength of the audio signals may be substantially larger than the size of the microphone array.

[0012] The invention may allow low complexity implementation and/or operation. The approach may be suitable for providing improved directivity and may in particular be suitable for scenarios wherein the size of the microphone array is much smaller than a wavelength of interest.

[0013] In many embodiments and scenarios, the approach may allow a null to be directed towards a single point interference while substantially reducing diffuse noise. In particular, the approach may in many scenarios allow a reduction of a single point interference corresponding to or better than many prior art interference reduction techniques, while at the same time providing improved diffuse noise.

[0014] The approach may in many scenarios allow a low complexity yet highly efficient and advantageous beam steering based on low complexity parallel local minima extraction. In many embodiments, the approach may ensure that at least one of the identified local minima is also a global minimum and thus may allow an efficient estimation of the angle of interference.

[0015] The reference beams may be non-adaptive and may be independent of the captured signals and/or the audio conditions. The reference beams may be constant and may be generated by a constant/ non-adaptive combination of the signals from the at least three microphones. The reference beams may specifically be Eigenbeams or orthogonal beams.

[0016] The first angle interval and the second angle interval may be disjoint intervals and may be adjacent intervals. The first and second angle intervals may together cover the entire 360° interval.

[0017] The interfering sound source may be an assumed interfering sound source. A direction estimate for a sound source may be generated independently of whether the sound source is present or not. Thus, even if no interfering point source is detected, the estimation circuit may generate the direction estimate from the microphone signals under the assumption that an interfering sound source is present.

[0018] In accordance with an optional feature of the invention, the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle separating the first angle interval and the second angle interval.

[0019] This may provide a particularly efficient and low complexity determination of the direction estimate. The angle may be any angle between the first angle interval and the second angle interval including the end points of one or both of the angle intervals.

[0020] In accordance with an optional feature of the invention, the first angle interval comprises angles from 0 to π and the second angle interval comprises angles from π to 2π.

[0021] This may provide particularly advantageous performance and may in particular allow adaptation for all possible directions of the interfering sound source.

[0022] In accordance with an optional feature of the invention, the estimation circuit is arranged to select the direction estimate as one of the first angle and the second angle in response to a gradient of a power measure of the output signal as a function of the direction estimate for an angle of π.

[0023] This may provide a particularly efficient and low complexity determination of the direction estimate.

[0024] In accordance with an optional feature of the invention, the combining circuit comprises a sidelobe canceller.

[0025] This may provide particularly advantageous performance and/or practical implementation.

[0026] In accordance with an optional feature of the invention, the sidelobe canceller is arranged to generate the output signal as a weighted combination of at least a primary signal, a first noise reference signal and a second noise reference signal.

[0027] This may provide particularly advantageous performance and/or practical implementation. The primary signal may correspond to a beam adapted in the direction of the desired sound source and each of the reference signals may correspond to beams adapted to cancel/ reduce noise. The noise reference signals may specifically have notches in the direction of the desired sound source.

[0028] In accordance with an optional feature of the invention, the combining circuit is arranged to calculate weights for the first and second noise reference signals in response to the direction estimate and a maximization of the directivity cost measure.

[0029] This may provide a particularly advantageous performance and/or low complexity implementation. In particular, the weights may be determined as a function of the direction estimate wherein the function is selected to maximize the directivity cost measure.

[0030] In accordance with an optional feature of the invention, the estimation circuit is arranged to determine at least one of the first and second angles by a gradient search applied to a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit and having an angle input variable.

[0031] This may provide a particularly advantageous performance and/or low complexity implementation. In particular, a gradient search may provide a highly efficient approach for identifying potential minima that may optimize the beamforming operation. An efficient and low complexity adaptation of the beamforming may be achieved which can reduce both diffuse noise and reduce/cancel a single point interference.

[0032] In many embodiments both the first and single angle are determined by a gradient search. The gradient search may be performed using a sidelobe canceller operation which is identical to the sidelobe canceller operation used to generate the output signal but with a value of the angle input variable that may be different than the phase value (the direction estimate) used to generate the output signal (thus which can be varied independently).

[0033] In some embodiments, a gradient search may be applied in parallel in the two angle intervals using parallel sidelobe canceller operations with independent angle input variables. The output signal of the combining circuit may be selected as the signal of the parallel sidelobe canceller corresponding to the selected angle of the first and second angles.

[0034] In some embodiments, a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit may be used to determine a gradient of a power measure of the output signal for a given angle (specifically π) and the selection between the first and second angle may be in response to the gradient.

[0035] In accordance with an optional feature of the invention, an update value for the angle input variable is determined as a function of an output signal of the sidelobe canceller for a current phase value of the angle input variable, and a first and second noise reference signal of the sidelobe canceller for the current phase value.

[0036] This may provide particularly advantageous performance and/or facilitated implementation and/or operation.

[0037] In accordance with an optional feature of the invention, the first and second noise reference signals are weighted as a function of the current phase value.

[0038] This may provide particularly advantageous performance and/or facilitated implementation or operation.

[0039] In accordance with an optional feature of the invention, the estimation circuit is arranged to determine a power estimate for at least one of the first and second noise reference signals and to perform a normalization of the update value as a function of the power estimate.

[0040] This may provide particularly advantageous performance and/or facilitated implementation and/or operation.

[0041] In accordance with an optional feature of the invention, the at least two-dimensional microphone array comprises at least four microphones and the apparatus comprises a circuit for combining signals from at least two of the at least four microphones prior to generating the reference beams.

[0042] This may provide particularly advantageous performance and/or facilitated implementation and/or operation. In particular, it may provide improved noise performance in many scenarios.

[0043] In accordance with an optional feature of the invention, the apparatus of further comprises the at least two-dimensional microphone array, the at least two-dimensional microphone array comprising directional microphones having a maximum response in a direction outwardly of a perimeter of the at least two-dimensional microphone array.

[0044] This may provide particularly advantageous performance and/or facilitated implementation and/or operation.

[0045] According to an aspect of the invention there is provided a method of audio beamforming in accordance with claim 14.

[0046] These and other aspects, features and advantages of the invention will be apparent from and elucidated with reference to the embodiment(s) described hereinafter.

BRIEF DESCRIPTION OF THE DRAWINGS



[0047] Embodiments of the invention will be described, by way of example only, with reference to the drawings, in which

Fig. 1 illustrates an example of a system for capturing audio with an adaptable directional characteristic in accordance with some embodiments of the invention;

Fig. 2 illustrates an example of a microphone configuration for a microphone array;

Fig. 3 illustrates an example of Eigenbeams generated by the system of Fig. 1;

Fig. 4 illustrates an example of a sidelobe canceller used in the system of Fig. 1;

Fig. 5 illustrates an example of a cost function for adapting the system of Fig. 1;

Fig. 6 illustrates an example of local minima for the cost function of Fig. 5; and

Fig. 7 illustrates an example of local maxima for the cost function of Fig. 5

Fig. 8 illustrates an example of a method for capturing audio with an adaptable directional characteristic in accordance with some embodiments of the invention.


DETAILED DESCRIPTION OF SOME EMBODIMENTS OF THE INVENTION



[0048] Fig. 1 illustrates an example of a system for capturing audio with an adaptable directional characteristic. The system processes signals from a plurality of microphones to generate a suitable desired beam pattern. The processing is specifically adapted such that the generated output signal has substantially improved noise and interference characteristics. The system provides for a joint improvement in both single point interference and diffuse noise performance. The system is furthermore suitable for use in scenarios wherein the wavelength of the signals is substantially longer than the dimensions of the microphone array, i.e. than the distances between the microphones.

[0049] The system processes the received microphone signals to generate a set of constant non-adaptable reference beams. These reference beams are then adaptively combined to generate a desired beam pattern. The combination is adapted such that the resulting beam form is adapted to cancel or substantially attenuate an assumed single point interference source while at the same time minimizing or reducing the impact of diffuse noise.

[0050] The system provides an efficient adaptive beamforming where the main lobe can be steered towards the direction of a desired sound source while adapting the directional pattern such that a point interferer from another angle is effectively rejected and a substantially optimal rejection of diffuse (isotropic) noise is achieved. The system of Fig. 1 specifically includes an adaptive null-steering scheme with multiple gradient-estimates for adjusting the directivity pattern in such a way that this effective rejection of noise and interference can be achieved automatically.

[0051] The system of Fig. 1 comprises a microphone array 101 which is a two-dimensional microphone array. The microphone array 101 comprises at least three microphones which are not arranged in a single one dimensional line. In most embodiments, the shortest distance from one microphone to a line going through two other microphones is at least a fifth of the distance between these two microphones.

[0052] In the specific example, the microphone array 101 comprises three microphones which are spaced uniformly on a circle as illustrated in Fig. 2.

[0053] Thus, in the example a circular array of at least three (omni- or uni-directional) sensors in a planar geometry is used. It will be appreciated that in other embodiments, other arrangements of the microphones may be used. It will also be appreciated that for embodiments wherein more than three microphones are used, these may possibly be arranged in a non-planar geometry, i.e. the microphone array may be a three dimensional microphone array. However, the following description will focus on a three microphone equidistant circular array arranged in the azimuth plane.

[0054] The microphone array 101 is coupled to a receiving circuit 103 which receives the microphone signals. In the example of Fig. 1, the receiving circuit 103 is arranged to amplify, filter and digitize the microphone signals as is well known to the skilled person.

[0055] The receiving circuit 103 is coupled to a reference processor 105 which is arranged to generate at least three reference beams from the microphone signals. The reference beams are constant beams that are not adapted but are generated by a fixed combination of the digitized microphone signals from the receiving circuit 103. In the example of Fig. 1, three orthogonal Eigenbeams are generated by the reference processor 105.

[0056] In the example, the three microphones of the microphone array are directional microphones and are specifically uni-directional cardioid microphones which are arranged such that the main gain is pointing outwardly from the perimeter formed by joining the positions of the microphones (and thus outwardly of the circle of the circular array in the specific example). The use of uni-directional cardioid microphones provides an advantage in that the sensitivity to sensor noise and sensor-mismatches is greatly reduced. However, it will be appreciated that in other scenarios other microphone types may be used, such as omnidirectional microphones.

[0057] Denoting the responses of the three cardioid microphones as respectively



and

and ignoring any uncorrelated sensor-noise, the i'th cardioid microphone response is ideally given by:

with

where θ and Φ are the standard spherical coordinate angles: elevation and azimuth, c is the speed of sound and xi and yi are the x and y coordinates of the i'th microphone.

[0058] Using:

and

with r the radius of the circle we can write:



[0059] From the three cardioid microphones, the three orthogonal Eigenbeams can be determined from:



[0060] For wavelengths larger than the size of the array, the responses of the Eigenbeams are frequency invariant and ideally equal to:







[0061] The directivity patterns of these Eigenbeams are illustrated in Fig. 3.

[0062] The zero'th-order Eigenbeam Em represents the monopole response corresponding to a sphere whereas the other Eigenbeams represent first order Eigenbeams corresponding to double spheres as illustrated in Fig. 3. Thus, the two first order Eigenbeams are orthogonal dipoles.

[0063] The resulting signals from each of the three Eigenbeams are fed to a beamform circuit 107 which proceeds to adaptively combine these signals to provide a desired beam pattern.

[0064] Specifically, by suitable combining the first order Eigenbeams, a dipole can be steered to any angle ϕs. E.g. a weighted summation of the orthogonal diagonals can be generated:

where ϕs represents the desired angle for the resulting dipole.

[0065] The steered and scaled superdirectional microphone response can then be constructed by combining the steered dipole with the monopole, e.g. as:

where α ≤ 1 is a parameter for controlling the directional pattern of the first-order response and S is an arbitrary scaling factor (that can also have negative values).

[0066] Thus, the beamform circuit 107 can generate a suitable beam pattern by a suitable combination of the reference Eigenbeams. The beamform circuit 107 is arranged to generate a nominal (e.g. unity) gain in the direction of a desired speaker coming from an arbitrary azimuthal angle Φ=ϕs. The direction of the desired speaker is assumed to be known by the beamform circuit 107. It will be appreciated that any suitable way of determining a desired direction may be used without detracting from the invention. For example, a fixed direction may be used or e.g. a tracking algorithm for a desired speaker or sound source may be used. It will be appreciated that many different algorithms for determining a desired sound source direction will be known to the skilled person.

[0067] The beamform circuit 107 is furthermore arranged to adapt the beam such that the sensitivity to diffuse noise is minimized and a notch is generated in an estimated direction of an assumed interfering point source. The system of Fig. 1 is specifically arranged to adapt the combination of the reference Eigenbeams such that the nominal gain is provided in the desired direction, a notch is generated in the direction estimated to correspond to a point source interference and with a minimization of the diffuse noise under these constraints. This is achieved by a highly efficient adaptation algorithm which will be described in the following.

[0068] The beamform circuit 107 is specifically coupled to an estimation circuit 109 which determines an estimate for the direction to an assumed point source interference. Based on the estimated direction, the beamform circuit 107 generates combination parameters for the combination of the Eigenbeams such that a notch (typically a null) is generated in the estimated direction. However, the combination of three Eigenbeams provides sufficient degrees of freedom to allow a range of solutions to the constraint of providing a nominal gain in a desired direction and a notch in an interference direction. In the system, this additional degree of freedom is used to improve the diffuse noise performance. This is specifically achieved by the combination parameters being selected to maximize a directivity cost measure where the directivity cost measure is indicative of a ratio between a power/energy gain in the first direction and an average power/energy gain. Specifically, the directivity cost measure may be indicative of the gain in the desired direction relative to an average gain of the resulting beam where the averaging is over all angles in the azimuth plane (i.e. from 0-2π) or from all directions in the three dimensions. Thus, the directivity cost measure is a function which indicates the attenuation of homogenous spatially diffuse noise (i.e. the same noise level in all direction) provided by the beam pattern.

[0069] The estimation circuit 109 is specifically arranged to determine the estimated angle of an interference point by searching for local minima of a power measure for the output signal. Thus, the estimation circuit 109 seeks to minimize the power of the output signal as this will correspond to the lowest noise/interference. In some embodiments, the estimation may only be performed when the desired sound source is inactive (e.g. when a desired speaker is not speaking) but it will be appreciated that this is not necessary for the minimization of the power of the output signal to be an indication of optimal noise/interference operation (specifically the presence of the desired signal may introduce an offset to the power measure but will not change the position of the minimum).

[0070] The estimation circuit 109 determines at least two local minima by searching in at least two angle intervals. The two angle intervals are typically disjoint, although in some embodiments some overlap may occur. The local minima are determined in the different angle intervals by a parallel processing based on different angles. Specifically, the estimation circuit 109 may copy the operation of the beamform circuit 107 and evaluate the resulting output signal for different angles in the different angle intervals. The estimation circuit 109 may then select one of the angles that have been found to correspond to a local minima for the output signals and the selected angle is then used as the estimate for the assumed single point interference source. The selected angle is then fed to the beamform circuit 107 which proceeds to perform the combination such that a nominal gain is provided in the direction of the desired source and a notch is provided in the estimated direction of the main single point interference. Furthermore, the combination uses weights that are selected to further minimize the diffuse noise. This constraint is imposed by the weights being selected to maximize a directivity cost measure.

[0071] The estimation operation and adaptation is independent of the actual noise and interference conditions and specifically is independent of whether a significant single point interferer or diffuse noise is present or not. However, the approach results in very efficient performance across a wide variety of scenarios including scenarios with a dominant single point interference and no diffuse noise as well as scenarios with no single point interference but substantial diffuse noise. Indeed, the approach and underlying assumptions result in an operation that not only adapts to the specific characteristics of the noise and single point interference characteristics but also adapts to the type of noise/interference scenario that is experienced. This also reduces complexity and facilitates operation as there is no need to adapt the algorithm to the type of audio environment being experienced. This also provides increased flexibility and a wider application of the approach.

[0072] In the following, a specific example of the system of Fig. 1 will be described. In the example, the beamform circuit 107 implements a sidelobe canceller and the local minima are determined using a gradient search within each angle interval. Once the direction to the assumed single point interference has been estimated, combination parameters in terms of the weights applied to the noise reference signals are determined under the constraint that the directivity cost measure is maximized.

[0073] Fig. 4 illustrates an example of a generalized sidelobe canceller used in the system of Fig. 1. The two dipole reference beams are first combined to generate two dipoles which are angled in the desired directions. The resulting dipoles are then combined with the monopole to generate a primary signal which corresponds to a beam directed towards the desired audio source.

[0074] The primary response may be given by

where



and



[0075] The primary signal thus corresponds to the desired audio signal but also comprises signals from undesired directions. The impact of these sidelobes is reduced by generation of noise reference signals which are weighted and subtracted from the primary signal to generate the output signal.

[0076] Thus, the sidelobe canceller generates the noise reference signals given by

where B is a blocking matrix given by:



[0077] It is noted that the noise-references are respectively a cardioid and a dipole response, with a null steered towards the primary signal at azimuth ϕs and elevation θ=π/2.

[0078] The two noise reference signals are then weighted by weights w1 and w2 before being subtracted from the primary signal to provide the output signal. Thus, the overall beam-pattern from the sidelobe canceller is given by:



[0079] The beamform circuit 107 is arranged to generate a nominal gain, in the following a unity gain, in a desired angle ϕs and a notch, specifically a zero, in the direction ϕn of an assumed single point interference determined by the estimation circuit 109.

[0080] With a unity gain in the direction of cps, the weights required to steer a zero towards the angle ϕn can be calculated by solving the equation:

where ϕ=ϕsn.

[0081] Solving the equation yields:

or alternatively:



[0082] As can be seen, the constraints of the unity gain and the direction of the zero do not uniquely define the required weights but provide an extra degree of freedom.

[0083] In the system, this degree of freedom is used to optimize diffuse noise performance. In particular, the noise reference weights are selected such that a directivity cost measure is maximized.

[0084] A suitable directivity cost measure is given by:



[0085] Thus, the directivity cost measure represents a ratio between the gain in the desired direction and the overall (power) gain averaged over the entire sphere. It will be appreciated that in other embodiments, the gain averaging may e.g. only be in a two-dimensional plane such as the azimuth plane.

[0086] For the response given by

this can be shown to correspond to:



[0087] Inserting the output signal response given by:

and inserting the value of w1 as given by:

in the directivity cost measure, and differentiating with respect to w2 and setting the result to zero allows for the minima of the directivity cost measure with respect to w2 to be determined. Thus, the value of w2 for which the directivity cost measure is maximized and thus the diffuse noise sensitivity is minimized can be determined. This specifically yields:



[0088] With this value of w2, we can also compute w1 as:



[0089] Thus, w1 and w2 can be calculated such that a unity gain is provided in the desired direction, a zero is formed in the direction of an assumed interferer and the diffuse noise attenuation is maximized under these constraints.

[0090] Thus, once the estimation circuit 109 has determined a suitable angle estimate for the assumed point source interferer, the derived equations can be used to calculate suitable weights that will also maximize the directivity cost measure and thus optimize the diffuse noise performance.

[0091] Rather than using the previously derived equation for w1, a value compensated for the effect of the design parameter can be used:



[0092] It can be shown that w2 can then be derived from:



[0093] Thus, the output signal y[k] is given by:

with





where is the estimate of the angle of the assumed undesired interferer and ϕs is the angle of the desired audio source.

[0094] The estimation circuit 109 proceeds to determine the direction estimate by minimizing a power measure for the output signal in different angle intervals.

[0095] Specifically, the estimation circuit 109 seeks to maximize the cost function given by:

where denotes the expected value.

[0096] Fig. 5 illustrates some examples of this cost function for a scenario wherein there is a single point interferer at the direction of ϕ equal to 1, 2 and 3 radians respectively (i.e. the angle difference ϕ between the desired direction and the direction between an actual interferer is 1, 2 and 3 radians respectively). The cost function is shown as a function of the estimated direction, i.e. as a function of the steering of the null performed by the weights of the reference signals. Fig. 6 illustrate the cost function in the presence of noise which either may be spherical (coming from all directions) or cylindrical (coming from all directions in a two-dimensional plane). The situation for spherical noise is shown by a full line and the situation for cylindrical noise is shown by the dashed line.

[0097] Some observations can be made from Fig. 5. Firstly, it is clear that in all situations, a notch (and specifically a null) exists for the right estimate, i.e. when ϕ̂ = ϕ. However, it is clear that whereas this null is indeed a local minimum for the cost function, it is not the only local minimum. In particular, in some cases local minima are found which do not correspond to a null and in some cases other nulls exist. Thus, it can be seen that merely determining the angle estimate by finding local minima is not a sufficient approach.

[0098] This can further be seen in Fig. 6 which illustrates the local minima of the cost function for different directions ϕ of an actual point source interferer. Again, it can be seen that there is a local minimum for the correct value (i.e. a minimum exists ϕ̂ = ϕ for as indicated by the diagonal line). However, in addition it can be seen that at least one and possibly two other local minima exist. For example, for an interferer at an angle of 1 radian, a cost function minimum exists at the right value of 1 radian but also at the wrong value of around 3.8 radians. Furthermore, for an interferer angle between around 2 radians to 4.3 radians, two wrong local minima exist.

[0099] However, a further observation is that the correct minimum is always the only local minimum in the phase interval from either 0 to π or from π to 2π. Thus, for an interferer angle within the interval of [0;π], the only local minimum in the interval of [0;π] is the correct value. Similarly, for an interferer angle within the interval of [π;2π], the only local minimum in the interval of [π;2π] is the correct value.

[0100] This realization is exploited in the system of Fig. 1. Specifically, the estimation circuit 109 is arranged to determine a local minimum in the angle interval of [0;π] and a local minimum in the angle interval of [π;2π]. Thus, the estimation circuit 109 determines two angles for which the cost function corresponding to the power of the output signal is minimized. This approach ensures that one of the determined local minima will correspond to the correct estimated angle.

[0101] The estimation circuit 109 then proceeds to select one of the two estimated values as the estimated angle that is used to control the beamforming by the beamform circuit 107. Thus, one of the local minima is selected and used to calculate the weights for the noise reference signals using the equations that also optimize diffuse noise performance.

[0102] It will be appreciated that different criteria for selecting between the determined local minima may be used. For example, in some embodiments, a simple beamforming may be applied to the microphone signals such that a beam is formed in each of the two directions in order to measure the interference level in those directions. The direction having the highest level is then selected as it corresponds to the most dominant interference.

[0103] However, in the specific example, the selection of the correct local minima is based on the gradient of the cost function at a specific angle which separates the two angle intervals (i.e. it is inbetween the two angle intervals and may specifically be an endpoint of one or both of the intervals). In the specific example, the gradient at π is determined and is used to select the appropriate local minimum. Specifically, if the cost function has a negative gradient for ϕ̂ = π then the local minimum in the interval of [0;π] is selected and otherwise the local minimum in the interval of [π;2π] is selected. Indeed, it has been found that such a selection provides a very reliable indication of the correct local minima and thus provides a low complexity but efficient selection approach.

[0104] Intuitively, it can be understood as follows. The directional beam pattern for ϕ̂ = π yields a cardioid response with only a single null. The gradient of the cost function for ϕ̂ = π therefore yields the direction toward the true value of ϕ. If the gradient is positive, the true value of ϕ lies in the interval [π;2π]. If the gradient is negative, the true value of ϕ lies in the interval [0;π].

[0105] It should also be noted that in some embodiments, all the local minima of the function may be determined and separated into the two angle intervals. Indeed, in such an embodiment, the detection of two local minima in one interval may automatically lead to the selection of the other minimum (i.e. the one in the other angle interval). This approach is based on the realization that (as illustrated in Fig. 6), the correct local minimum will be the only local minimum in the angle interval. It will also be appreciated that this leads to the conclusion that it is not necessary to identify more than one minimum in each angle interval as any non-identified local minima will inherently not be the correct minimum as it is in an angle interval with more than one minimum.

[0106] In the system of Fig. 1, the determination of the local minima is performed by performing a gradient search in each angle interval.

[0107] Thus, the estimation circuit 109 performs a sidelobe cancelling operation corresponding to that of the beamform circuit 107 while using an input angle value that is constantly updated and biased in the direction that will reduce the cost function. This approach will result in the angle variable ending in a local minimum.

[0108] Specifically, a steepest descent update equation for ϕ̂ can be derived by stepping in the direction opposite to the surface of the cost function with respect to ϕ̂:

with a gradient given by:

and where µ is the update step-size with 0 <µ< 1.

[0109] In practice, the mean value is not available and therefore an instantaneous estimate of the gradient is used:

by inserting the previously derived formulas for y[k] and performing the derivation, this can be shown to lead to:

where







[0110] Thus, the update value for the angle input variable of the gradient search is a function of an output signal of the sidelobe canceller and of the first and second noise reference signals.

[0111] In the above example, the update value is dependent on the power of the noise references. In order to compensate for this, the estimation circuit 109 may determine a power estimate for one or both of the noise reference signals and normalize the update value accordingly.

[0112] Accordingly, the following update equation for the gradient search may be used:

where ε is a small value to prevent zero division and is the power estimate of the i'th noise reference signal. This can specifically be calculated by a recursive averaging:

where β is a suitable design parameter.

[0113] Thus, for each angle interval, the estimation circuit 109 operates a sidelobe canceller applied to the same signals as the sidelobe canceller of the beamform circuit 107. However, the sidelobe cancellers are operated based on an input angle variable which corresponds to a current estimate of the angle to the assumed point source interferer. The input angle variable is continuously updated using the gradient search approach such that it will converge on the local minimum in the angle interval.

[0114] The estimation circuit 109 then selects between the current values of the input angle variables and uses this result as the estimated angle for the assumed point source interferer. The selection is based on the gradient of the cost function for an input variable of π. The estimation circuit 109 may specifically determine this by operating a further sidelobe canceller process on the input signals but with a fixed angle value of π. Specifically, the estimation circuit 109 may continuously evaluate the update value:

for ϕ̂=π. The derived values may be averaged over time and the sign of the averaged value (i.e. the gradient of the cost function at π) is then used to select which of the angles determined by the gradient searches is used.

[0115] It will be appreciated that whereas the previous discussion illustrated the principle by referring to the use of four sidelobe cancellers (one for the beamform circuit 107, one for each gradient search, and one for determining the gradient at π), this is merely used to illustrate the principle. Indeed, in many embodiments, the same sidelobe canceller may be implemented, e.g. as a subroutine, and used for the different purposes and with different input angles.

[0116] It will also be appreciated that typically, the beamform circuit 107 will not repeat a sidelobe canceller operation for the estimated angle but will directly use the output signal calculated for the selected angle when performing the estimation.

[0117] In the example, the gradient search is arranged to re-initialize the gradient search if the value of the angle input variable moves out of the corresponding angle interval. Specifically, a re-initialization of the gradient search may be performed if the two gradient searches reach a scenario wherein the both have angle values in the same angle interval. For example, if during the gradient search in the [0;π] interval, the updated angle value moves into the [π;2π] interval such that both gradient searches have current values within this interval, the gradient search is re-initialized. The re-initialization is specifically performed by resetting the value of the input angle variable of one of the two gradient searches to an initial value. The initial value may for example be a fixed value such as the midpoint in the interval (i.e. π/2 and 3π/2).

[0118] From Fig. 6, we can see that the relevant quadrant for the gradient search in the interval [0;π] is the lower-left quadrant. For the gradient search in the interval [π;2π], the upper-right quadrant is relevant.

[0119] Next looking at the lower-left quadrant for the gradient search in the interval [0;π], we can see from Fig. 6 and Fig. 7 (showing respectively the minima and maxima of the cost function) that a re-initialization within this interval would only lead to a correct convergence ϕ̂ = ϕ in case the re-initialization would be done in the range [0;η], where η≈2.55.

[0120] When the re-initialization would be larger than η, there is a risk that the gradient search again ends up in the wrong quadrant, i.e. the upper-left quadrant. Especially when ϕ is equal or close to zero, it is mandatory that re-initialization is done in the range [0;η], where η≈2.55.

[0121] Hence, for the re-initialization, it is safe to choose a mid-point in the interval [0;η], (i.e. η / 2), where η≈2.55.

[0122] A specific example of the approach that may be used is illustrated in Fig. 8. In step 801, the parameter values are initialized.

[0123] Step 801 is followed by step 803 wherein it is ensured that ϕ̂1[k] is smaller than ϕ̂2[k] (if not the two variable values are simply swapped).

[0124] Step 803 is followed by step 805 wherein it is determined if the two gradient searches have resulted in angle values, ϕ̂1 [k], ϕ̂2[k], in the same angle interval. If so, the appropriate value is re-initialized to ensure there is one angle in each angle interval.

[0125] Step 805 is followed by step 807 wherein the weights for the noise reference signals, the resulting output signal and the cost function gradients are calculated.

[0126] Step 807 is followed by step 809 wherein the new values for the angle input variables, ϕ̂1[k], ϕ̂2[k], of the gradient searches are calculated. Furthermore, the filtered cost function gradient at π is calculated.

[0127] Step 809 is followed by step 811 wherein the appropriate angle value is selected based on the filtered cost function gradient at π.

[0128] Step 811 is followed by step 813 wherein the power estimates for the noise reference signals used in the update value determination are updated.

[0129] After step 813 the method returns to step 803 to process the next sample.

[0130] A pseudo-code of an algorithm corresponding to Fig. 1 may be represented as:



[0131] In the specific example, the number of microphones in the microphone array 101 corresponded to the number of reference beams (i.e. three). However, in some embodiments, the microphone array may comprise more microphones than reference beams.

[0132] Specifically the microphone array 101 may comprise at least four microphones. The system may still only generate three reference beams and may specifically be arranged to combine signals from at least two microphones prior to generating the reference beams. Thus, the reference processor 105 may still only receive three input signals and generate three reference beams from these. However, at least one of these input signals may be generated by combining (and specifically averaging or adding (e.g. by a weighted summation)) the signals from at least two microphones. Such an approach may provide improved noise performance in many scenarios as the level of uncorrelated noise may be averaged. Furthermore, using more microphones on a particular area, has the advantage that spatial aliasing will occur at a higher frequency.

[0133] It will be appreciated that the above description for clarity has described embodiments of the invention with reference to different functional circuits, units and processors. However, it will be apparent that any suitable distribution of functionality between different functional circuits, units or processors may be used without detracting from the invention. For example, functionality illustrated to be performed by separate processors or controllers may be performed by the same processor or controllers. Hence, references to specific functional units or circuits are only to be seen as references to suitable means for providing the described functionality rather than indicative of a strict logical or physical structure or organization.

[0134] The invention can be implemented in any suitable form including hardware, software, firmware or any combination of these. The invention may optionally be implemented at least partly as computer software running on one or more data processors and/or digital signal processors. The elements and components of an embodiment of the invention may be physically, functionally and logically implemented in any suitable way. Indeed the functionality may be implemented in a single unit, in a plurality of units or as part of other functional units. As such, the invention may be implemented in a single unit or may be physically and functionally distributed between different units, circuits and processors.


Claims

1. An audio beamforming apparatus comprising:

a receiving circuit (103) for receiving signals from an at least two-dimensional microphone array (101) comprising at least three microphones;

a reference circuit (105) for generating at least three reference beam signals from the microphone signals received from the receiving circuit;

a combining circuit (107) for generating an output signal corresponding to a desired beam pattern by combining the reference beam signals in response to a first direction of a desired sound source and a direction estimate of an interfering sound source;

an estimation circuit (109) configured to generate said direction estimate by:

determining a first angle corresponding to a local minimum of a power measure of the output signal in a first angle interval,

the power measure being an angular function of the direction estimate; determining a second angle corresponding to a local minimum of the power measure of the output signal in a second angle interval; and

determining said direction estimate as an angle selected from a set of angles corresponding to local minima of the power measure of the output signal, the set of angles comprising at least the first angle and the second angle; and

wherein the combining circuit (107) is arranged to determine combination parameters for the combining of the reference beam signals to provide a notch in the desired beam pattern at an angle corresponding to the direction estimate and a maximization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and overall power gain averaged over all directions.


 
2. The apparatus of claim 1 wherein the estimation circuit (109) is arranged to determine said direction estimate as an angle selected from one of the first angle and the second angle in response to a gradient of the power measure of the output signal for an angle value separating the first angle interval and the second angle interval.
 
3. The apparatus of claim 1 wherein the first angle interval comprises angles from 0 to π and the second angle interval comprises angles from π to 2π.
 
4. The apparatus of claim 3 wherein the estimation circuit (109) is arranged to determine said direction estimate as an angle selected from one of the first angle and the second angle in response to a gradient of the power measure of the output signal for an angle value of π.
 
5. The apparatus of claim 1 wherein the combining circuit (107) comprises a sidelobe canceller.
 
6. The apparatus of claim 5 wherein the sidelobe canceller is arranged to generate the output signal as a weighted combination of at least a primary signal, a first noise reference signal and a second noise reference signal, the primary signal, the first and second noise reference signals being combinations of the at least three reference beam signals.
 
7. The apparatus of claim 6 wherein the combining circuit (107) is arranged to calculate weights for the first and second noise reference signals in response to the direction estimate and the maximization of the directivity cost measure.
 
8. The apparatus of claim 5 wherein the estimation circuit (109) is arranged to determine at least one of the first and second angles by a gradient search applied to a sidelobe canceller corresponding to the sidelobe canceller of the combining circuit and having an angle variable and a cost function being a power measure of the output signal of the sidelobe canceller.
 
9. The apparatus of claim 8 wherein an update value for the angle variable is determined as a function of the output signal of the sidelobe canceller for a current value of the angle variable, and a first and second noise reference signals of the sidelobe canceller for the current value of the angle variable.
 
10. The apparatus of claim 9 wherein the first and second noise reference signals are weighted as a function of the current value of the angle variable.
 
11. The apparatus of claim 9 wherein the estimation circuit (109) is arranged to determine a power estimate for at least one of the first and second noise reference signals of the sidelobe canceller and to perform a normalization of the update value as a function of the power estimate.
 
12. The apparatus of claim 1 wherein the at least two-dimensional microphone array comprises at least four microphones and the apparatus comprises a circuit for combining signals from at least two of the at least four microphones prior to generating the reference beam signals.
 
13. The apparatus of claim 1 further comprising the at least two-dimensional microphone array (101), the at least two-dimensional microphone array (101) comprising directional microphones having a maximum response in a direction outwardly of a perimeter of the at least two-dimensional microphone array.
 
14. A method of audio beamforming comprising:

receiving signals from an at least two-dimensional microphone array comprising at least three microphones;

generating at least three reference beam signals from the microphone signals of the received signals;

generating an output signal corresponding to a desired beam pattern by combining the reference beam signals in response to a first direction of a desired sound source and a direction estimate of an interfering sound source;

generating the direction estimate by:

determining a first angle corresponding to a local minimum of a power measure of the output signal in a first angle interval, the power measure being an angular function of the direction estimate;

determining a second angle corresponding to a local minimum of the power measure of the output signal in a second angle interval; and

determining said direction estimate as an angle selected from a set of angles corresponding to local minima of the power measure of the output signal, the set of angles comprising at least the first angle and the second angle;

wherein combining of the reference beam signals comprises determining combination parameters for the combining of the reference beam signals to provide a notch in the desired beam pattern at an angle corresponding to the direction estimate and a maximization of a directivity cost measure, the directivity cost measure being indicative of a ratio between a gain in the first direction and an overall power gain averaged over all directions.


 
15. A computer program product comprising a computer program enabling a processor to carry out the method of claim 14.
 


Ansprüche

1. Audio-Strahlformungsvorrichtung, umfassend:

eine Empfangsschaltung (103), um Signale von einer mindestens zweidimensionalen Mikrofonanordnung (101) mit mindestens drei Mikrofonen zu empfangen;

eine Referenzschaltung (105), um mindestens drei Referenzstrahlsignale aus den von der Empfangsschaltung empfangenen Mikrofonsignalen zu generieren;

eine Kombinationsschaltung (107), um, im Ansprechen auf eine erste Richtung einer gewünschten Schallquelle und eine Richtungsschätzung einer interferierenden Schallquelle, ein Ausgangssignal durch Kombination der Referenzstrahlsignale zu generieren, das zu einem gewünschten Strahlmuster korrespondiert;

eine Schätzschaltung (109), die konfiguriert ist, die Richtungsschätzung zu generieren durch:

bestimmen eines ersten Winkels, korrespondierend zu einem lokalen Minimum eines Leistungsmaßes des Ausgangssignals in einem ersten Winkelintervall, wobei das Leistungsmaß eine Winkelfunktion der Richtungsschätzung ist;

bestimmen eines zweiten Winkels, korrespondierend zu einem lokalen Minimum des Leistungsmaßes des Ausgangssignals in einem zweiten Winkelintervall; und

bestimmen der Richtungsschätzung als einen Winkel, der aus einem Satz von Winkeln ausgewählt wird, die zu lokalen Minima des Leistungsmaßes des Ausgangssignals korrespondieren, wobei der Winkelsatz mindestens den ersten Winkel und den zweiten Winkel umfasst; und

wobei die Kombinationsschaltung (107) eingerichtet ist, um Kombinationsparameter für die Kombination der Referenzstrahlsignale zu bestimmen, um eine Kerbe in dem gewünschten Strahlmuster bei einem Winkel bereitzustellen, der zu der Richtungsschätzung und einer Maximierung eines Richtungskostenmaßes korrespondiert, wobei das Richtungskostenmaß ein Verhältnis zwischen einer Verstärkung in der ersten Richtung und einer Gesamtleistungsverstärkung, gemittelt über alle Richtungen, anzeigt.


 
2. Vorrichtung nach Anspruch 1, wobei die Schätzschaltung (109) eingerichtet ist, im Ansprechen auf einen Gradienten des Leistungsmesswerts des Ausgangssignals für einen Winkelwert, der das erste Winkelintervall und das zweite Winkelintervall trennt, die Richtungsschätzung als einen Winkel zu bestimmen, der aus dem ersten Winkel und zweiten Winkel ausgewählt wird.
 
3. Vorrichtung nach Anspruch 1, wobei das erste Winkelintervall Winkel von 0 bis π und das zweite Winkelintervall Winkel von π bis 2 π umfasst.
 
4. Vorrichtung nach Anspruch 3, wobei die Schätzschaltung (109) eingerichtet ist, im Ansprechen auf einen Gradienten des Leistungsmesswerts des Ausgangssignals für einen Winkelwert von π, die Richtungsschätzung als einen Winkel zu bestimmen, der aus dem ersten Winkel und zweiten Winkel ausgewählt wird.
 
5. Vorrichtung nach Anspruch 1, wobei die Kombinationsschaltung (107) einen Nebenkeulenauslöscher umfasst.
 
6. Vorrichtung nach Anspruch 5, wobei der Nebenkeulenauslöscher eingerichtet ist, das Ausgangssignal als eine gewichtete Kombination aus mindestens einem Primärsignal, einem ersten Rauschreferenzsignal und einem zweiten Rauschreferenzsignal zu erzeugen, wobei das Primärsignal, das erste und das zweite Rauschreferenzsignal Kombinationen der mindestens drei Referenzstrahlsignale sind.
 
7. Vorrichtung nach Anspruch 6, wobei die Kombinationsschaltung (107) eingerichtet ist, im Ansprechen auf die Richtungsschätzung und die Maximierung des Richtkostenmaßes Gewichte für das erste und das zweite Rauschreferenzsignal zu berechnen.
 
8. Vorrichtung nach Anspruch 5, wobei die Schätzschaltung (109) eingerichtet ist, den ersten und/oder den zweiten Winkel durch eine Gradientensuche zu bestimmen, die auf einen Nebenkeulenauslöscher angewandt wird, der zu dem Nebenkeulenauslöscher der Kombinationsschaltung korrespondiert und eine Winkelvariable und eine Kostenfunktion hat, die ein Leistungsmaß des Ausgangssignals des Nebenkeulenauslöschers ist.
 
9. Vorrichtung nach Anspruch 8, wobei ein Aktualisierungswert für die Winkelvariable als eine Funktion des Ausgangssignals des Nebenkeulenauslöschers für einen aktuellen Wert der Winkelvariablen und eines ersten und zweiten Rauschreferenzsignals des Nebenkeulenauslöschers für den aktuellen Wert der Winkelvariablen bestimmt wird.
 
10. Vorrichtung nach Anspruch 9, wobei die ersten und zweiten Rauschreferenzsignale als Funktion des aktuellen Wertes der Winkelvariablen gewichtet werden.
 
11. Vorrichtung nach Anspruch 9, wobei die Schätzschaltung (109) eingerichtet ist, einen Leistungsschätzwert für das erste und/oder das zweite Rauschreferenzsignal des Nebenkeulenauslöschers zu bestimmen und eine Normalisierung des Aktualisierungswertes als Funktion des Leistungsschätzwertes durchzuführen.
 
12. Vorrichtung nach Anspruch 1, wobei die mindestens zweidimensionale Mikrofonanordnung mindestens vier Mikrofone umfasst und die Vorrichtung eine Schaltung zur Kombination von Signalen von mindestens zwei der mindestens vier Mikrofone vor der Erzeugung der Referenzstrahlsignale umfasst.
 
13. Vorrichtung nach Anspruch 1, weiter umfassend die mindestens zweidimensionale Mikrofonanordnung (101), wobei die mindestens zweidimensionale Mikrofonanordnung (101) Richtmikrofone mit einem maximalen Ansprechverhalten in einer Richtung außerhalb eines Perimeters der mindestens zweidimensionalen Mikrofonanordnung umfasst.
 
14. Verfahren zur Audio-Strahlformung, umfassend:

empfangen von Signalen von einer mindestens zweidimensionalen Mikrofonanordnung mit mindestens drei Mikrofonen zu empfangen;

generieren von mindestens drei Referenzstrahlsignalen aus den Mikrofonsignalen der Empfangssignale;

generieren, im Ansprechen auf eine erste Richtung einer gewünschten Schallquelle und eine Richtungsschätzung einer interferierenden Schallquelle, eines Ausgangssignals durch Kombination der Referenzstrahlsignale, das zu einem gewünschten Strahlmuster korrespondiert;

generieren der Richtungsschätzung durch:

bestimmen eines ersten Winkels, korrespondierend zu einem lokalen Minimum eines Leistungsmaßes des Ausgangssignals in einem ersten Winkelintervall, wobei das Leistungsmaß eine Winkelfunktion der Richtungsschätzung ist;

bestimmen eines zweiten Winkels, korrespondierend zu einem lokalen Minimum des Leistungsmaßes des Ausgangssignals in einem zweiten Winkelintervall; und

bestimmen der Richtungsschätzung als einen Winkel, der aus einem Satz von Winkeln ausgewählt wird, die zu lokalen Minima des Leistungsmaßes des Ausgangssignals korrespondieren, wobei der Winkelsatz mindestens den ersten Winkel und den zweiten Winkel umfasst; und

wobei die Kombination der Referenzstrahlsignale ein Bestimmen von Kombinationsparametern für die Kombination der Referenzstrahlsignale umfasst, um eine Kerbe in dem gewünschten Strahlmuster bei einem Winkel bereitzustellen, der zu der Richtungsschätzung und einer Maximierung eines Richtungskostenmaßes korrespondiert, wobei das Richtungskostenmaß ein Verhältnis zwischen einer Verstärkung in der ersten Richtung und einer Gesamtleistungsverstärkung, gemittelt über alle Richtungen, anzeigt.


 
15. Computerprogrammprodukt, das ein Computerprogramm umfasst, das einen Prozessor in die Lage versetzt, das Verfahren nach Anspruch 14 durchzuführen.
 


Revendications

1. Appareil de formation de faisceau audio comprenant :

un circuit de réception (103) pour recevoir des signaux d'un réseau de microphones au moins bidimensionnel (101) comprenant au moins trois microphones ;

un circuit de référence (105) pour générer au moins trois signaux de faisceau de référence à partir des signaux de microphone reçus du circuit de réception ;

un circuit de combinaison (107) pour générer un signal de sortie correspondant à un diagramme de faisceau souhaité en combinant les signaux de faisceau de référence en réponse à une première direction d'une source sonore souhaitée et une estimation de direction d'une source sonore d'interférence ;

un circuit d'estimation (109) configuré pour générer ladite estimation de direction par :

la détermination d'un premier angle correspondant à un minimum local d'une mesure de puissance du signal de sortie dans un premier intervalle d'angle,

la mesure de puissance étant une fonction angulaire de l'estimation de direction ;

la détermination d'un deuxième angle correspondant à un minimum local de la mesure de puissance du signal de sortie dans un deuxième intervalle d'angle ; et

la détermination de ladite estimation de direction en tant qu'angle choisi parmi un ensemble d'angles correspondant à des minimums locaux de la mesure de puissance du signal de sortie, l'ensemble d'angles comprenant au moins le premier angle et le deuxième angle ; et

dans lequel le circuit de combinaison (107) est agencé pour déterminer des paramètres de combinaison pour la combinaison des signaux de faisceau de référence pour fournir une encoche dans le diagramme de faisceau souhaité à un angle correspondant à l'estimation de direction et une maximisation d'une mesure de coût de directivité, la mesure de coût de directivité indiquant un rapport entre un gain dans la première direction et un gain en puissance global moyenné dans toutes les directions.


 
2. Appareil de la revendication 1, dans lequel le circuit d'estimation (109) est agencé pour déterminer ladite estimation de direction en tant qu'angle choisi parmi l'un du premier angle et du deuxième angle en réponse à un gradient de la mesure de puissance du signal de sortie pour une valeur d'angle séparant le premier intervalle d'angle et le deuxième intervalle d'angle.
 
3. Appareil de la revendication 1, dans lequel le premier intervalle d'angle comprend des angles allant de 0 à π et le deuxième intervalle d'angle comprend des angles allant de π à 2π.
 
4. Appareil de la revendication 3, dans lequel le circuit d'estimation (109) est agencé pour déterminer ladite estimation de direction en tant qu'angle choisi parmi l'un du premier angle et du deuxième angle en réponse à un gradient de la mesure de puissance du signal de sortie pour une valeur d'angle de π.
 
5. Appareil de la revendication 1, dans lequel le circuit de combinaison (107) comprend un suppresseur de lobes latéraux.
 
6. Appareil de la revendication 5, dans lequel le suppresseur de lobes latéraux est agencé pour générer le signal de sortie en tant que combinaison pondérée d'au moins un signal primaire, un premier signal de référence de bruit et un deuxième signal de référence de bruit, le signal primaire, les premier et deuxième signaux de référence de bruit étant des combinaisons des au moins trois signaux de faisceau de référence.
 
7. Appareil de la revendication 6, dans lequel le circuit de combinaison (107) est agencé pour calculer des poids pour les premier et deuxième signaux de référence de bruit en réponse à l'estimation de direction et la maximisation de la mesure de coût de directivité.
 
8. Appareil de la revendication 5, dans lequel le circuit d'estimation (109) est agencé pour déterminer au moins l'un des premier et deuxième angles par une recherche de gradient appliquée à un suppresseur de lobes latéraux correspondant au suppresseur de lobes latéraux du circuit de combinaison et ayant une variable d'angle et une fonction de coût étant une mesure de puissance du signal de sortie du suppresseur de lobes latéraux.
 
9. Appareil de la revendication 8, dans lequel une valeur de mise à jour pour la variable d'angle est déterminée en fonction du signal de sortie du suppresseur de lobes latéraux pour une valeur actuelle de la variable d'angle, et des premier et deuxième signaux de référence de bruit du suppresseur de lobes latéraux pour la valeur actuelle de la variable d'angle.
 
10. Appareil de la revendication 9, dans lequel les premier et deuxième signaux de référence de bruit sont pondérés en fonction de la valeur actuelle de la variable d'angle.
 
11. Appareil de la revendication 9, dans lequel le circuit d'estimation (109) est agencé pour déterminer une estimation de puissance pour au moins l'un des premier et deuxième signaux de référence de bruit du suppresseur de lobes latéraux et pour effectuer une normalisation de la valeur de mise à jour en fonction de l'estimation de puissance.
 
12. Appareil de la revendication 1, dans lequel le réseau de microphones au moins bidimensionnel comprend au moins quatre microphones et l'appareil comprend un circuit pour combiner des signaux d'au moins deux des au moins quatre microphones avant de générer les signaux de faisceau de référence.
 
13. Appareil de la revendication 1, comprenant en outre le réseau de microphones au moins bidimensionnel (101), le réseau de microphones au moins bidimensionnel (101) comprenant des microphones directionnels ayant une réponse maximale dans une direction vers l'extérieur d'un périmètre du réseau de microphones au moins bidimensionnel.
 
14. Procédé de formation de faisceau audio comprenant le fait :

de recevoir des signaux d'un réseau de microphones au moins bidimensionnel comprenant au moins trois microphones ;

de générer au moins trois signaux de faisceau de référence à partir des signaux de microphone des signaux reçus ;

de générer un signal de sortie correspondant à un diagramme de faisceau souhaité en combinant les signaux de faisceau de référence en réponse à une première direction d'une source sonore souhaitée et une estimation de direction d'une source sonore d'interférence ;

de générer l'estimation de direction par :

la détermination d'un premier angle correspondant à un minimum local d'une mesure de puissance du signal de sortie dans un premier intervalle d'angle, la mesure de puissance étant une fonction angulaire de l'estimation de direction ;

la détermination d'un deuxième angle correspondant à un minimum local de la mesure de puissance du signal de sortie dans un deuxième intervalle d'angle ; et

la détermination de ladite estimation de direction en tant qu'angle choisi parmi un ensemble d'angles correspondant à des minimums locaux de la mesure de puissance du signal de sortie, l'ensemble d'angles comprenant au moins le premier angle et le deuxième angle ;

dans lequel la combinaison des signaux de faisceau de référence comprend la détermination de paramètres de combinaison pour la combinaison des signaux de faisceau de référence pour fournir une encoche dans le diagramme de faisceau souhaité à un angle correspondant à l'estimation de direction et une maximisation d'une mesure de coût de directivité, la mesure de coût de directivité indiquant un rapport entre un gain dans la première direction et un gain en puissance global moyenné dans toutes les directions.


 
15. Produit de programme informatique comprenant un programme informatique permettant à un processeur de réaliser le procédé de la revendication 14.
 




Drawing





























Cited references

REFERENCES CITED IN THE DESCRIPTION



This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.

Non-patent literature cited in the description