BACKGROUND OF THE INVENTION
[0001] The invention relates to a method for the reduction or elimination of an eventual
noise contribution to an audio signal, which is picked up by sensor means. The invention
further relates to a device for reduction or elimination of an eventual noise contribution
to an audio signal, which is picked up by sensor means.
[0002] The invention will for practical reasons be explained in connection with a hearing
aid. The invention is however not limited to this field as it may be implemented in
other technical fields where an audio signal is picked up by sensor means.
[0003] Although modern hearing aids have come a long way in providing amplification well
suited to the hearing impaired user, many hearing impaired people still have problems
understanding speech in noise. The issue of improving the speech intelligibility by
improving the ratio between the desired speech signal and the noise has already been
addressed in many ways, mostly based on directionality.
[0004] As of the present day the commercially available directional devices are either an
acoustical two-port design as disclosed in US patent no. 5524056, an electrical combination
of the outputs of two omni-directional microphones as disclosed in J. Malsano & W.
Hottinger, "A method for electronically beam forming acoustical signals and acoustical
sensorapparatus", European Patent EP 0 820 210 A2, or an electrical combination of
several microphones into a single highly directional device to be used physically
externally to the hearing aid as disclosed in W. Soede, A.J. Berkhout & F.A. Bilsen,
1993, "Development of a directional hearing instrument based on array technology",
J. Acoust. Soc. Am. 94(2), p. 785-798.
[0005] These previously known devices are thus characterised by a certain amount of directionality
pointing in the forward direction with respect to the user, and characterised by a
time-invariant processing of the individual ingoing sensor signals. Although the use
of directionality to some extent gives improved speech intelligibility, the directionality
is in many situations non-satisfactory and there is a requirement of further improving
speech inteligibility.
[0006] Directionality is also used in a different group of proposed devices, which make
use of adaptive noise cancelling principles as disclosed in B. Widrow, J. Glover,
J. McCool, J. Kaunitz, C. Williams, R. Hern, J. Zeidler, E. Dong & R. Goodlin, 1975,
"Adaptive noise cancelling: Principles and applications", IEEE Proceedings 63, p.
1692-1716. Thus, a number of microphones, which may be mounted either in hearing aids
at each ear of the user, on a headband or in a single hearing aid shell, are combined
to form estimates of the interfering noise from which the target signal is removed.
A signal containing as much target signal as possible is also formed, and these signals
are then used as inputs to an adaptive noise canceller. These devices work on the
assumption that the target speech signal impinges from a certain predetermined direction,
and are characterised by a directionality and a processing of the individual ingoing
sensor signals, which is adaptively varying with time. Further, these devices are
characterised by the use of the adaptive noise canceller, as disclosed in P.M. Zurek,
J.E. Greenberg & P.M. Peterson, 1990, "Adaptive beamforming for noise reduction".
United States Patent 4,956,867; P.V.F. Clough & N.A. Lobo, European Patent 0084892;
and J. Vanden Berghe & J. Wouters, 1998, "An adaptive noise canceller for hearing
aids using two nearby microphones", J. Acoust. Soc. Am. 103(6), p. 3621-3626.
[0007] Although the use of adaptive noise cancelling based on directionality to some extend
gives improved speech intelligibility, the directionality is in many situations non-satisfactory
in this respect, as it is the case in connection with the use of directionality alone.
[0008] Recently, several researchers have published a new principle, which is called independent
component analysis (ICA):
1. J. Herault & C. Jutten, "Space or Time adaptive signal processing by neural network
models", Neural Networks for Computing, AIP Conference Proceedings 151, Snowbird,
Utah, pp. 207-211, 1986.
2. A.J. Bell, United States Patent 5,706,402.
3. A.J. Bell & T.J. Sejnowski, "An information-maximisation approach to blind separation
and blind deconvolution", Technical Report no. INC-9501, February 1995, Institute
for Neural Computation, UCSD, San Diego, CA 92093-0523.
4. T.W. Lee, M. Girolami, A.J. Bell & T.J. Sejnowski, "A unifying information-theoretic
framework for independent component analysis", Computers & Mathematics with applications,
1998 in press.
5. T.W. Lee, A.J. Bell & R. Orglmeister, "Blind source separation of real world signals",
Proceedings ICNN, USA, 1997.
6. T.W. Lee, A.J. Bell & R. H. Lambert, "Blind separation of delayed and convolved
sources", Advances in Neural Information Processing Systems 9, 1997 MIT Press, Camvridge
MA pp. 758-764.
7. P. Smaragdis, 1997, "Efficient blind separation of convolved sound mixtures", IEEE
Workshop on Applications of Signal Processing to Audio and Acoustics, Mohonk Mountain
House, New Paltz, New York.
8. K. Torkkola, "Blind separation of delayed sources based on information maximation",
Proceedings of the IEEE international conference on acoustic, speech and signal processing,
May 7-10 1996, GA, USA.
9. K. Torkkola, "Blind separation of convolved ources based on information maximization",
IEEE workshop on neural network for signal processing, Kyoto, Japan, Sept 4-6, 1996.
10.J.F. Cardoso, "Equivariant adaptive source separation", IEEE Transactions on Signal
Processing 44(12), 1996.
11.S. Amari, A. Cichocki, H. H. Yang, "A new learning algorithm for blind signal separation",
Advances in neural information Processing systems 8. MIT Press, 1996.
[0009] The objective of ICA is to recover the underlying independent source signals given
only sensor observations that are linear mixtures of the original source signals.
The only assumption of ICA is that the original source signals are statistically independent,
otherwise the statistics of the source signals and the mixing of these into the sensor
signals may be unknown. In contrast to correlation-based transformations such as Principal
Component Analysis (PCA, I.T. Jolliffe, Principal Component Analysis, 1986, Springer
Verlag), which decorrelates signals according to 2
nd-order statistics, ICA also reduces higher- order statistical dependencies, in terms
of maximising joint output entropy, in order to extract statistically independent
signal components.
[0010] In the linear blind signal separation problem,
N signals,
are mixed so that an array of
N sensors picks up a set of signals
each of which has been mixed, delayed and filtered as follows
where
Dij are entries in a matrix of delays and
aij are the
M - tap filter coefficients between the
jth source and the
ith sensor. The problem is to invert this scrambling without knowledge of it, thus
recovering the original signals
s(t) given only the
x(t) signals. Finding this inverse scrambling is a challenging task since no informations
are provided about the mixing nor the signals (hence the term blind separation). The
type of architecture chosen for inverting the scrambling is important and can be made
in numerous ways. An accurate architecture for inverting a M-tap filter is an infinite
impulse response (IIR) filter with M coefficients. However, IIR filters are limited
to have poles inside the unit circle, which imply that a stable filter only exists
for a minimum phase system. FIR filters may be used to approximate the inverse solution.
Thus the inverse scrambling is performed according to
which has filters,
wij, and delays
dij, which supposedly reproduce, at the output
u(
t) , the original uncorrupted source signals,
s(
t) , apart from a scaling factor for each signal and a permutation of signals.
[0011] Several algorithms have been proposed for the blind separation of linear mixtures.
Bell and Sejnowski (3) proposed to learn the separating process by minimising the
mutual information between components of
y(
t) =
g(
u(
t)), where
g is a non-linear function approximating the cumulative probability density function
of the sources. They showed that for positively kurtotic signals (like speech) minimising
the mutual information between components of
y(
t) is equal to maximising the entropy of
y(
t), which can be written as
where
fy(
y) denotes the probability density function of
y(
t). Denoting the determinant of the Jacobian of the whole unmixing process by |
J| ,
fy(
y) can be written as
fx(
x)/|
J| (the Jacobian is a matrix with entries of ∂
yi/∂
xj). Maximising the entropy of the output leads to maximising
which in turn can be developed into a stochastic gradient ascent rule using instances
of
x(
t) and
y(
t), instead of using the expectation. Thus
where
g(
u) = 1/(1 +
e-u) is used to approximate the cdf.
[0012] The algorithm can be made more efficient and independent of the conditioning of the
mixing process (matrix) by using the so-called natural gradient instead of the absolute
gradient, see Amari (11).
[0013] One particular proposed application of ICA is within electroencephalographic (EEG)
recording of scalp potentials in humans and related brain activity measurements.
[0014] The application of the ICA to hearing aids has also been mentioned in H. Sahlin,
"Blind signal separation by second order statistics", Ph.D. thesis, Chalmers University
of Technology, Technical Report no. 345, Sweden, 1998, but it has so far never been
suggested how the implementation of this technique could be realised in connection
with audio systems in order to improve speech intelligibility.
[0015] Based on this prior art the objective of the present invention is to provide a method
and a device for reducing noise in an audio signal comprising both noise and target
signal, which method and device has an increased functionality and reliability compared
with the prior art within the audio field of technology.
SUMMARY OF THE INVENTION
[0016] This is achieved by means of a method comprising the steps of:
providing at least two input signals, each having different contents of target signal
and noise;
processing the two input signals;
the processing comprising use of an independent component analysis or a similar technique
based on the differences of the at least two input signals, hereby determining whether
statistical dependent signal elements are present and removing at least part of unwanted
signal elements;
outputting a part of the audio signal.
[0017] The target signal is defined as the signal coming from in front of the device performing
the method, whereas other signals are considered as noise signals. In case of no specific
well-defined source signal and cases laying outside the assumed set-up, the noise
elimination will disappear meaning that the signal processing strategy will pass the
input signals unaltered to the outputs.
[0018] In a preferred embodiment the method comprises at least two input signals, which
are picked up at least at two mutually distanced locations. Hereby one signal contains
the target signal with a higher signal-to-noise ratio than the other input signal.
In this case the sensor may be identical.
[0019] In a further preferred embodiment the method comprises at least two input signals,
which are based on differences in the directionality, preferably a directional and
an omni-directional sensor. Hereby it becomes possible to determine the origin of
the respective signal components based on the differences of the directionality applied
when sensing the two input signals.
[0020] The different embodiments show that the differences in signal-to-noise ratio is based
on the actual situation of use, which means that it is the mutual differences in signal-to-noise
ratio in relation to a desired target signal, which is relevant, and not the signal-to-noise
ratio of the sensor means itself.
[0021] In a further preferred embodiment two or more output signals are produced and where
a possibility exists for switching between the two or more output signals or combinations
of these. Alternatively an automatic switching between the two or more output signals
according to a predetermined scheme is provided. Hereby it becomes possible to make
a choice of the actual influence of the separation method on the noise canceling in
the actual situation or at least have a change according to the actual choice of the
predetermined scheme, which may be adapted for switching at different listening environments.
[0022] The objective of the invention is further achieved by means of a device comprising:
at least two input channels;
signal processing means in connection with the input channels, which provides input
signals to the signal processing means;
a receiver in connection with the signal processing means;
the signal processing means being adapted to process the signals by means of an independent
component analysis method or a similar method based on differences of signal-to-noise
ratios of the input signals in relation to a desired target signal, the processing
comprising determining whether statistical dependent signal elements are present and
removing at least part of the unwanted signal elements, thereby enhancing other parts
of the audio signal.
[0023] In a preferred embodiment the device comprises two sensors, preferably microphones,
having different signal-to-noise ratio (S/N-ratio). One of the sensors is chosen as
the target sensor. Hereby it becomes possible to separate the respective signal components
based on the noise content of the further sensor.
[0024] In a preferred embodiment the device comprises a directional microphone and an omni-directional
microphone. Hereby the directional microphone will preferably contain the desired
output signal based on the position of the user facing the desired audio signal source.
[0025] In a preferred embodiment the two microphones are mutually distanced. Hereby it becomes
possible to determine the origin of the respective signal components based on a time
delay in reception of the two input signals. This makes it possible to make a beamforming
of the input signals hereby possibly adding directionality to at least one of the
inputs. Other ways of beamforming may be used in this connection.
[0026] In a preferred embodiment two or more output signals are produced and means are provided
for switching between the two or more output signals or combinations of these. Alternatively
means are provided for automatic switching between the two or more output signals
according to a predetermined scheme.
[0027] Due to the fact that most hearing impaired have a hearing disorder which makes it
even more difficult than for normal hearing persons to separate a target signal from
the noise, which is often present in a speech situation, the invention is particularly
relevant in connection with the technical field of hearing aids.
[0028] The invention therefor further relates to a hearing aid comprising: at least two
microphones for audio signal input; signal processing means in connection with the
microphones; an amplifier in connection with the signal processing means; a receiver
in connection with the amplifier for outputting a signal from the amplifier; the signal
processing means being adapted to process the signals by means of an independent component
analysis method or a similar method based on the input from the at least two microphones,
the processing comprising determining whether statistical dependent signal elements
are present and removing at least part of the unwanted signal elements, thereby enhancing
other parts of the audio signal.
[0029] The hearing aid according to the invention may further comprise the features set
forth above, either separate or in combination.
[0030] Other fields of relevant use of the invention may be telecommunication or audio systems.
In such systems the input and output may be connected to antennas or similar transmission
and receiving means or may comprise microphones as input means as in the case of a
hearing aid. Other elements of such systems may be standard elements, as these are
not influenced by the signal processing according to the invention.
[0031] The invention will be described more detailed with reference to the accompanying
drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
[0032]
- Fig. 1
- is a diagram showing the principles of the invention;
- Fig. 2
- is a schematic diagram showing the principles of an implemented version of the invention;
- Fig. 3
- is a schematic diagram showing the principles of an implemented version of the invention;
- Fig. 4
- is a schematic diagram showing the implemented version of the invention as shown in
fig. 2 further implemented in a hearing aid.
DESCRIPTION OF THE PREFERRED EMBODIMENT
[0033] The fundamental principle of the invention is schematically shown in fig. 1. The
invention is basically a system, e.g. a hearing aid, with two or more sensors and
a calculation unit. The calculation unit carries out the separation of the target
and noise signals, by using the independence of the mixed signals according to ICA
or a similar method comprising the basics of the ICA. The sensors are arranged so
that one is positioned to receive sound primarily from a target direction in front,
whereas the others have arbitrary characteristics that do not specifically favour
the target direction. Hereby, it is possible to use the technique of independent component
analysis to separate the desired signal, which impinges from the target direction,
from the disturbing noise signals, which impinge from any other directions.
[0034] To illustrate the invention, an example is given of a system implementing signal
processing as described above. Fig. 2 schematically shows the signal processing system.
The system comprises a directional and an omni-directional microphone, and a digital
signal processing unit implementing the signal separation algorithm. Using the directional
microphone gives the target direction from in front of the user, whereas the omni-directional
microphone gives a signal equally representing all signals around the head of the
user.
[0035] A particularly important property of the independent component analysis is that it
separates convolved and delayed source signals, where each independent source signal
is defined as a signal which appears in the same way within each mixing process. Another
important characteristic about the independent component analysis is that knowledge
about the ratio of the source signals within the mixed signals can be used for classifying
the separated signals. If for instance one source signal appears with a significantly
better signal to noise ratio in one of the sensor signals, this information can be
used to ensure that this source signal always will appear in a fixed output. Within
the present invention these two characteristics combined with an appropriate placement
of at least two sensors are exploited to eliminate signals not coming from in front
of the user of the device.
[0036] From fig. 3 an embodiment appears, which comprises the features of the embodiment
of fig. 2, but where the signal processor produces two output signals. By means of
switching means one of the two output signals may be selected for further processing,
e.g. amplification, or for output.
[0037] From fig. 4 an embodiment appears as a hearing aid according to the invention. The
essential components of the hearing aid comprise two microphones, preferably a directional
microphone and an omni-directional microphone, and an A/D converter connected to each
of the microphones. The A/D converters are connected to a digital signal processor,
which is adapted to perform the ICA method on the incoming signals. The signal from
the signal processor is then lead to an amplifier and from this through a D/A converter
to a receiver for performing the output of the processed signal. The devices of the
figs. is in a usual manner powered by means of usual power sources, such as batteries.
1. A method for reduction of noise in an audio signal containing noise and a target signal,
the method comprising:
providing at least two input signals, each having different contents of the target
signal and the noise, hereby providing different input signals;
processing the input signals by means of an independent component analysis based on
the differences of the content of the at least two input signals, hereby determining
whether statistical dependent signal elements are present and removing at least part
of the unwanted signal elements;
outputting a part of the audio signal.
2. A method according to claim 1, wherein the at least two input signals are picked up
at least at two mutually distanced locations;
3. A method according to claim 1 or 2, wherein two or more output signals are produced
and where a possibility exists for switching between the two or more output signals
or combinations of these.
4. A method according to any of the claims 1-3, wherein two or more output signals are
produced and where an automatic switching between the two or more output signals according
to a predetermined scheme is provided.
5. A device for use in reducing noise in an audio signal containing noise and a target
signal, the device comprising:
at least two input channels;
signal processing means in connection with the input channels, which provides input
signals to the signal processing means;
a receiver in connection with the signal processing means;
the signal processing means being adapted to process the signals by means of an independent
component analysis method based on differences of signal-to-noise ratios of the inputs
signals in relation to a desired target signal, the processing comprising determining
whether statistical dependent signal elements are present and removing at least part
of the unwanted signal elements, thereby enhancing other parts of the audio signal.
6. A device according to claim 5, wherein the device comprises at least two microphones
having a mutually different signal-to-noise ratio in relation to a desired target
signal, hereby being able to provide different input signals for the respective input
channels.
7. A device according to claim 5 or 6, wherein the device comprises at least a directional
microphone and an omni-directional microphone.
8. A device according to claim 5, 6 or 7, wherein two or more output signals are produced
and where means are provided for switching between the two or more output signals
or combinations of these.
9. A method according to claim 5, 6 or 7, wherein two or more output signals are produced
and where automatic switching means are provided for switching between the two or
more output signals according to a predetermined scheme.
10. A hearing aid comprising at least two microphones each having a different signal-to-noise
ratio in relation to a desired target signal; signal processing means in connection
with the at least two microphones; an amplifier in connection with the signal processing
means; a receiver in connection with the amplifier for outputting a signal from the
amplifier; the signal processing means being adapted to process the signals by means
of an independent component analysis method based on the input from the two microphones
having a mutually different signal-to-noise ratio in relation to a desired target
signal, the processing comprising determining whether statistical dependent signal
elements are present and removing at least part of the unwanted signal elements, thereby
enhancing other parts of the audio signal.
11. A hearing aid according to claim 12, wherein the hearing aid comprises a directional
microphone and an omni-directional microphone.