Claim of Priority
[0001] The present application is a continuation-in-part (CIP) of and claims the benefit
of priority under 35 U.S.C. § 120 to
U.S. Application Serial No. 12/474,881, filed May 29,2009, and titled COMPRESSION AND MIXING FOR HEARING ASSISTANCE DEVICES, which claims the
benefit of priority under 35 U.S.C. § 119(e) to
U.S. Provisional Patent Application Serial No. 61/058,101, filed on June 2, 2008, the benefit of priority of each of which is claimed hereby, and each of which are
incorporated by reference herein in its entirety
Field of the Invention
[0002] This patent application pertains to apparatus and processes for compression of spaced
sources for hearing assistance devices.
Background
[0003] Hearing assistance devices, such as hearing aids, include electronic instruments
worn in or around the ear that compensate for hearing losses by amplifying and processing
sound. The electronic circuitry of the device is contained within a housing that is
commonly either placed in the external ear canal and/or behind the ear. Transducers
for converting sound to an electrical signal and vice-versa may be integrated into
the housing or external to it.
[0004] Whether due to a conduction deficit or sensorineural damage, hearing loss in most
patients occurs non-uniformly over the audio frequency range, most commonly at high
frequencies. Hearing aids may be designed to compensate for such hearing deficits
by amplifying received sound in a frequency-specific manner, thus acting as a kind
of acoustic equalizer that compensates for the abnormal frequency response of the
impaired ear. Adjusting a hearing aid's frequency specific amplification characteristics
to achieve a desired level of compensation for an individual patient is referred to
as fitting the hearing aid. One common way of fitting a hearing aid is to measure
hearing loss, apply a fitting algorithm, and fine-tune the hearing aid parameters.
[0005] Hearing assistance devices also use a dynamic range adjustment, called dynamic range
compression, which controls the level of sound sent to the ear of the patient to normalize
the loudness of sound in specific frequency regions. The gain that is provided at
a given frequency is controlled by the level of sound in that frequency region (the
amount of frequency specificity is determined by the filters in the multiband compression
design). When properly used, compression adjusts the level of a sound at a given frequency
such that its loudness is similar to that for a normal hearing person without a hearing
aid. There are other fitting philosophies, but they all prescribe a certain gain for
a certain input level at each frequency. It is well known that the application of
the prescribed gain for a given input level is affected by time constants of the compressor.
What is less well understood is that the prescription can break down when there are
two or more simultaneous sounds in the same frequency region. The two sounds may be
at two different levels, and therefore each should receive different gain for each
to be perceived at their own necessary loudness. Because only one gain value can be
prescribed by the hearing aid, however, at most one sound can receive the appropriate
gain, providing the second sound with the less than desired sound level and resulting
loudness.
[0006] This phenomenon is illustrated in the following fgures. FIG. 1 shows the levels of
two different sounds out of a filter centered at 1 kHz-in this example, the two sounds
are two different speech samples. The samples are overlaid on FIG. 1 and one is in
a thick dark line 1 and the second is in a thin line 2.
[0007] FIG. 2 shows the gains that would be applied to those two different sounds at 1 kHz
if they were to be presented to a hypothetical multiband dynamic range compressor.
Notice that the ideal gain for each speech sample is different. Again, the samples
from the thick dark line 1 are shown in comparison to those of the thin line 2.
[0008] FIG. 3 shows the two gains from FIG. 1 represented by the thick dark line 1 and the
thin line 2, but with a line of intermediate thickness 3 which shows the gain that
is applied when the two sounds are mixed together before being sent to the multiband
compressor. Notice that when the two sounds are mixed together, neither receives the
exact gain that should be prescribed for each separately; in fact, there are times
when the gain should be high for one speech sample, but it is low because the gain
is controlled by the level of the mix of the two sounds, not the level of each sound
individually. This can cause artificial envelope fluctuations in each sound, described
as comodulation or cross modulation by Stone and Moore (
Stone, M. A., and Moore, B. C. (2008). "Effects of spectro-temporal modulation changes
produced by multi-channel compression on intelligibility in a competing-speech task,"
J Acoust Soc Am 123, 1063-1076.)
[0009] This could be particularly problematic with music and other acoustic sound mixes
such as the soundtrack to a Dolby 5.1 movie, where signals of significantly different
levels are mixed together with the goal of provided a specific aural experience. If
the mix is sent to a compressor and improper gains are applied to the different sounds,
then the auditory experience is negatively affected and is not the experience intended
by the produce of the sound. In the case of music, the gain for each musical instrument
is not correct, and the gain to one instrument might be quite different than it would
be if the instrument were played in isolation. The impact is three-fold: the loudness
of that instrument is not normal for the hearing aid listener (it may be too soft,
for example), distortion to the temporal envelope of that instrument can occur, and
interaural-level difference (ILD) cues for sound source localization and segregation
can be distorted, making the perceived auditory image of that instrument fluctuate
in a way that was not in the original recording..
[0010] Another example is when the accompanying instrumental tracks in a movie soundtrack
have substantial energy then compression can overly reduce the overall level and distort
the ILD of the simultaneous vocal tracks, diminishing the ability of the wearer to
enjoy the mix of instrumental and vocal sound and even to hear and understand the
vocal track. Thus, there is a need in the art for improved compression and mixing
systems for hearing assistance devices.
Summary
[0011] This application relates to a system for compression for hearing assistance devices
by application of compression to individual sound sources before applying a head-related
transfer function (HRTF) to prevent compression and cross modulation of ILD cues for
localization, according to one example. Variations of the present system use coordinated
compression to left and right individual source signals after application of HRTFs.
This Summary is an overview of some of the teachings of the present application and
is not intended to be an exclusive or exhaustive treatment of the present subject
matter. Further details about the present subject matter are found in the detailed
description and the appended claims. The scope of the present invention is defined
by the appended claims and their legal equivalents.
Brief Description of the Drawings
[0012] FIG. 1 shows the levels of two different sounds out of a filter centered at 1 kHz.
[0013] FIG. 2 shows the gains that would be applied to those two different sounds of FIG.
1 at 1 kHz if they were to be presented to a hypothetical multiband dynamic range
compressor.
[0014] FIG. 3 shows the two gains from FIG. 1 represented by the thick line and the thinner
line, but with a line of intermediate thickness which shows the gain that is applied
when the two sounds are mixed together before being sent to the multiband compressor.
[0015] FIG. 4 illustrates a system for processing left and right stereo signals from a plurality
of sound sources in order to produce mixed left and right sound output signals that
can be used by left and right hearing assistance devices.
[0016] FIG. 5 illustrates a system for processing left and right stereo signals from a plurality
of sound sources by applying compression before mixing to produce mixed left and right
sound output signals that can be used by left and right hearing assistance devices
according to one embodiment of the present subject matter.
[0017] FIG. 6 shows one embodiment of a signal processor that includes a surround sound
synthesizer for producing the surround sound signals from the left and right stereo
signals where compression is applied the surround sound signals before mixing to produce
mixed left and right sound output signals that can be used by left and right hearing
assistance devices according to one embodiment of the present subject matter.
[0018] FIG. 7 shows an embodiment where a stereo music signal is processed to separate the
center signal from the left-dominant and right-dominant signals in order to compress
the center signal separately from the left-dominant and right-dominant signals, according
to one embodiment of the present subject matter.
[0019] FIG. 8 shows an embodiment for separating sounds into component sound sources and
compressing each individual sound source before being remixed into the original number
of channels, according to one embodiment of the present subject matter.
[0020] FIG. 9A shows a modification of FIG. 6 above, including independently compressing
individual sound sources before applying HRTFs to prevent compression and cross modulation
of TLDs, according to one embodiment of the present subject matter.
[0021] FIG. 9B shows a modification of FIG. 6 above, including applying coordinated compression
to individual left and right signal sources after applying HRTFs to prevent compression
and cross modulation of ILDs, according to one embodiment of the present subject matter.
[0022] FIG. 10 shows a system with separate source signals and separate microphone signals.
[0023] FIG. 11A shows a modification of FIG. 10 above, including independently compressing
estimated source signals before applying HRTFs to prevent compression and cross modulation
of ILDs, according to one embodiment of the present subject matter.
[0024] FIG. 11B shows a modifcation of FIG. 10 above, including applying coordinated compression
to individual left and right signal sources after applying HRTFs to prevent compression
and cross modulation of ILDs, according to one embodiment of the present subject matter.
Detailed Description
[0025] The following detailed description of the present invention refers to subject matter
in the accompanying drawings which show, by way of illustration, specific aspects
and embodiments in which the present subject matter may be practiced. These embodiments
are described in sufficient detail to enable those skilled in the art to practice
the present subject matter. References to "an", "one", or "various" embodiments in
this disclosure are not necessarily to the same embodiment, and such references contemplate
more than one embodiment. The following detailed description is, therefore, not to
be taken in a limiting sense, and the scope is defined only by the appended claims,
along with the full scope of legal equivalents to which such claims are entitled.
[0026] FIG. 4 illustrates a system for processing left and right stereo signals from a plurality
of sound sources in order to produce mixed left and right sound output signals that
can be used by left and right hearing assistance devices. The figure shows separate
left 410 and right 420 channels where a plurality of left sound sources 1L, 2L, ...
, NL are mixed by mixer 411 to make a composite signal that is compressed using compressor
412 to produce the left output signal LO. FIG. 4 also shows in the right channel 420
a plurality of right sound sources 1R, 2R, ... , NR that are mixed by mixer 421 to
make a composite right signal that is compressed by compressor 422 to produce a right
signal RO. It is understood that the separate sound sources can be right and left
tracks of individual instruments. It is also possible that the tracks include vocals
or other sounds. The system provides compression after the mixing which can result
in over-attenuation of desired sounds, which is an undesired side effect of the signal
processing. For example, if track 1 included bass guitar and track 2 included a lead
guitar, it is possible that the louder instrument would dominate the signal strength
in the channel at any given time and may result in over-attenuation of the weaker
signal when compression is applied to the composite signal. Furthermore, because left
and right signals are compressed independently, level difference between the left
and right output signals LO and RO are compressed, i.e., ILD cues are reduced.
[0027] FIG. 5 illustrates a system for processing left and right stereo signals from a plurality
of sound sources by applying compression before mixing to produce mixed left and right
sound output signals that can be used by left and right hearing assistance devices,
according to one embodiment of the present subject matter. This embodiment applies
compression (512 for the left channel 510 and 522 for the right channel 520) to each
signal independently to assist in preserving the ability to mix each signal accordingly
(using mixers 510 and 521, respectively). This approach allows each sound source 1L,
2L, ... , NL and 1R, 2R, ... , NL to be added to the composite signal as desired.
It is understood that to provide a plurality of sound sources two or more sound sources
are input into the mixer. These may be right and left components of an instrumental
input, vocal input, or other sound input. Level difference between the left and right
output signals LO and RO are compressed, i.e., ILD cues are reduced, because left
and right signals are compressed independently.
[0028] FIG. 6 shows one embodiment of a signal processor that includes a surround sound
synthesizer for producing the surround sound signals from the left and right stereo
signals where compression is applied the surround sound signals before mixing to produce
mixed left and right sound output signals that can be used by left and right hearing
assistance devices according to one embodiment of the present subject matter. A surround
sound synthesizer 601 receives a right stereo signal SR and a left stereo signal SL
and converts the signals into LS, L, C, R, and RS signals. In various embodiments,
the HRTFs are not used and the signal passes from the surround sound synthesizer 601
to the compression stages 610R and 610L before being sent to the mixers 611 R and
611L. In various embodiments, the signals are processed by right and left head-related
transfer functions (HRTFs) 608R and 608L. The resulting signals are then sent through
compression stages 510R and 610L before being sent through mixers 611R and 611L. The
resulting outputs RO and LO are used by the hearing assistance device to provide stereo
sound reception. Level difference between the left and right output signals LO and
RO are compressed, i.e., ILD cues are reduced, because left and right signals are
compressed independently. It is understood that other surround sound systems may be
employed without departing from the scope of the present subject matter. For example,
surround sound systems include, but are not limited to Dolby 5.1, 6.1, and 7.1 systems,
and the application ofHRTFs is optional. Thus, the examples provided herein are intended
to be demonstrative and not limiting, exclusive, or exhaustive.
[0029] One advantage of the system of FIG. 6 is that the center channel, which frequently
is dominated by vocals, can be separated compressed from the other channels, which
are largely dominated by the music. Such compression and mixing avoids cross modulation
of gain. In various embodiments, the level of compression is commensurate with that
found in hearing assistance devices, such as hearing aids. Other levels of compression
are possible without departing from the scope of the present subject matter.
[0030] Independent dynamic-range compression of stereo signals diminishes ILD cues for localization.
Particularly in complex situations with multiple sources, preserving ILD cues is beneficial.
The present subject matter preserves source-specific ILD cues by preventing compression
and cross modulation of the ILDs, while providing optimal gain for each individual
source.
[0031] FIG. 9A shows a modification of FIG. 6 above, including independently compressing
individual sound sources before applying HRTFs to prevent compression and cross modulation
of ILDs, according to one embodiment of the present subject matter. The present subject
matter applies compression 609 separately to each individual source in a mixture,
without compressing its ILD. The five surround sound signals LS, L, C, R, and RS represent
estimates of individual source signals. These estimated source signals are first independently
compressed which prevents cross modulation between the waveforms of these sources.
Then, head-related transfer functions (HRTFs) are applied to the compressed source
signals. Since the HRTFs are applied after compression, the ILDs are neither compressed
nor cross modulated.
[0032] An alternate embodiment is shown in FIG. 9B including applying coordinated compression
to individual left and right signal sources after applying HRTFs to prevent compression
and cross modulation of ILDs, according to one embodiment of the present subject matter,
Here, coordinated compression, i.e., identical gain, is applied to the left and right
individual source signals after application of the HRTFs. Coordinated compression
prevents compression and cross modulation of the ILDs. This holds true for simple
coordinated compression of the mixture as well. However, here optimal gain is applied
to each individual source. High-level sources, for example, will not suppress the
gain for low-level sources.
[0033] FIG. 10 shows a system with separate source signals and separate microphone signals.
Here, s1 and s2 represent the true separate source signals and x1 and x2 two (hearing-aid)
microphone signals. Blind source separation (BSS) provides the estimates y1 and y2
of these source signals. Application of the (room) transfer functions H
nmtilde yields separate stereo output signals for the two sources. Signals "z1 left" and
"z1 right," for example, represent the estimated left and right microphone signals
for the single source s1. FIG. 11A shows a modification of FIG. 10 above, representing
one embodiment of the present subject matter. Compression 1109 is applied separately
to the estimated source signals y1 and y2, before application of the transfer functions
H
nmtilde. Similar to the embodiment in FIG. 9A, this yields ILDs that are neither compressed
nor cross modulated. The embodiment shown in FIG. 11B applies coordinated compression
1109 to the stereo source signals, similar to the embodiment in FIG. 9B. Again, ILDs
are preserved while optimal gain is applied to each of the sources. FIG. 7 shows one
embodiment for separating a stereo signal into three channels for a more source-specific
compression. Often in music, the signal for the singer is equally applied to both
the left and right channel, centering the perceptual image of the singer. Consider
the simple example of a stereo music signal with a singer S that is equally in the
left and right channel, instrument A that is predominantly in the left channel, and
instrument B that is predominantly in the right channel. Then, the left L and right
R channels can be described as:


[0035] Thus, one can compress the (L+R)/2 mix to the compressor so that the gain is primarily
that for the singer. To get a signal that is primarily instrument A and one that is
primarily instrument B:


[0036] After CS, CL and CR have been individually compressed, they are mixed together to
create a stereo channel again:


[0037] FIG. 7 is one example of how to combine the original channels before compression
and how to mix the post-compressed signals back into a stereo signal, but other approaches
exist. FIG. 7 shows the left (A+S) signal 701 and the right (B+S) signal 702 applied
to multipliers (which multiply by ½) and summed by summers to create the CA, CB, and
2CS signals. The CS signal is obtained using multiplier 705. The CA, CB and CS signals
are compressed by compressors 706, 708, and 707, respectively, and summed by summers
710 and 712. The resulting outputs are multiplied by 2/3 by multipliers 714 and 715
to provide the compressed left and compressed right signals, as shown in FIG. 7. It
is understood that this is one example of how to process the signals and that other
variations are possible without departing from the scope of the present subject matter.
Thus, the system set forth in FIG. 7 is intended to be demonstrative and not exhaustive
or exclusive.
[0038] FIG. 8 represents a general way of isolating a stereo signal into individual components
that can then be separately compressed and recombined to create a stereo signal. There
are known ways of taking a stereo signal and extracting the center channel in a more
complex way than shown in FIG. 8 (e.g.,
U.S. Pat. No. 6,405,163, and
U.S. Patent Application Publication Number 2007/0076902). Techniques can also be applied to monaural signals to separate the signal into
individual instruments. With either approach, the sounds are separated into individual
sound source signals, and each source is compressed; the individually compressed sources
are then combined to create either the monaural or stereo signal for listening by
the hearing impaired listener.
[0039] Left stereo signal 801 and right stereo signal 802 are sent through a process 803
that separates individual sound sources. Each source is sent to a compressor 804 and
then mixed with mixer 806 to provide left 807 and right 808 stereo signals according
to one embodiment of the present subject matter.
[0040] It is understood that the present subject matter can be embodied in a number of different
applications. In applications involving mixing of music to generate hearing assistance
device-compatible stereo signals, the mixing can be performed in a computer programmed
to mix the tracks and perform compression as set forth herein. In various embodiments,
the mixing is done in a fitting system. Such fitting systems include, but are not
limited to, the fitting systems set forth in
U.S. Patent Application Ser. No. 11/935,935, filed Nov. 6, 2007, and entitled: SIMULATED SURROUND SOUND HEARING AID FITTING SYSTEM, the entire specification
of which is hereby incorporated by reference in its entirety.
[0041] In various embodiments, the mixing is done using the processor of the hearing assistance
device. In cases where such devices are hearing aids, that processing can be done
by the digital signal processor of the hearing aid or by another set of logic programmed
to perform the mixing function provided herein. Other applications and processes are
possible without departing from the scope of the present subject matter.
[0042] It is understood that in various embodiments, the apparatus and processes set forth
herein may be embodied in digital hardware, analog hardware, and/or combinations thereof.
It is also understood that in various embodiments, the apparatus and processes set
forth herein may be embodied in hardware, software, firmware, and/or combinations
thereof.
[0043] This application is intended to cover adaptations and variations of the present subject
matter. It is to be understood that the above description is intended to be illustrative,
and not restrictive. The scope of the present subject matter should be determined
with reference to the appended claim, along with the full scope of legal equivalents
to which the claims are entitled.
1. An apparatus for processing sound for a hearing assistance device placed at a wearer's
ear, the apparatus comprising:
a receiver adapted to receive signals from a sound environment;
a processor connected to the receiver, the processor adapted to process received signals
to isolate individual sound source components, the processor further adapted to apply
a head-related transfer function to the individual sound source components;
a compressor connected to the processor, the compressor adapted to independently compress
each of the individual sound source components prior to the application of the head-related
transfer function;
a mixer connected to the compressor, the mixer adapted to mix the compressed sound
source components to produce a mixed output signal; and
a speaker connected to the mixer, the speaker integrated with the hearing assistance
device and adapted to output the mixed output signal at the wearer's ear.
2. The apparatus of claim 1, wherein the compressor is adapted to apply coordinated compression
to the individual sound components after application of the head-related transfer
function.
3. The apparatus of claim 1 or claim 2, wherein the head related transfer function is
applied at an individual angle of reception for each of the individual sound components.
4. The apparatus of any of the preceding claims, wherein the receiver is adapted to receive
sound signals having a stereo right (SR) and a stereo left (SL) sound signal.
5. The apparatus of claim 4, wherein the processor is adapted to process the SR and SL
signals to produce left surround (LS), left (L), center (C), right (R) and right surround
(RS) signals.
6. The apparatus of claim 5, wherein the processor is further adapted to generate a processed
version for each of the LS, L, C, R, and RS signals by application of a head-related
transfer function at an individual angle of reception for each of the LS, L, C, R,
and RS signals.
7. The apparatus of claim 6, wherein the compressor is adapted to compress the processed
version for each of the LS, L, C, R, and RS signals.
8. The apparatus of claim 7, wherein the mixer is adapted to mix the compressed and processed
version of the LS, L, C, R, and RS signals to produce one or both of a right output
signal (RO) and a left output signal (LO).
9. The apparatus of claim 8, wherein the hearing assistance device includes a right hearing
assistance device including a right speaker and a left hearing assistance device including
a left speaker, and wherein the RO signal is adapted to be used by the right speaker
the LO signal is adapted to be used by the left speaker.
10. The apparatus of any of the preceding claims, wherein the processor includes a synthesizer.
11. The apparatus of claim 10, wherein the synthesizer includes a surround sound synthesizer.
12. A method, comprising:
receiving stereo surround signals from a sound environment;
processing the received signals to isolate individual sound source components;
independently compressing the individual sound source components to prevent cross
modulation between waveforms of the source components; and
after compressing the components, applying a head-related transfer function to the
compressed individual sound source components.
13. The method of claim 12, wherein receiving stereo surround signals includes receiving
at least one left sound signal and at least one right sound signal.
14. The method of claim 12 or claim 13, wherein processing the received signals to isolate
components includes processing to isolate voice and instrument components from musical
signals.
15. The method of any of claim 12 through claim 14, wherein applying the head related
transfer function includes applying the transfer function at an individual angle of
reception for each of the individual sound components.