Technical Field
[0001] The present disclosure relates to the field of presentation of audio content and,
in particular, to the presentation of a subset of the audio from a selection of loudspeakers.
The present disclosure also relates to associated apparatus, methods and computer
programs.
Background
[0002] Telecommunication or telephony systems are being developed that provide for more
than monophonic capture and presentation of audio. The presentation of such audio
may require careful consideration to ensure the telecommunication is clear and effective.
[0003] The listing or discussion of a prior-published document or any background in this
specification should not necessarily be taken as an acknowledgement that the document
or background is part of the state of the art or is common general knowledge. One
or more aspects/examples of the present disclosure may or may not address one or more
of the background issues.
Summary
[0004] In an aspect of the disclosure there is provided an apparatus, the apparatus comprising
means configured to:
receive first position information indicative of at least the location of a first
user with respect to a presentation space; and
receive second position information indicative of at least the location of a second
user with respect to the presentation space;
provide for presentation, using a first selection of loudspeakers comprising at least
two loudspeakers having known locations in the presentation space, of secondary audio
of first audio content to the first user based on the first position information being
indicative of the first user being in the presentation space and the second position
information being indicative of the second user being outside the presentation space;
and
provide for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
[0005] In one or more examples, the primary audio comprises voice audio comprising audio
determined to be generated by a voice of one or more remote users, such as for telecommunication
with the first/second user, and the secondary audio comprises ambient audio comprising
audio other than that determined to be generated by the voice of the one or more remote
users. In one or more examples, the primary audio comprises spatial audio that includes
directional information such that, when presented, it is to be perceived as originating
from a direction or range of directions in accordance with the directional information
and the secondary audio comprises at least one of audio without said directional information
and spatial audio with said directional information that defines a range of directions
from which the audio should be perceived greater than a threshold range of directions.
In one or more examples, the first and/or second audio content comprises telecommunication
audio content comprising audio content provided for the purpose of telecommunication,
which may be via a traditional telecommunication network or provided by a voice over
IP or any other packet-based or circuit switched telephony service.
[0006] In one or more embodiments, the apparatus includes means configured to provide for
presentation of the primary audio of the first audio content and the primary audio
of the second audio content from at least the speaker of the at least two speakers
that satisfies a criterion, the criterion including at least in part the following
qualifiers:
the speaker that is the closest speaker to the respective user that is closer to the
respective user than it is to the other respective user; and
the primary audio of the first audio content and the primary audio of the second audio
content are presented from different speakers.
[0007] In one or more examples, the apparatus includes means configured to provide for presentation
of the primary audio of the first audio content and the primary audio of the second
audio content from at least the loudspeaker of the at least two loudspeakers that
satisfies a criterion, the criterion including at least in part the following qualifiers:
the loudspeaker that is the closest loudspeaker to the respective user that is closer
to the respective user than it is to the other respective user; and
the primary audio of the first audio content and the primary audio of the second audio
content are presented from different loudspeakers.
[0008] In one or more embodiments, the second selection is determined based, at least in
part, on which of the at least two loudspeakers are closer to the first user than
the second user and the third selection is determined based, at least in part, on
which of the at least two loudspeakers are closer to the second user than the first
user, wherein the second and third selections are non-overlapping.
[0009] In one or more embodiments, the first position information is also indicative of
the orientation of the first user in the presentation space and the second position
information is also indicative of the orientation of the second user in the presentation
space; and
wherein the loudspeakers that form part of the second selection are determined based
on a criterion, the criterion including at least in part the following qualifiers:
which of the at least two loudspeakers are closer to the first user; and
which of the at least two loudspeakers the first user is facing; and
wherein the loudspeakers that form part of the third selection are determined based
on a criterion, the criterion including at least in part the following qualifiers:
which of the at least two loudspeakers are closer to the second user; and
which of the at least two loudspeakers the second user is facing.
[0010] In one or more embodiments, the apparatus includes means configured to change the
loudspeakers that form the second selection and the loudspeakers that form the third
selection based on the respective first position information and the respective second
position information being indicative of movement of the location of one or both of
the first user and the second user in the presentation space.
[0011] In one or more embodiments, said change in the loudspeakers that form the second
selection and the loudspeakers that form the third selection is further based on the
elapse of a predetermined amount of time since any previous change in the loudspeakers
that form the second selection and the loudspeakers that form the third selection.
[0012] In one or more embodiments, the means are configured to cease presentation of the
secondary audio of one or both of the first audio content and the second audio content
and maintain the presentation of the primary audio of both the first audio content
and the second audio content, in response to the first position information and the
second position information being indicative of the first user and the second user
having moved to within a predetermined distance of one another.
[0013] In one or more embodiments, said means are configured to, in response to the first
position information and the second position information being indicative of the first
user and the second user having moved to being within a predetermined distance of
one another,
provide for presentation of the secondary audio of the first audio content from the
loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the
first user that is closer to the first user than it is to the second user in preference
to the second selection, and
provide for presentation of the secondary audio of the second audio content from the
loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the
second user that is closer to the second user than it is to the first user in preference
to the third selection, and
maintain presentation of the primary audio of both the first audio content and the
second audio content.
[0014] In one or more embodiments, the first audio content and the second audio content
comprise audio from respective first and second remote persons in telecommunication
with the first user and second user respectively, and said means are configured to
provide for presentation of the secondary audio of the first audio content using a
fourth selection of the loudspeakers and presentation of the secondary audio of the
second audio content using a fifth selection of the loudspeakers, the fourth and fifth
selections comprising overlapping selections of the loudspeakers, in response to user
input received from one or both of the first user and the second user.
[0015] In one or more examples, the fourth and fifth selections are formed of the same selection
of speakers. In one or more examples, the fourth and fifth selections comprise all
of the at least two speakers in the presentation space.
[0016] In one or more embodiments, the means are configured to provide for presentation
of one of the first audio content and second audio content with a sixth selection
of the loudspeakers, the sixth selection comprising a greater number of loudspeakers
than the second selection and a greater number of loudspeakers than the third selection,
in response to an end of the other of the first audio content and second audio content
being reached such that there is no further audio content to present.
[0017] In one or more embodiments, the primary audio is provided for presentation using
a selection of the at least two loudspeakers based on primary-audio-criteria, wherein
the primary-audio-criteria is different to criterion used to determine one or more
of the first selection, the second selection and the third selection of loudspeakers
of the at least two loudspeakers.
[0018] In one or more embodiments, the means are configured to provide for the simultaneous
presentation of the first audio content and the second audio content to at least three
loudspeakers at known locations in the presentation space.
[0019] In one or more embodiments, one or both of the first audio content and the second
audio content comprises audio-type information which designates which audio of the
respective first and second audio content comprises the primary audio and which audio
of the respective first and second audio content comprises the secondary audio.
[0020] In one or more examples, the apparatus is configured to provide for determination
of which audio of one or both of the first audio content and second audio content
is the primary audio, and which is the secondary audio based on audio analysis.
[0021] In a further aspect there is provided a method, the method comprising:
receiving first position information indicative of at least the location of a first
user with respect to a presentation space; and
receiving second position information indicative of at least the location of a second
user with respect to the presentation space;
providing for presentation, using a first selection of loudspeakers comprising at
least two loudspeakers having known locations in the presentation space, of secondary
audio of first audio content to the first user based on the first position information
being indicative of the first user being in the presentation space and the second
position information being indicative of the second user being outside the presentation
space; and
providing for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
[0022] In a further aspect there is provided a computer readable medium comprising computer
program code stored thereon, the computer readable medium and computer program code
being configured to, when run on at least one processor, perform the method of:
receiving first position information indicative of at least the location of a first
user with respect to a presentation space; and
receiving second position information indicative of at least the location of a second
user with respect to the presentation space;
providing for presentation, using a first selection of loudspeakers comprising at
least two loudspeakers having known locations in the presentation space, of secondary
audio of first audio content to the first user based on the first position information
being indicative of the first user being in the presentation space and the second
position information being indicative of the second user being outside the presentation
space; and
providing for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
[0023] In a further example aspect there is provided an apparatus comprising:
at least one processor; and
at least one memory including computer program code,
the at least one memory and the computer program code configured to, with the at least
one processor, cause the apparatus to perform at least the following:
receive first position information indicative of at least the location of a first
user with respect to a presentation space; and
receive second position information indicative of at least the location of a second
user with respect to the presentation space;
provide for presentation, using a first selection of loudspeakers comprising at least
two loudspeakers having known locations in the presentation space, of secondary audio
of first audio content to the first user based on the first position information being
indicative of the first user being in the presentation space and the second position
information being indicative of the second user being outside the presentation space;
and
provide for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
[0024] The present disclosure includes one or more corresponding aspects, examples or features
in isolation or in various combinations whether or not specifically stated (including
claimed) in that combination or in isolation. Corresponding means and corresponding
functional units (e.g., function enabler, speaker selector, amplifier, display device)
for performing one or more of the discussed functions are also within the present
disclosure.
[0025] Corresponding computer programs for implementing one or more of the methods disclosed
are also within the present disclosure and encompassed by one or more of the described
examples.
[0026] The above summary is intended to be merely exemplary and non-limiting.
Brief Description of the Figures
[0027] A description is now given, by way of example only, with reference to the accompanying
drawings, in which:
figure 1 illustrates an example apparatus for providing for presentation of audio
in a presentation space;
figure 2 shows an example presentation space with a first user therein;
figure 3 shows the example presentation space with the first user and a second user
therein;
figure 4 shows the example presentation space with the first user and the second user
having moved to new locations therein;
figure 5 shows a first example of the presentation of audio when the first user and
the second user move close to one another;
figure 6 shows a second example of the presentation of audio when the first user and
the second user move close to one another;
figure 7 shows an example of the presentation of audio when the first user and second
user have provided an input to join separate telecommunication calls they were engaged
in;
figure 8 shows a second example of the presentation of audio when the first user and
second user have provided an input to join separate telecommunication calls they were
engaged in;
figure 9 shows a first stage of an example method of determining which speakers are
within a speaker selection for a user;
figure 10 shows a second stage of an example method of determining which speakers
are within a speaker selection for a user;
figure 11 shows a third stage of an example method of determining which speakers are
within a speaker selection for a user;
figure 12 shows a fourth stage of an example method of determining which speakers
are within a speaker selection for a user;
figure 13 shows a fifth stage of an example method of determining which speakers are
within a speaker selection for a user;
figure 14 shows a flowchart illustrating an example method; and
figure 15 shows a computer readable medium.
Description of Example Aspects
[0028] Telecommunication or telephony systems are being developed that provide for more
than monophonic capture and monophonic presentation of audio. Immersive telephony
systems are being developed, such as by the 3rd Generation Partnership Project (3GPP),
that will enable a new level of immersion in telephony services. Such systems may
provide for transmission of and presentation of immersive, spatial audio content.
This may enable receiving and sending of an enveloping sound scene from/to the telecommunication
call participants. Thus, when a remote user or person calls to a first user, the first
user can experience the audio environment around the remote user as if he/she was
physically located at the location of the remote user and vice versa.
[0029] In one or more examples, the audio provided as part of said telecommunication may
be categorised as primary audio and secondary audio. It will be appreciated that while
the examples herein relate to the provision of audio in the field of telecommunication,
the principles may be applied to other fields of audio presentation.
[0030] In one or more examples, the primary audio may comprise voice audio comprising audio
determined to be generated by a voice of one or more remote users in telecommunication
with a first user. The "voice" primary audio may be categorised at the point of capture
or at the point of play back using audio analysis techniques, or by a server or any
other entity involved in said telecommunication. The secondary audio may, in one or
more examples, comprise ambient audio comprising audio other than that determined
to be generated by the voice of one or more remote persons. The primary and secondary
audio may be presented differently.
[0031] In one or more examples, the primary audio may comprise spatial audio content that
includes directional information such that, when presented, it is perceived as originating
from one or more directions in accordance with the directional information. In one
or more examples, the secondary audio may comprise ambient audio comprising audio
without said directional information or without a direction of arrival distinguishable
above a threshold level. Again, the primary and secondary audio may be presented differently.
[0032] With primary audio that comprises spatial audio content, the direction from which
audio was received at the location of the remote user may be reproduced when presenting
the audio to the first user (or any other user) by use of spatial audio presentation.
[0033] Spatial audio comprises audio presented in such a way to a user that it is perceived
to originate from a particular location, as if the source of the audio was located
at that particular location. Spatial audio content comprises audio for presentation
as spatial audio and, as such, typically comprises audio having directional information
(either explicitly specified as, for example, metadata or inherently present in the
way the audio is captured), such that the spatial audio content can be presented such
that its component audio is perceived to originate from one or more points or one
or more directions in accordance with the directional information.
[0034] In one or more examples, non-spatial audio content may be presented as spatial audio.
Thus, "conventional" monophonic or stereophonic audio may be provided for presentation
such that it will be perceived to originate from a particular location or direction.
One or more of the embodiments described herein may present spatial audio based on
spatial audio content or non-spatial audio content.
[0035] The spatial positioning of the spatial audio may be provided by 3D audio effects,
such as those that utilise a head related transfer function to create a spatial audio
space (aligned with a real-world space in the case of augmented reality) in which
audio can be positioned for presentation to a user. Spatial audio may be presented
by headphones by using head-related-transfer-function (HRTF) filtering techniques
or, for loudspeakers, by using vector-base-amplitude panning techniques to position
the perceived aural origin of the audio content. In other embodiments Ambisonics audio
presentation may be used to present spatial audio.
[0036] Spatial audio may use one or more of volume differences, timing differences and pitch
differences between audible presentation to each of a user's ears to create the perception
that the origin of the audio is at a particular location or in a particular direction
in space. The perceived distance to the perceived origin of the audio may be rendered
by controlling the amount of reverberation and gain to indicate closeness or distance
from the perceived source of the spatial audio.
[0037] In one or more examples, an apparatus 100 may be provided that is configured to provide
for presentation of secondary audio to one or more users.
[0038] Figure 1 shows an example overview of a system 110 in which such an apparatus 100
may be used. The system may include means to receive audio content, such as a first
audio input 101 and second audio input 102. It will be appreciated that the system
110 may include further audio inputs. The audio inputs 101, 102 may receive audio
from one or more telecommunication devices (not shown). The telecommunication devices
may comprise mobile telephones or any other telecommunication equipment. In one or
more other examples, the audio inputs 101, 102 receive audio from other sources, in
the field of telecommunication or not.
[0039] The apparatus 100 may be configured to provide for presentation of the audio content
from the first audio input 101 and the second audio input 102 to a plurality of loud
speakers 103, 104, 105, 106. The speakers 103-106 may be arranged in a presentation
space 107, such as a room or part of a room.
[0040] The apparatus 100 may comprise or be connected to a processor 108 and a memory 109
and may be configured to execute computer program code. The apparatus 100 may have
only one processor 108 and one memory 109 but it will be appreciated that other embodiments
may utilise more than one processor and/or more than one memory (e.g. same or different
processor/memory types). Further, the apparatus 100 may be an Application Specific
Integrated Circuit (ASIC).
[0041] The processor may be a general purpose processor dedicated to executing/processing
information received from other components, such as received from the audio inputs
101, 102, or a user position information sensor 111 in accordance with instructions
stored in the form of computer program code in the memory. The output signalling generated
by such operations of the processor is provided onwards to further components, such
as to the speakers 103-106 or an amplifier or other audio presentation equipment (not
shown) to provide the audio to the speakers 103-106.
[0042] The user position information sensor 111 may comprise one or more cameras for tracking
the position of the one or more users in the presentation space 107. In one or more
other examples, the sensor 111 may be configured to track the position of tags, such
as RFID tags, worn by the one or more users. In one or more examples, the sensor 111
may be embodied as a plurality of sensors arranged at least within the presentation
space 107. It will be appreciated that there are many ways the position and, optionally,
the orientation, of one or more users may be tracked. In one or more embodiments,
the sensor 111 may be associated with circuitry for performing the tracking of users
and providing position information to the apparatus 100. In one or more examples,
the apparatus 100 may use data from the sensor to track the one or more users and
generate the position information.
[0043] The memory 109 (not necessarily a single memory unit) is a computer readable medium
(solid state memory in this example, but may be other types of memory such as a hard
drive, ROM, RAM, Flash or the like) that stores computer program code. This computer
program code stores instructions that are executable by the processor, when the program
code is run on the processor. The internal connections between the memory and the
processor can be understood, in one or more example embodiments, to provide an active
coupling between the processor and the memory to allow the processor to access the
computer program code stored on the memory.
[0044] In this example, the respective processors and memories are electrically connected
to one another internally to allow for electrical communication between the respective
components. In this example, the components are all located proximate to one another
so as to be formed together as an ASIC, in other words, so as to be integrated together
as a single chip/circuit that can be installed into an electronic device. In some
examples one or more or all of the components may be located separately from one another.
[0045] Example figures 2 to 8 below show various positions of a first user 201 and a second
user 202 inside the presentation space 107 (and outside the presentation space in
the case of example figure 2).
[0046] In this and one or more examples, the apparatus 100 may be configured to receive
first audio content, such as at first audio input 101, intended for presentation to
the first user 201 and, at the same or later time, second audio content, such as at
second audio input 102, for presentation to the second user 202 (shown in figures
3 to 8). The first audio content may comprise audio content from a remote user with
which the first user 201 is in telecommunication. The second audio content may comprise
audio content from a different remote user with which the second user 202 is in telecommunication.
In other examples, the first and second audio content may comprise audio content outside
the field of telecommunication, such as an audiobook. In one or more other examples,
the apparatus 100 may not receive the audio content itself but receives information
indicative that it is available for presentation by audio presentation equipment.
The apparatus 100 may then provide for control of the audio presentation equipment.
[0047] As described above, the first audio content and the second audio content may be categorised
into primary audio content and secondary audio content.
[0048] The example apparatus 100 is configured to receive first position information indicative
of at least the location of the first user 201 with respect to the presentation space
107 and, optionally, the orientation of the first user 201. The first position information
may be received from the sensor 111 or from a different device.
[0049] The example apparatus 100 is configured to receive second position information indicative
of at least the location of the second user 202 with respect to the presentation space
107 and, optionally, the orientation of the second user 202. The second position information
may be received from the sensor 111 or from a different device.
[0050] In this example, the apparatus 100 is configured to provide for the simultaneous
presentation of the first audio content and the second audio content to the four loudspeakers
comprising the first speaker 103, the second speaker 104, third speaker 105, and fourth
speaker 106. The simultaneous presentation may be such that the first and second audio
content is presented at the same time but not necessarily from the same speaker(s),
as will be described in more detail below. The apparatus 100 may, in one or more examples,
be configured to provide an audio signal for each loudspeaker 103-106. In one or more
examples, the apparatus 100 may be configured to provide one or more control signals
to audio presentation equipment to control how the audio presentation equipment mixes
the first and second audio content to each speaker 103-106.
[0051] The apparatus 100 may be provided with information regarding the location of the
first through fourth loudspeakers 103-106 in the presentation space 107. This known
location information may be pre-set or may be determined by the apparatus 100 prior
to or at the time of determining selection(s) of loudspeakers to which to present
the audio content. It will be appreciated that the location of the loudspeakers are
known to the apparatus at the time the second and third speaker selections are determined
but it may be inconsequential how far in advance the speakers locations are determined
or provided to the apparatus.
[0052] Example figure 2 shows the first user 201 in the presentation space 107. The apparatus
100 or a different apparatus, such as the audio presentation equipment, may provide
for presentation of the secondary audio using a first selection of speakers 103-106.
In this example, the first selection comprises all of the speakers 103-106 available
in the presentation space 107. The circle 203 illustrates that the secondary audio
of the first audio content is presented from all of the speakers and therefore the
ambient audio of the first audio content can be heard to surround the first user 201,
due to the speakers being arranged to surround the first user 201.
[0053] The criteria for selecting which loudspeakers comprise the first selection is based,
at least in part, on the first position information being indicative of the first
user 201 being in the presentation space 107 and the second position information being
indicative of the second user (not present in figure 2) being outside the presentation
space 107. The second position information may be considered to be indicative of the
second user being outside the presentation space 107 by one or more of specifying
a location for the second user that is outside the presentation space, not specifying
a location at all, or the absence of second position information at the time of presentation
of the first audio content. In this example, the criteria may comprise use all available
speakers when only one user requires secondary audio to be presented to them in the
presentation space 107. It will be appreciated that in one or more other examples,
the criteria may be different. For example, where the presentation space 107 is large
and many speakers are provided therein, the criteria for the first selection may comprise
present the secondary audio from all speakers within a predetermined distance of the
first user 201. Other criteria may be used to determine the speakers of the first
selection.
[0054] The apparatus 100 may include means configured to provide for the presentation of
the primary audio of the first audio content from one or more of the speakers 102
-106, such as from the loudspeaker or loudspeakers that are closest to the first user
201 (e.g. first speaker 103 or a subset comprising first and second speakers 103,
104). In one or more examples, the primary audio is provided for presentation from
the speaker or speakers that most directly faces the first user, such as third and
fourth speakers 105, 106. In other examples, the primary audio may comprise spatial
audio and therefore, the direction from which the first user 201 perceives the audio
may be based on the directional information of the primary audio of the first audio
content. Thus, the speakers used may be determined based on the spatial audio presentation
of the primary audio. It will be appreciated that various primary-audio-criteria may
be used to determine the speakers used for presentation of the primary audio.
[0055] Example figure 3 shows the presentation space 107 including both the first user 201
and the second user 202. The second user 202 may start a telecommunication call and
therefore require the presentation of the associated second audio content by the speakers
103-106 or may be conducting a telecommunication call and enter the presentation space
107. Accordingly, the apparatus 100 is configured to select which of the loudspeakers
103-106 to use to render the ambient secondary audio of the first audio content to
the first user 201 and which of the speakers 103-106 to use to render the ambient
secondary audio of the second audio content to the second user 202. The rendering
of the secondary audio for both the first and second users 201, 202 from all of the
speakers may be confusing and distracting. Accordingly, the selection of speakers
is preferably required to provide an impression of the ambient scene created by the
secondary audio but should not be distracting for the other of the first and second
user 201, 202.
[0056] In example figure 3, the apparatus 101 is configured to provide for presentation
of the secondary audio of the first audio content for the first user 201 using a second
selection of the speakers 103-106 rather than the first selection shown in figure
2. The apparatus 101 is also configured to provide for presentation of the secondary
audio of the second audio content for the second user 202 using a third selection
of the speakers 103-106.
[0057] In this example, the second selection, represented by oval 301, comprises the first
and second speakers 103, 104. In this example, the third selection, represented by
the second oval 302, comprises the third and fourth speakers 105, 106. Thus, the secondary
audio of the first audio content is only presented by a subset of the four speakers
and the second audio of the second audio content is only presented by a subset of
the four speakers 103-106.
[0058] In particular, the second selection comprises a fewer number of loudspeakers (e.g.
two) than the first selection (e.g. four). Thus, as the ambient secondary audio of
the first audio content can no longer be presented from all speakers, because the
presentation space 107 is shared by a plurality of users who require the presentation
of audio content, a compromise may be required.
[0059] Further, in one or more examples, the second selection and third selection comprise
non-overlapping selections of the speakers 103-106. Accordingly, speakers used to
present the secondary audio of the first audio content are not used to present the
secondary audio of the second audio content.
[0060] Still further, the second and third selections may be based on the location of the
first and second users 201, 202 in the presentation space 107 determined from the
first position information and the second position information.
[0061] The criterion for selecting the loudspeakers for the second and third selections
may comprise determining which non-overlapping subset of speakers are closer to the
first user 201 and which speakers are closer to the second user 202. In one or more
examples, the number of speakers assigned to each of the first and second selection
may be equal or close to equal in the event an odd number of speakers are used.
[0062] In one or more examples, the second selection may be determined prior to the third
selection or vice versa. In one or more examples, the third selection may comprise
the remaining speakers after determining the speakers that make up the second selection
or vice versa. In one or more examples, the user who first initiated the telecommunication
may have their speaker selection determined first.
[0063] In one or more examples, the criteria for determining the second and third selections
may comprise determining, for each speaker, which user 201, 202 is closer thereto
and assigning the speaker to the second selection if the first user 201 is closer
and assigning the speaker to the third selection if the second user 202 is closer.
An example strategy for determination of speaker selections is provided below with
reference to figures 9 to 13. However, in principle, various qualifiers or conditions
may be used to determine which speakers are assigned to which speaker selection based
on the location of the users. In one or more examples, a first qualifier may be to
identify the closest speaker or speakers to the respective users. If this does not
yield an independent or partially independent selection of speakers then further qualifiers
may be applied. In one or more examples, the speaker(s) that is closest to one of
the users than the other is assigned to the speaker selection for that user and the
other user is assigned the closest or next closest speaker(s) that is not part of
the other selection. In one or more examples, the apparatus 100 may continue to assess
each of the other speakers to determine which selection it should be part of.
[0064] In one or more examples, the first position information is also indicative of the
orientation of the first user 201 in the presentation space 107. In one or more examples,
the second position information may be indicative of the orientation of the second
user 202 in the presentation space 107. Accordingly, the apparatus 100 may be configured
to determine which of the speakers 103-106 the first and second users 201, 202 are
facing. Thus, the second selection and the third selection may be determined based
on a combination of which of the at least two speakers are closer to the first user
201 and the second user 202 and which of the at least two speakers the first user
201 and the second user 202 are facing. It will be appreciated that the assignment
of the speakers to the second and third selections may comprise one or more criteria
that needs to be met in terms of one or more of location and orientation in the presentation
space 107 and which user 201, 202 is closer to particular speakers than the other
user.
[0065] The apparatus 100 may include means configured to provide for the presentation of
the primary audio of the first audio content and the primary audio of the second audio
content when the first and second position information is indicative of both the first
and second users being within the presentation space. In one or more examples, the
criteria for selecting which of the one or more speakers 103 -106, from which to present
the primary audio of the first audio content is different to the criteria used for
selecting which of the one or more speakers 103 -106, from which to present the secondary
audio of the first audio content. Likewise, in one or more examples, the criteria
for selecting which of the one or more speakers 103 -106, from which to present the
primary audio of the second audio content is different to the criteria used for selecting
which of the one or more speakers 102 -106, from which to present the secondary audio
of the second audio content. In terms of the presentation of the primary audio, the
same primary-audio-criteria as described for Figure 2 may be used.
[0066] As mentioned in relation to figure 1 and 2, the apparatus 100 may be configured to
provide for presentation of the primary audio from the speaker or speakers that are
closest to the respective user or the speaker or speakers that most directly face
the respective user. In one or more other examples, the primary audio may comprise
spatial audio and therefore, the direction from which the users 201, 202 perceives
the audio may be based on the directional information of the primary audio of the
first audio content and second audio content respectively.
[0067] Example figure 4 shows the first user 201 having moved within the presentation space
107. Example figure 4 also shows the second user 202 having moved within the presentation
space 107. The apparatus 100 may include means configured to, based on movement of
the location of one or both of the first user 201 and the second user 202 in the presentation
space 107 as indicated in the respective first position information and second position
information, provide for a change in the speakers 103-106 that comprise the second
selection and the speakers 103-106 that comprise the third selection.
[0068] Thus, in one or more examples, one or both of the second selection and the third
selection comprise a dynamic selection of speakers based at least in part on the current
position of the first user and second user respectively.
[0069] In example figure 4, the apparatus 100 has updated the second selection to comprise
the first speaker 103 and the third speaker 105, as shown by oval 401. In example
figure 4, the apparatus 100 has updated the third selection to comprise the second
speaker 104 and the fourth speaker 106, as shown by oval 402.
[0070] The same criteria may be used to assign speakers to the second and third selections
when updating the selections, as mentioned above. Alternatively, different criteria
may be used to update the second and/or third selections after an initial assignment
of speakers to the second and/or third selections. For example, it may be confusing
if the criteria used to assign speakers to the second and third selections resulted
in the borderline assignment of a particular speaker to either the second or third
selection due to the relative positions of the first and second user 201, 202. In
such a circumstance, small movements of the one or both users may result in updating
of the second and third selections which may be confusing or annoying.
[0071] Thus, in one or more examples, a predetermined amount of time, which may be known
as a "no-update time" may be defined wherein the assignment of speakers to the second
and third selections is not updated despite the first position information and second
position information being indicative of movement that would otherwise result in the
updating of the second and/or third selections. In one or more examples the predetermined
amount of time may be greater than 1 second, 5 seconds, 10 seconds or 30 seconds.
In one or more examples, the predetermined amount of time may be based on a determined
speed of movement of the first and/or second users based on the first position information
and second position information. Thus, if the first and/or second users 201, 202 are
moving quickly around the presentation space 107, then the second and third selections
may be updated more quickly. However, if they are moving slowly, then the predetermined
amount of time may be greater so the second and third selections may be updated more
slowly. In one or more examples, the apparatus may provide for a no-update-distance
similar to the no-update-time. Thus, in one or more examples, the apparatus 100 is
configured to perform an update of the assignment of speakers to the second and third
selections based on the determined distance of movement of the first and/or second
users being above the no-update-distance. Thus, small movements of the users, less
than the no-update distance, may not trigger an update to the speaker selections whereas
a greater movement, greater than the no-update distance, will cause the apparatus
to provide for updating of the selections.
[0072] In terms of the presentation of the primary audio, the same primary-audio-criteria
as described for Figure 2 may be used.
[0073] Example figure 5 shows the first user 201 and the second user 202 having moved to
positions in the presentation space 107 that are within a predetermined distance 500
of one another. If the first and second users 201, 202 move close to each other, they
may have a higher likelihood of hearing each other's audio content. To make the first
and second audio content easier to listen to for the respective users, the apparatus
100 may be configured to stop playing the secondary audio or "ambient audio" of the
first and/or second audio content entirely. In one or more examples, the apparatus
100 may be configured to provide for presentation of the primary audio of the first
and/or second audio content from just one or more (e.g. not all) of the nearest speakers
to the corresponding users 201, 202.
[0074] Thus, example figure 5 shows, based on the first position information and the second
position information, the first user 201 and the second user 202 having moved to being
within a predetermined distance 500 of one another. The apparatus 100 is thus configured
to provide for ceasing of presentation of the secondary audio of one or both of the
first audio content and the second audio content and maintaining presentation of the
primary audio of both the first audio content and the second audio content. The absence
of an oval (or circular) selection in example figure 5 shows the absence of the presentation
of the secondary audio of both the first audio content and the second audio content.
While the primary audio of both the first audio content and the second audio content
may continue to be presented, it will be appreciated that the selection of which speakers
103-106 are used may change based on the first user 201 and the second user 202 having
moved to being within a predetermined distance 500 of one another. In one or more
examples, the same primary-audio-criteria as described for Figure 2 may be used or
a different criterion.
[0075] Example figure 6 shows an example of an alternative action by apparatus 100 in response
to the first user 201 and the second user 202 having moved to positions in the presentation
space 107 that are within a predetermined distance 500 of one another. In this and
one and more examples, the apparatus 100 is configured to, in response to the first
position information and the second position information being indicative of the first
user 201 and the second user 202 having moved to being within a predetermined distance
500 of one another, change from presentation of the secondary audio using the second
and third selections.
[0076] In particular, the apparatus 100 may be configured to provide for presentation of
the secondary audio of the first audio content from the speaker that is closest to
the first user 201 and closer to the first user 201 than the second user 202, which
in this example comprises the third speaker 105, as shown by selection circle 601.
Further, the apparatus 100 may be configured to provide for presentation of the secondary
audio of the second audio content from the speaker that is closest to the second user
202 and closer to the second user 202 than the first user 201, which in this example
comprises the fourth speaker 106, as shown by selection circle 602.
[0077] Thus, in some examples, the ambient, secondary audio may be important for the telecommunication
call because, for example, it carries the sound of children and the call is with the
grandparents. In one or more examples, the apparatus 100 may provide the secondary
audio of the first audio content for presentation additionally based on first-user-input
indicative of them wanting to hear the secondary audio. In one or more examples, the
apparatus 100 may provide the secondary audio of the first audio content for presentation
additionally based on remote-user-input indicative of them wanting the first user
201 to hear the secondary audio. Similarly, the second user may provide corresponding
second-user-input to hear the secondary audio of the second audio content and/or the
remote user with which the second user is telecommunicating may provide said remote-user-input
indicative of them wanting the second user 202 to hear the secondary audio. The apparatus
may thus be configured to act on said second-user-input and/or remote-user-input by
presenting the secondary audio from at least one of the loudspeakers.
[0078] The apparatus 100 may be configured to maintain presentation of the primary audio
of both the first audio content and the second audio content. As before, while the
primary audio of both the first audio content and the second audio content may continue
to be presented, it will be appreciated that the selection of which speakers 103-106
are used may change based on the first user 201 and the second user 202 having moved
to being within a predetermined distance 500 of one another compared to when they
were further than the predetermined distance 500 from one another.
[0079] In one or more examples, the first audio content is audio from a remote user with
which the first user 201 is communicating and the second audio content is audio from
a different remote user with which the second user 202 is communicating. In one or
more examples, the first user 201 and/or the second user 202 may be desirous of joining
the telecommunication calls to provide multi-party "conference call" in which the
first user 201, second user 202 and remote persons with which the first and second
user are communicating can all hear each other and communicate.
[0080] Example figure 7 shows how the apparatus 100 may provide for presentation of the
secondary audio based on a user input received from one or both of the first user
201 and the second user 202 indicative of a desire to join the telecommunication to
provide for telecommunication between any two of the first and second user 201, 202
and the first and second remote persons (not shown).
[0081] In this example, the apparatus 100 is configured to provide for presentation of the
secondary audio of the first audio content using a fourth selection 701 of the speakers
and presentation of the secondary audio of the second audio content using a fifth
selection 702 of the speakers 103-106. The fourth and fifth selections may comprise
overlapping selections of the speakers 103-106. In the example shown in figure 7 and
in one or more other examples, the fourth 701 and fifth 702 selections are the same
selection of speakers 103-106. In one or more examples, the fourth and fifth selections
comprise all of the speakers 103-106 in the presentation space 107.
[0082] The primary audio of the first audio content and the second audio content may also
be presented from the speakers 103-106 that are selected independently using different
criteria than the criteria used to determine the fourth and fifth selections for the
secondary audio.
[0083] Example figure 8 shows how the apparatus 100 may provide for presentation of the
secondary audio based on a user input received from one or both of the first user
201 and the second user 202 indicative of a desire to join the telecommunication to
provide for telecommunication between any two of the first and second user 201, 202
and more than two remote persons (not shown). For example, the first user may be in
communication with a first remote user and the second user may be in communication
with two remote users at the time the telecommunication is combined. Alternatively,
the third remote user may join the telecommunication after the first and second users
have joined their calls. Thus, in example figure 8, there may be third or more audio
content due to the number of remote users. The presentation of the secondary audio
from many, e.g. more than two, remote users may be confusing. Therefore, in one or
more examples, the apparatus 100 is configured to, based on the first audio content,
the second audio content and at least third audio content for presentation to at least
both the first user and the second user, provide for grouping of the secondary audio
of the first, second and at least third audio content into two or more groups for
presentation from different respective groups of the speakers.
[0084] In this example, the secondary audio from the first audio content and the secondary
audio from the second audio content is grouped and presented from the first and third
speakers 103, 105 as shown by selection 801. In this example, the secondary audio
from the third audio content forms the other group and is presented from the second
and fourth speakers 104, 106 as shown by selection 802.
[0085] In one or more examples, when one of the telecommunication calls finishes and therefore
the first audio content or second audio content is no longer available for presentation,
the selection of speakers used for presentation of the secondary audio may be updated.
Thus, based on an end of one of the first audio content and second audio content being
reached such that there is no further audio content to present, the apparatus 100
may provide for presentation of the other of the first audio content and second audio
content with a sixth selection (not shown) of the speakers, the sixth selection comprising
a greater number of speakers than the second and third selections. In one or more
examples, the sixth selection may comprise the first selection shown in example figure
2. Thus, the sixth selection may comprise all of the speakers of the presentation
space 107 or a subset thereof based on distance between the user and the speakers
103-106.
[0086] It will be appreciated that the number of speakers 103-106 may be greater or less
than four as shown in the examples but may be at least two or at least three speakers.
[0087] In one or more examples, one or both of the first audio content and the second audio
content comprises audio-type information which designates which audio of the respective
first and second audio content comprises primary audio and which audio comprises secondary
audio. Alternatively, in one or more examples, the apparatus 100 is configured to
provide for determination of which audio of one or both of the first audio content
and second audio content is the primary audio and which is the secondary audio based
on audio analysis.
[0088] Thus, in one or more examples, the audio of the first audio content may be labelled
with a category comprising either secondary "ambient" audio or primary audio. In one
or more examples, the labelling of the audio may be performed at a device of the remote
person at the time of capture. In such an example, the first audio content may be
separated into a primary audio stream and a secondary audio stream. In one or more
examples, a server may perform the labelling of the audio and/or creation of said
different component audio streams. In one or more examples, the apparatus 100 may
be configured to determine which audio is primary user audio and which is secondary
audio. In any of the examples, the determination of primary audio from secondary audio
may be performed using any appropriate audio analysis technique and may be based on
the frequency range of the audio at a particular time, the degree of reverberation,
the volume or any other audiological factor. The determination of primary audio from
secondary audio may be performed based on predetermined speech samples of the remote
person. In other examples, the first/second audio content may be recorded by a plurality
of microphones and may be determined to be primary audio or secondary audio based
on differences between how the audio was captured by the plurality of microphones.
For example, audio deemed to be received at a greater volume or with a particular
degree of reverberation by a front facing microphone relative to a rearfacing microphone
may be deemed to be primary audio (possibly once subject to audio processing). The
remaining audio or audio received at a greater volume or with a particular degree
of reverberation by the rear facing microphone relative to the front facing microphone
may be determined to be secondary audio (possibly once subject to audio processing).
It will be appreciated that various audio processing techniques may be used to separate
the first user audio and the ambient audio.
[0089] In one or more examples, the first audio content and the second audio content may
comprise telecommunication audio content or any other audio content that may be categorized.
For example, an audiobook may provide the reader of the story as primary audio and
supplementing sounds as secondary audio.
[0090] Figure 9 shows a first stage of an example process for determining loudspeaker selections,
such as the second and third selections. Figure 9 shows an arrangement of six speakers
901-906 (first through sixth) in a presentation space 907 in which the first user
201 and the second user 202 are present. In this and one or more examples, it is a
predetermined requirement of the speaker selection process for each speaker selection
to include two speakers. It will be appreciated that other requirements are possible.
A first stage of an example speaker selection determination process comprises forming
all possible speaker pairs and connecting the pairs with straight lines, shown by
arrows 908.
[0091] An example second stage comprises removal from consideration of loudspeaker pairs
which have associated connecting lines 908 that cross one another. Thus, in this example,
the follow speaker pairs are discounted - first-third speaker pair, first-fourth speaker
pair, fifth-second speaker pair, fifth-third speaker pair and fourth-sixth speaker
pair. This leaves the speakers pairs that have non-intersecting connecting lines,
which comprise first-second pair 910, second-third pair 911, third-fourth pair 912,
fourth-fifth pair 913, fifth-first pair 914, fifth-sixth pair 915 and the first-sixth
pair 916. Example figure 10 shows only the non-intersecting connecting lines and thus
the remaining pairs of speakers 901-906.
[0092] Example figure 10 also illustrates the determination of a mid-point between the remaining
speaker pairs along their associated connecting lines, marked by a cross in Figure
10 along each connecting line.
[0093] Example figure 11 illustrates that, for each user 201, 202, the apparatus is configured
to provide for the determination of the distance between the respective user 201,
202 and each of the determined mid-points. The second selection for the first user
201 is then determined based on which mid-point the first user 201 is closest to.
The closest mid-point is associated with a speaker pair and the speakers of the speaker
pair are determined to form the second selection. In this example, the second selection
for the first user 201 is determined to comprise the first and sixth speakers as shown
by dashed oval 921.
[0094] The third selection may also be determined based on which mid-point the second user
202 is closest to, which in this example is the mid-point of the first-second speaker
pair on connecting line 910. Dashed oval 922 represents the (possible) third selection.
However, in one or more examples, a further qualifier for determination of the speaker
selection comprises that the second and third speaker selections must be mutually
exclusive in terms of the speakers that form the selections. That is, there must not
be any overlap between the speaker selections. Dashed ovals 921 and 922 are shown
to overlap and therefore the apparatus may be configured to identify a different selection
of speakers for the third selection.
[0095] Example figure 12 shows the apparatus 100 providing for determination of which is
the second closest mid-point to the second user 202 for determination of the third
selection. In this example, the mid-point of the connecting line 911 is the second
closest to the second user 202, which thereby identifies the second and third speakers
902, 903 as the speakers that form the third selection, shown by dashed oval 923.
Accordingly, the process has identified the first and sixth speakers 901, 906 as forming
the second selection and the second and third speakers 902, 903 as forming the third
selection.
[0096] Thus, in summary, the apparatus 100 may be configured to identify the speaker or
speakers that are closest to each of the users, wherein if said selected speakers
are not mutually exclusive then the speaker or speakers that are second closest to
one of the users are determined and wherein the speakers determined based on the location
of the first user form the second selection and the speakers determined based on the
location of the second user form the third selection. It will be appreciated that
the process may be followed for determining the speakers of selections disclosed herein
other than the second and third selections. In one or more examples, only once the
speaker selections have been determined is the apparatus 100 configured to begin to
use those speakers for presentation of audio.
[0097] Further, the apparatus may be configured to provide for the following:
identification of all possible speaker pairs from the plurality of speakers;
determination of a connecting line between each of the determined speaker pairs;
discounting of any speaker pairs than have a connecting line that intersects with
any other connecting line to identify a set of speaker pairs for further consideration;
determination of a mid-point along each of the connecting lines of each of the identified
set of speaker pairs;
determination of which mid-point each of the users are closest to; and
provided that the speakers associated with the connecting line having the mid-point
closest to the respective user are all different speakers, provide for determination
of the second selection based on the speakers associated with the connecting line
having the mid-point closest to the first user and provide for determination of the
third selection based on the speakers associated with the connecting line having the
mid-point closest to the second user.
[0098] In one or more examples, the apparatus is configured to, if the speakers are not
all different, for at least one of the users, determine the second closest mid-point
and provided that the speakers associated with the connecting line having the mid-point
that is second closest to the at least one of the users are all different speakers,
provide for determination of one of the second and third selection based on the speakers
associated with the connecting line having the mid-point second closest to the respective
user and provide for determination of the other of the second and third selection
based on the speakers associated with the connecting line having the mid-point closest
to the respective user.
[0099] In one or more examples, the second and third speaker selections may be expanded
to include one or more additional speakers, which may form part of the example selection
process. Example figure 13 shows the apparatus 100 configured to, for each of the
second and third selections, provide for determination of the nearest mid-point to
the mid-point of the speaker pair that currently forms part of the respective selection
and thereby determine at least one further speaker from the speaker pairs associated
with said nearest mid-point, and wherein, provided that said at least one additional
speaker is not part of the other of the second and third selection, expand said second
or third selection to include said at least one additional speaker.
[0100] In this example, the nearest mid-point to the mid-point of the first-sixth speaker
pair 916 of the second selection 921 is the mid-point of the fifth-first speaker pair
914. Accordingly, the second selection 921 may be expanded to include the fifth speaker
905. In this example, the nearest mid-point to the mid-point of the second-third speaker
pair 911 of the third selection is the mid-point of the third-fourth speaker pair
912. Accordingly, the third selection 923 may be expanded to include the fourth speaker
904. The expanded selections 921, 923 do not overlap in terms of the speakers of which
they are formed and therefore the apparatus 100 may be configured to provide for audio
presentation from the determined second and third selections 921, 923 respectively.
[0101] Figure 14 shows a flow diagram illustrating the steps of
receiving first position information 1400 indicative of at least the location of a
first user with respect to a presentation space; and
receiving second position information 1401 indicative of at least the location of
a second user with respect to the presentation space;
providing for presentation 1402, using a first selection of loudspeakers comprising
at least two loudspeakers having known locations in the presentation space, of secondary
audio of first audio content to the first user based on the first position information
being indicative of the first user being in the presentation space and the second
position information being indicative of the second user being outside the presentation
space; and
providing for presentation 1403 of secondary audio of the first audio content using
a second selection of the loudspeakers and the secondary audio of the second audio
content using a third selection of the loudspeakers, the second selection comprising
fewer loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
[0102] Figure 15 illustrates schematically a computer/processor readable medium 1500 providing
a program according to an example. In this example, the computer/processor readable
medium is a disc such as a digital versatile disc (DVD) or a compact disc (CD). In
some examples, the computer readable medium may be any medium that has been programmed
in such a way as to carry out an inventive function. The computer program code may
be distributed between the multiple memories of the same type, or multiple memories
of a different type, such as ROM, RAM, flash, hard disk, solid state, etc.
[0103] User inputs may be gestures which comprise one or more of a tap, a swipe, a slide,
a press, a hold, a rotate gesture, a static hover gesture proximal to the user interface
of the device, a moving hover gesture proximal to the device, bending at least part
of the device, squeezing at least part of the device, a multi-finger gesture, tilting
the device, or flipping a control device. Further the gestures may be any free space
user gesture using the user's body, such as their arms, or a stylus or other element
suitable for performing free space user gestures.
[0104] The apparatus shown in the above examples may be a portable electronic device, a
laptop computer, a mobile phone, a Smartphone, a tablet computer, a personal digital
assistant, a digital camera, a smartwatch, smart eyewear, a pen based computer, a
non-portable electronic device, a desktop computer, a monitor, a smart TV, a server,
a wearable apparatus, a virtual reality apparatus, or a module/circuitry for one or
more of the same.
[0105] Any mentioned apparatus and/or other features of particular mentioned apparatus may
be provided by apparatus arranged such that they become configured to carry out the
desired operations only when enabled, e.g. switched on, or the like. In such cases,
they may not necessarily have the appropriate software loaded into the active memory
in the non-enabled (e.g. switched off state) and only load the appropriate software
in the enabled (e.g. on state). The apparatus may comprise hardware circuitry and/or
firmware. The apparatus may comprise software loaded onto memory. Such software/computer
programs may be recorded on the same memory/processor/functional units and/or on one
or more memories/processors/ functional units.
[0106] In some examples, a particular mentioned apparatus may be pre-programmed with the
appropriate software to carry out desired operations, and wherein the appropriate
software can be enabled for use by a user downloading a "key", for example, to unlock/enable
the software and its associated functionality. Advantages associated with such examples
can include a reduced requirement to download data when further functionality is required
for a device, and this can be useful in examples where a device is perceived to have
sufficient capacity to store such pre-programmed software for functionality that may
not be enabled by a user.
[0107] Any mentioned apparatus/circuitry/elements/processor may have other functions in
addition to the mentioned functions, and that these functions may be performed by
the same apparatus/circuitry/elements/processor. One or more disclosed aspects may
encompass the electronic distribution of associated computer programs and computer
programs (which may be source/transport encoded) recorded on an appropriate carrier
(e.g. memory, signal).
[0108] Any "computer" described herein can comprise a collection of one or more individual
processors/processing elements that may or may not be located on the same circuit
board, or the same region/position of a circuit board or even the same device. In
some examples one or more of any mentioned processors may be distributed over a plurality
of devices. The same or different processor/processing elements may perform one or
more functions described herein.
[0109] The term "signalling" may refer to one or more signals transmitted as a series of
transmitted and/or received electrical/optical signals. The series of signals may
comprise one, two, three, four or even more individual signal components or distinct
signals to make up said signalling. Some or all of these individual signals may be
transmitted/received by wireless or wired communication simultaneously, in sequence,
and/or such that they temporally overlap one another.
[0110] With reference to any discussion of any mentioned computer and/or processor and memory
(e.g. including ROM, CD-ROM etc), these may comprise a computer processor, Application
Specific Integrated Circuit (ASIC), field-programmable gate array (FPGA), and/or other
hardware components that have been programmed in such a way to carry out the inventive
function.
[0111] The applicant hereby discloses in isolation each individual feature described herein
and any combination of two or more such features, to the extent that such features
or combinations are capable of being carried out based on the present specification
as a whole, in the light of the common general knowledge of a person skilled in the
art, irrespective of whether such features or combinations of features solve any problems
disclosed herein, and without limitation to the scope of the claims. The applicant
indicates that the disclosed aspects/examples may consist of any such individual feature
or combination of features. In view of the foregoing description it will be evident
to a person skilled in the art that various modifications may be made within the scope
of the disclosure.
[0112] While there have been shown and described and pointed out fundamental novel features
as applied to examples thereof, it will be understood that various omissions and substitutions
and changes in the form and details of the devices and methods described may be made
by those skilled in the art without departing from the scope of the disclosure. For
example, it is expressly intended that all combinations of those elements and/or method
steps which perform substantially the same function in substantially the same way
to achieve the same results are within the scope of the disclosure. Moreover, it should
be recognized that structures and/or elements and/or method steps shown and/or described
in connection with any disclosed form or examples may be incorporated in any other
disclosed or described or suggested form or example as a general matter of design
choice. Furthermore, in the claims means-plus-function clauses are intended to cover
the structures described herein as performing the recited function and not only structural
equivalents, but also equivalent structures. Thus, although a nail and a screw may
not be structural equivalents in that a nail employs a cylindrical surface to secure
wooden parts together, whereas a screw employs a helical surface, in the environment
of fastening wooden parts, a nail and a screw may be equivalent structures.
1. An apparatus comprising means configured to:
receive first position information indicative of at least the location of a first
user with respect to a presentation space; and
receive second position information indicative of at least the location of a second
user with respect to the presentation space;
provide for presentation, using a first selection of loudspeakers comprising at least
two loudspeakers having known locations in the presentation space, of secondary audio
of first audio content to the first user based on the first position information being
indicative of the first user being in the presentation space and the second position
information being indicative of the second user being outside the presentation space;
and
provide for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
2. An apparatus according to claim 1, wherein the means are configured to provide for
presentation of the primary audio of the first audio content and the primary audio
of the second audio content from at least the loudspeaker of the at least two loudspeakers
that satisfies a criterion, the criterion including at least in part the following
qualifiers:
the loudspeaker that is the closest loudspeaker to the respective user that is closer
to the respective user than it is to the other respective user; and
the primary audio of the first audio content and the primary audio of the second audio
content are presented from different loudspeakers.
3. An apparatus according to claim 1 or claim 2, wherein the second selection is determined
based, at least in part, on which of the at least two loudspeakers are closer to the
first user than the second user and the third selection is determined based, at least
in part, on which of the at least two loudspeakers are closer to the second user than
the first user, wherein the second and third selections are non-overlapping.
4. An apparatus according to claim 1 or claim 2, wherein the first position information
is also indicative of the orientation of the first user in the presentation space
and the second position information is also indicative of the orientation of the second
user in the presentation space; and
wherein the loudspeakers that form part of the second selection are determined based
on a criterion, the criterion including at least in part the following qualifiers:
which of the at least two loudspeakers are closer to the first user; and
which of the at least two loudspeakers the first user is facing; and
wherein the loudspeakers that form part of the third selection are determined based
on a criterion, the criterion including at least in part the following qualifiers:
which of the at least two loudspeakers are closer to the second user; and
which of the at least two loudspeakers the second user is facing.
5. An apparatus according to claim 1 or claim 2, wherein the apparatus includes means
configured to change the loudspeakers that form the second selection and the loudspeakers
that form the third selection based on the respective first position information and
the respective second position information being indicative of movement of the location
of one or both of the first user and the second user in the presentation space.
6. An apparatus according to claim 5, wherein said change in the loudspeakers that form
the second selection and the loudspeakers that form the third selection is further
based on the elapse of a predetermined amount of time since any previous change in
the loudspeakers that form the second selection and the loudspeakers that form the
third selection.
7. An apparatus according to any preceding claim, wherein the means are configured to
cease presentation of the secondary audio of one or both of the first audio content
and the second audio content and maintain the presentation of the primary audio of
both the first audio content and the second audio content, in response to the first
position information and the second position information being indicative of the first
user and the second user having moved to within a predetermined distance of one another.
8. An apparatus according to any one of claims 1 to 6, wherein said means are configured
to, in response to the first position information and the second position information
being indicative of the first user and the second user having moved to being within
a predetermined distance of one another,
provide for presentation of the secondary audio of the first audio content from the
loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the
first user that is closer to the first user than it is to the second user in preference
to the second selection, and
provide for presentation of the secondary audio of the second audio content from the
loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the
second user that is closer to the second user than it is to the first user in preference
to the third selection, and
maintain presentation of the primary audio of both the first audio content and the
second audio content.
9. An apparatus according to any preceding claim, wherein the first audio content and
the second audio content comprise audio from respective first and second remote persons
in telecommunication with the first user and second user respectively, and said means
are configured to provide for presentation of the secondary audio of the first audio
content using a fourth selection of the loudspeakers and presentation of the secondary
audio of the second audio content using a fifth selection of the loudspeakers, the
fourth and fifth selections comprising overlapping selections of the loudspeakers,
in response to user input received from one or both of the first user and the second
user.
10. An apparatus according to any preceding claim, wherein the means are configured to
provide for presentation of one of the first audio content and second audio content
with a sixth selection of the loudspeakers, the sixth selection comprising a greater
number of loudspeakers than the second selection and a greater number of loudspeakers
than the third selection, in response to an end of the other of the first audio content
and second audio content being reached such that there is no further audio content
to present.
11. An apparatus according to any preceding claim, wherein the primary audio is provided
for presentation using a selection of the at least two loudspeakers based on primary-audio-criteria,
wherein the primary-audio-criteria is different to criterion used to determine one
or more of the first selection, the second selection and the third selection of loudspeakers
of the at least two loudspeakers.
12. An apparatus according to any preceding claim, wherein the means are configured to
provide for the simultaneous presentation of the first audio content and the second
audio content to at least three loudspeakers at known locations in the presentation
space.
13. An apparatus according to any preceding claim, wherein one or both of the first audio
content and the second audio content comprises audio-type information which designates
which audio of the respective first and second audio content comprises the primary
audio and which audio of the respective first and second audio content comprises the
secondary audio.
14. A method, the method comprising:
receiving first position information indicative of at least the location of a first
user with respect to a presentation space; and
receiving second position information indicative of at least the location of a second
user with respect to the presentation space;
providing for presentation, using a first selection of loudspeakers comprising at
least two loudspeakers having known locations in the presentation space, of secondary
audio of first audio content to the first user based on the first position information
being indicative of the first user being in the presentation space and the second
position information being indicative of the second user being outside the presentation
space; and
providing for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.
15. A computer readable medium comprising computer program code stored thereon, the computer
readable medium and computer program code being configured to, when run on at least
one processor, perform the method of:
receiving first position information indicative of at least the location of a first
user with respect to a presentation space; and
receiving second position information indicative of at least the location of a second
user with respect to the presentation space;
providing for presentation, using a first selection of loudspeakers comprising at
least two loudspeakers having known locations in the presentation space, of secondary
audio of first audio content to the first user based on the first position information
being indicative of the first user being in the presentation space and the second
position information being indicative of the second user being outside the presentation
space; and
providing for presentation of secondary audio of the first audio content using a second
selection of the loudspeakers and the secondary audio of the second audio content
using a third selection of the loudspeakers, the second selection comprising fewer
loudspeakers than the first selection, and the second and third selections comprising
non-overlapping selections of the loudspeakers based on the location of the first
user and the second user in the presentation space determined from the first position
information and the second position information, wherein the first audio content and
the second audio content each comprise primary audio and the secondary audio, the
secondary audio being different to the primary audio and comprising ambient audio.