AN APPARATUS AND ASSOCIATED METHODS FOR PRESENTATION OF AUDIO

(19)

(11)

EP 3 588 986 A1

(12)	EUROPEAN PATENT APPLICATION

(43)	Date of publication:
	01.01.2020 Bulletin 2020/01

(21)	Application number: 18180734.8

(22)	Date of filing: 29.06.2018

(51)

International Patent Classification (IPC):

H04R 27/00^(2006.01)

H04S 7/00^(2006.01)

(84)	Designated Contracting States:
	AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR
	Designated Extension States:
	BA ME
	Designated Validation States:
	KH MA MD TN

(71)	Applicant: Nokia Technologies Oy
	02610 Espoo (FI)

(72)	Inventors:
	ERONEN, Antti 33820 Tampere (FI) LEHTINIEMI, Arto 33880 Lempäälä (FI) MATE, Sujeet Shyamsundar 33720 Tampere (FI) LEPPÄNEN, Jussi 33580 Tampere (FI)

(74)	Representative: Nokia EPO representatives
	Nokia Technologies Oy Karakaari 7 02610 Espoo 02610 Espoo (FI)

(54)	AN APPARATUS AND ASSOCIATED METHODS FOR PRESENTATION OF AUDIO

(57) An apparatus comprising means configured to:
receive first and second position information indicative of the location of a respective first and second user with respect to a presentation space;
provide for presentation, using a first selection of loudspeakers, of secondary audio of first audio content based on the first user being in and the second user being outside the presentation space; and
provide for presentation of secondary audio of the first audio content using a second selection of the loudspeakers and the secondary audio of the second audio content using a third selection of the loudspeakers, the second selection comprising fewer loudspeakers than the first selection, and the second and third selections comprising non-overlapping selections based on the location of the first and second user in the presentation space, wherein the first and second audio content each comprise primary audio and different, ambient secondary audio.

Description

Technical Field

[0001] The present disclosure relates to the field of presentation of audio content and, in particular, to the presentation of a subset of the audio from a selection of loudspeakers. The present disclosure also relates to associated apparatus, methods and computer programs.

Background

[0002] Telecommunication or telephony systems are being developed that provide for more than monophonic capture and presentation of audio. The presentation of such audio may require careful consideration to ensure the telecommunication is clear and effective.

[0003] The listing or discussion of a prior-published document or any background in this specification should not necessarily be taken as an acknowledgement that the document or background is part of the state of the art or is common general knowledge. One or more aspects/examples of the present disclosure may or may not address one or more of the background issues.

Summary

[0004] In an aspect of the disclosure there is provided an apparatus, the apparatus comprising means configured to:

receive first position information indicative of at least the location of a first user with respect to a presentation space; and

receive second position information indicative of at least the location of a second user with respect to the presentation space;

provide for presentation, using a first selection of loudspeakers comprising at least two loudspeakers having known locations in the presentation space, of secondary audio of first audio content to the first user based on the first position information being indicative of the first user being in the presentation space and the second position information being indicative of the second user being outside the presentation space; and

provide for presentation of secondary audio of the first audio content using a second selection of the loudspeakers and the secondary audio of the second audio content using a third selection of the loudspeakers, the second selection comprising fewer loudspeakers than the first selection, and the second and third selections comprising non-overlapping selections of the loudspeakers based on the location of the first user and the second user in the presentation space determined from the first position information and the second position information, wherein the first audio content and the second audio content each comprise primary audio and the secondary audio, the secondary audio being different to the primary audio and comprising ambient audio.

[0005] In one or more examples, the primary audio comprises voice audio comprising audio determined to be generated by a voice of one or more remote users, such as for telecommunication with the first/second user, and the secondary audio comprises ambient audio comprising audio other than that determined to be generated by the voice of the one or more remote users. In one or more examples, the primary audio comprises spatial audio that includes directional information such that, when presented, it is to be perceived as originating from a direction or range of directions in accordance with the directional information and the secondary audio comprises at least one of audio without said directional information and spatial audio with said directional information that defines a range of directions from which the audio should be perceived greater than a threshold range of directions. In one or more examples, the first and/or second audio content comprises telecommunication audio content comprising audio content provided for the purpose of telecommunication, which may be via a traditional telecommunication network or provided by a voice over IP or any other packet-based or circuit switched telephony service.

[0006] In one or more embodiments, the apparatus includes means configured to provide for presentation of the primary audio of the first audio content and the primary audio of the second audio content from at least the speaker of the at least two speakers that satisfies a criterion, the criterion including at least in part the following qualifiers:

the speaker that is the closest speaker to the respective user that is closer to the respective user than it is to the other respective user; and

the primary audio of the first audio content and the primary audio of the second audio content are presented from different speakers.

[0007] In one or more examples, the apparatus includes means configured to provide for presentation of the primary audio of the first audio content and the primary audio of the second audio content from at least the loudspeaker of the at least two loudspeakers that satisfies a criterion, the criterion including at least in part the following qualifiers:
the loudspeaker that is the closest loudspeaker to the respective user that is closer to the respective user than it is to the other respective user; and
the primary audio of the first audio content and the primary audio of the second audio content are presented from different loudspeakers.

[0008] In one or more embodiments, the second selection is determined based, at least in part, on which of the at least two loudspeakers are closer to the first user than the second user and the third selection is determined based, at least in part, on which of the at least two loudspeakers are closer to the second user than the first user, wherein the second and third selections are non-overlapping.

[0009] In one or more embodiments, the first position information is also indicative of the orientation of the first user in the presentation space and the second position information is also indicative of the orientation of the second user in the presentation space; and

wherein the loudspeakers that form part of the second selection are determined based on a criterion, the criterion including at least in part the following qualifiers:

which of the at least two loudspeakers are closer to the first user; and

which of the at least two loudspeakers the first user is facing; and

wherein the loudspeakers that form part of the third selection are determined based on a criterion, the criterion including at least in part the following qualifiers:

which of the at least two loudspeakers are closer to the second user; and

which of the at least two loudspeakers the second user is facing.

[0010] In one or more embodiments, the apparatus includes means configured to change the loudspeakers that form the second selection and the loudspeakers that form the third selection based on the respective first position information and the respective second position information being indicative of movement of the location of one or both of the first user and the second user in the presentation space.

[0011] In one or more embodiments, said change in the loudspeakers that form the second selection and the loudspeakers that form the third selection is further based on the elapse of a predetermined amount of time since any previous change in the loudspeakers that form the second selection and the loudspeakers that form the third selection.

[0012] In one or more embodiments, the means are configured to cease presentation of the secondary audio of one or both of the first audio content and the second audio content and maintain the presentation of the primary audio of both the first audio content and the second audio content, in response to the first position information and the second position information being indicative of the first user and the second user having moved to within a predetermined distance of one another.

[0013] In one or more embodiments, said means are configured to, in response to the first position information and the second position information being indicative of the first user and the second user having moved to being within a predetermined distance of one another,

provide for presentation of the secondary audio of the first audio content from the loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the first user that is closer to the first user than it is to the second user in preference to the second selection, and

provide for presentation of the secondary audio of the second audio content from the loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the second user that is closer to the second user than it is to the first user in preference to the third selection, and

maintain presentation of the primary audio of both the first audio content and the second audio content.

[0014] In one or more embodiments, the first audio content and the second audio content comprise audio from respective first and second remote persons in telecommunication with the first user and second user respectively, and said means are configured to provide for presentation of the secondary audio of the first audio content using a fourth selection of the loudspeakers and presentation of the secondary audio of the second audio content using a fifth selection of the loudspeakers, the fourth and fifth selections comprising overlapping selections of the loudspeakers, in response to user input received from one or both of the first user and the second user.

[0015] In one or more examples, the fourth and fifth selections are formed of the same selection of speakers. In one or more examples, the fourth and fifth selections comprise all of the at least two speakers in the presentation space.

[0016] In one or more embodiments, the means are configured to provide for presentation of one of the first audio content and second audio content with a sixth selection of the loudspeakers, the sixth selection comprising a greater number of loudspeakers than the second selection and a greater number of loudspeakers than the third selection, in response to an end of the other of the first audio content and second audio content being reached such that there is no further audio content to present.

[0017] In one or more embodiments, the primary audio is provided for presentation using a selection of the at least two loudspeakers based on primary-audio-criteria, wherein the primary-audio-criteria is different to criterion used to determine one or more of the first selection, the second selection and the third selection of loudspeakers of the at least two loudspeakers.

[0018] In one or more embodiments, the means are configured to provide for the simultaneous presentation of the first audio content and the second audio content to at least three loudspeakers at known locations in the presentation space.

[0019] In one or more embodiments, one or both of the first audio content and the second audio content comprises audio-type information which designates which audio of the respective first and second audio content comprises the primary audio and which audio of the respective first and second audio content comprises the secondary audio.

[0020] In one or more examples, the apparatus is configured to provide for determination of which audio of one or both of the first audio content and second audio content is the primary audio, and which is the secondary audio based on audio analysis.

[0021] In a further aspect there is provided a method, the method comprising:

receiving first position information indicative of at least the location of a first user with respect to a presentation space; and

receiving second position information indicative of at least the location of a second user with respect to the presentation space;

providing for presentation, using a first selection of loudspeakers comprising at least two loudspeakers having known locations in the presentation space, of secondary audio of first audio content to the first user based on the first position information being indicative of the first user being in the presentation space and the second position information being indicative of the second user being outside the presentation space; and

providing for presentation of secondary audio of the first audio content using a second selection of the loudspeakers and the secondary audio of the second audio content using a third selection of the loudspeakers, the second selection comprising fewer loudspeakers than the first selection, and the second and third selections comprising non-overlapping selections of the loudspeakers based on the location of the first user and the second user in the presentation space determined from the first position information and the second position information, wherein the first audio content and the second audio content each comprise primary audio and the secondary audio, the secondary audio being different to the primary audio and comprising ambient audio.

[0022] In a further aspect there is provided a computer readable medium comprising computer program code stored thereon, the computer readable medium and computer program code being configured to, when run on at least one processor, perform the method of:

receiving first position information indicative of at least the location of a first user with respect to a presentation space; and

receiving second position information indicative of at least the location of a second user with respect to the presentation space;

[0023] In a further example aspect there is provided an apparatus comprising:

at least one processor; and

at least one memory including computer program code,

the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

receive first position information indicative of at least the location of a first user with respect to a presentation space; and

receive second position information indicative of at least the location of a second user with respect to the presentation space;

[0024] The present disclosure includes one or more corresponding aspects, examples or features in isolation or in various combinations whether or not specifically stated (including claimed) in that combination or in isolation. Corresponding means and corresponding functional units (e.g., function enabler, speaker selector, amplifier, display device) for performing one or more of the discussed functions are also within the present disclosure.

[0025] Corresponding computer programs for implementing one or more of the methods disclosed are also within the present disclosure and encompassed by one or more of the described examples.

[0026] The above summary is intended to be merely exemplary and non-limiting.

Brief Description of the Figures

[0027] A description is now given, by way of example only, with reference to the accompanying drawings, in which:

figure 1 illustrates an example apparatus for providing for presentation of audio in a presentation space;

figure 2 shows an example presentation space with a first user therein;

figure 3 shows the example presentation space with the first user and a second user therein;

figure 4 shows the example presentation space with the first user and the second user having moved to new locations therein;

figure 5 shows a first example of the presentation of audio when the first user and the second user move close to one another;

figure 6 shows a second example of the presentation of audio when the first user and the second user move close to one another;

figure 7 shows an example of the presentation of audio when the first user and second user have provided an input to join separate telecommunication calls they were engaged in;

figure 8 shows a second example of the presentation of audio when the first user and second user have provided an input to join separate telecommunication calls they were engaged in;

figure 9 shows a first stage of an example method of determining which speakers are within a speaker selection for a user;

figure 10 shows a second stage of an example method of determining which speakers are within a speaker selection for a user;

figure 11 shows a third stage of an example method of determining which speakers are within a speaker selection for a user;

figure 12 shows a fourth stage of an example method of determining which speakers are within a speaker selection for a user;

figure 13 shows a fifth stage of an example method of determining which speakers are within a speaker selection for a user;

figure 14 shows a flowchart illustrating an example method; and

figure 15 shows a computer readable medium.

Description of Example Aspects

[0028] Telecommunication or telephony systems are being developed that provide for more than monophonic capture and monophonic presentation of audio. Immersive telephony systems are being developed, such as by the 3rd Generation Partnership Project (3GPP), that will enable a new level of immersion in telephony services. Such systems may provide for transmission of and presentation of immersive, spatial audio content. This may enable receiving and sending of an enveloping sound scene from/to the telecommunication call participants. Thus, when a remote user or person calls to a first user, the first user can experience the audio environment around the remote user as if he/she was physically located at the location of the remote user and vice versa.

[0029] In one or more examples, the audio provided as part of said telecommunication may be categorised as primary audio and secondary audio. It will be appreciated that while the examples herein relate to the provision of audio in the field of telecommunication, the principles may be applied to other fields of audio presentation.

[0030] In one or more examples, the primary audio may comprise voice audio comprising audio determined to be generated by a voice of one or more remote users in telecommunication with a first user. The "voice" primary audio may be categorised at the point of capture or at the point of play back using audio analysis techniques, or by a server or any other entity involved in said telecommunication. The secondary audio may, in one or more examples, comprise ambient audio comprising audio other than that determined to be generated by the voice of one or more remote persons. The primary and secondary audio may be presented differently.

[0031] In one or more examples, the primary audio may comprise spatial audio content that includes directional information such that, when presented, it is perceived as originating from one or more directions in accordance with the directional information. In one or more examples, the secondary audio may comprise ambient audio comprising audio without said directional information or without a direction of arrival distinguishable above a threshold level. Again, the primary and secondary audio may be presented differently.

[0032] With primary audio that comprises spatial audio content, the direction from which audio was received at the location of the remote user may be reproduced when presenting the audio to the first user (or any other user) by use of spatial audio presentation.

[0033] Spatial audio comprises audio presented in such a way to a user that it is perceived to originate from a particular location, as if the source of the audio was located at that particular location. Spatial audio content comprises audio for presentation as spatial audio and, as such, typically comprises audio having directional information (either explicitly specified as, for example, metadata or inherently present in the way the audio is captured), such that the spatial audio content can be presented such that its component audio is perceived to originate from one or more points or one or more directions in accordance with the directional information.

[0034] In one or more examples, non-spatial audio content may be presented as spatial audio. Thus, "conventional" monophonic or stereophonic audio may be provided for presentation such that it will be perceived to originate from a particular location or direction. One or more of the embodiments described herein may present spatial audio based on spatial audio content or non-spatial audio content.

[0035] The spatial positioning of the spatial audio may be provided by 3D audio effects, such as those that utilise a head related transfer function to create a spatial audio space (aligned with a real-world space in the case of augmented reality) in which audio can be positioned for presentation to a user. Spatial audio may be presented by headphones by using head-related-transfer-function (HRTF) filtering techniques or, for loudspeakers, by using vector-base-amplitude panning techniques to position the perceived aural origin of the audio content. In other embodiments Ambisonics audio presentation may be used to present spatial audio.

[0036] Spatial audio may use one or more of volume differences, timing differences and pitch differences between audible presentation to each of a user's ears to create the perception that the origin of the audio is at a particular location or in a particular direction in space. The perceived distance to the perceived origin of the audio may be rendered by controlling the amount of reverberation and gain to indicate closeness or distance from the perceived source of the spatial audio.

[0037] In one or more examples, an apparatus 100 may be provided that is configured to provide for presentation of secondary audio to one or more users.

[0038] Figure 1 shows an example overview of a system 110 in which such an apparatus 100 may be used. The system may include means to receive audio content, such as a first audio input 101 and second audio input 102. It will be appreciated that the system 110 may include further audio inputs. The audio inputs 101, 102 may receive audio from one or more telecommunication devices (not shown). The telecommunication devices may comprise mobile telephones or any other telecommunication equipment. In one or more other examples, the audio inputs 101, 102 receive audio from other sources, in the field of telecommunication or not.

[0039] The apparatus 100 may be configured to provide for presentation of the audio content from the first audio input 101 and the second audio input 102 to a plurality of loud speakers 103, 104, 105, 106. The speakers 103-106 may be arranged in a presentation space 107, such as a room or part of a room.

[0040] The apparatus 100 may comprise or be connected to a processor 108 and a memory 109 and may be configured to execute computer program code. The apparatus 100 may have only one processor 108 and one memory 109 but it will be appreciated that other embodiments may utilise more than one processor and/or more than one memory (e.g. same or different processor/memory types). Further, the apparatus 100 may be an Application Specific Integrated Circuit (ASIC).

[0041] The processor may be a general purpose processor dedicated to executing/processing information received from other components, such as received from the audio inputs 101, 102, or a user position information sensor 111 in accordance with instructions stored in the form of computer program code in the memory. The output signalling generated by such operations of the processor is provided onwards to further components, such as to the speakers 103-106 or an amplifier or other audio presentation equipment (not shown) to provide the audio to the speakers 103-106.

[0042] The user position information sensor 111 may comprise one or more cameras for tracking the position of the one or more users in the presentation space 107. In one or more other examples, the sensor 111 may be configured to track the position of tags, such as RFID tags, worn by the one or more users. In one or more examples, the sensor 111 may be embodied as a plurality of sensors arranged at least within the presentation space 107. It will be appreciated that there are many ways the position and, optionally, the orientation, of one or more users may be tracked. In one or more embodiments, the sensor 111 may be associated with circuitry for performing the tracking of users and providing position information to the apparatus 100. In one or more examples, the apparatus 100 may use data from the sensor to track the one or more users and generate the position information.

[0043] The memory 109 (not necessarily a single memory unit) is a computer readable medium (solid state memory in this example, but may be other types of memory such as a hard drive, ROM, RAM, Flash or the like) that stores computer program code. This computer program code stores instructions that are executable by the processor, when the program code is run on the processor. The internal connections between the memory and the processor can be understood, in one or more example embodiments, to provide an active coupling between the processor and the memory to allow the processor to access the computer program code stored on the memory.

[0044] In this example, the respective processors and memories are electrically connected to one another internally to allow for electrical communication between the respective components. In this example, the components are all located proximate to one another so as to be formed together as an ASIC, in other words, so as to be integrated together as a single chip/circuit that can be installed into an electronic device. In some examples one or more or all of the components may be located separately from one another.

[0045] Example figures 2 to 8 below show various positions of a first user 201 and a second user 202 inside the presentation space 107 (and outside the presentation space in the case of example figure 2).

[0046] In this and one or more examples, the apparatus 100 may be configured to receive first audio content, such as at first audio input 101, intended for presentation to the first user 201 and, at the same or later time, second audio content, such as at second audio input 102, for presentation to the second user 202 (shown in figures 3 to 8). The first audio content may comprise audio content from a remote user with which the first user 201 is in telecommunication. The second audio content may comprise audio content from a different remote user with which the second user 202 is in telecommunication. In other examples, the first and second audio content may comprise audio content outside the field of telecommunication, such as an audiobook. In one or more other examples, the apparatus 100 may not receive the audio content itself but receives information indicative that it is available for presentation by audio presentation equipment. The apparatus 100 may then provide for control of the audio presentation equipment.

[0047] As described above, the first audio content and the second audio content may be categorised into primary audio content and secondary audio content.

[0048] The example apparatus 100 is configured to receive first position information indicative of at least the location of the first user 201 with respect to the presentation space 107 and, optionally, the orientation of the first user 201. The first position information may be received from the sensor 111 or from a different device.

[0049] The example apparatus 100 is configured to receive second position information indicative of at least the location of the second user 202 with respect to the presentation space 107 and, optionally, the orientation of the second user 202. The second position information may be received from the sensor 111 or from a different device.

[0050] In this example, the apparatus 100 is configured to provide for the simultaneous presentation of the first audio content and the second audio content to the four loudspeakers comprising the first speaker 103, the second speaker 104, third speaker 105, and fourth speaker 106. The simultaneous presentation may be such that the first and second audio content is presented at the same time but not necessarily from the same speaker(s), as will be described in more detail below. The apparatus 100 may, in one or more examples, be configured to provide an audio signal for each loudspeaker 103-106. In one or more examples, the apparatus 100 may be configured to provide one or more control signals to audio presentation equipment to control how the audio presentation equipment mixes the first and second audio content to each speaker 103-106.

[0051] The apparatus 100 may be provided with information regarding the location of the first through fourth loudspeakers 103-106 in the presentation space 107. This known location information may be pre-set or may be determined by the apparatus 100 prior to or at the time of determining selection(s) of loudspeakers to which to present the audio content. It will be appreciated that the location of the loudspeakers are known to the apparatus at the time the second and third speaker selections are determined but it may be inconsequential how far in advance the speakers locations are determined or provided to the apparatus.

[0052] Example figure 2 shows the first user 201 in the presentation space 107. The apparatus 100 or a different apparatus, such as the audio presentation equipment, may provide for presentation of the secondary audio using a first selection of speakers 103-106. In this example, the first selection comprises all of the speakers 103-106 available in the presentation space 107. The circle 203 illustrates that the secondary audio of the first audio content is presented from all of the speakers and therefore the ambient audio of the first audio content can be heard to surround the first user 201, due to the speakers being arranged to surround the first user 201.

[0053] The criteria for selecting which loudspeakers comprise the first selection is based, at least in part, on the first position information being indicative of the first user 201 being in the presentation space 107 and the second position information being indicative of the second user (not present in figure 2) being outside the presentation space 107. The second position information may be considered to be indicative of the second user being outside the presentation space 107 by one or more of specifying a location for the second user that is outside the presentation space, not specifying a location at all, or the absence of second position information at the time of presentation of the first audio content. In this example, the criteria may comprise use all available speakers when only one user requires secondary audio to be presented to them in the presentation space 107. It will be appreciated that in one or more other examples, the criteria may be different. For example, where the presentation space 107 is large and many speakers are provided therein, the criteria for the first selection may comprise present the secondary audio from all speakers within a predetermined distance of the first user 201. Other criteria may be used to determine the speakers of the first selection.

[0054] The apparatus 100 may include means configured to provide for the presentation of the primary audio of the first audio content from one or more of the speakers 102 -106, such as from the loudspeaker or loudspeakers that are closest to the first user 201 (e.g. first speaker 103 or a subset comprising first and second speakers 103, 104). In one or more examples, the primary audio is provided for presentation from the speaker or speakers that most directly faces the first user, such as third and fourth speakers 105, 106. In other examples, the primary audio may comprise spatial audio and therefore, the direction from which the first user 201 perceives the audio may be based on the directional information of the primary audio of the first audio content. Thus, the speakers used may be determined based on the spatial audio presentation of the primary audio. It will be appreciated that various primary-audio-criteria may be used to determine the speakers used for presentation of the primary audio.

[0055] Example figure 3 shows the presentation space 107 including both the first user 201 and the second user 202. The second user 202 may start a telecommunication call and therefore require the presentation of the associated second audio content by the speakers 103-106 or may be conducting a telecommunication call and enter the presentation space 107. Accordingly, the apparatus 100 is configured to select which of the loudspeakers 103-106 to use to render the ambient secondary audio of the first audio content to the first user 201 and which of the speakers 103-106 to use to render the ambient secondary audio of the second audio content to the second user 202. The rendering of the secondary audio for both the first and second users 201, 202 from all of the speakers may be confusing and distracting. Accordingly, the selection of speakers is preferably required to provide an impression of the ambient scene created by the secondary audio but should not be distracting for the other of the first and second user 201, 202.

[0056] In example figure 3, the apparatus 101 is configured to provide for presentation of the secondary audio of the first audio content for the first user 201 using a second selection of the speakers 103-106 rather than the first selection shown in figure 2. The apparatus 101 is also configured to provide for presentation of the secondary audio of the second audio content for the second user 202 using a third selection of the speakers 103-106.

[0057] In this example, the second selection, represented by oval 301, comprises the first and second speakers 103, 104. In this example, the third selection, represented by the second oval 302, comprises the third and fourth speakers 105, 106. Thus, the secondary audio of the first audio content is only presented by a subset of the four speakers and the second audio of the second audio content is only presented by a subset of the four speakers 103-106.

[0058] In particular, the second selection comprises a fewer number of loudspeakers (e.g. two) than the first selection (e.g. four). Thus, as the ambient secondary audio of the first audio content can no longer be presented from all speakers, because the presentation space 107 is shared by a plurality of users who require the presentation of audio content, a compromise may be required.

[0059] Further, in one or more examples, the second selection and third selection comprise non-overlapping selections of the speakers 103-106. Accordingly, speakers used to present the secondary audio of the first audio content are not used to present the secondary audio of the second audio content.

[0060] Still further, the second and third selections may be based on the location of the first and second users 201, 202 in the presentation space 107 determined from the first position information and the second position information.

[0061] The criterion for selecting the loudspeakers for the second and third selections may comprise determining which non-overlapping subset of speakers are closer to the first user 201 and which speakers are closer to the second user 202. In one or more examples, the number of speakers assigned to each of the first and second selection may be equal or close to equal in the event an odd number of speakers are used.

[0062] In one or more examples, the second selection may be determined prior to the third selection or vice versa. In one or more examples, the third selection may comprise the remaining speakers after determining the speakers that make up the second selection or vice versa. In one or more examples, the user who first initiated the telecommunication may have their speaker selection determined first.

[0063] In one or more examples, the criteria for determining the second and third selections may comprise determining, for each speaker, which user 201, 202 is closer thereto and assigning the speaker to the second selection if the first user 201 is closer and assigning the speaker to the third selection if the second user 202 is closer. An example strategy for determination of speaker selections is provided below with reference to figures 9 to 13. However, in principle, various qualifiers or conditions may be used to determine which speakers are assigned to which speaker selection based on the location of the users. In one or more examples, a first qualifier may be to identify the closest speaker or speakers to the respective users. If this does not yield an independent or partially independent selection of speakers then further qualifiers may be applied. In one or more examples, the speaker(s) that is closest to one of the users than the other is assigned to the speaker selection for that user and the other user is assigned the closest or next closest speaker(s) that is not part of the other selection. In one or more examples, the apparatus 100 may continue to assess each of the other speakers to determine which selection it should be part of.

[0064] In one or more examples, the first position information is also indicative of the orientation of the first user 201 in the presentation space 107. In one or more examples, the second position information may be indicative of the orientation of the second user 202 in the presentation space 107. Accordingly, the apparatus 100 may be configured to determine which of the speakers 103-106 the first and second users 201, 202 are facing. Thus, the second selection and the third selection may be determined based on a combination of which of the at least two speakers are closer to the first user 201 and the second user 202 and which of the at least two speakers the first user 201 and the second user 202 are facing. It will be appreciated that the assignment of the speakers to the second and third selections may comprise one or more criteria that needs to be met in terms of one or more of location and orientation in the presentation space 107 and which user 201, 202 is closer to particular speakers than the other user.

[0065] The apparatus 100 may include means configured to provide for the presentation of the primary audio of the first audio content and the primary audio of the second audio content when the first and second position information is indicative of both the first and second users being within the presentation space. In one or more examples, the criteria for selecting which of the one or more speakers 103 -106, from which to present the primary audio of the first audio content is different to the criteria used for selecting which of the one or more speakers 103 -106, from which to present the secondary audio of the first audio content. Likewise, in one or more examples, the criteria for selecting which of the one or more speakers 103 -106, from which to present the primary audio of the second audio content is different to the criteria used for selecting which of the one or more speakers 102 -106, from which to present the secondary audio of the second audio content. In terms of the presentation of the primary audio, the same primary-audio-criteria as described for Figure 2 may be used.

[0066] As mentioned in relation to figure 1 and 2, the apparatus 100 may be configured to provide for presentation of the primary audio from the speaker or speakers that are closest to the respective user or the speaker or speakers that most directly face the respective user. In one or more other examples, the primary audio may comprise spatial audio and therefore, the direction from which the users 201, 202 perceives the audio may be based on the directional information of the primary audio of the first audio content and second audio content respectively.

[0067] Example figure 4 shows the first user 201 having moved within the presentation space 107. Example figure 4 also shows the second user 202 having moved within the presentation space 107. The apparatus 100 may include means configured to, based on movement of the location of one or both of the first user 201 and the second user 202 in the presentation space 107 as indicated in the respective first position information and second position information, provide for a change in the speakers 103-106 that comprise the second selection and the speakers 103-106 that comprise the third selection.

[0068] Thus, in one or more examples, one or both of the second selection and the third selection comprise a dynamic selection of speakers based at least in part on the current position of the first user and second user respectively.

[0069] In example figure 4, the apparatus 100 has updated the second selection to comprise the first speaker 103 and the third speaker 105, as shown by oval 401. In example figure 4, the apparatus 100 has updated the third selection to comprise the second speaker 104 and the fourth speaker 106, as shown by oval 402.

[0070] The same criteria may be used to assign speakers to the second and third selections when updating the selections, as mentioned above. Alternatively, different criteria may be used to update the second and/or third selections after an initial assignment of speakers to the second and/or third selections. For example, it may be confusing if the criteria used to assign speakers to the second and third selections resulted in the borderline assignment of a particular speaker to either the second or third selection due to the relative positions of the first and second user 201, 202. In such a circumstance, small movements of the one or both users may result in updating of the second and third selections which may be confusing or annoying.

[0071] Thus, in one or more examples, a predetermined amount of time, which may be known as a "no-update time" may be defined wherein the assignment of speakers to the second and third selections is not updated despite the first position information and second position information being indicative of movement that would otherwise result in the updating of the second and/or third selections. In one or more examples the predetermined amount of time may be greater than 1 second, 5 seconds, 10 seconds or 30 seconds. In one or more examples, the predetermined amount of time may be based on a determined speed of movement of the first and/or second users based on the first position information and second position information. Thus, if the first and/or second users 201, 202 are moving quickly around the presentation space 107, then the second and third selections may be updated more quickly. However, if they are moving slowly, then the predetermined amount of time may be greater so the second and third selections may be updated more slowly. In one or more examples, the apparatus may provide for a no-update-distance similar to the no-update-time. Thus, in one or more examples, the apparatus 100 is configured to perform an update of the assignment of speakers to the second and third selections based on the determined distance of movement of the first and/or second users being above the no-update-distance. Thus, small movements of the users, less than the no-update distance, may not trigger an update to the speaker selections whereas a greater movement, greater than the no-update distance, will cause the apparatus to provide for updating of the selections.

[0072] In terms of the presentation of the primary audio, the same primary-audio-criteria as described for Figure 2 may be used.

[0073] Example figure 5 shows the first user 201 and the second user 202 having moved to positions in the presentation space 107 that are within a predetermined distance 500 of one another. If the first and second users 201, 202 move close to each other, they may have a higher likelihood of hearing each other's audio content. To make the first and second audio content easier to listen to for the respective users, the apparatus 100 may be configured to stop playing the secondary audio or "ambient audio" of the first and/or second audio content entirely. In one or more examples, the apparatus 100 may be configured to provide for presentation of the primary audio of the first and/or second audio content from just one or more (e.g. not all) of the nearest speakers to the corresponding users 201, 202.

[0074] Thus, example figure 5 shows, based on the first position information and the second position information, the first user 201 and the second user 202 having moved to being within a predetermined distance 500 of one another. The apparatus 100 is thus configured to provide for ceasing of presentation of the secondary audio of one or both of the first audio content and the second audio content and maintaining presentation of the primary audio of both the first audio content and the second audio content. The absence of an oval (or circular) selection in example figure 5 shows the absence of the presentation of the secondary audio of both the first audio content and the second audio content. While the primary audio of both the first audio content and the second audio content may continue to be presented, it will be appreciated that the selection of which speakers 103-106 are used may change based on the first user 201 and the second user 202 having moved to being within a predetermined distance 500 of one another. In one or more examples, the same primary-audio-criteria as described for Figure 2 may be used or a different criterion.

[0075] Example figure 6 shows an example of an alternative action by apparatus 100 in response to the first user 201 and the second user 202 having moved to positions in the presentation space 107 that are within a predetermined distance 500 of one another. In this and one and more examples, the apparatus 100 is configured to, in response to the first position information and the second position information being indicative of the first user 201 and the second user 202 having moved to being within a predetermined distance 500 of one another, change from presentation of the secondary audio using the second and third selections.

[0076] In particular, the apparatus 100 may be configured to provide for presentation of the secondary audio of the first audio content from the speaker that is closest to the first user 201 and closer to the first user 201 than the second user 202, which in this example comprises the third speaker 105, as shown by selection circle 601. Further, the apparatus 100 may be configured to provide for presentation of the secondary audio of the second audio content from the speaker that is closest to the second user 202 and closer to the second user 202 than the first user 201, which in this example comprises the fourth speaker 106, as shown by selection circle 602.

[0077] Thus, in some examples, the ambient, secondary audio may be important for the telecommunication call because, for example, it carries the sound of children and the call is with the grandparents. In one or more examples, the apparatus 100 may provide the secondary audio of the first audio content for presentation additionally based on first-user-input indicative of them wanting to hear the secondary audio. In one or more examples, the apparatus 100 may provide the secondary audio of the first audio content for presentation additionally based on remote-user-input indicative of them wanting the first user 201 to hear the secondary audio. Similarly, the second user may provide corresponding second-user-input to hear the secondary audio of the second audio content and/or the remote user with which the second user is telecommunicating may provide said remote-user-input indicative of them wanting the second user 202 to hear the secondary audio. The apparatus may thus be configured to act on said second-user-input and/or remote-user-input by presenting the secondary audio from at least one of the loudspeakers.

[0078] The apparatus 100 may be configured to maintain presentation of the primary audio of both the first audio content and the second audio content. As before, while the primary audio of both the first audio content and the second audio content may continue to be presented, it will be appreciated that the selection of which speakers 103-106 are used may change based on the first user 201 and the second user 202 having moved to being within a predetermined distance 500 of one another compared to when they were further than the predetermined distance 500 from one another.

[0079] In one or more examples, the first audio content is audio from a remote user with which the first user 201 is communicating and the second audio content is audio from a different remote user with which the second user 202 is communicating. In one or more examples, the first user 201 and/or the second user 202 may be desirous of joining the telecommunication calls to provide multi-party "conference call" in which the first user 201, second user 202 and remote persons with which the first and second user are communicating can all hear each other and communicate.

[0080] Example figure 7 shows how the apparatus 100 may provide for presentation of the secondary audio based on a user input received from one or both of the first user 201 and the second user 202 indicative of a desire to join the telecommunication to provide for telecommunication between any two of the first and second user 201, 202 and the first and second remote persons (not shown).

[0081] In this example, the apparatus 100 is configured to provide for presentation of the secondary audio of the first audio content using a fourth selection 701 of the speakers and presentation of the secondary audio of the second audio content using a fifth selection 702 of the speakers 103-106. The fourth and fifth selections may comprise overlapping selections of the speakers 103-106. In the example shown in figure 7 and in one or more other examples, the fourth 701 and fifth 702 selections are the same selection of speakers 103-106. In one or more examples, the fourth and fifth selections comprise all of the speakers 103-106 in the presentation space 107.

[0082] The primary audio of the first audio content and the second audio content may also be presented from the speakers 103-106 that are selected independently using different criteria than the criteria used to determine the fourth and fifth selections for the secondary audio.

[0083] Example figure 8 shows how the apparatus 100 may provide for presentation of the secondary audio based on a user input received from one or both of the first user 201 and the second user 202 indicative of a desire to join the telecommunication to provide for telecommunication between any two of the first and second user 201, 202 and more than two remote persons (not shown). For example, the first user may be in communication with a first remote user and the second user may be in communication with two remote users at the time the telecommunication is combined. Alternatively, the third remote user may join the telecommunication after the first and second users have joined their calls. Thus, in example figure 8, there may be third or more audio content due to the number of remote users. The presentation of the secondary audio from many, e.g. more than two, remote users may be confusing. Therefore, in one or more examples, the apparatus 100 is configured to, based on the first audio content, the second audio content and at least third audio content for presentation to at least both the first user and the second user, provide for grouping of the secondary audio of the first, second and at least third audio content into two or more groups for presentation from different respective groups of the speakers.

[0084] In this example, the secondary audio from the first audio content and the secondary audio from the second audio content is grouped and presented from the first and third speakers 103, 105 as shown by selection 801. In this example, the secondary audio from the third audio content forms the other group and is presented from the second and fourth speakers 104, 106 as shown by selection 802.

[0085] In one or more examples, when one of the telecommunication calls finishes and therefore the first audio content or second audio content is no longer available for presentation, the selection of speakers used for presentation of the secondary audio may be updated. Thus, based on an end of one of the first audio content and second audio content being reached such that there is no further audio content to present, the apparatus 100 may provide for presentation of the other of the first audio content and second audio content with a sixth selection (not shown) of the speakers, the sixth selection comprising a greater number of speakers than the second and third selections. In one or more examples, the sixth selection may comprise the first selection shown in example figure 2. Thus, the sixth selection may comprise all of the speakers of the presentation space 107 or a subset thereof based on distance between the user and the speakers 103-106.

[0086] It will be appreciated that the number of speakers 103-106 may be greater or less than four as shown in the examples but may be at least two or at least three speakers.

[0087] In one or more examples, one or both of the first audio content and the second audio content comprises audio-type information which designates which audio of the respective first and second audio content comprises primary audio and which audio comprises secondary audio. Alternatively, in one or more examples, the apparatus 100 is configured to provide for determination of which audio of one or both of the first audio content and second audio content is the primary audio and which is the secondary audio based on audio analysis.

[0088] Thus, in one or more examples, the audio of the first audio content may be labelled with a category comprising either secondary "ambient" audio or primary audio. In one or more examples, the labelling of the audio may be performed at a device of the remote person at the time of capture. In such an example, the first audio content may be separated into a primary audio stream and a secondary audio stream. In one or more examples, a server may perform the labelling of the audio and/or creation of said different component audio streams. In one or more examples, the apparatus 100 may be configured to determine which audio is primary user audio and which is secondary audio. In any of the examples, the determination of primary audio from secondary audio may be performed using any appropriate audio analysis technique and may be based on the frequency range of the audio at a particular time, the degree of reverberation, the volume or any other audiological factor. The determination of primary audio from secondary audio may be performed based on predetermined speech samples of the remote person. In other examples, the first/second audio content may be recorded by a plurality of microphones and may be determined to be primary audio or secondary audio based on differences between how the audio was captured by the plurality of microphones. For example, audio deemed to be received at a greater volume or with a particular degree of reverberation by a front facing microphone relative to a rearfacing microphone may be deemed to be primary audio (possibly once subject to audio processing). The remaining audio or audio received at a greater volume or with a particular degree of reverberation by the rear facing microphone relative to the front facing microphone may be determined to be secondary audio (possibly once subject to audio processing). It will be appreciated that various audio processing techniques may be used to separate the first user audio and the ambient audio.

[0089] In one or more examples, the first audio content and the second audio content may comprise telecommunication audio content or any other audio content that may be categorized. For example, an audiobook may provide the reader of the story as primary audio and supplementing sounds as secondary audio.

[0090] Figure 9 shows a first stage of an example process for determining loudspeaker selections, such as the second and third selections. Figure 9 shows an arrangement of six speakers 901-906 (first through sixth) in a presentation space 907 in which the first user 201 and the second user 202 are present. In this and one or more examples, it is a predetermined requirement of the speaker selection process for each speaker selection to include two speakers. It will be appreciated that other requirements are possible. A first stage of an example speaker selection determination process comprises forming all possible speaker pairs and connecting the pairs with straight lines, shown by arrows 908.

[0091] An example second stage comprises removal from consideration of loudspeaker pairs which have associated connecting lines 908 that cross one another. Thus, in this example, the follow speaker pairs are discounted - first-third speaker pair, first-fourth speaker pair, fifth-second speaker pair, fifth-third speaker pair and fourth-sixth speaker pair. This leaves the speakers pairs that have non-intersecting connecting lines, which comprise first-second pair 910, second-third pair 911, third-fourth pair 912, fourth-fifth pair 913, fifth-first pair 914, fifth-sixth pair 915 and the first-sixth pair 916. Example figure 10 shows only the non-intersecting connecting lines and thus the remaining pairs of speakers 901-906.

[0092] Example figure 10 also illustrates the determination of a mid-point between the remaining speaker pairs along their associated connecting lines, marked by a cross in Figure 10 along each connecting line.

[0093] Example figure 11 illustrates that, for each user 201, 202, the apparatus is configured to provide for the determination of the distance between the respective user 201, 202 and each of the determined mid-points. The second selection for the first user 201 is then determined based on which mid-point the first user 201 is closest to. The closest mid-point is associated with a speaker pair and the speakers of the speaker pair are determined to form the second selection. In this example, the second selection for the first user 201 is determined to comprise the first and sixth speakers as shown by dashed oval 921.

[0094] The third selection may also be determined based on which mid-point the second user 202 is closest to, which in this example is the mid-point of the first-second speaker pair on connecting line 910. Dashed oval 922 represents the (possible) third selection. However, in one or more examples, a further qualifier for determination of the speaker selection comprises that the second and third speaker selections must be mutually exclusive in terms of the speakers that form the selections. That is, there must not be any overlap between the speaker selections. Dashed ovals 921 and 922 are shown to overlap and therefore the apparatus may be configured to identify a different selection of speakers for the third selection.

[0095] Example figure 12 shows the apparatus 100 providing for determination of which is the second closest mid-point to the second user 202 for determination of the third selection. In this example, the mid-point of the connecting line 911 is the second closest to the second user 202, which thereby identifies the second and third speakers 902, 903 as the speakers that form the third selection, shown by dashed oval 923. Accordingly, the process has identified the first and sixth speakers 901, 906 as forming the second selection and the second and third speakers 902, 903 as forming the third selection.

[0096] Thus, in summary, the apparatus 100 may be configured to identify the speaker or speakers that are closest to each of the users, wherein if said selected speakers are not mutually exclusive then the speaker or speakers that are second closest to one of the users are determined and wherein the speakers determined based on the location of the first user form the second selection and the speakers determined based on the location of the second user form the third selection. It will be appreciated that the process may be followed for determining the speakers of selections disclosed herein other than the second and third selections. In one or more examples, only once the speaker selections have been determined is the apparatus 100 configured to begin to use those speakers for presentation of audio.

[0097] Further, the apparatus may be configured to provide for the following:

identification of all possible speaker pairs from the plurality of speakers;

determination of a connecting line between each of the determined speaker pairs;

discounting of any speaker pairs than have a connecting line that intersects with any other connecting line to identify a set of speaker pairs for further consideration;

determination of a mid-point along each of the connecting lines of each of the identified set of speaker pairs;

determination of which mid-point each of the users are closest to; and

provided that the speakers associated with the connecting line having the mid-point closest to the respective user are all different speakers, provide for determination of the second selection based on the speakers associated with the connecting line having the mid-point closest to the first user and provide for determination of the third selection based on the speakers associated with the connecting line having the mid-point closest to the second user.

[0098] In one or more examples, the apparatus is configured to, if the speakers are not all different, for at least one of the users, determine the second closest mid-point and provided that the speakers associated with the connecting line having the mid-point that is second closest to the at least one of the users are all different speakers, provide for determination of one of the second and third selection based on the speakers associated with the connecting line having the mid-point second closest to the respective user and provide for determination of the other of the second and third selection based on the speakers associated with the connecting line having the mid-point closest to the respective user.

[0099] In one or more examples, the second and third speaker selections may be expanded to include one or more additional speakers, which may form part of the example selection process. Example figure 13 shows the apparatus 100 configured to, for each of the second and third selections, provide for determination of the nearest mid-point to the mid-point of the speaker pair that currently forms part of the respective selection and thereby determine at least one further speaker from the speaker pairs associated with said nearest mid-point, and wherein, provided that said at least one additional speaker is not part of the other of the second and third selection, expand said second or third selection to include said at least one additional speaker.

[0100] In this example, the nearest mid-point to the mid-point of the first-sixth speaker pair 916 of the second selection 921 is the mid-point of the fifth-first speaker pair 914. Accordingly, the second selection 921 may be expanded to include the fifth speaker 905. In this example, the nearest mid-point to the mid-point of the second-third speaker pair 911 of the third selection is the mid-point of the third-fourth speaker pair 912. Accordingly, the third selection 923 may be expanded to include the fourth speaker 904. The expanded selections 921, 923 do not overlap in terms of the speakers of which they are formed and therefore the apparatus 100 may be configured to provide for audio presentation from the determined second and third selections 921, 923 respectively.

[0101] Figure 14 shows a flow diagram illustrating the steps of

receiving first position information 1400 indicative of at least the location of a first user with respect to a presentation space; and

receiving second position information 1401 indicative of at least the location of a second user with respect to the presentation space;

providing for presentation 1402, using a first selection of loudspeakers comprising at least two loudspeakers having known locations in the presentation space, of secondary audio of first audio content to the first user based on the first position information being indicative of the first user being in the presentation space and the second position information being indicative of the second user being outside the presentation space; and

providing for presentation 1403 of secondary audio of the first audio content using a second selection of the loudspeakers and the secondary audio of the second audio content using a third selection of the loudspeakers, the second selection comprising fewer loudspeakers than the first selection, and the second and third selections comprising non-overlapping selections of the loudspeakers based on the location of the first user and the second user in the presentation space determined from the first position information and the second position information, wherein the first audio content and the second audio content each comprise primary audio and the secondary audio, the secondary audio being different to the primary audio and comprising ambient audio.

[0102] Figure 15 illustrates schematically a computer/processor readable medium 1500 providing a program according to an example. In this example, the computer/processor readable medium is a disc such as a digital versatile disc (DVD) or a compact disc (CD). In some examples, the computer readable medium may be any medium that has been programmed in such a way as to carry out an inventive function. The computer program code may be distributed between the multiple memories of the same type, or multiple memories of a different type, such as ROM, RAM, flash, hard disk, solid state, etc.

[0103] User inputs may be gestures which comprise one or more of a tap, a swipe, a slide, a press, a hold, a rotate gesture, a static hover gesture proximal to the user interface of the device, a moving hover gesture proximal to the device, bending at least part of the device, squeezing at least part of the device, a multi-finger gesture, tilting the device, or flipping a control device. Further the gestures may be any free space user gesture using the user's body, such as their arms, or a stylus or other element suitable for performing free space user gestures.

[0104] The apparatus shown in the above examples may be a portable electronic device, a laptop computer, a mobile phone, a Smartphone, a tablet computer, a personal digital assistant, a digital camera, a smartwatch, smart eyewear, a pen based computer, a non-portable electronic device, a desktop computer, a monitor, a smart TV, a server, a wearable apparatus, a virtual reality apparatus, or a module/circuitry for one or more of the same.

[0105] Any mentioned apparatus and/or other features of particular mentioned apparatus may be provided by apparatus arranged such that they become configured to carry out the desired operations only when enabled, e.g. switched on, or the like. In such cases, they may not necessarily have the appropriate software loaded into the active memory in the non-enabled (e.g. switched off state) and only load the appropriate software in the enabled (e.g. on state). The apparatus may comprise hardware circuitry and/or firmware. The apparatus may comprise software loaded onto memory. Such software/computer programs may be recorded on the same memory/processor/functional units and/or on one or more memories/processors/ functional units.

[0106] In some examples, a particular mentioned apparatus may be pre-programmed with the appropriate software to carry out desired operations, and wherein the appropriate software can be enabled for use by a user downloading a "key", for example, to unlock/enable the software and its associated functionality. Advantages associated with such examples can include a reduced requirement to download data when further functionality is required for a device, and this can be useful in examples where a device is perceived to have sufficient capacity to store such pre-programmed software for functionality that may not be enabled by a user.

[0107] Any mentioned apparatus/circuitry/elements/processor may have other functions in addition to the mentioned functions, and that these functions may be performed by the same apparatus/circuitry/elements/processor. One or more disclosed aspects may encompass the electronic distribution of associated computer programs and computer programs (which may be source/transport encoded) recorded on an appropriate carrier (e.g. memory, signal).

[0108] Any "computer" described herein can comprise a collection of one or more individual processors/processing elements that may or may not be located on the same circuit board, or the same region/position of a circuit board or even the same device. In some examples one or more of any mentioned processors may be distributed over a plurality of devices. The same or different processor/processing elements may perform one or more functions described herein.

[0109] The term "signalling" may refer to one or more signals transmitted as a series of transmitted and/or received electrical/optical signals. The series of signals may comprise one, two, three, four or even more individual signal components or distinct signals to make up said signalling. Some or all of these individual signals may be transmitted/received by wireless or wired communication simultaneously, in sequence, and/or such that they temporally overlap one another.

[0110] With reference to any discussion of any mentioned computer and/or processor and memory (e.g. including ROM, CD-ROM etc), these may comprise a computer processor, Application Specific Integrated Circuit (ASIC), field-programmable gate array (FPGA), and/or other hardware components that have been programmed in such a way to carry out the inventive function.

[0111] The applicant hereby discloses in isolation each individual feature described herein and any combination of two or more such features, to the extent that such features or combinations are capable of being carried out based on the present specification as a whole, in the light of the common general knowledge of a person skilled in the art, irrespective of whether such features or combinations of features solve any problems disclosed herein, and without limitation to the scope of the claims. The applicant indicates that the disclosed aspects/examples may consist of any such individual feature or combination of features. In view of the foregoing description it will be evident to a person skilled in the art that various modifications may be made within the scope of the disclosure.

[0112] While there have been shown and described and pointed out fundamental novel features as applied to examples thereof, it will be understood that various omissions and substitutions and changes in the form and details of the devices and methods described may be made by those skilled in the art without departing from the scope of the disclosure. For example, it is expressly intended that all combinations of those elements and/or method steps which perform substantially the same function in substantially the same way to achieve the same results are within the scope of the disclosure. Moreover, it should be recognized that structures and/or elements and/or method steps shown and/or described in connection with any disclosed form or examples may be incorporated in any other disclosed or described or suggested form or example as a general matter of design choice. Furthermore, in the claims means-plus-function clauses are intended to cover the structures described herein as performing the recited function and not only structural equivalents, but also equivalent structures. Thus, although a nail and a screw may not be structural equivalents in that a nail employs a cylindrical surface to secure wooden parts together, whereas a screw employs a helical surface, in the environment of fastening wooden parts, a nail and a screw may be equivalent structures.

Claims

1. An apparatus comprising means configured to:

receive first position information indicative of at least the location of a first user with respect to a presentation space; and

receive second position information indicative of at least the location of a second user with respect to the presentation space;

2. An apparatus according to claim 1, wherein the means are configured to provide for presentation of the primary audio of the first audio content and the primary audio of the second audio content from at least the loudspeaker of the at least two loudspeakers that satisfies a criterion, the criterion including at least in part the following qualifiers:

the loudspeaker that is the closest loudspeaker to the respective user that is closer to the respective user than it is to the other respective user; and

the primary audio of the first audio content and the primary audio of the second audio content are presented from different loudspeakers.

3. An apparatus according to claim 1 or claim 2, wherein the second selection is determined based, at least in part, on which of the at least two loudspeakers are closer to the first user than the second user and the third selection is determined based, at least in part, on which of the at least two loudspeakers are closer to the second user than the first user, wherein the second and third selections are non-overlapping.

4. An apparatus according to claim 1 or claim 2, wherein the first position information is also indicative of the orientation of the first user in the presentation space and the second position information is also indicative of the orientation of the second user in the presentation space; and
wherein the loudspeakers that form part of the second selection are determined based on a criterion, the criterion including at least in part the following qualifiers:

which of the at least two loudspeakers are closer to the first user; and

which of the at least two loudspeakers the first user is facing; and

wherein the loudspeakers that form part of the third selection are determined based on a criterion, the criterion including at least in part the following qualifiers:

which of the at least two loudspeakers are closer to the second user; and

which of the at least two loudspeakers the second user is facing.

5. An apparatus according to claim 1 or claim 2, wherein the apparatus includes means configured to change the loudspeakers that form the second selection and the loudspeakers that form the third selection based on the respective first position information and the respective second position information being indicative of movement of the location of one or both of the first user and the second user in the presentation space.

6. An apparatus according to claim 5, wherein said change in the loudspeakers that form the second selection and the loudspeakers that form the third selection is further based on the elapse of a predetermined amount of time since any previous change in the loudspeakers that form the second selection and the loudspeakers that form the third selection.

7. An apparatus according to any preceding claim, wherein the means are configured to cease presentation of the secondary audio of one or both of the first audio content and the second audio content and maintain the presentation of the primary audio of both the first audio content and the second audio content, in response to the first position information and the second position information being indicative of the first user and the second user having moved to within a predetermined distance of one another.

8. An apparatus according to any one of claims 1 to 6, wherein said means are configured to, in response to the first position information and the second position information being indicative of the first user and the second user having moved to being within a predetermined distance of one another,
provide for presentation of the secondary audio of the first audio content from the loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the first user that is closer to the first user than it is to the second user in preference to the second selection, and
provide for presentation of the secondary audio of the second audio content from the loudspeaker of the at least two loudspeakers that is the closest loudspeaker to the second user that is closer to the second user than it is to the first user in preference to the third selection, and
maintain presentation of the primary audio of both the first audio content and the second audio content.

9. An apparatus according to any preceding claim, wherein the first audio content and the second audio content comprise audio from respective first and second remote persons in telecommunication with the first user and second user respectively, and said means are configured to provide for presentation of the secondary audio of the first audio content using a fourth selection of the loudspeakers and presentation of the secondary audio of the second audio content using a fifth selection of the loudspeakers, the fourth and fifth selections comprising overlapping selections of the loudspeakers, in response to user input received from one or both of the first user and the second user.

10. An apparatus according to any preceding claim, wherein the means are configured to provide for presentation of one of the first audio content and second audio content with a sixth selection of the loudspeakers, the sixth selection comprising a greater number of loudspeakers than the second selection and a greater number of loudspeakers than the third selection, in response to an end of the other of the first audio content and second audio content being reached such that there is no further audio content to present.

11. An apparatus according to any preceding claim, wherein the primary audio is provided for presentation using a selection of the at least two loudspeakers based on primary-audio-criteria, wherein the primary-audio-criteria is different to criterion used to determine one or more of the first selection, the second selection and the third selection of loudspeakers of the at least two loudspeakers.

12. An apparatus according to any preceding claim, wherein the means are configured to provide for the simultaneous presentation of the first audio content and the second audio content to at least three loudspeakers at known locations in the presentation space.

13. An apparatus according to any preceding claim, wherein one or both of the first audio content and the second audio content comprises audio-type information which designates which audio of the respective first and second audio content comprises the primary audio and which audio of the respective first and second audio content comprises the secondary audio.

14. A method, the method comprising:

receiving first position information indicative of at least the location of a first user with respect to a presentation space; and

receiving second position information indicative of at least the location of a second user with respect to the presentation space;

15. A computer readable medium comprising computer program code stored thereon, the computer readable medium and computer program code being configured to, when run on at least one processor, perform the method of:

receiving first position information indicative of at least the location of a first user with respect to a presentation space; and

receiving second position information indicative of at least the location of a second user with respect to the presentation space;

Drawing

Search report

Search report