TECHNICAL FIELD
[0001] The present invention relates to an object-based three dimensional (3-D) audio service
system using preset audio scenes and a method thereof; and, more particularly, to
an object-based 3-D audio service system using preset audio scenes and a method thereof
for providing an interactive service that enables a user or a viewer to directly form
an audio scene using a 3-D audio related technology for providing realistic broadcasting
to a user or a viewer.
BACKGROUND ART
[0002] Fig. 1 is a diagram illustrating a conventional audio service system.
[0003] As shown in Fig. 1, the conventional audio service system includes an audio service
providing apparatus 10 and an audio service reproducing apparatus 20. The audio service
providing apparatus 10 includes an audio-capture unit 11 for capturing an audio signal
such as sound, an editing/mixing unit 12 for editing and mixing the captured audio
signal to transmit the audio signal to an audio service reproducing apparatus 20,
and a storing/transmitting unit 13 for storing the mixed audio signal and transmitting
the mixed audio signal to the audio service reproducing apparatus 20.
[0004] The audio service reproducing apparatus 20 includes a receiver 21 for receiving an
audio signal transmitted from the audio service providing apparatus 10, a controller
22 for controlling the received audio signal, and a reproducer 23 for reproducing
an audio signal.
[0005] An audio signal, which is provided through broadcasting services such as TV broadcasting,
radio broadcasting, and Digital Multimedia Broadcasting (DMB) based on the conventional
audio service system, is generally created by mixing a plurality of audio signals
captured from various sound sources. For example, an audio signal provided through
a soccer game broadcasting is created by mixing noises in a soccer stadium, yelling
of a crowd, and a voice of an announcer.
[0006] Although a user or a viewer can control the volume of the overall audio signal, it
is impossible to control the volume of each object such as the voice of an announcer,
the yelling of a crowd, and the noises of the soccer stadium. It is because the audio
signal is transmitted after a plurality of object audio signals are mixed into one
audio signal in a general broadcasting service.
[0007] However, if a transmitter such as the audio service providing apparatus 10 independently
transmits object audio signals of the sound sources without the object audio signals
of the sound sources mixed to one audio signal, a receiver such as the audio service
reproducing apparatus 20 can independently control the volumes of the object audio
signals of the sound sources. An object-based audio service denotes such an audio
service that allows a user or a viewer to control each of the object audio signals
at a receiver by independently transmitting the object audio signals of the sound
sources through a transmitter.
[0008] For example, if an audio signal of a soccer game broadcasting is provided based on
an object-based 3-D audio service, a user or a viewer can control each of objects,
such as the noises in the soccer stadium, the yelling of the crowd, and the voices
of an announcer to obtain a desired audio setting. That is, a user or a viewer can
control the noise of the soccer stadium loud, the yelling of the crowd soft, and the
voice of the announcer loud. Or, a viewer can control the audio signal to reproduce
only the noises of the soccer stadium and the voice of an announcer without the yelling
of the crowd reproduced.
[0009] Therefore, there is a great demand for developing a method for providing an object-based
3-D audio service that enables a user to control each of object audio signals of sound
sources, which can be applied to all broadcasting services and multimedia services
providing audio such as digital broadcasting, radio broadcasting, Digital Multimedia
Broadcasting, Internet broadcasting, digital movie, DVD, moving picture contents.
[0010] Although a conventional object-based 3-D audio system and a control method thereof
was introduced in Korean Patent Publication No.
10-2004-0037437, published on May 7th 2004, the conventional object-based 3-D audio system requires a user to control each of
object audio signals of sound sources to set the audio signals according to user's
preference. Therefore, it is very annoying to a user or a viewer.
[0011] Taejin Lee et al., "An Object-based 3D Audio Broadcasting System for Interactive Services",
Audio Engineering Society, Convention paper 6384, 2005, May 28-31, Barcelona, Spain, disclose an object-based 3D audio broadcasting system. This system consists of
an authoring tool, a streaming server, and a client. The authoring tool generates
an MPEG-4 file, made of multiple audio objects, after adapting several kinds of acoustical
effects to the audio objects. The streaming server generates transmission packets
and sends them to the client through the Internet. The client reconstructs the bitstream
and plays the 3D audio with a user interaction. The authors present the design and
implementation method of an object-based 3D audio system and describe the simulation
result and applications thereof.
DISCLOSURE
TECHNICAL PROBLEM
[0012] An embodiment of the present invention is directed to providing an object-based three
dimensional (3-D) audio service system and a method thereof for enabling a user to
easily and conveniently watch and listen an object-based 3-D audio service by eliminating
inconvenience that requires a user to control each of object audio signals of sound
sources.
[0013] Other objects and advantages of the present invention can be understood by the following
description, and become apparent with reference to the embodiments of the present
invention. Also, it is obvious to those skilled in the art of the present invention
that the objects and advantages of the present invention can be realized by the means
as claimed and combinations thereof.
TECHNICAL SOLUTION
[0014] The present invention is defined in the independent claims. The dependent claims
define embodiments thereof.
[0015] In accordance with an embodiment , there is provided an object-based three dimensional
(3-D) audio service providing apparatus using preset audio scenes, including: audio
input means for inputting an audio signal; preset audio scene generating means for
extracting object audio signals from the audio signal inputted through the audio input
means and generating more than one of 3-D audio scene information by arranging the
extracted object audio signals in a 3-D space and editing features of each object;
and encoding means for encoding and multiplexing the audio signal and the 3-D audio
scene information for each object audio signal.
[0016] In accordance with another embodiment , there is provided an object-based 3-D audio
service reproducing apparatus using preset audio scenes including: decoding means
for de-multiplexing and decoding object-based 3-D audio contents; audio scene forming
means for forming 3-D audio scene information according to one selected from a plurality
of 3-D audio scene information in the de-multiplexed and decoded object-based 3-D
audio contents by a user including a viewer; audio signal mixing means for controlling
features of objects in an audio signal of the de-multiplexed and decoded object-based
3-D audio contents according to the formed 3-D audio scene information; and reproducing
means for reproducing the audio signal with one of the features controlled.
[0017] In accordance with another embodiment , there is provided a method for providing
an object-based 3-D audio service using preset audio scenes, including the steps of:
inputting an audio signal; extracting object audio signals from the inputted audio
signal and generating more than one of 3-D audio scene information by arranging the
extracted object audio signals in a 3-D space and editing features of each object;
and encoding and multiplexing the audio signal and the 3-D audio scene information
for each object audio signal.
[0018] In accordance with another embodiment , there is provided a method for reproducing
object-based 3-D audio service using preset audio scenes including the steps of: de-multiplexing
and decoding object-based 3-D audio contents; forming 3-D audio scene information
according to one selected from a plurality of 3-D audio scene information in the de-multiplexed
and decoded object-based 3-D audio contents by a user including a viewer; controlling
features of objects in an audio signal of the de-multiplexed and decoded object-based
3-D audio contents according to the formed 3-D audio scene information; and reproducing
the audio signal with one of the features controlled.
ADVANTAGEOUS EFFECTS
[0019] An object-based three dimensional (3-D) audio service system and a method thereof
according to the present invention provides previously generated preset audio scenes
to a user or a viewer with an object-based 3-D audio service applied to all broadcasting
services and multimedia services providing audio, such as digital broadcasting, radio
broadcasting, Digital Multimedia Broadcasting (DMB), Internet broadcasting, digital
movies, Digital Video Disk (DVD), and moving picture contents. Therefore, the object-based
3-D audio service system and a method thereof according to the present invention eliminates
the inconvenience of a user to control each of object audio signals of sound sources
and enables the user to easily and conveniently watch and listen the object-based
3-D audio service.
[0020] The present invention can be applied to broadcasting services and multimedia services
providing audio, such as digital broadcasting, radio broadcasting, DMB, Internet broadcasting,
digital movies, DVD, and moving picture contents, and the present invention is not
limited to the types of mediums for transmitting and storing object-based audio contents
for broadcasting and multimedia services providing audio.
BRIEF DESCRIPTION OF THE DRAWINGS
[0021]
Fig. 1 is a diagram illustrating a conventional audio service system.
Fig. 2 is a block diagram illustrating an object-based three-dimensional (3-D) audio
service system using preset audio scenes in accordance with an embodiment of the present
invention.
Fig. 3 is a flowchart illustrating a method for providing an object-based 3-D audio
service using preset audio scenes in accordance with an embodiment of the present
invention.
Fig. 4 is a flowchart illustrating a method for reproducing an object-based 3-D audio
service using preset audio scenes in accordance with the embodiment of the present
invention.
BEST MODE FOR THE INVENTION
[0022] The advantages, features and aspects of the invention will become apparent from the
following description of the embodiments with reference to the accompanying drawings,
which is set forth hereinafter.
[0023] Fig. 2 is a block diagram illustrating an object-based three-dimensional (3-D) audio
service system using preset audio scenes in accordance with an embodiment of the present
invention.
[0024] As shown in Fig. 2, the object-based 3-D audio service system includes an object-based
3-D audio service providing apparatus 30, a transmitting medium 50, and an object-based
3-D audio service reproducing apparatus 40. The 3-D service providing apparatus 30
receives an audio signal through various input devices, creates more than one of object-based
3-D audio scene information which can be selected by a user or a viewer, and transmits
the created object-based 3-D audio scene information to the object-based 3-D audio
service reproducing apparatus 40. The transmitting medium 50 is a medium such as a
digital broadcasting network or an Internet network for connecting the object-based
3-D audio service providing apparatus 30 and the object-based 3-D audio service reproducing
apparatus 40 through a network. The object-based 3-D audio service reproducing apparatus
40 generates more than one of 3-D audio scenes based on the object-based 3-D audio
scene information transmitted from the object-based 3-D audio service providing apparatus
30.
[0025] Hereinafter, the constituent elements of the object-based 3-D audio service system
using preset audio scenes according to the present embodiment will be described in
detail.
[0026] The object-based 3-D audio service providing apparatus 30 includes an input unit
31, a preset audio scene generator 32, an encoder 33, and a transmitter 34. The input
unit 31 receives audio signals through various input devices. The preset audio scene
generator 32 extracts object-based audio signals (hereinafter, object audio signals)
from the audio signal received through the input unit 31, arranges the extracted object
audio signals in a three dimensional space, and creates more than one of 3-D audio
scene information by editing features such as a location, a size, a direction, and
a sound field environment of each object. The encoder 33 encodes and multiplexes the
audio signal inputted through the input unit 31 and the object-based 3-D audio scene
information created by the preset audio scene generating unit 32 for transmitting
the input audio signal and the generated preset audio scene information to the object-based
3-D audio service reproducing apparatus 40. For example, the input audio signals and
the generated preset audio scene information are multiplexed to a moving picture experts
group 4 (MPEG-4) file format in a digital broadcasting network. The transmitter 34
transforms the multiplexed object-based audio contents including the input audio signal
and the created object-based 3-D audio scene information from the encoding unit 33
to a transport format. For example, the transmitter 34 transforms the multiplexed
object-based audio contents to a MPEG-2 transport stream (TS) for a digital broadcasting
network.
[0027] The transformed object-based audio contents including the input audio signal and
the generated object-based 3-D audio scene information may be transmitted to the object-based
3-D audio reproducing apparatus 40 and may be stored in a storing medium.
[0028] The transmitter 34 may transmit the object-based audio contents including the input
audio signal and the object-based 3-D audio scene information to the object-based
3-D audio reproducing apparatus 40 through a digital broadcasting network such as
a terrestrial DMB channel 50.
[0029] If the sound source of the audio signal inputted to the input unit 31 is a mixed
sound source, the preset audio scene generator 32 uses a Convolutive Blind Source
Separation technique to extract object audio signals. Especially, the preset audio
scene generator 32 forms more than one of object-based 3-D audio scene information
by controlling a ratio of each object-based the audio scene information of each object
audio signal, which is set according to the control of a user such as an editor.
[0030] The object-based 3-D audio service reproducing apparatus 40 includes a decoder 42,
an audio scene information forming unit 43, an audio signal mixer 44, and an audio
signal reproducer 45. The decoder 42 de-multiplexes and decodes object-based audio
contents including an audio signal and object-based 3-D audio scene information for
reproducing. The audio scene information forming unit 43 provides the object-based
3-D audio scene information of the object-based 3-D audio contents, which is de-multiplexed
and decoded by the decoder 42, to a user such as a viewer to select, and forms the
object-based 3-D audio scene information according to the user selection. The audio
signal mixer 44 mixes object audio signals of the audio signal of the de-multiplexed
and decoded object-based 3-D audio contents from the decoder 42 by controlling features
of each object, such as a location, a direction, a size, and a sound field of each
object according to the object-based 3-D audio scene information formed by the audio
scene information forming unit 43. The audio signal reproducer 45 reproduces the audio
signal mixed to one object-based 3-D audio scene by the audio signal mixer 44.
[0031] The object-based audio contents including the audio signal and the object-based 3-D
audio scene information may be provided through a broadcasting service or a multimedia
service such as digital broadcasting, radio broadcasting, Digital Multimedia Broadcasting
(DMB), Internet broadcasting, digital movies, Digital Video Disk (DVD), and moving
picture contents. Although the object-based audio contents may be received through
the receiver 41 in the present embodiment, the present invention is not limited thereto.
That is, the object-based audio contents may be provided through a transmission medium
or a storage medium that can provide a broadcasting service or a multimedia service
that provides an audio.
[0032] The audio scene information forming unit 43 enables a user or a viewer to select
features of objects such as a location, a direction, a volume, and a sound field environment
of each object and forms new object-based 3-D audio scene information according to
the features including a location, a direction, a volume, and a sound field environment
of each object set by the user.
[0033] A user or a viewer can control features of a 3-D audio space by changing a reverberation
time of a 3-D space through controlling a volume and a delay time of an initial reflected
sound through the audio scene information forming unit 43.
[0034] That is, the object-based 3-D audio service system using the preset audio scene according
to the present embodiment previously generates object-based 3-D audio scenes that
are expected to be frequently used and provides the generated object-based 3-D audio
scenes as preset audio scenes to a user or a viewer. That is, the object-based 3-D
audio service system according to the present embodiment enables a user or a viewer
to select one of the preset audio scenes in order to make a user to conveniently watch
and listen a broadcasting program with the desired audio preference.
[0035] For example, noises of a soccer stadium, yelling of a crowd, a voice of an announcer
are defined as audio objects for a soccer game broadcasting, and the defined audio
objects are transmitted independently. With the audio objects, a first audio scene
having information about volume of the noises of soccer stadium, the yelling of a
crowd, and the voice of an announcer set to 1:1:1, a second audio scene having information
about volume of the noises of a soccer stadium, the yelling of a crowd, and the voice
of an announcer set to 1:0.5:1, and an audio scene having information about volume
of the noises of a soccer stadium, the yelling of a crowd, and the voice of an announcer
set to 1:0:1 are transmitted as the preset audio scenes. Then, a user or a viewer
selects one of the preset audio scenes to watch and listen the soccer game broadcasting
with the desired audio preference.
[0036] A user may directly control each of the audio objects if the user cannot find a desired
audio scene from the provided audio scenes. However, it is preferable to provide a
large number of preset audio scenes to a user in order to enable the user to find
a desired audio scene from the provided preset audio scenes.
[0037] Fig. 3 is a flowchart illustrating a method for providing an object-based 3-D audio
service using preset audio scenes in accordance with an embodiment of the present
invention.
[0038] Referring to Fig. 3, the input unit 31 of the object-based 3-D audio service providing
apparatus 30 receives an object-based audio signal through various input devices at
step S301.
[0039] The preset audio scene generator 32 extracts object-based audio signals, that is,
object audio signals, from the audio signal inputted through the input unit 31 at
step S302. Then, the preset audio scene generator 32 generates more than one of object-based
3-D audio scene information at step S304 by arranging the extracted object audio signals
in a 3-D space and editing the features of each object audio signal such as a location,
a direction, a volume, and a sound field environment of the audio object at step S303.
The encoder 33 encodes and multiplexes the audio signal inputted through the input
unit 31 and the object-based 3-D audio scene information generated by the preset audio
scene generator 32 at step S305. For example, the encoder 33 encodes and multiplexes
the audio signal and the object-based 3-D audio scene information into MPEG-4 file
format for a digital broadcasting network.
[0040] Then, the transmitter 34 transforms the multiplexed object-based audio contents including
the audio signal and the object-based 3-D audio scene information to be proper to
a transport format and transmits the transformed object-based audio contents at step
S306. For example, the multiplexed object-based audio contents are transformed to
a MPEG-2 TS in a digital broadcasting network.
[0041] For example, the transmitter 34 transmits the transformed object-based audio contents
including the audio signal and the object-based 3-D audio scene information to the
object-based 3-D audio reproducing apparatus 40 through a digital broadcasting network
such as a terrestrial DMB channel. The transformed object-based audio contents including
the audio signal and the object-based 3-D audio scene information may be stored in
a storing medium.
[0042] Fig. 4 is a flowchart illustrating a method for reproducing an object-based 3-D audio
service using preset audio scenes in accordance with an embodiment of the present
invention.
[0043] Referring to Fig. 4, the receiver 41 of the object-based 3-D audio service reproducing
apparatus 40 receives the object-based audio contents including an audio signal and
object-based 3-D audio information through, for example, a digital broadcasting network
such as a terrestrial DMB channel 50 or the Internet network at step S401.
[0044] The receiver 41 may receive the object-based audio contents through a transmission
medium that can provide a broadcasting service or a multimedia service that provides
an audio. Or, the object-based audio contents may be inputted through the storing
medium.
[0045] The decoder 42 de-multiplexes and decodes the received or inputted object-based audio
contents including the audio signal and the object-based 3-D audio scene information
at step S402. The audio scene information forming unit 43 provides the object-based
3-D audio scene information of the de-multiplexed and decoded object-based 3-D audio
contents to a user or a viewer to select, and forms object-based 3-D audio scene information
according to the user selection at step S403.
[0046] Then, the audio signal mixer 44 mixes object audio signals by controlling features
of objects in the audio signal of the de-multiplexed and decoded object-based 3-D
audio contents, such as a location, a direction, a volume, and a sound field environment
of each audio object, according to the object-based 3-D audio scene information formed
by the audio scene information forming unit 43 at step S404. Finally, the audio signal
reproducer 45 reproduces the audio signal mixed based on one of the object-based 3-D
audio scenes by the audio signal mixer 44 at step S405.
[0047] The above described method according to the present invention can be embodied as
a program and stored on a computer readable recording medium. The computer readable
recording medium is any data storage device that can store data which can be thereafter
read by the computer system. The computer readable recording medium includes a read-only
memory (ROM), a random-access memory (RAM), a CD-ROM, a floppy disk, a hard disk and
an optical magnetic disk.
[0048] While the present invention has been described with respect to certain preferred
embodiments, it will be apparent to those skilled in the art that various changes
and modifications may be made without departing from the scope of the invention as
defined in the following claims.
1. An object-based three dimensional (3-D) audio service providing apparatus (30) using
preset audio scenes, comprising:
audio input means (31) for inputting an audio signal;
preset audio scene generating means (32) for extracting object audio signals from
the audio signal inputted through the audio input means and generating more than one
of 3-D audio scene information by arranging the extracted object audio signals in
a 3-D space and editing features of each object based on the preset audio scenes;
and
encoding means (33) for encoding and multiplexing the audio signal and the 3-D audio
scene information for each object audio signal,
wherein the preset audio scene generating means (32) is adapted to generate more than
one of the 3-D audio scene information by setting the ratios between the volumes of
the object audio signals, based on audio scene information of each object audio signal,
which ratios are set according to control of a user including a viewer.
2. The object-based 3-D audio service providing apparatus (30) of claim 1, further comprising
processing means for processing the encoded and multiplexed object-based 3-D audio
contents.
3. The object-based 3-D audio service providing apparatus (30) of claim 2, wherein the
processing means is adapted to transmit the encoded and multiplexed object-based 3-D
audio contents to an audio reproducing terminal through a digital broadcasting network.
4. The object-based 3-D audio service providing apparatus (30) of claim 1, wherein the
audio scene generating means (32) is adapted to extract object audio signals using
a Convolutive Blind Source Separation technique when a sound source of the input audio
signal is a mixed sound source.
5. An object-based three dimensional (3-D) audio service reproducing apparatus (40) for
reproducing object-based 3-D audio service provided by the object-based 3-D audio
service providing apparatus (30) according to any one of claims 1 to 4, using preset
audio scenes comprising:
decoding means (42) for de-multiplexing and decoding object-based 3-D audio contents;
audio scene selecting means (43) for selecting one 3-D audio scene information from
a plurality of 3-D audio scene information based on the preset audio scenes in the
de-multiplexed and decoded object-based 3-D audio contents by a user including a viewer;
audio scene forming means for forming 3-D audio scene information according to the
selected 3-D audio scene information;
audio signal mixing means (44) for controlling features of objects in an audio signal
of the de-multiplexed and decoded object-based 3-D audio contents according to the
formed 3-D audio scene information; and
reproducing means (45) for reproducing the audio signal with one of the features controlled.
6. A method for providing an object-based three dimensional (3-D) audio service using
preset audio scenes, comprising the steps of:
inputting an audio signal;
generating (S304) more than one of 3-D audio scene information by extracting object
audio signals (S302) from the inputted audio signal, arranging the extracted object
audio signals in a 3-D space and editing (S303) features of each object based on the
preset audio scenes; and
encoding and multiplexing (S305) the audio signal and the 3-D audio scene information
for each object audio signal,
wherein in the step of generating (S304) 3-D audio scene information, more than one
of the 3-D audio scene information is generated by setting the ratios between the
volumes of the object audio signals, based on audio scene information of each object
audio signal, which ratios are set according to control of a user including a viewer.
7. The method of claim 6, further comprising the step of:
processing the encoded and multiplexed object-based 3-D audio contents.
8. The method of claim 7, wherein in the step of processing the object-based 3-D audio
contents, the encoded and multiplexed object-based 3-D audio contents are transmitted
through a digital broadcasting network,
9. A method for reproducing object-based three dimensional (3-D) audio service provided
by the object-based 3-D audio service providing method according to any one of claims
6 to 8, using preset audio scenes comprising the steps of :
de-multiplexing and decoding (S402) object-based 3-D audio contents;
selecting (S403) one 3-D audio scene information from a plurality of 3-D audio scene
information based on the preset audio scenes in the de-multiplexed and decoded object-based
3-D audio contents by a user including a viewer; forming (S403) 3-D audio scene information
according to the selected 3-D audio scene information;
controlling (S404) features of objects in an audio signal of the de-multiplexed and
decoded object-based 3-D audio contents according to the formed 3-D audio scene information;
and
reproducing (S405) the audio signal with one of the features controlled.
10. The method of claim 9 or apparatus (40) of claim 5, wherein in the step of forming
(S403) 3-D audio scene information, 3-D audio scene information is formed according
to features of each object, which are set by a user including a viewer.
11. The apparatus (30; 40) of claim 1 or 5, or the method of claim 6 or 9, wherein the
features include at least one among a location, a volume, a direction, and a sound
field environment of each object.
12. The apparatus (40) of claim 5 or the method of claim 9, wherein a feature of a 3-D
audio space is controlled by changing a reverberation time of a 3-D audio space through
controlling a volume and a delay time of an initial reflected sound.
1. Eine objekt-basierte dreidimensionale (3-D) Audio-Service-Lieferungsvorrichtung (30),
die voreingestellte Audioszenen verwendet, aufweisend:
Audio-Eingabemittel (31) zum Eingeben eines Audiosignals;
Erzeugungsmittel (32) zur Erzeugung voreingestellter Audioszenen zum Extrahieren von
Objektaudiosignalen aus dem Audiosignal, das durch die Audioeingabemittel eingegeben
wurde und zum Erzeugen von mehr als einer 3-D-Audioszeneninformation durch Anordnen
der extrahierten Objektaudiosignale in einem 3-D-Raum und Editieren von Merkmalen
von jedem Objekt basierend auf den voreingestellten Audio-Szenen; und
Codiermittel (33) zum Codieren und Multiplexen des Audiosignals und der 3-D-Audioszeneninformation
für jedes Objektaudiosignal,
wobei die Erzeugungseinrichtung (32) für voreingestellte Audioszenen angepasst ist,
mehr als eine der 3-D-Audioszeneninformation zu erzeugen durch Setzen des Verhältnisses
zwischen den Lautstärken der Objektaudiosignale basierend auf Audioszeneninformation
von jedem Objektaudiosignal,
wobei die Verhältnisse gesetzt werden in Übereinstimmung mit der Steuerung eines Benutzers
einschließend einen Betrachter.
2. Die objekt-basierte 3-D-Audio-Service-Lieferungsvorrichtung (30) von Anspruch 1, ferner
aufweisend: Verarbeitungsmittel zum Verarbeiten der codierten und gemultiplexten objektbasierten
3-D-Audioinhalte.
3. Die objekt-basierte 3-D-Audio-Service-Lieferungsvorrichtung (30) nach Anspruch 2,
wobei die Verarbeitungsmittel angepasst sind, um die codierten und gemultiplexten
objektbasierten 3-D-Audioinhalte an ein Audio-Wiedergabeterminal über ein digitales
Broadcasting-Netzwerk zu übertragen.
4. Die objekt-basierte 3-D-Audio-Service-Lieferungsvorrichtung (30) nach Anspruch 1,
wobei die Audioszenen-Erzeugungsmittel (32) angepasst sind, Objektaudiosignale unter
Verwendung einer konvolutiven Blindquellenseparationstechnik zu extrahieren, wenn
eine Schallquelle des Audiosignals eine gemischte Schallquelle ist.
5. Eine objekt-basierte dreidimensionale (3-D)-Audio-Service-Wiedergabe-vorrichtung (40)
zum Wiedergeben von objektbasiertem 3-D Audioservice, der geliefert wird durch die
objektbasierte 3-D-Audio-Service-Lieferungsvorrichtung (30) gemäß einem der Ansprüche
1 - 4, welche voreingestellte Audioszenen verwendet, aufweisend:
Decodiermittel (42) zum Demultiplexen und Decodieren objektbasierter 3-D-Audioinhalte;
Audioszenen-Auswahlmittel (43) zum Auswählen einer 3-D-Audioszeneninformation aus
einer Mehrzahl von 3-D-Audioszeneninformationen basierend auf den voreingestellten
Audioszenen in den gedemultiplexten und decodierten objektbasierten 3-D-Audioinhalten
durch einen Benutzer einschließend einen Betrachter;
Audioszenen-Formungsmittel zum Formen von 3-D-Audioszeneninformation gemäß der ausgewählten
3-D-Audioszeneninformation;
Audiosignal-Mischmittel (44) zum Steuern von Merkmalen von Objekten in einem Audiosignal
der gedemultiplexten und decodierten objektbasierten 3-D-Audioinhalte gemäß den geformten
3-D-Audioszeneninformationen; und
Wiedergabemittel (45) zum Wiedergeben des Audiosignals mit einem der gesteuerten Merkmale.
6. Ein Verfahren zum Liefern eines objektbasierten dreidimensionalen (3-D) Audioservice
unter Verwendung voreingestellter Audioszenen, aufweisend die Schritte:
Eingeben eines Audiosignals;
Erzeugen (S304) von mehr als einer 3-D-Audioszeneninformation durch Extrahieren von
Objektaudiosignalen (S302) von dem eingegebenen Audiosignal, Anordnen der extrahierten
Objektaudiosignale in einem 3-D-Raum und Editieren (S303) von Merkmalen von jedem
Objekt basierend auf den voreingestellten Audioszenen; und
Codieren und Multiplexen (S305) des Audiosignals und der 3-D-Audioszeneninformation
für jedes Objektaudiosignal,
wobei in dem Schritt des Erzeugens (S304) von 3-D-Audioszeneninformationen mehr als
eine der 3-D-Audioszeneninformation erzeugt wird durch Setzen des Verhältnisses zwischen
den Lautstärken der Objektaudiosignale basierend auf Audioszeneninformation von jedem
Objektaudiosignal, wobei die Verhältnisse gesetzt werden in Übereinstimmung mit der
Steuerung eines Benutzers einschließend einen Betrachter.
7. Das Verfahren nach Anspruch 6, ferner aufweisend den Schritt:
Verarbeiten der codierten und gemultiplexten objektbasierten 3-D-Audioinhalte.
8. Das Verfahren nach Anspruch 7, wobei in dem Schritt des Verarbeitens der objektbasierten
3-D-Audioinhalte die codierten und gemultiplexten objektbasierten 3-D-Audioinhalte
über ein digitales Broadcasting-Netzwerk übertragen werden.
9. Ein Verfahren zum Wiedergeben von objektbasiertem dreidimensionalem (3-D) Audioservice,
der geliefert wird durch das objektbasierte 3-D-Audioservice-Lieferungsverfahren gemäß
einem der Ansprüche 6 bis 8, unter Verwendung voreingestellter Audioszenen, aufweisend
die Schritte:
Demultiplexen und Decodieren (S402) von objektbasierten 3-D-Audioinhalten;
Auswählen (S403) von einer 3-D-Audioszeneninformation aus einer Mehrzahl von 3-D-Audioszeneninformationen
basierend auf den voreingestellten Audioszenen in dem gedemultiplexten und decodierten
objektbasierten 3-D-Audioinhalten durch einen Benutzer einschließend eine Betrachter;
Bilden (S403) von 3-D-Audioszeneninformationen gemäß der ausgewählten 3-D-Audioszeneninformation;
Steuren (S404) von Merkmalen von Objekten in einem Audiosignal der gedemultiplexten
und decodierten objektbasierten 3-D-Audioinhalte in Übereinstimmung mit der gebildeten
3-D-Audioszeneninformation; und
Wiedergeben (S405) des Audiosignals mit einem der gesteuerten Merkmale.
10. Das Verfahren nach Anspruch 9 oder die Vorrichtung (40) nach Anspruch 5, wobei in
dem Schritt des Bildens (S403) von 3-D-Audioszeneninformation 3-D-Audioszenen-Information
gebildet wird in Übereinstimmung mit Merkmalen von jedem Objekt, die durch einen Benutzer
einschließend einen Betrachter gesetzt werden.
11. Die Vorrichtung (30; 40) nach Anspruch 1 oder 5, oder das Verfahren nach Anspruch
6 oder 9, wobei die Merkmale einschließen: zumindest eines aus einem Ort, einer Lautstärke,
einer Richtung und einer Schallfeldumgebung jedes Objekts.
12. Die Vorrichtung nach Anspruch 5 oder das Verfahren nach Anspruch 9, wobei ein Merkmal
eines 3-D-Audioraums gesteuert wird durch Verändern einer Hallzeit eines 3-D-Audioraums
durch Steuern einer Lautstärke und einer Verzögerungszeit eines initialreflektierten
Schalls.
1. Appareil de fourniture de service audio tridimensionnel (3D) à base d'objet utilisant
des scènes audio préétablies, comprenant :
des moyens d'entrée audio (31) pour entrer un signal audio ;
des moyens de génération de scène audio préétablie (32) pour extraire des signaux
audio d'objet du signal audio entré via les moyens d'entrée audio et pour générer
plus d'une information de scène audio 3-D en agençant les signaux audio d'objet extraits
dans un espace 3-D et en éditant des caractéristiques de chaque objet d'après les
scènes audio préétablies ; et
des moyens de codage (33) pour coder et multiplexer le signal audio et les informations
de scène audio 3-D pour chaque signal audio d'objet,
dans lequel les moyens de génération de scène audio préétablie (32) sont adaptés pour
générer plus d'une des informations de scène audio 3-D en établissant les rapports
entre les volumes des signaux audio d'objet, d'après des informations de scène audio
de chaque signal audio d'objet, lesquels rapports sont établis selon la commande d'un
utilisateur incluant un spectateur.
2. Appareil de fourniture de service audio 3-D à base d'objet (30) selon la revendication
1, comprenant en outre des moyens de traitement pour traiter les contenus audio 3-D
à base d'objet codés et multiplexés.
3. Appareil de fourniture de service audio 3-D à base d'objet (30) selon la revendication
2, dans lequel les moyens de traitement sont adaptés pour transmettre les contenus
audio 3-D à base d'objet codés et multiplexés à un terminal de reproduction audio
via un réseau de diffusion numérique.
4. Appareil de fourniture de service audio 3-D à base d'objet (30) selon la revendication
1, dans lequel les moyens de génération de scène audio (32) sont adaptés pour extraire
des signaux audio d'objet à l'aide d'une technique de séparation de source aveugle
convolutive lorsqu'une source sonore du signal audio entré est une source sonore mélangée.
5. Appareil de reproduction de service audio tridimensionnel (3-D) à base d'objet (40)
pour reproduire un service audio 3-D à base d'objet fourni par l'appareil de fourniture
de service audio 3-D à base d'objet (30) selon l'une quelconque des revendications
1 à 4, utilisant des scènes audio préétablies comprenant :
des moyens de décodage (42) pour démultiplexer et décoder des contenus audio 3-D à
base d'objet ;
des moyens de sélection de scène audio (43) pour sélectionner une information de scène
audio 3-D parmi une pluralité d'informations de scène audio 3-D d'après les scènes
audio préétablies dans les contenus audio 3-D à base d'objet démultiplexés et décodés
par un utilisateur incluant un spectateur ;
des moyens de formation de scène audio pour former les informations de scène audio
3-D selon les informations de scène audio 3-D sélectionnées ;
des moyens de mélange de signal audio (44) pour contrôler les caractéristiques d'objets
dans un signal audio des contenus audio 3-D à base d'objet démultiplexés et décodés
selon les informations de scène audio 3-D formées ; et
des moyens de reproduction (45) pour reproduire le signal audio avec une des caractéristiques
commandées.
6. Procédé de fourniture d'un service audio tridimensionnel (3-D) à base d'objet utilisant
des scènes audio préétablies, comprenant les étapes de :
fourniture en entrée d'un signal audio ;
génération (S304) de plus d'une information de scène audio 3-D en extrayant des signaux
audio d'objet (S302) du signal audio fourni en entrée, agencement des signaux audio
d'objet extraits dans un espace 3-D et édition (S303) de caractéristiques de chaque
objet d'après les scènes audio préétablies ; et
codage et multiplexage (S305) du signal audio et des informations de scène audio 3-D
pour chaque signal audio d'objet,
dans lequel dans l'étape de génération (S304) d'information de scène audio 3-D, plus
d'une des informations de scène audio 3-D est générée en établissant les rapports
entre les volumes des signaux audio d'objet, basés sur les informations de scène audio
de chaque signal audio d'objet, lesquels rapports sont établis selon une commande
d'un utilisateur incluant un spectateur.
7. Procédé selon la revendication 6, comprenant en outre l'étape de :
traitement des contenus audio 3-D à base d'objet codés et multiplexés.
8. Procédé selon la revendication 7, dans lequel dans l'étape de traitement des contenus
audio 3-D à à base d'objet, les contenus audio 3-D à base d'objet codés et multiplexés
sont transmis via un réseau de diffusion numérique.
9. Procédé de reproduction d'un service audio tridimensionnel (3-D) à base d'objet fourni
par le procédé de fourniture de service audio 3-D à base d'objet selon l'une quelconque
des revendications 6 à 8, utilisant des scènes audio préétablies comprenant les étapes
de :
démultiplexage et décodage (S402) de contenus audio 3-D à base d'objet ;
sélection (S403) d'une information de scène audio 3-D parmi une pluralité d'informations
de scène audio 3-D d'après les scènes audio préétablies dans les contenus audio 3-D
à base d'objet démultiplexés et décodés par un utilisateur incluant un spectateur
;
formation (S403) d'information de scène audio 3-D selon les informations de scène
audio 3-D sélectionnées ;
commande (S404) des caractéristiques d'objet dans un signal audio des contenus audio
3-D à base d'objet démultiplexés et décodés selon les informations de scène audio
3-D formées ; et
reproduction (S405) du signal audio avec l'une des caractéristiques commandées.
10. Procédé selon la revendication 9 ou appareil (40) selon la revendication 5, où dans
l'étape de formation (S403) d'information de scène audio 3-D, l'information de scène
audio 3-D est formée selon les caractéristiques de chaque objet, qui sont établies
par un utilisateur incluant un spectateur.
11. Appareil (30 ; 40) selon la revendication 1 ou 5, ou procédé selon la revendication
6 ou 9, où les caractéristiques comprennent au moins un élément parmi un emplacement,
un volume, une direction, et un environnement de champ acoustique de chaque objet.
12. Appareil (40) selon la revendication 5 ou procédé selon la revendication 9, où une
caractéristique d'un espace audio 3-D est commandée en changeant un temps de réverbération
d'un espace audio 3-D par la commande d'un volume et d'un délai d'attente d'un son
réfléchi initial.