Global Patent Index - EP 1542206 A1

EP 1542206 A1 20050615 - Apparatus and method for automatic classification of audio signals

Title (en)

Apparatus and method for automatic classification of audio signals

Title (de)

Vorrichtung und Verfahren zur automatischen Klassifizierung von Audiosignalen

Title (fr)

Dispositif et procédé pour la classification automatique de signaux audio

Publication

EP 1542206 A1 20050615 (EN)

Application

EP 03028573 A 20031211

Priority

EP 03028573 A 20031211

Abstract (en)

The present invention relates to an apparatus and a method for automatic classification of audio signals. <??>Such an apparatus comprises: signal input means (3) for supplying audio signals; audio signal fragmenting means (4) for partitioning audio signals supplied by the signal input means (3) into audio fragments of a predetermined length; feature extracting means (5) for analysing acoustic characteristics of the audio signals comprised in the audio fragments; and classifying means (6) for discriminating the audio fragments provided by the audio signal fragmenting means (4) into a predetermined audio class based on predetermined audio class classsifying models (71,72,73) by using acoustic characteristics of the audio signals comprised in the audio fragments, wherein a predetermined audio class classifying model (71,72,73) is provided for each audio class and each audio class represents a respective kind of audio signals comprised in the corresponding audio fragment. <??>It is a disadvantage that singing voice included in the audio signal frequently is misclassified as speech, particularly when the singing voice is the dominant signal component. The reason is that singing voice is more similar to speech than to music. <??>To solve this problem, according to the present invention an individual predetermined audio class classifying model (71,72,73) is provided for at least each audio class "speech", "music" and "singing voice". <??>Furthermore, the above disadvantage is overcome by the inventive method and the inventive software product. <IMAGE>

IPC 1-7

G10L 11/00

IPC 8 full level

G10L 25/48 (2013.01)

CPC (source: EP)

G10L 25/48 (2013.01)

Citation (search report)

  • [X] US 2002163533 A1 20021107 - TROVATO KAREN I [US], et al
  • [X] WO 0116937 A1 20010308 - WAVEMAKERS RES INC [CA], et al
  • [X] ZHANG T ET AL: "Audio content analysis for online audiovisual data segmentation and classification", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE INC. NEW YORK, US, vol. 9, no. 4, May 2001 (2001-05-01), pages 441 - 457, XP001164214, ISSN: 1063-6676
  • [X] WU CHOU ET AL: "Robust singing detection in speech/music discriminator design", 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS (CAT. NO.01CH37221), 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS, SALT LAKE CITY, UT, USA, 7-11 MAY 2001, 2001, Piscataway, NJ, USA, IEEE, USA, pages 865 - 868 vol.2, XP002278343, ISBN: 0-7803-7041-4

Designated contracting state (EPC)

AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LI LU MC NL PT RO SE SI SK TR

DOCDB simple family (publication)

EP 1542206 A1 20050615

DOCDB simple family (application)

EP 03028573 A 20031211