Global Patent Index - EP 3203380 A1

EP 3203380 A1 20170809 - MULTI-MODE AUDIO RECOGNITION AND AUXILIARY DATA ENCODING AND DECODING

Title (en)

MULTI-MODE AUDIO RECOGNITION AND AUXILIARY DATA ENCODING AND DECODING

Title (de)

MULTI-MODUS AUDIO ANERKENNUNG UND AUXILIARY DATEN KODIEREN UND DEKODIEREN

Title (fr)

RECONNAISSANCE AUDIO MULTI-MODE ET CODAGE ET DÉCODAGE DE DONNÉES AUXILIAIRES

Publication

EP 3203380 A1 20170809 (EN)

Application

EP 16207395 A 20131015

Priority

  • US 201261714019 P 20121015
  • US 201313841727 A 20130315
  • EP 13847464 A 20131015
  • US 2013065069 W 20131015

Abstract (en)

Audio signal processing enhances audio watermark embedding and detecting processes. Audio signal processes include audio classification and adapting watermark embedding and detecting based on classification. Advances in audio watermark design include adaptive watermark signal structure data protocols, perceptual models, and insertion methods. Perceptual and robustness evaluation is integrated into audio watermark embedding to optimize audio quality relative the original signal, and to optimize robustness or data capacity. These methods are applied to audio segments in audio embedder and detector configurations to support real time operation. Feature extraction and matching are also used to adapt audio watermark embedding and detecting.

IPC 8 full level

G06F 17/00 (2006.01); G10L 19/018 (2013.01); G10L 19/02 (2013.01)

CPC (source: EP US)

G10L 19/018 (2013.01 - EP US); G10L 19/028 (2013.01 - US); G10L 25/87 (2013.01 - US); G10L 19/02 (2013.01 - US)

Citation (applicant)

  • US 2011161076 A1 20110630 - DAVIS BRUCE L [US], et al
  • US 2012134548 A1 20120531 - RHOADS GEOFFREY B [US], et al
  • US 2013150117 A1 20130613 - RODRIGUEZ TONY F [US], et al
  • US 2012214544 A1 20120823 - SHIVAPPA SHANKAR THAGADUR [US], et al
  • US 2012214515 A1 20120823 - DAVIS BRUCE L [US], et al
  • US 5918223 A 19990629 - BLUM THOMAS L [US], et al
  • US 7412072 B2 20080812 - SHARMA RAVI K [US], et al
  • US 6614914 B1 20030902 - RHOADS GEOFFREY B [US], et al
  • US 6674876 B1 20040106 - HANNIGAN BRETT T [US], et al
  • US 2012214515 A1 20120823 - DAVIS BRUCE L [US], et al
  • US 2012214544 A1 20120823 - SHIVAPPA SHANKAR THAGADUR [US], et al
  • US 7352878 B2 20080401 - REED ALASTAIR M [US], et al
  • US 7796826 B2 20100914 - RHOADS GEOFFREY B [US], et al
  • US 2010322469 A1 20101223 - SHARMA RAVI K [US]
  • US 2012082398 A1 20120405 - LYONS ROBERT G [US], et al
  • US 7076082 B2 20060711 - SHARMA RAVI K [US]
  • US 7013021 B2 20060314 - SHARMA RAVI K [US], et al
  • US 201313789126 A 20130307
  • WOLD, E.; BLUM, T.; KEISLAR, D.; WHEATON, J.: "Content-Based Classification, Search, and Rerieval of Audio", IEEE MULTIMEDIA MAGAZINE, 1996
  • KEISLAR ET AL.: "Audio Fingerprints: Technology and Applications", AUDIO ENGINEERING SOCIETY CONVENTION PAPER 6215, 28 October 2004 (2004-10-28)
  • ISO 389-7, ACOUSTICS - REFERENCE ZERO FOR THE CALIBRATION OF AUDIOMETRIC EQUIPMENT, 1996
  • C.M.RADER: "An improved algorithm for high speed autocorrelation with applications to spectral estimation", IEEE TRANSACTIONS ON ACOUSTICS AND ELECTROACOUSTICS, December 1970 (1970-12-01)
  • M. R. SHROEDER: "Computer Speech: Recognition, Compression, Synthesis", 2004, SPRINGER
  • L.R.RABINER; M.R.SAMBUR: "An Algorithm for Determining the Endpoints of Isolated Utterances", THE BELL SYSTEM TECHNICAL JOURNAL, February 1975 (1975-02-01)
  • L.R.RABINER; M.R.SAMBUR: "Voiced-Unvoiced-Silence Detection using the Itakura LPC Distance Measure", ICASSP, 1977
  • M.J. CAREY; E.S. PARRIS; H. LLOYD-THOMAS: "A comparison of features for speech, music discrimination", PROCEEDINGS OF IEEE ICASSP'99, 1999, pages 1432 - 1435
  • J.MAUCLAIR; J. PINQUIER: "Fusion of Descriptors for Speech/Music Classification", PROC. OF 12TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2004, September 2004 (2004-09-01)
  • J.MAUCLAIR; J. PINQUIER: "Fusion of Descriptors for Speech/Music Classification", PROC. OF 12TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2004), VIENNA, AUSTRIA, September 2004 (2004-09-01)
  • B.KEDEM: "Spectral analysis and discrimination by zero-crossings", PROCEEDINGS OF IEEE, vol. 74, no. 11, November 1986 (1986-11-01), XP008046669

Citation (search report)

  • [XI] EP 2362387 A1 20110831 - FRAUNHOFER GES FORSCHUNG [DE]
  • [XI] BONEY LAURENCE ET AL: "Digital watermarks for audio signals", 1996 8TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 1996), IEEE, 10 September 1996 (1996-09-10), pages 1 - 4, XP032770216, ISBN: 978-88-86179-83-6, [retrieved on 20150408]
  • [A] ARNOLD M: "Audio watermarking: features, applications and algorithms", MULTIMEDIA AND EXPO, 2000. ICME 2000. 2000 IEEE INTERNATIONAL CONFEREN CE ON NEW YORK, NY, USA 30 JULY-2 AUG. 2000, PISCATAWAY, NJ, USA,IEEE, US, vol. 2, 30 July 2000 (2000-07-30), pages 1013 - 1016, XP010513181, ISBN: 978-0-7803-6536-0, DOI: 10.1109/ICME.2000.871531

Designated contracting state (EPC)

AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

DOCDB simple family (publication)

US 2014108020 A1 20140417; US 9401153 B2 20160726; EP 2907044 A2 20150819; EP 2907044 A4 20160706; EP 3203380 A1 20170809; EP 3203380 B1 20220504; US 10026410 B2 20180717; US 2017133022 A1 20170511; WO 2014062688 A2 20140424; WO 2014062688 A3 20140619

DOCDB simple family (application)

US 201313841727 A 20130315; EP 13847464 A 20131015; EP 16207395 A 20131015; US 2013065069 W 20131015; US 201615220209 A 20160726