Global Patent Index - EP 1944754 A1

EP 1944754 A1 20080716 - Speech fundamental frequency estimator and method for estimating a speech fundamental frequency

Title (en)

Speech fundamental frequency estimator and method for estimating a speech fundamental frequency

Title (de)

Sprachgrundfrequenzkalkulator und Verfahren zur Kalkulation einer Sprachgrundfrequenz

Title (fr)

Estimateur de la fréquence fondamentale de la parole et méthode pour estimer une fréquence fondamentale de la parole

Publication

EP 1944754 A1 20080716 (EN)

Application

EP 07000568 A 20070112

Priority

EP 07000568 A 20070112

Abstract (en)

The present invention relates to a speech fundamental frequency estimator (1100) which is configured for receiving a first set of values ( Y 1 ) and a second set of values ( Y 2 ), the first set of values ( Y 1 ) being a frequency domain representation of a first set of time domain signal values (y 1 ) within a first time interval (t 1 ) and the second set of values ( Y 2 ) being a frequency domain representation of a second set of time domain signal values (y 2 ) within a second time interval (t 2 ), the second time interval (t 2 ) being later than and offset from the first time interval (t 1 ). Furthermore, the speech fundamental frequency estimator (1100) comprises a first power density spectrum calculator (1102) which is configured for storing a version of the first set of values ( Y 1 ) and being configured for providing values of a first power density spectrum ( S ^ y Ü ¢ y Ü ¢ d © µ ¢ n ) by multiplying the stored version of the first set of values ( Y 1 ) with a conjugate complex version of the second set of values ( Y 2 ). In addition the speech fundamental estimator (1100) comprises a second power density spectrum calculator (1104) being configured for providing values of a second power density spectrum ( S ^ y Ü ¢ y Ü © µ ¢ n ) by multiplying a version of the second set of values ( Y 2 ) with a complex conjugate verisin of the second set of values ( Y 2 ). Finally, the speech fundamental frequency estimator (1100) includes an analyzer 1(106) which is configured for determining the speech fundamental frequency estimate (fp(n)) on the basis of the values of the first power density spectrum ( S ^ y Ü ¢ y Ü ¢ d © µ ¢ n ) and the values of the second power density spectrum ( S ^ y Ü ¢ y Ü © µ ¢ n ) .

IPC 8 full level

G10L 25/90 (2013.01); G10L 21/0216 (2013.01)

CPC (source: EP)

G10L 25/90 (2013.01); G10L 2021/02168 (2013.01)

Citation (search report)

  • [A] WO 0207363 A2 20020124 - IBM [US], et al
  • [AY] QUAST H ET AL: "Robust pitch tracking in the car environment", 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS (CAT. NO.02CH37334) IEEE PISCATAWAY, NJ, USA, vol. 1, 2002, pages 353 - 356, XP002434822, ISBN: 0-7803-7402-9
  • [A] ROSS M J ET AL: "Average magnitude difference function pitch extractor", IEEE TRANSACTIONS ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING USA, vol. ASSP-22, no. 5, October 1974 (1974-10-01), pages 353 - 362, XP002434823, ISSN: 0096-3518
  • [A] ATKINSON I A ET AL: "Pitch detection of speech signals using segmented autocorrelation", ELECTRONICS LETTERS, IEE STEVENAGE, GB, vol. 31, no. 7, 30 March 1995 (1995-03-30), pages 533 - 535, XP006002624, ISSN: 0013-5194
  • [Y] KLAPURI A P: "Multiple fundamental frequency estimation based on harmonicity and spectral smoothness", IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 11, no. 6, November 2003 (2003-11-01), pages 804 - 816, XP011104552, ISSN: 1063-6676
  • [Y] XU JINFU ET AL: "Noise-robust speech recognition based on difference of power spectrum", ELECTRONICS LETTERS, IEE STEVENAGE, GB, vol. 36, no. 14, 6 July 2000 (2000-07-06), pages 1247 - 1248, XP006015408, ISSN: 0013-5194
  • [A] SHIMAMURA T ET AL: "Noise-robust fundamental frequency extraction method based on exponentiated band-limited amplitude spectrum", CIRCUITS AND SYSTEMS, 2004. MWSCAS '04. THE 2004 47TH MIDWEST SYMPOSIUM ON HIROSHIMA, JAPAN JULY 25-28, 2004, PISCATAWAY, NJ, USA,IEEE, vol. 2, 25 July 2004 (2004-07-25), pages II141 - II144, XP010738725, ISBN: 0-7803-8346-X
  • [A] COSI P ET AL: "Auditory modeling techniques for robust pitch extraction and noise reduction", ICSLP 98 : 5TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING.(INCORPORATING 7TH AUSTRALIAN INTERNATIONAL SPEECH SCIENCE AND TECHNOLOGY CONFERENCE). SYDNEY, AUSTRALIA, NOV. 30 - DEC. 4, 1998, INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCES, vol. CD-ROM, 30 November 1998 (1998-11-30), pages 1053 - 1057, XP002175877, ISBN: 1-876346-17-5

Designated contracting state (EPC)

AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

Designated extension state (EPC)

AL BA HR MK RS

DOCDB simple family (publication)

EP 1944754 A1 20080716; EP 1944754 B1 20160831

DOCDB simple family (application)

EP 07000568 A 20070112