EP 1505570 A1 20050209 - Singing voice synthesizing method
Title (en)
Singing voice synthesizing method
Title (de)
Verfahren zur Synthese einer Singstimme
Title (fr)
Méthode de synthèse de voix chantée
Publication
Application
Priority
EP 03017548 A 20030806
Abstract (en)
A frequency spectrum is detected by analyzing a frequency of a voice waveform corresponding to a voice synthesis unit formed of a phoneme or a phonemic chain. Local peaks are detected on the frequency spectrum, and spectrum distribution regions including the local peaks are designated. For each spectrum distribution region, amplitude spectrum data representing an amplitude spectrum distribution depending on a frequency axis and phase spectrum data representing a phase spectrum distribution depending on the frequency axis are generated. The amplitude spectrum data is adjusted to move the amplitude spectrum distribution represented by the amplitude spectrum data along the frequency axis based on an input note pitch, and the phase spectrum data is adjusted corresponding to the adjustment. Spectrum intensities are adjusted to be along with a spectrum envelope corresponding to a desired tone color. The adjusted amplitude and phase spectrum data are converted into a synthesized voice signal.
IPC 1-7
IPC 8 full level
G10L 13/033 (2013.01); G10L 21/0232 (2013.01)
CPC (source: EP)
G10L 13/033 (2013.01); G10L 21/0232 (2013.01)
Citation (applicant)
- LAROCHE J ET AL.: "New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects", APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 1999 IEEE WORKSHOP ON NEW PALTZ, NY, USA 17-20 OCT. 1999, 17 October 1999 (1999-10-17), pages 91 - 94, XP010365068, DOI: doi:10.1109/ASPAA.1999.810857
- CHENG-YUAN LIN ET AL.: "ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002. THIRD IEEE PACIFIC RIM CONFERENCE ON MULTIMEDIA PROCEEDINGS (LECTURE NOTES IN COMPUTER SCIENCE VOL. 2532), ADVANCES IN MULTIMEDIA INFORMATION PROCESSING", 2002, SPRINGER-VERLAG, article "An on-the-fly Mandarin singing voice synthesis system", pages: 631 - 638
- MOULINES E ET AL.: "SPEECH COMMUNICATION", vol. 16, 1 February 1995, ELSEVIER SCIENCE PUBLISHERS, article "Non-parametric techniques for pitch-scale and time-scale modification of speech", pages: 175 - 205
- DEPALLE P ET AL.: "APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 1995, IEEE ASSP WORKSHOP ON NEW PALTZ, NY, USA 15-18 OCT. 1995", 15 October 1995, article "The recreation of a castrato voice, Farinelli's voice", pages: 242 - 245
Citation (search report)
- [Y] CHENG-YUAN LIN ET AL: "An on-the-fly Mandarin singing voice synthesis system", ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2002. THIRD IEEE PACIFIC RIM CONFERENCE ON MULTIMEDIA. PROCEEDINGS (LECTURE NOTES IN COMPUTER SCIENCE VOL2532), ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, 2002, Berlin, Germany, Springer-Verlag, Germany, pages 631 - 638, XP002265864, ISBN: 3-540-00262-6
- [YD] LAROCHE J ET AL: "New phase-vocoder techniques for pitch-shifting, harmonizing and other exotic effects", APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 1999 IEEE WORKSHOP ON NEW PALTZ, NY, USA 17-20 OCT. 1999, PISCATAWAY, NJ, USA,IEEE, US, 17 October 1999 (1999-10-17), pages 91 - 94, XP010365068, ISBN: 0-7803-5612-8
- [A] MOULINES E ET AL: "Non-parametric techniques for pitch-scale and time-scale modification of speech", SPEECH COMMUNICATION, ELSEVIER SCIENCE PUBLISHERS, AMSTERDAM, NL, vol. 16, no. 2, 1 February 1995 (1995-02-01), pages 175 - 205, XP004024959, ISSN: 0167-6393
- [A] DEPALLE P ET AL: "The recreation of a castrato voice, Farinelli's voice", APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS, 1995., IEEE ASSP WORKSHOP ON NEW PALTZ, NY, USA 15-18 OCT. 1995, NEW YORK, NY, USA,IEEE, US, 15 October 1995 (1995-10-15), pages 242 - 245, XP010154675, ISBN: 0-7803-3064-1
- [A] COOK P R: "TOWARD THE PERFECT AUDIO MORPH? SINGING VOICE SYNTHESIS AND PROCESSING", WORKSHOP ON DIGITAL AUDIO EFFECTS, XX, XX, 19 November 1998 (1998-11-19), pages 223 - 230, XP002151707
- [T] LAROCHE J: "Frequency-domain techniques for high-quality voice modification", DAFX-03 - PROC. OF THE 6TH INT. CONFERENCE ON DIGITAL AUDIO EFFECTS, 8 September 2003 (2003-09-08) - 11 September 2003 (2003-09-11), London, UK, XP002265865
Designated contracting state (EPC)
DE GB IT
DOCDB simple family (publication)
DOCDB simple family (application)
EP 03017548 A 20030806