Global Patent Index - EP 3792915 A1

EP 3792915 A1 20210317 - SYSTEMS AND METHODS FOR ALIGNING LYRICS USING A NEURAL NETWORK

Title (en)

SYSTEMS AND METHODS FOR ALIGNING LYRICS USING A NEURAL NETWORK

Title (de)

SYSTEME UND VERFAHREN ZUM AUSRICHTEN VON LIEDTEXTEN UNTER VERWENDUNG EINES NEURONALEN NETZWERKS

Title (fr)

SYSTÈMES ET PROCÉDÉS D'ALIGNEMENT DE PAROLES À L'AIDE D'UN RÉSEAU NEURONAL

Publication

EP 3792915 A1 20210317 (EN)

Application

EP 19213434 A 20191204

Priority

  • US 201916569372 A 20190912
  • EP 19205617 A 20191028
  • US 201916691463 A 20191121

Abstract (en)

An electronic device receives audio data for a media item. The electronic device generates, from the audio data, a plurality of samples, each sample having a predefined maximum length. The electronic device, using a neural network trained to predict textual unit probabilities, generates a probability matrix of textual units for a first portion of a first sample of the plurality of samples. The probability matrix includes information about textual units, timing information, and respective probabilities of respective textual units at respective times. The electronic device identifies, for the first portion of the first sample, a first sequence of textual units based on the generated probability matrix.

IPC 8 full level

G10L 15/26 (2006.01); G10L 15/16 (2006.01); G10L 15/183 (2013.01); G10L 15/22 (2006.01)

CPC (source: EP)

G10L 15/16 (2013.01); G10L 15/183 (2013.01); G10L 15/26 (2013.01); G10L 2015/226 (2013.01)

Citation (search report)

  • [X] US 2018061439 A1 20180301 - DIAMOS GREGORY FREDERICK [US], et al
  • [A] US 2018174576 A1 20180621 - SOLTAU HAGEN [US], et al
  • [XI] DANIEL STOLLER ET AL: "End-to-end Lyrics Alignment for Polyphonic Music Using an Audio-to-Character Recognition Model", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 18 February 2019 (2019-02-18), XP081030544
  • [A] CHITRALEKHA GUPTA ET AL: "Semi-supervised Lyrics and Solo-singing Alignment", PROC. OF THE 19TH ISMIR CONFERENCE, 23 September 2018 (2018-09-23), XP055644355, DOI: 10.5281/ZENODO.1492487
  • [A] ANNA M. KRUSPE: "Bootstrapping A System For Phoneme Recognition And Keyword Spotting In Unaccompanied Singing", 17TH INTERNATIONAL SOCIETY FOR MUSIC INFORMATION RETRIEVAL CONFERENCE, 7 August 2016 (2016-08-07), pages 358 - 364, XP055669382, DOI: 10.5281/ZENODO.1417552

Designated contracting state (EPC)

AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

Designated extension state (EPC)

BA ME

DOCDB simple family (publication)

EP 3792915 A1 20210317

DOCDB simple family (application)

EP 19213434 A 20191204