EP 2132657 A1 20091216 - MACHINE LEARNING FOR TRANSLITERATION
Title (en)
MACHINE LEARNING FOR TRANSLITERATION
Title (de)
MASCHINELLES LERNEN FÜR TRANSKRIPTION
Title (fr)
APPRENTISSAGE AUTOMATIQUE POUR TRANSLITTÉRATION
Publication
Application
Priority
- US 2008056087 W 20080306
- US 89337207 P 20070306
Abstract (en)
[origin: WO2008109769A1] Methods, systems, and apparatus, including computer program products, for performing transliteration between text in different scripts. In one aspect, a method includes generating a transliteration model based on statistical information derived from parallel text having first text in an input script and corresponding second text in an output script; and using the transliteration model to transliterate input characters in the input script to output characters in the output script. In another aspect, a method includes performing word level transliterations. In another aspect, a method includes using an entry-aligned dictionary of source and target script pairs, in which, whenever a particular source word is mapped to multiple target words, the dictionary includes an entry for each target word including the same source word repeated in each entry. In another aspect, a method includes using phonetic scores of words in different scripts to identify corresponding parallel text.
IPC 8 full level
G06F 17/28 (2006.01)
CPC (source: EP)
G06F 40/126 (2020.01); G06F 40/16 (2020.01)
Designated contracting state (EPC)
AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR
DOCDB simple family (publication)
WO 2008109769 A1 20080912; EP 2132657 A1 20091216; EP 2132657 A4 20180103
DOCDB simple family (application)
US 2008056087 W 20080306; EP 08731575 A 20080306