(19)
(11)EP 3 441 887 A1

(12)EUROPEAN PATENT APPLICATION
published in accordance with Art. 153(4) EPC

(43)Date of publication:
13.02.2019 Bulletin 2019/07

(21)Application number: 16897834.4

(22)Date of filing:  04.04.2016
(51)International Patent Classification (IPC): 
G06F 13/00(2006.01)
(86)International application number:
PCT/JP2016/061028
(87)International publication number:
WO 2017/175275 (12.10.2017 Gazette  2017/41)
(84)Designated Contracting States:
AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR
Designated Extension States:
BA ME
Designated Validation States:
MA MD

(71)Applicant: WOVN TECHNOLOGIES, INC.
Tokyo 1080073 (JP)

(72)Inventors:
  • HAYASHI, Takaharu
    Minato-ku, Tokyo 1080073 (JP)
  • SANDFORD, Jeffrey Thomas
    Minato-ku, Tokyo 1080073 (JP)

(74)Representative: EIP 
EIP Europe LLP Fairfax House 15 Fulwood Place
London WC1V 6HU
London WC1V 6HU (GB)

  


(54)TRANSLATION SYSTEM


(57) The present invention provides a translation system that is able to translate a web page by a simple alteration of a website. The translation system comprises: a translation request receiving unit for receiving a translation request from a client device, the translation request including the URL of a web page in which text in a first language is displayed; a translating unit for translating the text in the first language included in the web page indicated by the URL into text in a second language by referring a bilingual database storing words and phrases in the first language associated with words and phrases in the second language constituting translated text of the words and phrases in the first language; and a translation sending unit for sending the translated text in the second language to the client device.




Description

Technical Field



[0001] The present invention relates to a translation system.

Background Art



[0002] Websites published on the internet can be accessed by computers anywhere in the world. However, the web pages constituting a website operated in a foreign country are often created in a language other than the native language of that country. These web pages are translated in order to provide easy access to the website (see, for example, Patent Document 1).

Citation List


Patent Literature



[0003] Patent Literature 1: JP 2013-190975 A

Summary of Invention


Technical Problem



[0004] However, in the configuration disclosed in Patent Document 1, a system for translating websites has to be provided for each website, and it is sometimes difficult to apply this system to small-scale websites due to cost factors.

[0005] It is an object of the present invention to provide a translation system that is able to translate a web page by a simple alteration of a website.

Solution to Problem



[0006] According to one aspect of the present invention, a translation system comprises: a translation request receiving unit for receiving a translation request from a client device, the translation request including the URL of a web page in which text in a first language is displayed; a translating unit for translating the text in the first language included in the web page indicated by the URL into text in a second language by referring to a bilingual database storing words and phrases in the first language associated with words and phrases in the second language constituting translated text of the words and phrases in the first language, and; and a translation sending unit for sending the translated text in the second language to the client device.

[0007] In the present invention, "unit" refers not only to a physical means but also to the implementation of the function of the "unit" using software. The functions of a single "unit" or device may be implemented by two or more physical means or devices, or the functions of two or more "units" and devices may be realized by a single physical means or device.

Advantageous Effects of Invention



[0008] The present invention is able to provide a translation system that is able to translate a web page by a simple alteration of a website.

Brief Description of Drawings



[0009] 

FIG. 1 is a diagram showing a configuration of the information processing system in the first embodiment of the present invention.

FIG. 2 is a diagram summarizing the library method in the first embodiment.

FIG. 3 is a diagram summarizing the script method in the first embodiment.

FIG. 4 is a diagram showing an example of a configuration of the translation system 30 in the first embodiment.

FIG. 5 is a diagram showing an example of the structure of a bilingual database 220.

FIG. 6 is a flowchart showing an example of translation processing in the first embodiment.

FIG. 7 is a diagram summarizing the library method in the second embodiment.

FIG. 8 is a diagram showing an example of a configuration of the translation system 30 in the second embodiment.

FIG. 9 is a flowchart showing an example of translation processing in the second embodiment.

FIG. 10 is a diagram summarizing the library method in the third embodiment.

FIG. 11 is a diagram showing an example of a configuration of the translation system 30 in the third embodiment.

FIG. 12 is a diagram showing an example of the structure of a synonym database 1000.

FIG. 13 is a diagram showing an example of the structure of a score database 1010.

FIG. 14 is a flowchart showing an example of translation processing in the third embodiment.


Description of Embodiments


First Embodiment



[0010] The following is an explanation of a first embodiment of the present invention. FIG. 1 is a diagram showing a configuration of the information processing system in the first embodiment of the present invention. As shown in FIG. 1, the information processing system includes websites 10, user terminals 20, and a translation system 30. The websites 10, user terminals 20, and translation system 30 are connected via a network such as the internet to enable communication.

[0011] The websites 10 provides web pages to the user terminals 20 and are equipped with computers such as web servers, database servers, and application servers. One or more computers may be used by each website 10. The websites 10 can provide web pages to user terminals 20 based on requests from user terminals 20. By using the translation system 30, websites 10 can provide translated pages to user terminals 20.

[0012]  A user terminal 20 is a computer used by a user to access a website 10. The user terminal 20 can be, for example, a personal computer, tablet, or smartphone. The user can access a website 10 by operating a browser installed on the user terminal 20.

[0013] The translation system 30 provides a web page translation service to websites 10. The translation system 30 includes one or more computers. A translation system 30 is not provided for each website 10 but is shared with the plural websites 10.

[0014] The following is an explanation of a translation method for web pages using a translation system 30. In the present embodiment, as examples of the translation method, two types of translation method are used, namely, the "library method" and the "script method".

[0015] FIG. 2 is a diagram summarizing the library method in the first embodiment. In the library method, a library 200 for utilizing the translation system 30 is incorporated into a website 10. The user inputs a URL (Universal Resource Locator) indicating a web page of the desired website 10 in the browser 210 on the user terminal 20. As a result, an HTTP (Hyper Text Transfer Protocol) request is sent to the website 10 from the user terminal 20 (S200).

[0016] When the website 10 receives an HTTP request from a user terminal 20, the website 10 sends a translation request to translate the first language text (for example, Japanese) included in the web pages indicated by the URL into the second language text (for example, English) (S210). The following is an example of the format of this translation request.
url: http://website.com, src: ja, dst: en
"url: http://website.com" is the URL of the web page to be translated, "src: ja" is the translation source language code, and "dst: en" is the translation target language code. In the case of the translation target language is fixed, a translation target language code does not have to be included. In the case of the source language can be determined by analyzing the web page, translation source language does not have to be included.

[0017] When the translation system 30 received a translation request from a website 10 (client device), the translation system 30 obtains the first language text to be translated on the web page to be translated based on the URL included in the translation request. For example, the text in the first language to be translated is the text included in the body of the web page (< BODY> ∼ </BODY>). The translation system 30 acquires translated sentence translated first language text into the second language text by referring to a bilingual database (DB) 220 storing words and phrases in the first language associated with words and phrases in the second language constituting translated text of the words and phrases in the first language, and (S220).Then, the translation system 30 sends the translated text to the website 10 (S230). The following is an example of the format of the translated text sent to the website 10.
src: Kon'nichiwa, dst: Hello, xpath:/body/p/text()
"src: Kon'nichiwa" is the translation source text in the first language, "dst: Hello" is the translation target text in the second language, and "xpath:/body/p/text()" is the code for identifying the location of the text to be translated on the web page.

[0018] The website 10 converts the first language text on the web page into the second language text received from the translation system 30 and sends an HTTP response to the user terminal 20 (S240). As a result, a translated web page is displayed in the browser 210 on the user terminal 20.

[0019] FIG. 3 is a diagram summarizing the script method in the first embodiment. In the script method, the script for utilizing the translation system 30 is embedded into a web page. The user inputs a URL in the browser 210 on the user terminal 20 indicating a web page of the target website 10. As a result, an HTTP request is sent to the website 10 from the user terminal 20 (S300).

[0020] When the website 10 received an HTTP request from a user terminal 20, the website 10 sends an HTTP response to the user terminal 20 for displaying the web page indicated by the URL (S310). As a result, a translated web page including translation source first language text is displayed in the browser 210 on the user terminal 20. And, the script is embedded in the web page for utilizing the translation system 30. This script is executed by the browser 210 and a translation request is sent from the user terminal 20 to the translation system 30. The format of translation requests is the same as the the library method.

[0021] As the library method, the translation system 30 acquires translated sentence translated the first language text into the second language text by referring to the bilingual database (DB) 220 based on the URL included in the translation request received from the user terminal (client device) and (S330). Then, the translation system 30 sends the translated sentence to the user terminal 20 (S340). The format of translated sentence sent to the user terminals 20 is the same as the format of translated sentence sent to the websites 10 in the library method.

[0022] On the user terminal 20, the text on the web page is converted into the second language text received from the translation system 30 by the script executed in the browser 210. As a result, a translated web page is displayed in the browser 210 on the user terminal 20.

[0023] The following is an explanation of a configuration using the library method. This configuration can also be implemented using the script method.

[0024] FIG. 4 is a diagram showing an example of the configuration of the translation system 30 in the first embodiment. The translation system 30 includes a translation request receiving unit 400, a translating unit 410, and a translation sending unit 420. Each unit can be implemented in a computer including memory and a processor, when the processor executes a program stored in the memory, for example.

[0025] The translation request receiving unit 400 receives a translation request including the URL of a web page displayed the first language text from a website 10 (client device). As explained above, the translation request contains a language code for the first language, which is the translation source language, and a language code for the second language, which is the translation target language, in addition to the URL.

[0026] The translating unit 410 translates first language text on the web page indicated by the URL included in the translation request into the second language text by referring to the bilingual database 220. FIG. 5 is a diagram showing an example of the structure of a bilingual database 220. In the example shown in FIG. 5, the bilingual database 220 includes translation source language codes, translation source words and phrases, translation target language codes, translation target words and phrases, and URLs. URLs are provided to customize translations for each website. For example, in the example shown in FIG. 5, the "kon'nichiwa" is translated to "hello" at http://website.com. In another case,the translation can be registered, "kon'nichiwa" is translated to "hi" at http://another.com. In other words, the translating unit 410 can obtain a translation in response to the URL included in the translation request from the bilingual database 220.

[0027] The translation sending unit 420 sends the second language text translated by the translating unit 410 to the website 10 (client device).

[0028] FIG. 6 is a flowchart showing an example of translation processing in the first embodiment. First, the browser 210 in the user terminal 20 sends an HTTP request to display a website of the first language to the website 10 (S600).

[0029] The website 10 receives the HTTP request from the user terminal 20 (S601). Then, the website 10 sends a translation request for translating the first language on the webpage into a second language, to the translation system 30(S602).

[0030]  The translation request receiving unit 400 in the translation system 30 receives the translation request from the website 10 (S603). Based on this translation request, the translating unit 410 in the translation system translates the first language text on the web page into the second language one(S604). Then, the translation sending unit 420 in the translation system 30 sends the translated sentence to the website 10 (S605).

[0031] The website 10 receives the translated sentence from the translation system 30 (S606). The website 10 converts the first language text on the web page into the second language text then sends the HTTP response to the user terminal 20 (S607).

[0032] The browser 210 in the user terminal 20 receives the HTTP response from the website 10 (S608). As a result, the webpage translated into the second language is displayed on the browser 210 (S609).

[0033] According to the first embodiment above, it is possible to translate many websites by making the simple alteration that is incorporating the library or the script.

[0034] Also, it is possible to customize translations for each website by providing items indicating the website in a bilingual database 220.

Second Embodiment



[0035] The following is an explanation of a second embodiment of the present invention. The explanation about common parts with the first embodiment is omitted. The similar architecture with the first embodiment is marked with symbols and omitted to explain.

[0036] FIG. 7 is a diagram summarizing the library method in the second embodiment. The process from the HTTP request (S200) to the HTTP response (S240) is the same as the process in the first embodiment (FIG. 2). In the second embodiment, the script is embedded in a web page to search for untranslated text. For example, there is any possibility of existing untranslated text on a web page displayed on a browser 210, on the web page updated dynamically. When untranslated text is detected by the script, the untranslated first language text is sent to the translation system 30 (S700). Then, translation system 30 updates the bilingual database 220 based on this report (S710).

[0037] FIG. 8 is a diagram showing the configuration of the translation system 30 in the second embodiment. In addition to the configuration shown in FIG. 4, this translation system 30 includes the untranslated text receiving unit 800 and the translation updating unit 810.

[0038] The untranslated text receiving unit 800 receives untranslated first language text included in the webpage from the user terminal 20 displaying the webpage translated into the second language.

[0039] The translation updating unit 810 associates first language words or phrases corresponding to the untranslated text with second language words or phrases and registers the association in the bilingual database 220.

[0040] FIG. 9 is a flowchart showing an example of translation processing in the second embodiment. The process from step S600 to step S609 is the same as the process in the first embodiment (FIG. 6).

[0041] In the user terminal 20, the script searching for untranslated text on the web page is executed on the browser. Then, when the untranslated text is searched (S900), the untranslated first language text is sent from the user terminal 20 to the translation system 30 (S901).

[0042] The untranslated text receiving unit 800 in the translation system 30 receives the untranslated first language text from the user terminal 20 (S902). The translation updating unit 810 in the translation system 30 registers the translation corresponding to the untranslated text to the bilingual database 220 (S903).

[0043] According to the second embodiment above, it is possible to enrich a translation by adding a translation of untranslated text to the bilingual database 220.

Third Embodiment



[0044] The following is an explanation of a third embodiment of the present invention. The explanation about common parts with the first embodiment is omitted. The similar architecture with the first embodiment is marked with symbols and omitted to explain.

[0045] FIG. 10 is a diagram summarizing the library method in the third embodiment. In the third embodiment, the translation system 30 acquires translated text based on the bilingual database 220 (S220), and selects a synonym from a plurality of second language synonyms (for example, "buy" and "purchase") corresponding to the first language word (for example, "konyu") (S1000) by referencing a synonym database (DB) 1000. The synonym can be selected based on a score stored in a score database (DB) 1010, for example. The translation system 30 sends the translated text including the selected synonym to the website 10 (S230).

[0046] In the third embodiment, the script monitoring the user behavior is embedded in a web page. For example, the user behavior is clicks on the web page, time spent on the web page, and leaving the web page. The user behavior information detected by the script is sent from the user terminal 20 to the translation system 30 (S1010). The translation system 30 updates the score related to the user behavior based on this behavior information (S1020).

[0047] FIG. 11 is a diagram showing an example of the configuration of the translation system 30 in the third embodiment. In addition to the configuration shown in FIG. 4, this translation system 30 comprises the synonym selecting unit 1100 and the score updating unit 1110.

[0048] The synonym selecting unit 1100 implements a part of functions in the translating unit 410. The synonym selecting unit 1100 references the synonym database 1000, and selects a first synonym (for example, "buy") for one translation request (the first translation request) and a second synonym (for example, "purchase") for another translation request (the second translation request) as the translation for a first word included in the first language text(for example, "konyu"). The translating unit 410 sends the translated text used the selected synonym to the website 10. FIG. 12 is a diagram showing an example of the structure of a synonym database 1000. In the example shown in FIG. 12, the synonym database 1000 includes translation source words and phrases, translation destination language codes, and synonyms. For example, two synonyms, "buy" and "purchase" are registered as translations for "konyu". Further, two synonyms "login" and "sign in" are registered as translations for "roguin". The number of synonyms is not limited to two, and there may be three or more.

[0049] For example, synonyms can be selected by the synonym selecting unit 1100 based on scores stored in a score database 1010. FIG. 13 is a diagram showing an example of the structure of a score database 1010. In the example shown in FIG. 13, the score database 1010 includes words and phrases, view counts, numbers of clicks, duration of stays, and departure rates. "View counts" means the number of times a website containing a certain word or phrase is displayed. "Number of clicks" means the number of times a click operation is performed on a website (or the number of times a button displayed the word or phrase is clicked). "Duration of stay" means the amount of time a user remains on a website, and "departure rate" means the proportion of users who moved on to another web page or closed the browser without performing a certain operation on the web page (such as clicking a button).

[0050] The synonym selecting unit 1100 can determine the score for each word or phrase based on one or more items in the score database 1010. When the synonym selecting unit 1100 sends translated text for a web page to a website 10, selects the highest score synonym as the best translation by tentatively determining with high ratio, on the other hand uses the remaining proportion to measure the effectiveness of each synonym. For example, in the case of "buy", "purchase" and "add to cart" are registered as synonyms for translating "konyu" and the score of "buy" is the highest, "buy" will be selected with high ratio (for example, 95% of the time). In order to measure the effectiveness of each of the three synonyms "buy", "purchase" and "add to cart", each of the synonyms is selected, for example, one-third of the time in the remaining ratio (for example, 5%).

[0051] The synonym selecting unit 1100 does not have to select a second language synonym for all words and phrases included in the first language text on the website. For example, the synonym selecting unit 1100 can select second language synonyms for words and phrases displayed in a specific location on the website (such as on a button) or for specific words and phrases that have an impact on the users behavior (for example, "konyu").

[0052] The score updating unit 1110 receives user behavior information on a web page from user terminals 20 displaying the web page translated into the second language, and updates the score database 1010 storing the scores for each second language synonym based on the behavior information.

[0053] FIG. 14 is a flowchart showing an example of translation processing in the third embodiment. The explanation about common parts with flowchart shown in FIG. 6 is omitted.

[0054] The translation request receiving unit 400 in the translation system 30 receives a translation request from a website 10 (S603). The translating unit 410 in the translation system 30 translates the first language text on a web page into the second language text based on the translation request (S604). Then, the synonym selecting unit 1100 selects synonyms for the object of words or phrases included in the first language text based on the scores stored in the score database 1010 (S1400). The translation sending unit 420 in the translation system 30 sends the translated sentence including selected synonyms to the website 10 (S605). The translation sending unit 420 can output information indicating that synonyms were selected by the synonym selecting unit 1100 for verification of effectiveness with the translated text. This information is included in the HTTP response to the user terminal 20.

[0055] In the user terminal 20, the script to monitor the user behavior is executed in the browser (S1401). Then, user behavior information is sent from the user terminal 20 to the translation system 36 (S1402). It is no problem that monitoring of the user behavior by the user terminal 20 is performed only when a synonym is selected for verification of effectiveness.

[0056]  The score updating unit 1110 in the translation system 30 receives user behavior information from the user terminal 20 (S1403). Then, the score updating unit 1110 updates the score stored in the score database 1010 (S1404) based on user behavior information.

[0057] According to the third embodiment above, it is possible to analyze user behavior for each synonym. For example, by analyzing the user behavior, it is possible to select a translation which has users take desired action(for example, behavior to purchase a product).

[0058] The present embodiments are for easily understanding about the present invention, not for limitedly interpreting the present invention. This present invention can be modified and improved without deviating from its purpose, and includes any equivalent.

Reference Signs List



[0059] 
10:
Website
20:
User terminal
30:
Translation system
200:
Library
210:
Browser
220:
Bilingual database
400:
Translation request receiving unit
410:
Translating unit
420:
Translation sending unit
800:
Untranslated text receiving unit
810:
Translation updating unit
1000:
Synonym database
1010:
Score database
1100:
Synonym selecting unit
1110:
Score updating unit



Claims

1. A translation system comprising:

a translation request receiving unit for receiving a translation request including the URL of a web page displayed a first language text;

a translating unit for translating the text in the first language included in the web page indicated by the URL into text in a second language by referring to a bilingual database storing words and phrases in the first language associated with words and phrases in the second language constituting translated text of the words and phrases in the first language, and; and

a translation sending unit for sending the translated text in the second language to the client device.


 
2. The translation system according to claim 1, wherein a language code indicating the translation target language is included in the translation request, and the translating unit translates the text in the first language into text in a second language indicated by the language code.
 
3. The translation system according to claim 1 or 2, wherein URLs are included in the bilingual database as items associating words and phrases in the first language with words and phrases in the second language, and
the translating unit translates text in the first language into text in the second language based on the words and phrases in the first language and the words and phrases in the second language corresponding to the URL included in the translation request.
 
4. The translation system according to any one of claims 1 to 3, further comprising:

an untranslated text receiving unit for receiving untranslated text included in the web page from a user terminal on which the web page translated into the second language is displayed; and

a translation updating unit for associating a word or phrase in the first language corresponding to the untranslated text with a word or phrase in the second language constituting a translation of the word or phrase in the first language and registering the association in the bilingual database.


 
5. The translation system according to any one of claims 1 to 4, wherein:

the translating unit selects a first synonym for a first translation request among the translation requests as the translation for a first word or phrase included in the text in the first language, and selects a second synonym for a second request among the translation requests, by referring to a synonym database storing synonyms of words and phrases in the second language;

the translation system further comprises a score updating means for receiving behavior information indicating the behavior of the user on the website from the user terminal on which the website translated into the second language is displayed; and

the translation system further comprises a score updating unit for storing the score for each synonym in the second language based on this behavior information.


 
6. The translation system according to claim 5, wherein the score for the first synonym is higher than the score for the second synonym, and the ratio at which the first synonym is selected is higher than the ratio at which the second synonym is selected.
 
7. A translation method comprising the computer-executed steps of:

receiving a translation request from a client device, the translation request including the URL of a web page in which text in a first language is displayed;

translating the text in the first language included in the web page indicated by the URL into text in the second language by referring to a bilingual database storing words and phrases in the first language associated with words and phrases in a second language constituting translated text of the words and phrases in the first language; and

sending the translated text in the second language to the client device.


 




Drawing








































Search report










Cited references

REFERENCES CITED IN THE DESCRIPTION



This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.

Patent documents cited in the description