<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ep-patent-document PUBLIC "-//EPO//EP PATENT DOCUMENT 1.1//EN" "ep-patent-document-v1-1.dtd">
<ep-patent-document id="EP03748274B1" file="EP03748274NWB1.xml" lang="en" country="EP" doc-number="1454313" kind="B1" date-publ="20061018" status="n" dtd-version="ep-patent-document-v1-1">
<SDOBI lang="en"><B000><eptags><B001EP>......DE....FRGB................................................................</B001EP><B003EP>*</B003EP><B005EP>J</B005EP><B007EP>DIM360 (Ver 1.5  21 Nov 2005) -  2100000/0</B007EP></eptags></B000><B100><B110>1454313</B110><B120><B121>EUROPEAN PATENT SPECIFICATION</B121></B120><B130>B1</B130><B140><date>20061018</date></B140><B190>EP</B190></B100><B200><B210>03748274.2</B210><B220><date>20030922</date></B220><B240><B241><date>20040625</date></B241></B240><B250>en</B250><B251EP>en</B251EP><B260>en</B260></B200><B300><B310>0221945</B310><B320><date>20020920</date></B320><B330><ctry>GB</ctry></B330></B300><B400><B405><date>20061018</date><bnum>200642</bnum></B405><B430><date>20040908</date><bnum>200437</bnum></B430><B450><date>20061018</date><bnum>200642</bnum></B450><B452EP><date>20051010</date></B452EP></B400><B500><B510EP><classification-ipcr sequence="1"><text>G10L  13/08        20060101AFI20040408BHEP        </text></classification-ipcr></B510EP><B540><B541>de</B541><B542>VERFAHREN ZUR ADAPTATION EINES AUSSPRACHE-LEXIKONS FÜR DIE SPRACHSYNTHESE</B542><B541>en</B541><B542>METHOD FOR ADAPTING A PRONUNCIATION DICTIONARY USED FOR SPEECH SYNTHESIS</B542><B541>fr</B541><B542>PROCEDE D'ADAPTATION D'UN DICTIONNAIRE DE PRONONCIATION UTILISE DANS LA SYNTHESE DE LA PAROLE</B542></B540><B560><B561><text>US-A1- 2001 012 999</text></B561><B561><text>US-A1- 2002 103 648</text></B561><B562><text>EDGINGTON M ET AL: "OVERVIEW OF CURRENT TEXT-TO-SPEECH TECHNIQUES: PART I - TEXT AND LINGUISTIC ANALYSIS" BT TECHNOLOGY JOURNAL, BT LABORATORIES, GB, vol. 14, no. 1, 1996, pages 68-83, XP000554640 ISSN: 1358-3948</text></B562></B560></B500><B700><B720><B721><snm>ANDERTON, John</snm><adr><str>Epson Scotland Design,
Centre The Alba Campus</str><city>Livingston EH54 7EG</city><ctry>GB</ctry></adr></B721></B720><B730><B731><snm>SEIKO EPSON CORPORATION</snm><iid>00730004</iid><irf>EPP14202A</irf><adr><str>4-1, Nishi-shinjuku 2-chome</str><city>Shinjuku-ku,
Tokyo 163-0811</city><ctry>JP</ctry></adr></B731></B730><B740><B741><snm>Kenyon, Sarah Elizabeth</snm><sfx>et al</sfx><iid>00062343</iid><adr><str>Miller Sturt Kenyon 
9 John Street</str><city>London WC1N 2ES</city><ctry>GB</ctry></adr></B741></B740></B700><B800><B840><ctry>DE</ctry><ctry>FR</ctry><ctry>GB</ctry></B840><B860><B861><dnum><anum>GB2003004037</anum></dnum><date>20030922</date></B861><B862>en</B862></B860><B870><B871><dnum><pnum>WO2004027757</pnum></dnum><date>20040401</date><bnum>200414</bnum></B871></B870></B800></SDOBI><!-- EPO <DP n="1"> -->
<description id="desc" lang="en">
<p id="p0001" num="0001">The present invention relates to a method for operating a text to speech (TTS) system according to claim 1 and a mobile telephone incorporating such a TTS system according claim 13.</p>
<p id="p0002" num="0002">A text to speech (TTS) system converts text to speech and involves determining the correct pronunciation. Figure 1 illustrates a typical TTS system incorporating four typical processing steps. The input text is analysed, segmented and normalised in the first processing step. In the second step, the normalised text is analysed morphologically and syntactically to determine the words or morphological tokens and the syntax of those words into notation which represents the pronunciation. That notation or linguistic text is combined together with prosodic parameters in step three. Finally, the linguistic text together with the prosody is synthesised using the pronunciation notation to output speech of the input text.</p>
<p id="p0003" num="0003">The present invention is directed towards such a TTS system and involves an improvement of at least step 2 in determining the pronunciation.</p>
<p id="p0004" num="0004">Hitherto, there have been proposed two methods for determining the pronunciation. The first method is to use pronunciation rules. These rules are typically developed (either manually or automatically) based on extensive knowledge and exposure to the language used in the TTS system. Examples for the constructions for such rules can be found from the following references:
<ul id="ul0001" list-style="none" compact="compact">
<li>[1] The CMU Pronouncing Dictionary (http;//www.speech.cs.cmu.edu/cgi-bin/cmudict)</li>
<li>[2] "Phonemic transcription by analogy in text to speech synthesis: Novel word pronunciation and lexicon compression" by Bagshaw, Computer Speech and Language (1998) volume 12 pages 119-142.</li>
</ul></p>
<p id="p0005" num="0005">Historically most early TTS systems used such rules extensively. The primary advantage of this method is the low memory requirements. As the quality of the TTS systems improved, pronunciations errors from the rule based method became more apparent.<!-- EPO <DP n="2"> --> The art developed to derive the second method which uses a dictionary. The dictionary or lexicon stores a potentially large number of input words together with the associated pronunciation, preferably for all syntactic variations of the word. A typical example for two entries in such a dictionary is as follows:
<tables id="tabl0001" num="0001">
<table frame="all">
<tgroup cols="3" colsep="1" rowsep="1">
<colspec colnum="1" colname="col1" colwidth="50mm" colsep="1"/>
<colspec colnum="2" colname="col2" colwidth="34mm" colsep="1"/>
<colspec colnum="3" colname="col3" colwidth="29mm" colsep="1"/>
<thead>
<row>
<entry namest="col1" nameend="col1" align="left" valign="top"><b>Orthographic Representation</b></entry>
<entry namest="col2" nameend="col2" align="left" valign="top"><b>Syntactic Category</b></entry>
<entry namest="col3" nameend="col3" align="left" valign="top"><b>Pronunciation</b></entry></row></thead>
<tbody>
<row>
<entry namest="col1" nameend="col1" align="left" valign="top">record</entry>
<entry namest="col2" nameend="col2" align="left" valign="top">verb</entry>
<entry namest="col3" nameend="col3" align="left" valign="top">R AH0 K AO1 R D</entry></row>
<row>
<entry namest="col1" nameend="col1" align="left" valign="top">record</entry>
<entry namest="col2" nameend="col2" align="left" valign="top">noun</entry>
<entry namest="col3" nameend="col3" align="left" valign="top">R EH1 K ER0 D</entry></row></tbody></tgroup>
</table>
</tables></p>
<p id="p0006" num="0006">Each entry comprises the word in the form of an orthographic representation, a field defining the syntactic category of the word and the pronunciation, in this case using phonetic notation. Thus, the above two entries show the two pronunciations for the word "record" for the two different syntactic categories. Entries for many other words can be found in reference [1] noted above.</p>
<p id="p0007" num="0007">If a word is not found in the dictionary, then the TTS reverts to determining the pronunciation using the pronunciation rules.</p>
<p id="p0008" num="0008">It has been found that the dictionary method produces a higher quality speech than the pronunciation rule method. However, the dictionary method requires a large memory to store the dictionary. In certain applications, there is a trade-off which must be made between the speech quality and the size of the memory which can be used. Previously, the most common way of addressing this trade-off is to design the dictionary so as to include only words which are most likely to be used in that application. Those most frequently used words must be analysed for the particular application. Such a trade-off and alternative ways of addressing this trade-off is described in reference number [2] above and the following references:
<ul id="ul0002" list-style="none" compact="compact">
<li>[3] "Speech Technology for Communications" by Westall, Johnston &amp; Lewis. 1998 ISBN 0 412 79080 7 Chapter 6.</li>
<li>[4] "Letter to sound rules for accented lexicon compression" Pagel, Lenzo &amp; Black ESCA98, 3rd International Workshop on Speech Syntheses, November 1998.</li>
</ul><!-- EPO <DP n="3"> --></p>
<p id="p0009" num="0009">Thus, the present invention aims to address this trade-off and improve the balance between speech quality and size of memory. In essence, the present invention relates to adapting the contents of the memory by adding to or maintaining those words most frequently used and deleting those words which are less frequently used.</p>
<p id="p0010" num="0010">The following document is an example of a text-to-speech synthesis system using both a dictionnary and pronunciation rules for handling exceptions:
<ul id="ul0003" list-style="none">
<li>[5] US 2002/0103648-A1.</li>
</ul></p>
<p id="p0011" num="0011">Accordingly, the present invention relates to a method of implementing a text to speech (TTS) system including a pronunciation dictionary and a memory containing pronunciation rules, said method comprising:
<ul id="ul0004" list-style="none" compact="compact">
<li>comparing each word in said text to words in said pronunciation dictionary to identify if the identical word is present;</li>
<li>if the word is identified, then obtaining the pronunciation of said word;</li>
<li>if the word is not identified, then applying said pronunciation rules to said word to obtain the pronunciation of said word; wherein the improvement lies in</li>
<li>storing each word which is not identified in an exceptions log; and</li>
<li>adding the most frequently referred words from the exceptions log to the pronunciation dictionary.</li>
</ul></p>
<p id="p0012" num="0012">In many instances, the size of the memory is restricted. Thus, the method enables the dictionary stored in the memory to be adapted so as to retain only those most frequently used words. This obviates the need for an initial analysis of the application of the TTS system. Moreover, in many applications where the memory size is severely limited such as in a mobile telephone, the speech quality can be improved considerably.</p>
<p id="p0013" num="0013">Accordingly, the present invention also relates to a mobile telephone incorporating a TTS system as defined in the attached claims.</p>
<p id="p0014" num="0014">Embodiments of the present invention will now be described by way of further example only and with reference to the accompanying drawings, in which:
<ul id="ul0005" list-style="none" compact="compact">
<li>Figure 1 is an illustration of a TTS system to which the present invention may be applied;</li>
<li>Figure 2 is an illustration of the present invention in determining pronunciation;</li>
<li>Figure 3 is an illustration of the interaction between the pronunciation determining stage and dictionary update stage;<!-- EPO <DP n="4"> --><!-- EPO <DP n="5"> --></li>
<li>Figure 4 is an illustration of the dictionary update stage;</li>
<li>Figure 5 is a schematic view of a mobile telephone incorporating a TTS system according to the present invention;</li>
<li>Figure 6 is a schematic view of a mobile personal computer incorporating a TTS system according to the present invention; and</li>
<li>Figure 7 is a schematic view of a digital camera incorporating a TTS system according to the present invention.</li>
</ul></p>
<p id="p0015" num="0015">As discussed above, a typical TTS system as shown in Figure 1 converts text to speech and in step 2 analyses the normalised text morphologically and syntactically to determine the words or morphological tokens and the syntax of those words into notation which represents the pronunciation. The TTS system determines whether the word or token is present in the dictionary and if so obtains the pronunciation but if not then applies pronunciation rules to determine the pronunciation. The present invention provides an improvement as shown in Figure 2.</p>
<p id="p0016" num="0016">In the first instance, in the embodiment, the memory 2 containing the dictionary is full and an exceptions log 4 is empty. The memory 2 contains for each entry at least an orthographic representation of the word, the syntactic category and the pronunciation in accordance with the prior art. In addition, the memory 2 contains two further fields for each entry, namely a long term occurrence count and a current occurrence count The long term occurrence count stores the average number of tokens analysed between each occurrence of a token. The current occurrence count stores the current number of tokens analysed since the last occurrence of a token.</p>
<p id="p0017" num="0017">The exceptions log 4 similarly contains for each entry at least the orthographic representation of the word or token, the syntactic category and the pronunciation together with the long term occurrence count and the current occurrence count.</p>
<p id="p0018" num="0018">A pronunciation module 6 receives the word or morphological token. The module 6 takes each token and searches the dictionary 2 to identify if the token is present in the dictionary 2. If the token is present, then the module obtains the pronunciation. A dictionary search and statistics module 8 increments the current occurrence count for that<!-- EPO <DP n="6"> --> entry if the token is present in the dictionary. Either module 6 or 8 may search the dictionary 2.</p>
<p id="p0019" num="0019">If the token is not present in the dictionary 2, then the pronunciation module 6 searches the exceptions log 4 to identify if the token is present in the exceptions log.</p>
<p id="p0020" num="0020">If the token is present, then the module 6 obtains the pronunciation from the exceptions log 4. In addition, if the token is present in the exceptions log, then the current occurrence count for that entry is incremented.</p>
<p id="p0021" num="0021">If the token is not present, then the module applies the pronunciation rules to obtain the pronunciation for the token. The pronunciation rules are stored in a memory (not shown) integral with or coupled to the pronunciation module 6. In addition, if the token is not present, then, if there is space in the exceptions log, then the token is added to the exceptions log. If there is no space in the exceptions log, then the module 6 identifies the token already stored in the exceptions log with the lowest weighted occurrence statistics as follows: <maths id="math0001" num=""><math display="block"><mrow><msub><mrow><mover><mrow><mi mathvariant="italic">k</mi></mrow><mrow><mo mathvariant="italic">˙</mo></mrow></mover></mrow><mrow><mn mathvariant="italic">1</mn></mrow></msub><mo>*</mo><mi mathvariant="normal">current occurence count</mi><mo>+</mo><msub><mrow><mi>k</mi></mrow><mrow><mn mathvariant="italic">2</mn></mrow></msub><mo>*</mo><mi mathvariant="normal">long term occurence count</mi></mrow></math><img id="ib0001" file="imgb0001.tif" wi="113" he="10" img-content="math" img-format="tif"/></maths></p>
<p id="p0022" num="0022">If the lowest weighted occurrence statistic is below a threshold <i>k</i><sub><i>3</i></sub>, then the new token replaces that stored in the exceptions log. If the lowest weighted occurrence statistic is above a threshold <i>k</i><sub><i>3</i></sub>, then no action is taken with regard to the new token.</p>
<p id="p0023" num="0023">Thus, the exceptions log 4 will be limited to only those entries which occur most frequently which have not been found in the dictionary 2.</p>
<p id="p0024" num="0024">In the present invention, the dictionary 2 is updated periodically with entries from the exception log 2. The updating may be effected at regular intervals in time, or be effected following certain events or a combination of both or even effected through exponentially changing time periods in order to update the dictionary very frequently when first initiated and less frequently thereafter having achieved an optimal content of the dictionary for the application in which the TTS is embodied.<!-- EPO <DP n="7"> --></p>
<p id="p0025" num="0025">Certain events could include that the pronunciation module 6 has processed a total number of tokens which exceeds a predetermined threshold or that the exceptions log is full or the event could be effected by a user operating the system or that the application in which the TTS system is embodied has access to a remote pronunciation dictionary to be discussed in more detail below.</p>
<p id="p0026" num="0026">Figure 3 illustrates the interaction between the pronunciation determining stage and dictionary update stage. The dictionary update stage is only effected when the TTS system is not processing text and outputting speech and visa versa. When the TTS system is to update the dictionary, following one of the events or at a specific time as discussed above, a switch 12 disables the pronunciation stage and enables the dictionary update stage.</p>
<p id="p0027" num="0027">In the first instance, as shown in Figure 4, a dictionary update control module 14 calculates the mean occurrence statistics and updates the long term occurrence count in both the pronunciation dictionary and the exceptions log. The mean occurrence statistics are updated as a weighted function as follows: <maths id="math0002" num=""><math display="block"><mrow><mi mathvariant="normal">long term occurence count</mi><mo>=</mo><msub><mrow><mi mathvariant="italic">k</mi></mrow><mrow><mn mathvariant="italic">4</mn></mrow></msub><mo>*</mo><mi mathvariant="normal">long term occurence count</mi><mo>+</mo><msub><mrow><mi mathvariant="italic">k</mi></mrow><mrow><mn mathvariant="italic">5</mn></mrow></msub><mo>*</mo><mi mathvariant="normal">current occurence count</mi></mrow></math><img id="ib0002" file="imgb0002.tif" wi="165" he="17" img-content="math" img-format="tif"/></maths></p>
<p id="p0028" num="0028">The constants <i>k</i><sub><i>4</i></sub> and <i>k</i><sub><i>5</i></sub> may be difference for the exceptions log and for the pronunciation dictionary and they may be fixed or variable. For example, the constants may change depending on the number of tokens processed since the dictionary was last updated.</p>
<p id="p0029" num="0029">The current occurrence count for each entry is reset to zero.</p>
<p id="p0030" num="0030">Having updated the long term occurrence count and reset the current occurrence count, the dictionary update control module 8 collates a deletion list from the pronunciation dictionary 2 comprising all entries whose long term occurrence count is below a deletion candidate threshold and an addition candidate list from the exceptions log for all entries where the long term occurrence count is above the addition candidate threshold. The addition count threshold may or may not be set to zero.<!-- EPO <DP n="8"> --></p>
<p id="p0031" num="0031">Thus, tokens in the deletion list are those which are rarely encountered by the TTS system. Hence, their deletion from the pronunciation dictionary would imply that there would be no significant speech quality degradation.</p>
<p id="p0032" num="0032">The addition candidate list is sorted in descending order according to the long term occurrence count. That is to say the most likely tokens to be added first. The deletion candidate list is sorted in ascending order according to the long term occurrence count. That is to say the most likely deletion candidates first.</p>
<p id="p0033" num="0033">The dictionary update control module 14 analyses the addition candidate list and the deletion candidate list entry by entry with the most likely addition candidate replacing the next deletion candidate, if the addition candidate long term occurrence count is greater than the deletion candidate long term occurrence count.</p>
<p id="p0034" num="0034">The number of entries in the addition list may or may not equal the number of entries in the deletion list. The various thresholds and constants may be adapted depending upon previous dictionary update stages.</p>
<p id="p0035" num="0035">Whenever a token from the addition list replaces a deletion candidate, then the addition candidate in the exceptions log is deleted and the long term occurrence count is reset to zero.</p>
<p id="p0036" num="0036">In addition, when a token from the addition list is to replace a deletion candidate, verification of the pronunciation of the token is required. Verification may be achieved by one of two ways or both and is effected by a pronunciation acquisition module 16.</p>
<p id="p0037" num="0037">The first way is to request the user to define the correct pronunciation for the token. In this case, the pronunciation acquisition module 16 requests a user dialogue module 18 to commence a verification dialogue with the user. The dialogue comprises interaction with the user via a user interface 20. The user interface comprises a screen, keyboard and speech dialogue. In this embodiment, the user dialogue module 18 announces that a verification dialogue with the user is to commence and requests confirmation from the user. If no confirmation is received, then the verification dialogue is deferred for a period of time. Having received confirmation, the user dialogue module 18 displays tokens on the screen<!-- EPO <DP n="9"> --> together with the syntax category and pronunciation, usually in the form of phonetic notation. The user is requested to input via the keyboard confirmation that the token definition is correct. If the user is not conversant with phonetic notation, then the user may confirm or correct the token more phonetically. There is also the option for the user to indicate that the TTS system should output the token with the defined pronunciation so that the user can check aurally that the correct pronunciation is defined. Finally, if the token is an abbreviation or acronym, then the user may input the token in it's full form.</p>
<p id="p0038" num="0038">The second way for verification comprises the interaction with a remote pronunciation dictionary 22. In this case, the remote pronunciation dictionary 22 provides the verification of the pronunciation of the token. If the token is absent from the remote pronunciation dictionary, then the correct pronunciation must be verified from the user in accordance with the first way.</p>
<p id="p0039" num="0039">The remote pronunciation dictionary 22 is typically quite large and would therefore be unlikely to be stored within the application embodying the TTS system. Availability to the remote pronunciation dictionary would be dependent upon the availability of access which is usually subject to physical or cost considerations.</p>
<p id="p0040" num="0040">The pronunciation acquisition module 16 instructs a remote dictionary access module 24 to ascertain whether the remote pronunciation dictionary 22 can be accessed. This includes the remote dictionary access module 24 activating a communications link control 26. The communications link control 26 initiates a communications link to another communications link control 28. The communications link may comprise any of those known in the art, such as an infrared link; wireless link such as the GSM air interface, GDM data channels or GSM Short message service channels; or a cable link connecting the application in which the TTS is embodied to a computer or host PC. The communications links 26, 28 access the remote pronunciation dictionary 22 via a remote dictionary server 30.</p>
<p id="p0041" num="0041">The addition candidate list and the verified pronunciation is passed to a dictionary update module 32. The dictionary update module 32 updates the pronunciation dictionary 2 by overwriting those entries in the pronunciation dictionary marked for being listed in the deletion list and present in the addition candidate list with those to be added in the addition candidate list.<!-- EPO <DP n="10"> --></p>
<p id="p0042" num="0042">Thus, the present invention enables the size of the pronunciation dictionary to be minimised; hence reducing the memory size of the TTS system. To ensure that this does not result in an unacceptable loss of speech quality, the method provides steps for adapting the contents of the pronunciation dictionary to contain only those entries that are most frequently encountered by the TTS system.</p>
<p id="p0043" num="0043">The present invention is advantageous for use in small, mobile electronic products such as mobile phones, computers, CD players, DVD players and the like - although it is not limited thereto.</p>
<p id="p0044" num="0044">Several electronic apparatuses using the TTS system will now be described.</p>
<heading id="h0001">&lt;1: Portable Phone&gt;</heading>
<p id="p0045" num="0045">An example in which the TTS system is applied to a portable or mobile phone will be described. Fig. 5 is an isometric view illustrating the configuration of the portable phone. In the drawing, the portable phone 1200 is provided with a plurality of operation keys 1202, an ear piece 1204, a mouthpiece 1206, and a display panel 100. The mouthpiece 1206 or ear piece 1204 may be used for outputting speech.</p>
<heading id="h0002">&lt;2: Mobile Computer&gt;</heading>
<p id="p0046" num="0046">An example in which the TTS system according to one of the above embodiments is applied to a mobile personal computer will now be described.</p>
<p id="p0047" num="0047">Figure 6 is an isometric view illustrating the configuration of this personal computer. In the drawing, the personal computer 1100 is provided with a body 1104 including a keyboard 1102 and a display unit 1106. The TTS system may use the display unit 1106 or keyboard 1102 to provide the user interface according to the present invention, as described above.</p>
<heading id="h0003">&lt;3 : Digital Still Camera&gt;</heading>
<p id="p0048" num="0048">Next, a digital still camera using a TTS system will be described. Fig. 7 is an isometric view illustrating the configuration of the digital still camera and the connection to external devices in brief.<!-- EPO <DP n="11"> --></p>
<p id="p0049" num="0049">Typical cameras sensitise films based on optical images from objects, whereas the digital still camera 1300 generates imaging signals from the optical image of an object by photoelectric conversion using, for example, a charge coupled device (CCD). The digital still camera 1300 is provided with an OEL element 100 at the back face of a case 1302 to perform display based on the imaging signals from the CCD. Thus, the display panel 100 functions as a finder for displaying the object. A photo acceptance unit 1304 including optical lenses and the CCD is provided at the front side (behind in the drawing) of the case 1302. The TTS system may be embodied in the digital still camera.</p>
<p id="p0050" num="0050">Further examples of electronic apparatuses, other than the personal computer shown in Fig. 6, the portable phone shown in Fig. 5, and the digital still camera shown in Fig. 7, include television sets, view-finder-type and monitoring-type video tape recorders, car navigation systems, pagers, electronic notebooks, portable calculators, word processors, workstations, TV telephones, point-of-sales system (POS) terminals, and devices provided with touch panels. Of course, the TTS system of the present invention can be applied to any of these electronic apparatuses.</p>
<p id="p0051" num="0051">The aforegoing description has been given by way of example only and it will be appreciated by a person skilled in the art that modifications can be made without departing from the scope of the present invention.</p>
<p id="p0052" num="0052">For example, the exceptions log may not always store the pronunciation of a word which has not been found in the dictionary, merely the word itself together with the average and current occurrence counts. The dictionary may not always initially be full but rather the dictionary is empty and all words referred in the first instance are stored in the dictionary. The pronunciation module and the dictionary update control module may not set the thresholds and constants but rather these are predetermined. In some circumstances, the number of entries in the deletion list may not equal the number in the addition list.</p>
<p id="p0053" num="0053">Moreover, the dictionary could be continuously updated rather than waiting until the pronunciation stage has paused and then switching between the pronunciation stage and the dictionary updating stage.<!-- EPO <DP n="12"> --></p>
<p id="p0054" num="0054">The TTS system according to the present invention may be disposed on a single semiconductor chip with or without memory for the dictionary and the exceptions log and with or without means for outputting speech encoder-decoder (CODEC).</p>
</description><!-- EPO <DP n="13"> -->
<claims id="claims01" lang="en">
<claim id="c-en-01-0001" num="0001">
<claim-text>A method for operating a text to speech (TTS) system including a pronunciation dictionary and a memory containing pronunciation rules, said method comprising:
<claim-text>comparing each word in said text to words in said pronunciation dictionary to identify if the identical word is present;</claim-text>
<claim-text>if the word is identified, then obtaining the pronunciation of said word;</claim-text>
<claim-text>if the word is not identified, then applying said pronunciation rules to said word to obtain the pronunciation of said word; wherein</claim-text>
<claim-text>storing each word which is not identified in an exceptions log; and</claim-text>
<claim-text>adding the most frequently referred words from the exceptions log to the pronunciation dictionary.</claim-text></claim-text></claim>
<claim id="c-en-01-0002" num="0002">
<claim-text>A method for operating a TTS system as claimed in claim 1, in which if the word is not identified in the pronunciation dictionary, then comparing the word with those in the exceptions log to identify whether the word is already stored in the exceptions log.</claim-text></claim>
<claim id="c-en-01-0003" num="0003">
<claim-text>A method for operating a TTS system as claimed in either claim 1 or 2, in which said pronunciation is also stored in the exceptions log.</claim-text></claim>
<claim id="c-en-01-0004" num="0004">
<claim-text>A method for operating a TTS system as claimed in any one of claims 1 to 3, in which said syntax is also stored in the exceptions log.</claim-text></claim>
<claim id="c-en-01-0005" num="0005">
<claim-text>A method for operating a TTS system as claimed in any one of the previous claims, further comprising logging in the exceptions log the number of times a word has not been identified in the dictionary.</claim-text></claim>
<claim id="c-en-01-0006" num="0006">
<claim-text>A method for operating a TTS system as claimed in any one of the previous claims, further comprising logging in the dictionary the number of times a word has been identified.</claim-text></claim>
<claim id="c-en-01-0007" num="0007">
<claim-text>A method for operating a TTS system as claimed in claim 5 and 6, further comprising analysing the most frequently logged words in the exceptions log with the least frequently logged words in the dictionary and deleting the least logged words in the dictionary whilst adding the most frequently logged words from the exceptions log.<!-- EPO <DP n="14"> --></claim-text></claim>
<claim id="c-en-01-0008" num="0008">
<claim-text>A method for operating a TTS system as claimed in any one of the previous claims, further comprising adding the most frequently used words periodically.</claim-text></claim>
<claim id="c-en-01-0009" num="0009">
<claim-text>A method for operating a TTS system as claimed in any one of claims 1 to 7, further comprising adding the most frequently used words when any one of the following events occur, when the number of words compared exceeds a predetermined number or that the exceptions log is full or following receipt of a command from a user of the TTS system or exponentially throughout time following initiation of the TTS system.</claim-text></claim>
<claim id="c-en-01-0010" num="0010">
<claim-text>A method for operating a TTS system as claimed in any one of the previous claims, further comprising verifying the pronunciation of the words stored in the exceptions log prior to adding the words to the dictionary.</claim-text></claim>
<claim id="c-en-01-0011" num="0011">
<claim-text>A method for operating a TTS system as claimed in claim 10, wherein verifying the pronunciation comprising verifying the pronunciation with a user of the TTS system.</claim-text></claim>
<claim id="c-en-01-0012" num="0012">
<claim-text>A method for operating a TTS system as claimed in claim 10, wherein verifying the pronunciation comprising verifying the pronunciation with a remote pronunciation dictionary.</claim-text></claim>
<claim id="c-en-01-0013" num="0013">
<claim-text>A mobile telephone including a text to speech system adapted to be operated in accordance with the method as claimed in any one of the preceding claims.</claim-text></claim>
</claims><!-- EPO <DP n="15"> -->
<claims id="claims02" lang="de">
<claim id="c-de-01-0001" num="0001">
<claim-text>Verfahren zum Betreiben eines Systems zur sprachlichen Wiedergabe von Text (TTS), das ein Aussprachelexikon und einen Speicher aufweist, welcher Ausspracheregeln enthält, wobei das Verfahren umfasst:
<claim-text>Vergleichen jedes Wortes in dem Text mit Wörtern im Aussprachelexikon, um zu ermitteln, ob das gleiche Wort vorhanden ist;</claim-text>
<claim-text>wenn das Wort ermittelt wird, dann Erhalten der Aussprache des Wortes;</claim-text>
<claim-text>wenn das Wort nicht ermittelt wird, dann Anwenden der Ausspracheregeln auf das Wort, um die Aussprache des Wortes zu erhalten; wobei</claim-text>
<claim-text>jedes Wort, das nicht ermittelt wird, in einem Ausnahmenprotokoll gespeichert wird; und</claim-text>
<claim-text>die am häufigsten genannten Wörter aus dem Ausnahmenprotokoll dem Aussprachelexikon hinzugefügt werden.</claim-text></claim-text></claim>
<claim id="c-de-01-0002" num="0002">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach Anspruch 1, in welchem dann, wenn das Wort im Aussprachelexikon nicht ermittelt wird, das Wort mit jenen im Ausnahmenprotokoll vergleichen wird, um zu ermitteln, ob das Wort bereits im Ausnahmenprotokoll gespeichert ist.</claim-text></claim>
<claim id="c-de-01-0003" num="0003">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach Anspruch 1 oder 2, in welchem auch die Aussprache in Ausnahmenprotokoll gespeichert wird.<!-- EPO <DP n="16"> --></claim-text></claim>
<claim id="c-de-01-0004" num="0004">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach einem der Ansprüche 1 bis 3, in welchem auch die Syntax im Ausnahmenprotokoll gespeichert wird.</claim-text></claim>
<claim id="c-de-01-0005" num="0005">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach einem der vorhergehenden Ansprüche, welches ferner das Protokollieren der Anzahl von Malen, die ein Wort im Lexikon nicht ermittelt wurde, im Ausnahmenprotokoll umfasst.</claim-text></claim>
<claim id="c-de-01-0006" num="0006">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach einem der vorhergehenden Ansprüche, welches ferner das Protokollieren der Anzahl von Malen, die ein Wort ermittelt wurde, im Lexikon umfasst.</claim-text></claim>
<claim id="c-de-01-0007" num="0007">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach Anspruch 5 und 6, welcher ferner das Analysieren der am häufigsten protokollierten Wörter im Ausnahmenprotokoll mit den am wenigsten häufig protokollierten Wörtern im Lexikon und das Löschen der am wenigstens protokollierten Wörter im Lexikon bei Hinzufügen der am häufigsten protokollierten Wörter aus dem Ausnahmenprotokoll umfasst.</claim-text></claim>
<claim id="c-de-01-0008" num="0008">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach einem der vorhergehenden Ansprüche, welches ferner das regelmäßige Hinzufügen der am häufigsten verwendeten Wörter umfasst.</claim-text></claim>
<claim id="c-de-01-0009" num="0009">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach einem der Ansprüche 1 bis 7, welches ferner das Hinzufügen der am häufigsten verwendeten Wörter umfasst, wenn irgendeines der folgenden Ereignisse eintritt: wenn die Anzahl von Wörtern, die verglichen werden, eine vorbestimmte Anzahl ürberschreitet, oder dass das Ausnahmenprotokoll voll ist, oder nach dem Empfang eines Befehls von einem Benutzer des TTS-Systems, oder exponentiell<!-- EPO <DP n="17"> --> während der ganzen Zeit, welche auf die Inbetriebnahme des TTS-Systems folgt.</claim-text></claim>
<claim id="c-de-01-0010" num="0010">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach einem der vorhergehenden Ansprüche, welches ferner das Überprüfen der Aussprache der Wörter, die im Ausnahmenprotokoll gespeichert sind, vor dem Hinzufügen der Wörter zum Lexikon umfasst.</claim-text></claim>
<claim id="c-de-01-0011" num="0011">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach Anspruch 10, wobei das Überprüfen der Aussprache das Überprüfen der Aussprache mit einem Benutzer des TTS-Systems umfasst.</claim-text></claim>
<claim id="c-de-01-0012" num="0012">
<claim-text>Verfahren zum Betreiben eines TTS-Systems nach Anspruch 10, wobei das Überprüfen der Aussprache das Überprüfen der Aussprache mit einem Fernaussprachelexikon umfasst.</claim-text></claim>
<claim id="c-de-01-0013" num="0013">
<claim-text>Mobiltelefon, das ein System zur sprachlichen Wiedergabe von Text aufweist, das so ausgelegt ist, dass es gemäß dem Verfahren nach einem der vorhergehenden Ansprüche betrieben wird.</claim-text></claim>
</claims><!-- EPO <DP n="18"> -->
<claims id="claims03" lang="fr">
<claim id="c-fr-01-0001" num="0001">
<claim-text>Procédé d'utilisation d'un texte pour un système écrit-parlé (télécomposeuse ou TTS) comprenant un dictionnaire de prononciation et une mémoire contenant des règles de prononciation, ledit procédé comprenant :
<claim-text>la comparaison de chaque mot dudit texte aux mots dudit dictionnaire de prononciation pour identifier si le mot identique est présent ;</claim-text>
<claim-text>si le mot est identifié, l'obtention de la prononciation dudit mot ;</claim-text>
<claim-text>si le mot n'est pas identifié, l'application consécutive desdites règles de prononciation audit mot pour obtenir la prononciation dudit mot ;</claim-text>
<claim-text>l'enregistrement de chaque mot qui n'est pas identifié dans un journal d'exceptions ; et</claim-text>
<claim-text>l'adjonction des mots auxquels il est fait référence le plus fréquemment du journal d'exceptions dans le dictionnaire de prononciation.</claim-text></claim-text></claim>
<claim id="c-fr-01-0002" num="0002">
<claim-text>Procédé d'utilisation d'un système TTS selon la revendication 1, dans lequel, si le mot n'est pas identifié dans le dictionnaire de prononciation, il y a ensuite une comparaison du mot avec ceux du journal d'exceptions pour identifier si le mot est déjà enregistré dans le journal d'exceptions.</claim-text></claim>
<claim id="c-fr-01-0003" num="0003">
<claim-text>Procédé d'utilisation d'un système TTS selon la revendication 1 ou 2, dans lequel ladite prononciation est aussi enregistrée dans le journal d'exceptions.</claim-text></claim>
<claim id="c-fr-01-0004" num="0004">
<claim-text>Procédé d'utilisation d'un système TTS selon l'une quelconque des revendications 1 à 3, dans lequel ladite syntaxe est aussi enregistrée dans le journal d'exceptions.<!-- EPO <DP n="19"> --></claim-text></claim>
<claim id="c-fr-01-0005" num="0005">
<claim-text>Procédé d'utilisation d'un système TTS selon l'une quelconque des revendications précédentes, comprenant en outre l'enregistrement dans le journal d'exceptions du nombre de fois qu'un mot n'a pas été identifié dans le dictionnaire.</claim-text></claim>
<claim id="c-fr-01-0006" num="0006">
<claim-text>Procédé d'utilisation d'un système TTS selon l'une quelconque des revendications précédentes, comprenant en outre l'enregistrement dans le journal d'exceptions du nombre de fois qu'un mot a été identifié.</claim-text></claim>
<claim id="c-fr-01-0007" num="0007">
<claim-text>Procédé d'utilisation d'un système TTS selon les revendications 5 et 6, comprenant en outre l'analyse des mots les plus fréquemment enregistrés dans le journal d'exceptions avec les mots les plus fréquemment enregistrés dans le dictionnaire et l'effacement des mots les moins enregistrés dans le dictionnaire tout en ajoutant les mots les plus fréquemment enregistrés du journal d'exceptions.</claim-text></claim>
<claim id="c-fr-01-0008" num="0008">
<claim-text>Procédé d'utilisation d'un système TTS selon l'une quelconque des revendications précédentes, comprenant en outre l'adjonction des mots les plus fréquemment utilisés périodiquement.</claim-text></claim>
<claim id="c-fr-01-0009" num="0009">
<claim-text>Procédé d'utilisation d'un système TTS selon l'une quelconque des revendications 1 à 7, comprenant en outre l'adjonction des mots les plus fréquemment utilisés lorsque n'importe lequel des événements suivants se produit, lorsque le nombre de mots comparés excède un nombre prédéterminé ou lorsque le journal d'exceptions est complet ou suite à la réception d'une commande d'un utilisateur du système TTS ou exponentiellement pendant le temps suivant l'initiation du système TTS.<!-- EPO <DP n="20"> --></claim-text></claim>
<claim id="c-fr-01-0010" num="0010">
<claim-text>Procédé d'utilisation d'un système TTS selon l'une quelconque des revendications précédentes, comprenant en outre la vérification de la prononciation des mots enregistrés dans le journal d'exceptions avant d'ajouter les mots au dictionnaire.</claim-text></claim>
<claim id="c-fr-01-0011" num="0011">
<claim-text>Procédé d'utilisation d'un système TTS selon la revendication 10, dans lequel la vérification de la prononciation comprend la vérification de la prononciation avec un utilisateur du système TTS.</claim-text></claim>
<claim id="c-fr-01-0012" num="0012">
<claim-text>Procédé d'utilisation d'un système TTS selon la revendication 10, dans lequel la vérification de la prononciation comprend la vérification de la prononciation avec un dictionnaire de prononciation distant.</claim-text></claim>
<claim id="c-fr-01-0013" num="0013">
<claim-text>Téléphone mobile comportant un texte pour un système écrit-parlé apte à être utilisé conformément au procédé selon l'une quelconque des revendications précédentes.</claim-text></claim>
</claims><!-- EPO <DP n="21"> -->
<drawings id="draw" lang="en">
<figure id="f0001" num=""><img id="if0001" file="imgf0001.tif" wi="165" he="213" img-content="drawing" img-format="tif"/></figure><!-- EPO <DP n="22"> -->
<figure id="f0002" num=""><img id="if0002" file="imgf0002.tif" wi="121" he="93" img-content="drawing" img-format="tif"/></figure><!-- EPO <DP n="23"> -->
<figure id="f0003" num=""><img id="if0003" file="imgf0003.tif" wi="165" he="145" img-content="drawing" img-format="tif"/></figure><!-- EPO <DP n="24"> -->
<figure id="f0004" num=""><img id="if0004" file="imgf0004.tif" wi="165" he="226" img-content="drawing" img-format="tif"/></figure>
</drawings>
</ep-patent-document>
