(19)
(11) EP 1 132 891 A1

(12) EUROPEAN PATENT APPLICATION
published in accordance with Art. 158(3) EPC

(43) Date of publication:
12.09.2001 Bulletin 2001/37

(21) Application number: 00961220.1

(22) Date of filing: 25.09.2000
(51) International Patent Classification (IPC)7G10L 19/00
// G10L101:06
(86) International application number:
PCT/JP0006/542
(87) International publication number:
WO 0124/164 (05.04.2001 Gazette 2001/14)
(84) Designated Contracting States:
AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT
Designated Extension States:
AL LT LV MK RO SI

(30) Priority: 28.09.1999 JP 27511999

(71) Applicant: MATSUSHITA ELECTRIC INDUSTRIAL CO., LTD.
Kadoma-shi, Osaka 571-8501 (JP)

(72) Inventor:
  • YONEZAKI, Tadashi
    Yokohama-shi, Kanagawa 236-0033 (JP)

(74) Representative: Grünecker, Kinkeldey, Stockmair & Schwanhäusser Anwaltssozietät 
Maximilianstrasse 58
80538 München
80538 München (DE)

   


(54) VOICE ENCODER, VOICE DECODER, AND VOICE ENCODING AND DECODING METHOD


(57) In speech coder 100, speech analysis section 101 extracts a fundamental frequency and spectrum envelope information from an input speech signal. Fundamental frequency quantization section 102 quantizes the fundamental frequency. Matrix generation section 103 generates a spectrum envelope curved surface from the spectrum envelope information and spectrum envelope quantization section 104 quantizes the spectrum envelope curved surface. Multiplexing section 105 multiplexes a quantized value of the spectrum envelope curved surface and a quantized value of the fundamental frequency and sends out the multiplexed quantized value. In speech decoder 200, spectrum envelope configuration section 202 reconfigures the spectrum envelope curved surface quantized from the spectrum envelope information and speech synthesis section 203 extracts the spectrum envelope curved surface based on the fundamental frequency information and synthesizes a decoded speech. This makes it possible to implement speech decoding of high quality even when information is transmitted at a low bit rate.




Description

Technical Field



[0001] The present invention relates to a speech coder, speech decoder and speech coding/decoding method used for a communication apparatus in a radio communication system such as a car telephone or cellular telephone.

Background Art



[0002] In the field of a radio communication system whose demand is rapidly increasing in recent years, the development of an apparatus capable of coding/decoding speech of high quality at a low bit rate is being carried forward for the effective utilization of radio wave resources.

[0003] FIG.1 is a block diagram showing a configuration of a conventional speech coder and speech decoder.

[0004] In speech coder 1 in FIG.1, spectrum envelope analysis section 11 estimates spectrum envelope information of an input speech signal. Spectrum envelope quantization section 12 quantizes the spectrum envelope information estimated by spectrum envelope analysis section 11.

[0005] Inverse filter 13 filters the inverse characteristic of the frequency characteristic of the spectrum envelope information quantized by spectrum envelope quantization section 12 for the input speech signal and removes the spectrum envelope component. This makes it possible to obtain a signal with a flat frequency characteristic. This signal is considered to be simulating an excitation signal generated by the vocal cords in the process of vocalization. Hereinafter, this signal is referred to as an "excitation signal".

[0006] Excitation codebook 14 stores signals having a flat frequency characteristic. Excitation coding section 15 searches the signal closest to the excitation signal from excitation codebook 14 and outputs the code (hereinafter, referred to as "excitation code").

[0007] Multiplexing section 16 multiplexes the code showing the quantized value of the spectrum envelope information output from spectrum envelope quantization section 12 and the excitation code output from excitation coding section 15 into a code string and sends this code string to a communication path.

[0008] In speech decoder 2 in FIG.1, inverse multiplexing section 21 separates the received code string into the code showing the quantized value of the spectrum envelope information and the excitation code.

[0009] Excitation codebook 22 stores signals identical to those in excitation codebook 14. Excitation selection section 23 selects and extracts a signal corresponding to the received excitation code from excitation codebook 22.

[0010] Synthesis filter 24 performs filtering so that the signal extracted by excitation selection section 23 has the frequency characteristic of the received spectrum envelope information and outputs the decoded speech.

[0011] In this way, the conventional speech coder and speech decoder separate spectrum envelope information with different signal dynamic ranges and quantization characteristics from the excitation signal, constitute quantizers according to their respective characteristics, thereby implementing high quality speech coding/decoding.

[0012] However, since the conventional speech coder and speech decoder above perform filtering based on the result from quantizing spectrum envelope information, when sufficient accuracy cannot be obtained in the quantization of the spectrum envelope information using a low bit rate, the conventional speech coder and speech decoder above fail to flatten an excitation signal, producing a problem of reduced quantization efficiency and deteriorated quality of the decoded speech.

Disclosure of Invention



[0013] It is an object of the present invention to provide a speech coder, speech decoder and speech coding/decoding method capable of realizing speech decoding of high quality even when information is transmitted at a low bit rate.

[0014] This object is attained, focused on the fact that a speech signal is generated by extracting a spectrum envelope curved surface, which is continuous on the time scale, based on a fundamental frequency, by completely separating spectrum envelope information from excitation information, implementing speech coding/decoding processing independent of the quantization accuracy of the spectrum envelope information and implementing highly efficient speech coding/decoding processing through a highly efficient technique of quantizing the spectrum envelope information, which is effective in an analysis/synthesis model.

Brief Description of Drawings



[0015] 

FIG.1 is a block diagram showing a configuration of a conventional speech coder and speech decoder;

FIG.2 is a block diagram showing a configuration of a speech coder and speech decoder according to Embodiment 1 of the present invention;

FIG.3 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 2 of the present invention;

FIG.4 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 3 of the present invention;

FIG.5 is a model diagram of a spectrum envelope curved surface according to Embodiment 3 of the present invention;

FIG.6 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 4 of the present invention;

FIG.7 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 5 of the present invention;

FIG.8 is a block diagram showing an internal configuration of a model applier of a speech coder according to Embodiment 6 of the present invention;

FIG.9 is a block diagram showing an internal configuration of a parameter quantizer of a speech coder according to Embodiment 7 of the present invention;

FIG.10 is a block diagram showing an internal configuration of a parameter quantizer of a speech coder according to Embodiment 8 of the present invention;

FIG.11 is a block diagram showing an internal configuration of a parameter quantizer of a speech coder according to Embodiment 9 of the present invention; and

FIG.12 is a block diagram showing an internal configuration of a spectrum envelope configuration section of a speech decoder according to Embodiment 10 of the present invention.


Best Mode for Carrying out the Invention



[0016] With reference now to the attached drawings, embodiments of the present invention will be explained below.

(Embodiment 1)



[0017] FIG.2 is a block diagram showing a configuration of a speech coder and speech decoder according to Embodiment 1 of the present invention.

[0018] In speech coder 100 in FIG.2, speech analysis section 101 extracts a fundamental frequency and short-time spectrum envelope information from an input speech signal. Fundamental frequency quantization section 102 quantizes the fundamental frequency extracted by speech analysis section 101.

[0019] By the way, regarding a speech analysis that extracts a fundamental frequency and short-time spectrum envelope information from an input speech signal, a speech analysis conducted based on a STRAIGHT analysis/synthesis model is already disclosed in "Speech Conversion Using Interpolation in Time Frequency Area" (by Hideki Kawahara, Ikuyo Masuda, TECHNICAL REPORT OF IEICE EA96-28,pp.9-18, 1996), etc. In this model, the excitation information is only a fundamental frequency and fully independent of the spectrum envelope information, and therefore quantization errors of the excitation information and spectrum envelope information have no influence on the quantization of the information of each other.

[0020] Matrix generation section 103 generates a spectrum envelope curved surface on the time-frequency plane by arranging short-time spectrum envelope information extracted by speech analysis section 101 along the time-axis. Spectrum envelope quantization section 104 quantizes the spectrum envelope curved surface generated by matrix generation section 103.

[0021] By the way, the spectrum envelope information is quantized as a continuous function on the time-frequency plane because when only the extracted spectrum envelope is quantized, the spectrum envelope information is quantized depending on the excitation information, making it impossible to realize a separation in information quantization processing, which is the essence of the present invention.

[0022] Multiplexing section 105 multiplexes the code showing the quantized value of the spectrum envelope curved surface output from spectrum envelope quantization section 104 and the code showing the quantized value of the fundamental frequency output from fundamental frequency quantization section 102 and sends out the multiplexed code to a communication path.

[0023] In speech decoder 200 in FIG.2, inverse multiplexing section 201 separates the received code string into the code showing the quantized value of the spectrum envelope information and the code showing the quantized value of the fundamental frequency.

[0024] Spectrum envelope configuration section 202 reconfigures the spectrum envelope curved surface quantized from the received spectrum envelope information. Speech synthesis section 203 extracts the spectrum envelope curved surface reconfigured by spectrum envelope configuration section 202 based on the fundamental frequency information and synthesizes this into a decoded speech and outputs.

[0025] Next, a flow of the information processing operation of the speech coder and speech decoder according to this Embodiment shown in FIG.2 will be explained.

[0026] First, the fundamental frequency and short-time spectrum envelope information are extracted from the input speech signal input by speech analysis section 101 of speech coder 100. The extracted fundamental frequency is quantized by fundamental frequency quantization section 102.

[0027] On the other hand, the extracted short-time spectrum envelope information is arranged along the time-axis by matrix generation section 103 and a spectrum envelope curved surface is generated on the time-frequency plane.The spectrum envelope curved surface is quantized by spectrum envelope quantization section 104.

[0028] The quantized fundamental frequency and spectrum envelope curved surface are multiplexed by multiplexing section 105 and sent out to the communication path. Then, the fundamental frequency and spectrum envelope curved surface are received by inverse multiplexing section 201 of speech decoder 200 and separated into the quantized value of the spectrum envelope information and the quantized value of the fundamental frequency.

[0029] The quantized value of the spectrum envelope information is input to spectrum envelope configuration section 202 and the spectrum envelope curved surface is reconfigured by spectrum envelope configuration section 202.

[0030] Then, speech synthesis section 203 extracts the reconfigured spectrum envelope curved surface based on the fundamental frequency information, synthesizes this into a decoded speech and outputs.

[0031] Thus, by quantizing excitation information and spectrum envelope information independently, even if the quantization accuracy of either information deteriorates it is possible to prevent the quantization efficiency of the other information from decreasing and thereby implement speech decoding of high quality even when information is transmitted at a low bit rate.

(Embodiment 2)



[0032] FIG.3 is the block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 2 of the present invention.

[0033] The configuration of the speech coder according to this Embodiment is the same as the configuration of the speech coder shown in FIG.2 of Embodiment 1, and therefore explanations thereof are omitted.

[0034] In spectrum envelope quantization section 104 in FIG.3, two-dimensional orthogonal transformer 301 performs a two-dimensional orthogonal transformation on a spectrum envelope curved surface in the time-axis direction and frequency-axis direction. Parameter quantizer 302 quantizes the transformation coefficient obtained through the two-dimensional orthogonal transformation processing by two-dimensional orthogonal transformer 301.

[0035] Here, it is generally perceptually difficult to recognize a difference of the high frequency component of the spectrum envelope curved surface. Therefore, the speech quality does not deteriorate a great deal even if a speech is synthesized on the decoding side using only the coefficient information of the low frequency component obtained through an orthogonal transformation. Thus, parameter quantizer 302 quantizes only the coefficient information of the low frequency component.

[0036] In this way, it is possible to delete information, which is perceptually not important by using an orthogonal transformation, and implement speech decoding of high quality even when information is transmitted at a low bit rate.

(Embodiment 3)



[0037] FIG.4 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 3 of the present invention.

[0038] The configuration of the speech coder according to this Embodiment is the same as the configuration of the speech coder of Embodiment 1 shown in FIG.2, and therefore explanations thereof are omitted.

[0039] In spectrum envelope quantization section 104 in FIG. 4, model applier 311 models a spectrum envelope curved surface and extracts a model parameter.

[0040] This model results from modeling of a spectrum envelope curved surface in the time-frequency space and modeling is available by, for example, applying a crossfield model to both sections on the time-axis of the spectrum envelope curved surface and complementing the crossfield model as shown in FIG.5.

[0041] Parameter quantizer 302 quantizes the model parameter extracted by model applier 311.

[0042] By modeling a spectrum envelope curved surface in this way, it is possible to improve the quantization efficiency of the spectrum envelope curved surface and implement speech decoding of high quality even when information is transmitted at a low bit rate.

(Embodiment 4)



[0043] FIG.6 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 4 of the present invention.

[0044] The configuration of the speech coder according to this Embodiment is the same as the configuration of the speech coder of Embodiment 1 shown in FIG.2, and therefore explanations thereof are omitted.

[0045] In spectrum envelope quantization section 104 in FIG.6, time-axis orthogonal transformer 321 performs an orthogonal transformation in the time-axis direction on a spectrum envelope curved surface. Model applier 311 applies a model according to the degree on the time-axis to the orthogonal-transformed time-axis transformation coefficient and extracts a parameter. Parameter quantizer 302 quantizes the model parameter extracted by model applier 311.

[0046] By applying a model according to the degree of the time-axis transformation coefficient in this way, it is possible to improve the quantization efficiency of modeling and implement speech decoding of high quality even when information is transmitted at a low bit rate.

(Embodiment 5)



[0047] FIG.7 is a block diagram showing an internal configuration of a spectrum envelope quantization section of a speech coder according to Embodiment 5 of the present invention.

[0048] The configuration of the speech coder according to this Embodiment is the same as the configuration of the speech coder of Embodiment 1 shown in FIG.2, and therefore explanations thereof are omitted.

[0049] In spectrum envelope quantization section 104 in FIG.7, time-axis orthogonal transformer 331 performs an orthogonal transformation in the time-axis direction on a spectrum envelope curved surface and classifies it into ones whose orthogonal-transformed time-axis transformation coefficient will be modeled and others whose orthogonal-transformed time-axis transformation coefficient will not be modeled. One example of this classification method is a method whereby a crossfield model is applied to the 0th order coefficient of the time-axis because it is a spectrum envelope obtained by averaging the spectrum envelope curved surface and no model is applied to other coefficients.

[0050] Model applier 311 applies a model according to the order on the time-axis to some of the orthogonal-transformed time-axis transformation coefficients and extracts a parameter. Frequency-axis orthogonal transformer 332 performs an orthogonal transformation in the frequency-axis direction on the time-axis transformation coefficients to which no model is applied. Parameter quantizer 302 quantizes the model parameter extracted by model applier 311 and the transformation coefficients output from frequency-axis orthogonal transformer 332.

[0051] By applying a model only to the time-axis transformation coefficient whose quantization efficiency will improve with modeling in this way, it is possible to improve the quantization efficiency of modeling, reduce modeling distortion and implement speech decoding of high quality even when information is transmitted at a low bit rate.

(Embodiment 6)



[0052] FIG.8 is a block diagram showing an internal configuration of a model applier of a speech coder according to Embodiment 6 of the present invention.

[0053] Model applier 311 according to this Embodiment is shown in one of Embodiments 3 to 5 above.

[0054] Model parameter estimator 401 applies a model to an input signal and extracts a parameter.

[0055] For example, in case of speech coding, an input signal is modeled with a crossfield model taking into account the speech generation process. However, when the order of the model is low, the model cannot show zero points contained in the signal producing analysis distortion by the model.

[0056] Therefore, model error estimator 402 estimates the analysis distortion generated when the model is applied and outputs the estimated analysis distortion to the parameter quantizer.

[0057] By quantizing modeling distortion in this way, it is possible to improve the quantization efficiency of modeling, reduce modeling distortion and implement speech decoding of high quality even when information is transmitted at a low bit rate.

(Embodiment 7)



[0058] FIG.9 is a block diagram showing an internal configuration of a parameter quantizer of a speech coder according to Embodiment 7 of the present invention.

[0059] Parameter quantizer 302 according to this Embodiment is shown in one of Embodiments 2 to 5 above.

[0060] Weight calculator 501 determines quantization sensitivity for each quantization target value using fundamental frequency information. An example of the method of determining quantization sensitivity by weight calculator 501 is shown below.

[0061] Speech decoding processing generates a decoded speech by extracting a spectrum envelope curved surface according to the fundamental frequency and connecting it on the time-axis. At this time, the harmonic wave amplitude value of the fundamental frequency becomes information more important than that of the other spectrum amplitude values. Therefore, a weighting factor curved surface obtained by assigning weights to the harmonic wave amplitude value at the position of the extracted spectrum envelope is generated.

[0062] Next, quantization sensitivity of each quantization target value is determined by transforming using a method similar to that of the transformation used to obtain the quantization target value and calculating a weighting factor in the quantization target parameter space.

[0063] Weight calculator 502 determines quantization sensitivity for each quantization target value using the spectrum envelope information. An example of the method of determining quantization sensitivity by weight calculator 502 is shown below.

[0064] When noise of a same magnitude is added to a certain signal, noise of a signal with a small dynamic range is perceptually more conspicuous than noise of a signal with a large dynamic range. Therefore, a weighting factor curved surface is generated with a greater weight assigned for a signal of a smaller amplitude on the spectrum envelope curved surface.

[0065] Then, quantization sensitivity for each quantization target value is determined by carrying out a transformation using a method similar to that for the transformation to obtain the quantization target value and calculating a weighting factor in the quantization target parameter space. Since the method of applying the quantizer by weight calculator 502 is also necessary in decoding processing, it is desirable to use a spectrum envelope information quantized with a preceding frame in order to establish synchronization with the decoding processing.

[0066] Statistical quantity accumulator 503 accumulates a statistical quantity for every quantization target value acquired beforehand. Quantization generator 504 designs a quantizer from the quantization sensitivity corresponding to the quantization target values output from weight calculator 501 and weight calculator 502 and a statistical quantity accumulated in statistical quantity accumulator 503.

[0067] For example, when a scalar quantizer is used, the dispersion of quantization target values is accumulated as a statistical quantity and a quantization step width is determined based on this dispersion and quantization sensitivity. When the dispersion is the same, quantization sensitivity is large. That is, a smaller quantization step width is set for a quantization target value susceptible to quantization errors.

[0068] Quantizer 505 quantizes quantization target values based on the design result of quantizer generator 504.

[0069] Thus, by making a quantizer adaptable to the fundamental frequency and spectrum envelope information, it is possible to reduce objective quantization distortion of the synthesized speech signal and perceptual distortion.

[0070] By the way, this Embodiment determines quantization sensitivity for each quantization target value using two information pieces, the fundamental frequency information and spectrum envelope information, but it is also possible to determine quantization sensitivity using the information of either of the two to design the quantizer.

(Embodiment 8)



[0071] FIG.10 is a block diagram showing an internal configuration of a parameter quantizer of a speech coder according to Embodiment 8 of the present invention.

[0072] Parameter quantizer 302 according to this Embodiment is shown in one of Embodiments 2 to 5 above.

[0073] Error scale determinator 511 adaptively determines a quantization error scale on a spectrum envelope using the fundamental frequency information. Error scale determinator 512 adaptively determines a quantization error scale on a spectrum envelope using the spectrum envelope information. Error scale synthesizer 513 synthesizes the error scales obtained from error scale determinator 511 and error scale determinator 512 into one error scale.

[0074] Codebook 514 stores quantized values. Spectrum envelope configurator 515 converts a quantized value stored in codebook 514 into a spectrum envelope curved surface. Spectrum envelope configurator 516 converts a quantization target value into a spectrum envelope curved surface.

[0075] Error calculator 517 calculates an error between the spectrum envelope curved surface composed by spectrum envelope configurator 515 and the spectrum envelope curved surface composed by spectrum envelope configurator 516 based on the error scale output from error scale synthesizer 513.

[0076] Code selector 518 selects and outputs the code corresponding to the quantized value, which corresponds to the smallest error from codebook 514.

[0077] By calculating the error of the spectrum envelope curved surface on the time-frequency plane using the error scale at the time of the quantization adapted to the fundamental frequency and the spectrum envelope information in this way, it is possible to reduce objective quantization distortion and perceptual distortion of the synthesized speech signal.

[0078] This Embodiment determines a quantization error scale on the spectrum envelope with respect to both the fundamental frequency and the spectrum envelope information, but it is also possible to determine a quantization error scale with respect to either of the two to calculate an error.

(Embodiment 9)



[0079] FIG.11 is a block diagram showing an internal configuration of a parameter quantizer of a speech coder according to Embodiment 9 of the present invention.

[0080] Parameter quantizer 302 according to this Embodiment is shown in one of Embodiments 2 to 5 above.

[0081] Error function determinator 521 adaptively determines a quantization error weighting factor on a spectrum envelope using the fundamental frequency information. Error function determinator 522 adaptively determines a quantization error weighting factor on a spectrum envelope using the spectrum envelope information. Error function synthesizer 523 synthesizes the quantization error weighting functions obtained from error function determinator 521 and error function determinator 522 into one error function. Error function converter 524 defines an error scale on a quantization parameter that converts the quantization error weighting function output from error function synthesizer 523.

[0082] Codebook 525 stores quantized values. Error calculator 526 calculates an error between a quantization target value and a quantized value stored in codebook 525 based on the error scale output from error function converter 524.

[0083] Code selector 527 selects and outputs the code corresponding to the quantized value, which corresponds to the smallest error from codebook 525.

[0084] By calculating the error between quantization parameters using the error scale at the time of the quantization adapted to the fundamental frequency and the spectrum envelope information in this way, it is possible to reduce objective quantization distortion of a synthesized speech signal and perceptual distortion with a small amount of processing.

[0085] This Embodiment determines a quantization error weighting function on a spectrum envelope with respect to both the fundamental frequency and the spectrum envelope information, but it is also possible to determine a quantization error weighting function with respect to either of the two to calculate an error.

(Embodiment 10)



[0086] FIG.12 is a block diagram showing an internal configuration of a spectrum envelope configuration section of a speech decoder according to Embodiment 10 of the present invention.

[0087] By the way, the configuration of the speech decoder according to this embodiment is the same as the configuration of the speech decoder of Embodiment 1 shown in FIG.2, and therefore explanations thereof are omitted.

[0088] As explained in Embodiment 2 above, the speech coding/decoding method using an orthogonal transformation is intended to compress information by not transmitting the high frequency component, which is not perceptually important on the coding side. Therefore, this Embodiment generates an envelope curved surface by complementing the parameter, which has not been received, using the parameter value statistically calculated beforehand on the decoding side.

[0089] In spectrum envelope configuration section 202 in FIG.12, parameter accumulator 601 stores parameter values statistically calculated beforehand, corresponding to parameters other than the quantization target. Spectrum envelope generator 602 generates a spectrum envelope curved surface based on the input spectrum envelope information.

[0090] By using a statistically calculated value as the parameter other than the quantization target in this way, it is possible to restore a more accurate spectrum envelope curved surface than in the case where an arbitrary value is used.

[0091] As described above, the speech coder, speech decoder and speech coding/decoding method of the present invention can completely separate spectrum envelope information from excitation information, achieve speech coding/decoding processing independent of the quantization accuracy of spectrum envelope information and achieve highly efficient speech coding/decoding processing through a highly efficient technique of quantizing spectrum envelope information, which is effective in an analysis/synthesis model, and thereby implement speech decoding of high quality even when information is transmitted at a low bit rate.

[0092] This application is based on the Japanese Patent Application No.HEI 11-275119 filed on September 28, 1999, entire content of which is expressly incorporated by reference herein.

Industrial Applicability



[0093] The present invention is ideally applicable to a base station apparatus or a communication terminal apparatus in a radio communication system that carries out radio communication of speech data.


Claims

1. A speech coder comprising:

speech analyzing means for extracting a fundamental frequency and spectrum envelope information from an input speech signal;

fundamental frequency quantizing means for quantizing the extracted fundamental frequency;

matrix generating means for generating a spectrum envelope curved surface from the extracted spectrum envelope information;

spectrum envelope quantizing means for quantizing the generated spectrum envelope curved surface; and

multiplexing means for multiplexing the quantized value of the spectrum envelope curved surface and the quantized value of the fundamental frequency and sending the multiplexed value.


 
2. The speech coder according to claim 1, wherein the spectrum envelope quantizing means comprising:

two-dimensional orthogonal transforming means for performing a two-dimensional orthogonal transformation in the time-axis direction and frequency-axis direction on the spectrum envelope curved surface; and

parameter quantization means for quantizing the transformation coefficient output from this two-dimensional orthogonal transforming means.


 
3. The speech coder according to claim 1, wherein the spectrum envelope quantizing means comprising:

model applying means for modeling the spectrum envelope curved surface and extracting a model parameter; and

parameter quantizing means for quantizing the extracted model parameter.


 
4. The speech coder according to claim 1, wherein the spectrum envelope quantizing means comprising:

time-axis orthogonal transforming means for performing an orthogonal transformation in the time-axis direction on the spectrum envelope curved surface;

model applying means for applying a model according to the order on the time-axis to the orthogonal-transformed time-axis transformation coefficient and extracting a parameter; and

parameter quantizing means for quantizing the extracted model parameter.


 
5. The speech coder according to claim 4, further comprising frequency-axis orthogonal transforming means for performing an orthogonal transformation in the frequency-axis direction on a time-axis transformation coefficient to which no model is applied, wherein the parameter quantizing means quantizes the extracted model parameter and the transformation coefficient output from said frequency-axis orthogonal transforming means.
 
6. The speech coder according to claim 3, wherein the model applying means comprising:

model parameter estimating means for applying a model to an input signal and extracting a parameter; and

model error estimating means for estimating analysis distortion generated when the model is applied by this model parameter estimating means.


 
7. The speech coder according to claim 2, wherein the parameter quantizing means comprising:

weight calculating means for determining quantization sensitivity for each quantization target value using at least one of the fundamental frequency information and spectrum envelope information;

statistical quantity storing means for storing a statistical quantity calculated beforehand for every quantization target;

quantization generating means for designing a quantizer from quantization sensitivity corresponding to the quantization target value output from said weight calculating means and the statistical quantity stored in the statistical quantity storing means; and

quantizing means for quantizing the quantization target value based on the design result of this quantization generating means.


 
8. The speech coder according to claim 2, wherein the parameter quantizing means comprising:

error scale determining means for adaptively determining a quantization error scale on the spectrum envelope using at least one of the fundamental frequency information and spectrum envelope information;

first spectrum envelope configuring means for converting a quantized value stored in a codebook to a spectrum envelope curved surface;

second spectrum envelope configuring means for converting the quantization target value to a spectrum envelope curved surface;

error calculating means for calculating an error between the spectrum envelope curved surface configured by said first spectrum envelope configuring means and the spectrum envelope curved surface configured by said second spectrum envelope configuring means based on the error scale; and

code selecting means for selecting a code corresponding to a quantized value with a minimum error from the codebook.


 
9. The speech coder according to claim 2, wherein the parameter quantizing means comprising:

error function determining means for adaptively determining a quantization error weighting function on the spectrum envelope using at least one of the fundamental frequency information and spectrum envelope information;

error function converting means for defining an error scale on the quantization parameter that converts the quantization error weighting function;

error calculating means for calculating an error between the quantization target value and quantized value stored in the codebook based on the error scale; and

code selecting means for selecting a code corresponding to a quantized value with a minimum error from the codebook.


 
10. A speech decoder comprising:

inverse multiplexing means for separating a code string sent from the speech coder according to claim 1 into a code showing a quantized value of spectrum envelope information and a code showing a quantized value of a fundamental frequency;

spectrum envelope configuring means for reconfiguring a quantized spectrum envelope curved surface from the received spectrum envelope information; and

speech synthesizing means for extracting the reconfigured spectrum envelope curved surface based on the fundamental frequency information and synthesizing a decoded speech.


 
11. The speech decoder according to claim 10, wherein the spectrum envelope configuring means comprising:

parameter storing means for storing parameter values statistically calculated according to parameters other than the equalization target; and

spectrum envelope generating means for generating a spectrum envelope curved surface based on the input spectrum envelope information.


 
12. A speech coding/decoding method with the coding side comprising the steps of:

extracting a fundamental frequency and spectrum envelope information from an input speech signal;

quantizing the extracted fundamental frequency;

generating and quantizing a spectrum envelope curved surface from the extracted spectrum envelope information; and

sending the quantized value of the spectrum envelope curved surface multiplexed with the quantized value of the fundamental frequency, and

with the decoding side comprising the steps of:

separating the received code string into a code showing a quantized value of spectrum envelope information and a code showing a quantized value of a fundamental frequency;

reconfiguring a quantized spectrum envelope curved surface from the received spectrum envelope information; and

extracting the reconfigured spectrum envelope curved surface based on the fundamental frequency information and synthesizing a decoded speech.


 
13. A mechanically readable recording medium that records a speech coding program to make a computer execute the steps of:

extracting a fundamental frequency and spectrum envelope information from an input speech signal;

quantizing the extracted fundamental frequency;

generating a spectrum envelope curved surface from the extracted spectrum envelope information;

quantizing the generated spectrum envelope curved surface; and

multiplexing the quantized value of the spectrum envelope curved surface and the quantized value of the fundamental frequency.


 
14. A mechanically readable recording medium that records a speech decoding program to make a computer execute the steps of:

separating a received code string into a code showing a quantized value of spectrum envelope information and a code showing a quantized value of a fundamental frequency;

reconfiguring a quantized spectrum envelope curved surface from the received spectrum envelope information; and

extracting the reconfigured spectrum envelope curved surface based on the fundamental frequency information and synthesizing a decoded speech.


 




Drawing








































Search report