Technical Field
[0001] The present invention relates to a speech coder, speech decoder and speech coding/decoding
method used for a communication apparatus in a radio communication system such as
a car telephone or cellular telephone.
Background Art
[0002] In the field of a radio communication system whose demand is rapidly increasing in
recent years, the development of an apparatus capable of coding/decoding speech of
high quality at a low bit rate is being carried forward for the effective utilization
of radio wave resources.
[0003] FIG.1 is a block diagram showing a configuration of a conventional speech coder and
speech decoder.
[0004] In speech coder 1 in FIG.1, spectrum envelope analysis section 11 estimates spectrum
envelope information of an input speech signal. Spectrum envelope quantization section
12 quantizes the spectrum envelope information estimated by spectrum envelope analysis
section 11.
[0005] Inverse filter 13 filters the inverse characteristic of the frequency characteristic
of the spectrum envelope information quantized by spectrum envelope quantization section
12 for the input speech signal and removes the spectrum envelope component. This makes
it possible to obtain a signal with a flat frequency characteristic. This signal is
considered to be simulating an excitation signal generated by the vocal cords in the
process of vocalization. Hereinafter, this signal is referred to as an "excitation
signal".
[0006] Excitation codebook 14 stores signals having a flat frequency characteristic. Excitation
coding section 15 searches the signal closest to the excitation signal from excitation
codebook 14 and outputs the code (hereinafter, referred to as "excitation code").
[0007] Multiplexing section 16 multiplexes the code showing the quantized value of the spectrum
envelope information output from spectrum envelope quantization section 12 and the
excitation code output from excitation coding section 15 into a code string and sends
this code string to a communication path.
[0008] In speech decoder 2 in FIG.1, inverse multiplexing section 21 separates the received
code string into the code showing the quantized value of the spectrum envelope information
and the excitation code.
[0009] Excitation codebook 22 stores signals identical to those in excitation codebook 14.
Excitation selection section 23 selects and extracts a signal corresponding to the
received excitation code from excitation codebook 22.
[0010] Synthesis filter 24 performs filtering so that the signal extracted by excitation
selection section 23 has the frequency characteristic of the received spectrum envelope
information and outputs the decoded speech.
[0011] In this way, the conventional speech coder and speech decoder separate spectrum envelope
information with different signal dynamic ranges and quantization characteristics
from the excitation signal, constitute quantizers according to their respective characteristics,
thereby implementing high quality speech coding/decoding.
[0012] However, since the conventional speech coder and speech decoder above perform filtering
based on the result from quantizing spectrum envelope information, when sufficient
accuracy cannot be obtained in the quantization of the spectrum envelope information
using a low bit rate, the conventional speech coder and speech decoder above fail
to flatten an excitation signal, producing a problem of reduced quantization efficiency
and deteriorated quality of the decoded speech.
Disclosure of Invention
[0013] It is an object of the present invention to provide a speech coder, speech decoder
and speech coding/decoding method capable of realizing speech decoding of high quality
even when information is transmitted at a low bit rate.
[0014] This object is attained, focused on the fact that a speech signal is generated by
extracting a spectrum envelope curved surface, which is continuous on the time scale,
based on a fundamental frequency, by completely separating spectrum envelope information
from excitation information, implementing speech coding/decoding processing independent
of the quantization accuracy of the spectrum envelope information and implementing
highly efficient speech coding/decoding processing through a highly efficient technique
of quantizing the spectrum envelope information, which is effective in an analysis/synthesis
model.
Brief Description of Drawings
[0015]
FIG.1 is a block diagram showing a configuration of a conventional speech coder and
speech decoder;
FIG.2 is a block diagram showing a configuration of a speech coder and speech decoder
according to Embodiment 1 of the present invention;
FIG.3 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 2 of the present invention;
FIG.4 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 3 of the present invention;
FIG.5 is a model diagram of a spectrum envelope curved surface according to Embodiment
3 of the present invention;
FIG.6 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 4 of the present invention;
FIG.7 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 5 of the present invention;
FIG.8 is a block diagram showing an internal configuration of a model applier of a
speech coder according to Embodiment 6 of the present invention;
FIG.9 is a block diagram showing an internal configuration of a parameter quantizer
of a speech coder according to Embodiment 7 of the present invention;
FIG.10 is a block diagram showing an internal configuration of a parameter quantizer
of a speech coder according to Embodiment 8 of the present invention;
FIG.11 is a block diagram showing an internal configuration of a parameter quantizer
of a speech coder according to Embodiment 9 of the present invention; and
FIG.12 is a block diagram showing an internal configuration of a spectrum envelope
configuration section of a speech decoder according to Embodiment 10 of the present
invention.
Best Mode for Carrying out the Invention
[0016] With reference now to the attached drawings, embodiments of the present invention
will be explained below.
(Embodiment 1)
[0017] FIG.2 is a block diagram showing a configuration of a speech coder and speech decoder
according to Embodiment 1 of the present invention.
[0018] In speech coder 100 in FIG.2, speech analysis section 101 extracts a fundamental
frequency and short-time spectrum envelope information from an input speech signal.
Fundamental frequency quantization section 102 quantizes the fundamental frequency
extracted by speech analysis section 101.
[0019] By the way, regarding a speech analysis that extracts a fundamental frequency and
short-time spectrum envelope information from an input speech signal, a speech analysis
conducted based on a STRAIGHT analysis/synthesis model is already disclosed in "Speech
Conversion Using Interpolation in Time Frequency Area" (by Hideki Kawahara, Ikuyo
Masuda, TECHNICAL REPORT OF IEICE EA96-28,pp.9-18, 1996), etc. In this model, the
excitation information is only a fundamental frequency and fully independent of the
spectrum envelope information, and therefore quantization errors of the excitation
information and spectrum envelope information have no influence on the quantization
of the information of each other.
[0020] Matrix generation section 103 generates a spectrum envelope curved surface on the
time-frequency plane by arranging short-time spectrum envelope information extracted
by speech analysis section 101 along the time-axis. Spectrum envelope quantization
section 104 quantizes the spectrum envelope curved surface generated by matrix generation
section 103.
[0021] By the way, the spectrum envelope information is quantized as a continuous function
on the time-frequency plane because when only the extracted spectrum envelope is quantized,
the spectrum envelope information is quantized depending on the excitation information,
making it impossible to realize a separation in information quantization processing,
which is the essence of the present invention.
[0022] Multiplexing section 105 multiplexes the code showing the quantized value of the
spectrum envelope curved surface output from spectrum envelope quantization section
104 and the code showing the quantized value of the fundamental frequency output from
fundamental frequency quantization section 102 and sends out the multiplexed code
to a communication path.
[0023] In speech decoder 200 in FIG.2, inverse multiplexing section 201 separates the received
code string into the code showing the quantized value of the spectrum envelope information
and the code showing the quantized value of the fundamental frequency.
[0024] Spectrum envelope configuration section 202 reconfigures the spectrum envelope curved
surface quantized from the received spectrum envelope information. Speech synthesis
section 203 extracts the spectrum envelope curved surface reconfigured by spectrum
envelope configuration section 202 based on the fundamental frequency information
and synthesizes this into a decoded speech and outputs.
[0025] Next, a flow of the information processing operation of the speech coder and speech
decoder according to this Embodiment shown in FIG.2 will be explained.
[0026] First, the fundamental frequency and short-time spectrum envelope information are
extracted from the input speech signal input by speech analysis section 101 of speech
coder 100. The extracted fundamental frequency is quantized by fundamental frequency
quantization section 102.
[0027] On the other hand, the extracted short-time spectrum envelope information is arranged
along the time-axis by matrix generation section 103 and a spectrum envelope curved
surface is generated on the time-frequency plane.The spectrum envelope curved surface
is quantized by spectrum envelope quantization section 104.
[0028] The quantized fundamental frequency and spectrum envelope curved surface are multiplexed
by multiplexing section 105 and sent out to the communication path. Then, the fundamental
frequency and spectrum envelope curved surface are received by inverse multiplexing
section 201 of speech decoder 200 and separated into the quantized value of the spectrum
envelope information and the quantized value of the fundamental frequency.
[0029] The quantized value of the spectrum envelope information is input to spectrum envelope
configuration section 202 and the spectrum envelope curved surface is reconfigured
by spectrum envelope configuration section 202.
[0030] Then, speech synthesis section 203 extracts the reconfigured spectrum envelope curved
surface based on the fundamental frequency information, synthesizes this into a decoded
speech and outputs.
[0031] Thus, by quantizing excitation information and spectrum envelope information independently,
even if the quantization accuracy of either information deteriorates it is possible
to prevent the quantization efficiency of the other information from decreasing and
thereby implement speech decoding of high quality even when information is transmitted
at a low bit rate.
(Embodiment 2)
[0032] FIG.3 is the block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 2 of the present invention.
[0033] The configuration of the speech coder according to this Embodiment is the same as
the configuration of the speech coder shown in FIG.2 of Embodiment 1, and therefore
explanations thereof are omitted.
[0034] In spectrum envelope quantization section 104 in FIG.3, two-dimensional orthogonal
transformer 301 performs a two-dimensional orthogonal transformation on a spectrum
envelope curved surface in the time-axis direction and frequency-axis direction. Parameter
quantizer 302 quantizes the transformation coefficient obtained through the two-dimensional
orthogonal transformation processing by two-dimensional orthogonal transformer 301.
[0035] Here, it is generally perceptually difficult to recognize a difference of the high
frequency component of the spectrum envelope curved surface. Therefore, the speech
quality does not deteriorate a great deal even if a speech is synthesized on the decoding
side using only the coefficient information of the low frequency component obtained
through an orthogonal transformation. Thus, parameter quantizer 302 quantizes only
the coefficient information of the low frequency component.
[0036] In this way, it is possible to delete information, which is perceptually not important
by using an orthogonal transformation, and implement speech decoding of high quality
even when information is transmitted at a low bit rate.
(Embodiment 3)
[0037] FIG.4 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 3 of the present invention.
[0038] The configuration of the speech coder according to this Embodiment is the same as
the configuration of the speech coder of Embodiment 1 shown in FIG.2, and therefore
explanations thereof are omitted.
[0039] In spectrum envelope quantization section 104 in FIG. 4, model applier 311 models
a spectrum envelope curved surface and extracts a model parameter.
[0040] This model results from modeling of a spectrum envelope curved surface in the time-frequency
space and modeling is available by, for example, applying a crossfield model to both
sections on the time-axis of the spectrum envelope curved surface and complementing
the crossfield model as shown in FIG.5.
[0041] Parameter quantizer 302 quantizes the model parameter extracted by model applier
311.
[0042] By modeling a spectrum envelope curved surface in this way, it is possible to improve
the quantization efficiency of the spectrum envelope curved surface and implement
speech decoding of high quality even when information is transmitted at a low bit
rate.
(Embodiment 4)
[0043] FIG.6 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 4 of the present invention.
[0044] The configuration of the speech coder according to this Embodiment is the same as
the configuration of the speech coder of Embodiment 1 shown in FIG.2, and therefore
explanations thereof are omitted.
[0045] In spectrum envelope quantization section 104 in FIG.6, time-axis orthogonal transformer
321 performs an orthogonal transformation in the time-axis direction on a spectrum
envelope curved surface. Model applier 311 applies a model according to the degree
on the time-axis to the orthogonal-transformed time-axis transformation coefficient
and extracts a parameter. Parameter quantizer 302 quantizes the model parameter extracted
by model applier 311.
[0046] By applying a model according to the degree of the time-axis transformation coefficient
in this way, it is possible to improve the quantization efficiency of modeling and
implement speech decoding of high quality even when information is transmitted at
a low bit rate.
(Embodiment 5)
[0047] FIG.7 is a block diagram showing an internal configuration of a spectrum envelope
quantization section of a speech coder according to Embodiment 5 of the present invention.
[0048] The configuration of the speech coder according to this Embodiment is the same as
the configuration of the speech coder of Embodiment 1 shown in FIG.2, and therefore
explanations thereof are omitted.
[0049] In spectrum envelope quantization section 104 in FIG.7, time-axis orthogonal transformer
331 performs an orthogonal transformation in the time-axis direction on a spectrum
envelope curved surface and classifies it into ones whose orthogonal-transformed time-axis
transformation coefficient will be modeled and others whose orthogonal-transformed
time-axis transformation coefficient will not be modeled. One example of this classification
method is a method whereby a crossfield model is applied to the 0th order coefficient
of the time-axis because it is a spectrum envelope obtained by averaging the spectrum
envelope curved surface and no model is applied to other coefficients.
[0050] Model applier 311 applies a model according to the order on the time-axis to some
of the orthogonal-transformed time-axis transformation coefficients and extracts a
parameter. Frequency-axis orthogonal transformer 332 performs an orthogonal transformation
in the frequency-axis direction on the time-axis transformation coefficients to which
no model is applied. Parameter quantizer 302 quantizes the model parameter extracted
by model applier 311 and the transformation coefficients output from frequency-axis
orthogonal transformer 332.
[0051] By applying a model only to the time-axis transformation coefficient whose quantization
efficiency will improve with modeling in this way, it is possible to improve the quantization
efficiency of modeling, reduce modeling distortion and implement speech decoding of
high quality even when information is transmitted at a low bit rate.
(Embodiment 6)
[0052] FIG.8 is a block diagram showing an internal configuration of a model applier of
a speech coder according to Embodiment 6 of the present invention.
[0053] Model applier 311 according to this Embodiment is shown in one of Embodiments 3 to
5 above.
[0054] Model parameter estimator 401 applies a model to an input signal and extracts a parameter.
[0055] For example, in case of speech coding, an input signal is modeled with a crossfield
model taking into account the speech generation process. However, when the order of
the model is low, the model cannot show zero points contained in the signal producing
analysis distortion by the model.
[0056] Therefore, model error estimator 402 estimates the analysis distortion generated
when the model is applied and outputs the estimated analysis distortion to the parameter
quantizer.
[0057] By quantizing modeling distortion in this way, it is possible to improve the quantization
efficiency of modeling, reduce modeling distortion and implement speech decoding of
high quality even when information is transmitted at a low bit rate.
(Embodiment 7)
[0058] FIG.9 is a block diagram showing an internal configuration of a parameter quantizer
of a speech coder according to Embodiment 7 of the present invention.
[0059] Parameter quantizer 302 according to this Embodiment is shown in one of Embodiments
2 to 5 above.
[0060] Weight calculator 501 determines quantization sensitivity for each quantization target
value using fundamental frequency information. An example of the method of determining
quantization sensitivity by weight calculator 501 is shown below.
[0061] Speech decoding processing generates a decoded speech by extracting a spectrum envelope
curved surface according to the fundamental frequency and connecting it on the time-axis.
At this time, the harmonic wave amplitude value of the fundamental frequency becomes
information more important than that of the other spectrum amplitude values. Therefore,
a weighting factor curved surface obtained by assigning weights to the harmonic wave
amplitude value at the position of the extracted spectrum envelope is generated.
[0062] Next, quantization sensitivity of each quantization target value is determined by
transforming using a method similar to that of the transformation used to obtain the
quantization target value and calculating a weighting factor in the quantization target
parameter space.
[0063] Weight calculator 502 determines quantization sensitivity for each quantization target
value using the spectrum envelope information. An example of the method of determining
quantization sensitivity by weight calculator 502 is shown below.
[0064] When noise of a same magnitude is added to a certain signal, noise of a signal with
a small dynamic range is perceptually more conspicuous than noise of a signal with
a large dynamic range. Therefore, a weighting factor curved surface is generated with
a greater weight assigned for a signal of a smaller amplitude on the spectrum envelope
curved surface.
[0065] Then, quantization sensitivity for each quantization target value is determined by
carrying out a transformation using a method similar to that for the transformation
to obtain the quantization target value and calculating a weighting factor in the
quantization target parameter space. Since the method of applying the quantizer by
weight calculator 502 is also necessary in decoding processing, it is desirable to
use a spectrum envelope information quantized with a preceding frame in order to establish
synchronization with the decoding processing.
[0066] Statistical quantity accumulator 503 accumulates a statistical quantity for every
quantization target value acquired beforehand. Quantization generator 504 designs
a quantizer from the quantization sensitivity corresponding to the quantization target
values output from weight calculator 501 and weight calculator 502 and a statistical
quantity accumulated in statistical quantity accumulator 503.
[0067] For example, when a scalar quantizer is used, the dispersion of quantization target
values is accumulated as a statistical quantity and a quantization step width is determined
based on this dispersion and quantization sensitivity. When the dispersion is the
same, quantization sensitivity is large. That is, a smaller quantization step width
is set for a quantization target value susceptible to quantization errors.
[0068] Quantizer 505 quantizes quantization target values based on the design result of
quantizer generator 504.
[0069] Thus, by making a quantizer adaptable to the fundamental frequency and spectrum envelope
information, it is possible to reduce objective quantization distortion of the synthesized
speech signal and perceptual distortion.
[0070] By the way, this Embodiment determines quantization sensitivity for each quantization
target value using two information pieces, the fundamental frequency information and
spectrum envelope information, but it is also possible to determine quantization sensitivity
using the information of either of the two to design the quantizer.
(Embodiment 8)
[0071] FIG.10 is a block diagram showing an internal configuration of a parameter quantizer
of a speech coder according to Embodiment 8 of the present invention.
[0072] Parameter quantizer 302 according to this Embodiment is shown in one of Embodiments
2 to 5 above.
[0073] Error scale determinator 511 adaptively determines a quantization error scale on
a spectrum envelope using the fundamental frequency information. Error scale determinator
512 adaptively determines a quantization error scale on a spectrum envelope using
the spectrum envelope information. Error scale synthesizer 513 synthesizes the error
scales obtained from error scale determinator 511 and error scale determinator 512
into one error scale.
[0074] Codebook 514 stores quantized values. Spectrum envelope configurator 515 converts
a quantized value stored in codebook 514 into a spectrum envelope curved surface.
Spectrum envelope configurator 516 converts a quantization target value into a spectrum
envelope curved surface.
[0075] Error calculator 517 calculates an error between the spectrum envelope curved surface
composed by spectrum envelope configurator 515 and the spectrum envelope curved surface
composed by spectrum envelope configurator 516 based on the error scale output from
error scale synthesizer 513.
[0076] Code selector 518 selects and outputs the code corresponding to the quantized value,
which corresponds to the smallest error from codebook 514.
[0077] By calculating the error of the spectrum envelope curved surface on the time-frequency
plane using the error scale at the time of the quantization adapted to the fundamental
frequency and the spectrum envelope information in this way, it is possible to reduce
objective quantization distortion and perceptual distortion of the synthesized speech
signal.
[0078] This Embodiment determines a quantization error scale on the spectrum envelope with
respect to both the fundamental frequency and the spectrum envelope information, but
it is also possible to determine a quantization error scale with respect to either
of the two to calculate an error.
(Embodiment 9)
[0079] FIG.11 is a block diagram showing an internal configuration of a parameter quantizer
of a speech coder according to Embodiment 9 of the present invention.
[0080] Parameter quantizer 302 according to this Embodiment is shown in one of Embodiments
2 to 5 above.
[0081] Error function determinator 521 adaptively determines a quantization error weighting
factor on a spectrum envelope using the fundamental frequency information. Error function
determinator 522 adaptively determines a quantization error weighting factor on a
spectrum envelope using the spectrum envelope information. Error function synthesizer
523 synthesizes the quantization error weighting functions obtained from error function
determinator 521 and error function determinator 522 into one error function. Error
function converter 524 defines an error scale on a quantization parameter that converts
the quantization error weighting function output from error function synthesizer 523.
[0082] Codebook 525 stores quantized values. Error calculator 526 calculates an error between
a quantization target value and a quantized value stored in codebook 525 based on
the error scale output from error function converter 524.
[0083] Code selector 527 selects and outputs the code corresponding to the quantized value,
which corresponds to the smallest error from codebook 525.
[0084] By calculating the error between quantization parameters using the error scale at
the time of the quantization adapted to the fundamental frequency and the spectrum
envelope information in this way, it is possible to reduce objective quantization
distortion of a synthesized speech signal and perceptual distortion with a small amount
of processing.
[0085] This Embodiment determines a quantization error weighting function on a spectrum
envelope with respect to both the fundamental frequency and the spectrum envelope
information, but it is also possible to determine a quantization error weighting function
with respect to either of the two to calculate an error.
(Embodiment 10)
[0086] FIG.12 is a block diagram showing an internal configuration of a spectrum envelope
configuration section of a speech decoder according to Embodiment 10 of the present
invention.
[0087] By the way, the configuration of the speech decoder according to this embodiment
is the same as the configuration of the speech decoder of Embodiment 1 shown in FIG.2,
and therefore explanations thereof are omitted.
[0088] As explained in Embodiment 2 above, the speech coding/decoding method using an orthogonal
transformation is intended to compress information by not transmitting the high frequency
component, which is not perceptually important on the coding side. Therefore, this
Embodiment generates an envelope curved surface by complementing the parameter, which
has not been received, using the parameter value statistically calculated beforehand
on the decoding side.
[0089] In spectrum envelope configuration section 202 in FIG.12, parameter accumulator 601
stores parameter values statistically calculated beforehand, corresponding to parameters
other than the quantization target. Spectrum envelope generator 602 generates a spectrum
envelope curved surface based on the input spectrum envelope information.
[0090] By using a statistically calculated value as the parameter other than the quantization
target in this way, it is possible to restore a more accurate spectrum envelope curved
surface than in the case where an arbitrary value is used.
[0091] As described above, the speech coder, speech decoder and speech coding/decoding method
of the present invention can completely separate spectrum envelope information from
excitation information, achieve speech coding/decoding processing independent of the
quantization accuracy of spectrum envelope information and achieve highly efficient
speech coding/decoding processing through a highly efficient technique of quantizing
spectrum envelope information, which is effective in an analysis/synthesis model,
and thereby implement speech decoding of high quality even when information is transmitted
at a low bit rate.
[0092] This application is based on the Japanese Patent Application No.HEI 11-275119 filed
on September 28, 1999, entire content of which is expressly incorporated by reference
herein.
Industrial Applicability
[0093] The present invention is ideally applicable to a base station apparatus or a communication
terminal apparatus in a radio communication system that carries out radio communication
of speech data.
1. A speech coder comprising:
speech analyzing means for extracting a fundamental frequency and spectrum envelope
information from an input speech signal;
fundamental frequency quantizing means for quantizing the extracted fundamental frequency;
matrix generating means for generating a spectrum envelope curved surface from the
extracted spectrum envelope information;
spectrum envelope quantizing means for quantizing the generated spectrum envelope
curved surface; and
multiplexing means for multiplexing the quantized value of the spectrum envelope curved
surface and the quantized value of the fundamental frequency and sending the multiplexed
value.
2. The speech coder according to claim 1, wherein the spectrum envelope quantizing means
comprising:
two-dimensional orthogonal transforming means for performing a two-dimensional orthogonal
transformation in the time-axis direction and frequency-axis direction on the spectrum
envelope curved surface; and
parameter quantization means for quantizing the transformation coefficient output
from this two-dimensional orthogonal transforming means.
3. The speech coder according to claim 1, wherein the spectrum envelope quantizing means
comprising:
model applying means for modeling the spectrum envelope curved surface and extracting
a model parameter; and
parameter quantizing means for quantizing the extracted model parameter.
4. The speech coder according to claim 1, wherein the spectrum envelope quantizing means
comprising:
time-axis orthogonal transforming means for performing an orthogonal transformation
in the time-axis direction on the spectrum envelope curved surface;
model applying means for applying a model according to the order on the time-axis
to the orthogonal-transformed time-axis transformation coefficient and extracting
a parameter; and
parameter quantizing means for quantizing the extracted model parameter.
5. The speech coder according to claim 4, further comprising frequency-axis orthogonal
transforming means for performing an orthogonal transformation in the frequency-axis
direction on a time-axis transformation coefficient to which no model is applied,
wherein the parameter quantizing means quantizes the extracted model parameter and
the transformation coefficient output from said frequency-axis orthogonal transforming
means.
6. The speech coder according to claim 3, wherein the model applying means comprising:
model parameter estimating means for applying a model to an input signal and extracting
a parameter; and
model error estimating means for estimating analysis distortion generated when the
model is applied by this model parameter estimating means.
7. The speech coder according to claim 2, wherein the parameter quantizing means comprising:
weight calculating means for determining quantization sensitivity for each quantization
target value using at least one of the fundamental frequency information and spectrum
envelope information;
statistical quantity storing means for storing a statistical quantity calculated beforehand
for every quantization target;
quantization generating means for designing a quantizer from quantization sensitivity
corresponding to the quantization target value output from said weight calculating
means and the statistical quantity stored in the statistical quantity storing means;
and
quantizing means for quantizing the quantization target value based on the design
result of this quantization generating means.
8. The speech coder according to claim 2, wherein the parameter quantizing means comprising:
error scale determining means for adaptively determining a quantization error scale
on the spectrum envelope using at least one of the fundamental frequency information
and spectrum envelope information;
first spectrum envelope configuring means for converting a quantized value stored
in a codebook to a spectrum envelope curved surface;
second spectrum envelope configuring means for converting the quantization target
value to a spectrum envelope curved surface;
error calculating means for calculating an error between the spectrum envelope curved
surface configured by said first spectrum envelope configuring means and the spectrum
envelope curved surface configured by said second spectrum envelope configuring means
based on the error scale; and
code selecting means for selecting a code corresponding to a quantized value with
a minimum error from the codebook.
9. The speech coder according to claim 2, wherein the parameter quantizing means comprising:
error function determining means for adaptively determining a quantization error weighting
function on the spectrum envelope using at least one of the fundamental frequency
information and spectrum envelope information;
error function converting means for defining an error scale on the quantization parameter
that converts the quantization error weighting function;
error calculating means for calculating an error between the quantization target value
and quantized value stored in the codebook based on the error scale; and
code selecting means for selecting a code corresponding to a quantized value with
a minimum error from the codebook.
10. A speech decoder comprising:
inverse multiplexing means for separating a code string sent from the speech coder
according to claim 1 into a code showing a quantized value of spectrum envelope information
and a code showing a quantized value of a fundamental frequency;
spectrum envelope configuring means for reconfiguring a quantized spectrum envelope
curved surface from the received spectrum envelope information; and
speech synthesizing means for extracting the reconfigured spectrum envelope curved
surface based on the fundamental frequency information and synthesizing a decoded
speech.
11. The speech decoder according to claim 10, wherein the spectrum envelope configuring
means comprising:
parameter storing means for storing parameter values statistically calculated according
to parameters other than the equalization target; and
spectrum envelope generating means for generating a spectrum envelope curved surface
based on the input spectrum envelope information.
12. A speech coding/decoding method with the coding side comprising the steps of:
extracting a fundamental frequency and spectrum envelope information from an input
speech signal;
quantizing the extracted fundamental frequency;
generating and quantizing a spectrum envelope curved surface from the extracted spectrum
envelope information; and
sending the quantized value of the spectrum envelope curved surface multiplexed with
the quantized value of the fundamental frequency, and
with the decoding side comprising the steps of:
separating the received code string into a code showing a quantized value of spectrum
envelope information and a code showing a quantized value of a fundamental frequency;
reconfiguring a quantized spectrum envelope curved surface from the received spectrum
envelope information; and
extracting the reconfigured spectrum envelope curved surface based on the fundamental
frequency information and synthesizing a decoded speech.
13. A mechanically readable recording medium that records a speech coding program to make
a computer execute the steps of:
extracting a fundamental frequency and spectrum envelope information from an input
speech signal;
quantizing the extracted fundamental frequency;
generating a spectrum envelope curved surface from the extracted spectrum envelope
information;
quantizing the generated spectrum envelope curved surface; and
multiplexing the quantized value of the spectrum envelope curved surface and the quantized
value of the fundamental frequency.
14. A mechanically readable recording medium that records a speech decoding program to
make a computer execute the steps of:
separating a received code string into a code showing a quantized value of spectrum
envelope information and a code showing a quantized value of a fundamental frequency;
reconfiguring a quantized spectrum envelope curved surface from the received spectrum
envelope information; and
extracting the reconfigured spectrum envelope curved surface based on the fundamental
frequency information and synthesizing a decoded speech.