(19)
(11) EP 0 774 750 A2

(12) EUROPEAN PATENT APPLICATION

(43) Date of publication:
21.05.1997 Bulletin 1997/21

(21) Application number: 96308081.7

(22) Date of filing: 07.11.1996
(51) International Patent Classification (IPC)6G10L 9/14
(84) Designated Contracting States:
DE FR GB SE

(30) Priority: 15.11.1995 US 6787
23.10.1996 US 735827

(71) Applicant: NOKIA MOBILE PHONES LTD.
24101 Salo (FI)

(72) Inventor:
  • Ruoppila, Vesa T.
    33720 Tampere (FI)

(74) Representative: Slingsby, Philip Roy et al
NOKIA MOBILE PHONES, Patent Department, St. Georges Court, St. Georges Road, 9 High Street
Camberley, Surrey GU15 3QZ
Camberley, Surrey GU15 3QZ (GB)

   


(54) Determination of line spectrum frequencies for use in a radiotelephone


(57) An audio signal, such as a speech signal, is applied to a LPC filter. A method is disclosed for determining line spectrum frequencies of the LPC filter, expressed by symmetric and antisymmetric auxiliary polynomials, the zeros of which determine the line spectrum frequencies of the LPC filter. The method includes the steps of (a) expressing the auxiliary polynomials using explicit forms of Chebyshev polynomials; (b) iteratively solving a zero of a first of the polynomials using a zero of the other one of the polynomials; and (c) successively eliminating zeroes from the polynomials by polynomial deflation so as to determine the line spectrum frequencies. Also disclosed is a method for determining the immittance spectrum frequencies.




Description


[0001] This invention relates generally to speech encoding methods and apparatus and, in particular, to linear predictive coding (LPC) speech and audio coding techniques that employ a line spectrum frequency representation of an LPC filter.

[0002] Linear predictive coding (LPC) is a known technique for analyzing a speech signal and for characterizing the signal in terms of coefficients which are encoded, broadcast, received and decoded to recover an approximation of the original signal. The parameters of a LPC filter are coded and sent as a part of the information stream. The use of line spectrum frequencies is an alternative to the use of, for example, polynomial coefficients or reflection coefficients for representing the LPC filter. The line spectrum frequencies have useful properties for quantization and interpolation which make them a more attractive representation than polynomial or reflection coefficients.

[0003] However, in known types of speech encoders that employ line spectrum frequencies, the procedure for deriving the line spectrum frequencies is computationally expensive. This disadvantage becomes especially apparent when implemented in real time or substantially real time in a speech encoder of, by example, a digital cellular telephone.

[0004] It is a first potential aim of this invention to provide a simplified and efficient technique for reducing the complexity of the line spectrum frequency computation.

[0005] It is a second potential aim of this invention to provide a radiotelephone having an LPC-based speech/audio encoder that employs line spectrum frequencies that are obtained in accordance with the method of this invention.

[0006] The foregoing and other problems may preferably be overcome and the potential aims of the invention realized by methods and apparatus in accordance with embodiments of this invention.

[0007] In this invention, a method for determining line spectrum frequencies of a LPC filter is disclosed. The predictor polynominal of the LPC filter is decomposed into symmetric and antisymmetric auxiliary polynomials, the zeros of which determine the line spectrum frequencies of the filter. In other words, the line spectrum frequency representation is determined by solving the zeros of the two auxiliary polynomials. Due to the symmetry of the auxiliary polynomials, their zeros are preferably solved from two cosine series. In speech codecs this is usually done by a bisection algorithm, and by employing the definition of Chebyshev polynomials to evaluate the cosine series.

[0008] A potential aim of this invention is to reduce the complexity of the line spectrum frequency computation. This may be obtained by rewriting the cosine series as polynomials using explicit forms of Chebyshev polynomials. This enables an evaluation of the series by nested multiplications. Moreover, the already-computed zeros are successively eliminated from the polynomial by polynomial deflation. This procedure and the properties of the auxiliary polynomials can enable the initial values to be chosen in the zero finding algorithm such that a zero is found by only a few iterations using the zero of the other polynomial. Thus, the invention may reduce considerably the arithmetic operations required to compute the line spectrum frequencies. The method of this invention thus has the potential to be implemented with relatively low complexity, and furthermore to be accomplished using fixed-point arithmetic.

[0009] The above set forth and other features of the invention are made more apparent in the ensuing detailed description of the invention when read in conjunction with the attached Drawings, wherein:

Fig. 1 is a block diagram showing a speech encoder employing a line spectrum frequency representation of the LPC filter.

Fig. 2 is a flow chart describing one method to implement the present invention. The procedure employed for computing the zeros of the auxiliary polynominals R and S (the block Compute zero) is presented in more detail in Figure 3.

Fig. 3 is a flow chart of one method to implement Newton's method. The polynomial G = R if i is odd. Otherwise G = S. G'(x) denotes the first derivative of the polynomial G at the point x. An additional refining procedure is often unnecessary if sufficient numerical accuracy is employed.

Figs. 4A-4D illustrate the progress of the technique (algorithm (29)) employing polynomial deflation.



[0010] A simplified block diagram of a speech encoder 10 employing the spectrum frequency representation of the LPC filter is presented in Fig. 1. The speech encoder 10 may form a portion of a radiotelephone, such as a digital cellular user terminal or a personal communicator device. An input audio signal, such as a speech signal obtained from a speech transducer or microphone 5, is converted into a digital form by an analog-to-digital (A/D) converter 12. The digital output of the A/D converter is preprocessed by separating the signal into frames of convenient length, typically of the order of tens of milliseconds. It should be noted that the A/D conversion is not necessary if the signal is already in digital form. After preprocessing, the signal is applied to an LPC analysis block 14. The LPC analysis produces coefficients for an LPC filter, also referred to herein as an LPC-analysis filter 16. The output of the LPC analysis block 14 is transformed into a line spectrum frequency (LSF) representation in block 18. The LSF coefficients may be quantized in block 20 and then interpolated in block 22 in order to construct a LPC analysis filter for each speech subframe. By example, and for a speech frame having a duration of 20 milliseconds, four 5 millesecond subframes may be used, wherein the analysis filter is constructed separately for each subframe. After the LPC filtering in block 16 the coefficients of a long-term prediction (LTP) filter are searched, and the residual is generated in the block labeled LTP analysis and filtering 24. The residual is encoded in the excitation encoding block 26, and the resulting encoded residual, i.e., an encoded excitation signal, is multiplexed (block 28) with the quantized LSF coefficients into a bit stream transmitted to a speech decoder (not shown) via a communication channel 30. By example, the communication channel 30 is a radio channel linking the mobile terminal to a base station (not shown) by a transmitter 32 and an antenna 34. The "side information" input to the multiplexer 28 determines, for example, the operational mode of the speech coder, particularly for variable rate codecs such as QCELP. For a speech coder operated in a fixed rate mode, this input may not be required.

[0011] The following detailed description of the invention pertains most particularly to the operation of the LPC to LSF block 18 of Fig. 1. The present invention does not require any modifications to other blocks presented in Fig. 1. It is also noted that the LPC is LSF transformation is not used in a decoder of present speech codecs. Therefore, the decoder of the speech codec is not considered in this description, although a decoder that employs a LPC to LSF transformation is also within the scope of the teaching of this invention.

1.1 Line Spectrum Frequency Representation



[0012] The nth degree predictor polynomial of the LPC filter

satisfies the recurrence formula

wherein k1, k2,...,kn+1 are reflection coefficients. The recurrence formula (2) is called the Levinson-Durbin solution to the Yule-Walker equations. It expresses the relationship between the (n+1)th and the nth degree predictor polynomials. For the purposes of this description it is assumed that all roots of the predictor polynomial An(z) are inside the unit circle, i.e., that the predictor polynomial is minimum phase.

[0013] By setting kn+1 = 1, the recurrence formula (2) gives the polynomial:



[0014] By construction, (3) is a symmetric polynomial in a sense that it satisfies the relation



[0015] Similarly, by setting kn+1 =-1 in (2) one obtains an antisymmetric polynomial

which has the property



[0016] From (3) and (4) it follows that the predictor polynomial (1) can be decomposed into a sum of symmetric and antisymmetric polynomials:



[0017] Actually, the roots of the polynomials Pn+1(z) and Qn+1(z) determine the line spectrum frequencies of the predictor polynomial. Soong and Juang ("Line spectrum pair (LSP) and speech data compression", Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, San Diego, CA, pp.1.10.1-1.10.4, March 1984) have shown that if An(z) is minimum phase, then the roots of Pn+1(z) and Qn+1(z) are on the unit circle, and the roots are simple and separate from each other. Therefore, Pn+1(z) and Qn+1(z) can be factored as follows:



where ω1, ω2,...,ωn are the phase angles of the zeros of the polynomials



such that

ω1, ω2,...,ωn are the line spectrum frequencies of An(z).

[0018] Note that both polynomials P(z) and Q(z) are symmetric, and the degree of P(z) is 2mp, where



[0019] The degree of the polynomial Q(z) is 2mQ, where



[0020] By explicitly using the symmetry of the polynomial, P(z) defined in (8) can be written in the form

where p1, p2,..., pmp are the coefficients of P(z). By substituting z = e and by employing the relation

equation (11) gives



[0021] The symmetric polynomial Q(z) can be rewritten similarly to the form

where q1, q2,..., qmQ are the coefficients of the polynomial Q(z).

[0022] From equation (8) it follows that line spectrum frequencies ω13,...,ω2mp are the zeros of P(ω) in the interval [0, π]. Correspondingly, line spectrum frequencies ω2, ω4,..., ω2mQ are the zeros of Q(ω) when ω ∈ [0,π]. Hence, the line spectrum frequencies of An(z) can be found by solving zeros of series



where ω ∈ [0, π]. In a QCELP speech coder, by example, line spectrum frequencies are solved directly from (14) and (15) (see, by example, TIA/EIA/IS-96-A, Speech Service Option Standard for Wideband Spread Spectrum Digital Cellular System (1994)). However, it is more desirable to rewrite the cosine series as polynomials in order to obviate the evaluation of trigonometric functions. This is discussed in more detail in the following section.

[0023] The relations between the coefficients of the predictor polynomial and the coefficients of the cosine series (14) and (15) can be derived from equations (3), (4) and (6)-(9).

1.2 Chebyshev Polynomials



[0024] The Chebyshev polynomials of the first kind are defined by the recurrence formula

with initial conditions T0(x) = 1 and T1(x) = x. For x in the interval [-1,1], the Chebyshev polynomials have the closed-form expression



[0025] The explicit forms of the first few Tk are









[0026] By changing a variable,

and using (12), equations (14) and (15) give





[0027] The line spectrum frequencies {ωi} can be determined by solving the equations R(x) =0 and S(x) =0 for x in the interval [-1,1]. Once the roots {xi} are solved, the corresponding line spectrum frequencies are given by ωi = arccos xi.

[0028] Various methods to solve the zeros of (14) and (15) have been suggested in the literature. So as to provide a background for the teaching of this invention, a brief survey is made of a few algorithms presented in the literature. Also, the methods employed in some currently standardized speech codecs are discussed.

[0029] The procedure introduced by Soong and Juang evaluates (14) and (15) on a fine grid by discrete cosine transformation. Sign changes at adjacent grid points isolate intervals containing roots. After a sign change has been detected, the interval is bisected until a sufficiently accurate numerical estimate for the zero is obtained. A similar bisection based algorithm is used also in the QCELP speech codec (i.e., TIA/EIA/IS-96-A, Speech Service Option Standard for Wideband Spread Spectrum Digital Cellular System (1994)). However, in QCELP the equations (14) and (15) are evaluated by computing directly all the terms of the series without transformations. Since a large number of trigonometric functions have to be evaluated, the algorithm becomes inevitably complex, and the numerical accuracy may be poor in a fixed-point implementation.

[0030] Kang and Fransen ("Application of line spectrum pairs to low bit rate speech encoders", Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Tampa, FL, pp. 7.3.1-7.3.4, March 1984) have proposed autocorrelation based and ratio-filter based methods for finding line spectrum frequencies. However, these two methods also require an evaluation of a large number of trigonometric functions.

[0031] In order to obviate evaluation of trigonometric functions, Kabal and Ramachandran ("The computation of line spectrum frequencies using Chebyshev polynomials", IEEE Transactions on Speech and Audio Processing, vol. 34, no. 6, pp. 1419-1426, 1986) suggested that the cosine series (14) and (15) are transformed to the form (19) and (20). Then line spectrum frequencies {ωi} can be determined by solving the zeros of (19) and (20) in the interval [-1,1]. Also Kabal and Ramachandran employed the bisection algorithm; Equations (19) and (20) are evaluated on a grid to locate sign changes at adjacent grid points. After a sign change has been detected, the zero is computed by successively bisecting the interval. Since this method is currently used in several speech codecs, e.g., in (TIA/EIA/IS-641 TDMA Cellular/PCS-Radio Interface-Enhanced Full-Rate Speech Codec (1996)), it is described in more detail in the next section.

[0032] Saoudi et al. ("A new efficient algorithm to compute the LSP parameters for speech coding", Signal Processing, vol. 28, pp. 201-212, 1992) reformulated the problem. That is, they introduced an algorithm which solves line spectrum frequencies from eigenvalues of tridiagonal symmetric matrices without computing the predictor polynomial. The eigenvalues are computed by the bisection method. Saoudi et al. also compared the complexity of several algorithms introduced for computing the line spectrum frequencies. Unfortunately, the results of the comparison are not comprehensive.

[0033] Also it is noted that Chan ("Computation of LSF parameters from reflection coefficients," Electronic Letters, vol. 27, no. 19, pp. 1773-1774, 1991, and "Efficient interconversion algorithm for PARCOR and LSP parameters," in Proceedings of International Symposium on Speech, Image Processing and Neural Networks, Hong Kong, pp. 603-606, 13-16 April 1994) has introduced a method for forming polynomials Pk=1(z) and Qk=1,2...,n(z) for k without computing explicitly the coefficients of the predictor polynomial. However, this approach does not solve the zero finding problem.

[0034] As was mentioned above, Kabal and Ramachandran suggested that the cosine series (14) and (15) be rewritten to the form (19) and (20) by exploiting the Chebyshev polynomials. This allows an evaluation of the equations with a simple recursion, and without trigonometric functions. Kabal and Ramachandran have used the property (10), or equivalently

to reduce the complexity of their algorithm.
(23) Algorithm

(a) Initialization. Compute the coefficients of (19) and (20) when An(z) is given. Partitate the upper half of the unit circle into N subintervals [ωGk, ωG(k+1], k = 1,2, ..., N and such that ωG1 = 0 and ωGN = π. Map the grid points {ωGk} into the real axis by xGk = cos ωGk. Set k = 1 and start the search from the polynomial R(x).

(b) Check if the polynomial under investigation has a sign change in the interval [xGk,xG(k+1)].

(c) If a sign change is detected, locate the zero xi from the interval [xGk,xG(k+1)] by the bisection algorithm. After the zero has been found, continue the search from another polynomial.

(d) Set k: = k+1. Continue to (b) until all n zeros are found or all intervals are gone through.



[0035] Although the algorithm (22) is known to be used in speech codecs, it has several shortcomings. For example, if some zeros of the polynomials are sufficiently near each other, the algorithm has a tendency to miss zeros, since the sign change is not detected. This shortcoming can be circumvented by making the grid denser. In other words, the intervals are made sufficiently small so that two or more roots do not occur in the same interval. The definitive choice of the grid interval is always a compromise between a reliability and computational burden. That is, as the grid interval is made more dense, the overall processing burden increases as well.

[0036] In accordance with the teaching of this invention, a reduction in the complexity of the speech encoding, and more particularly, the computation of line spectrum frequencies is made possible. In accordance with the teaching of this invention, a significant improvement is achieved by evaluating (19) and (20) with a recursion requiring fewer arithmetic operations than the methods used in current speech encoders.

[0037] Further in accordance with the teaching of this invention, the method successively eliminates already-found zeros from the polynomials. The procedure is known as polynomial deflation, or as synthetic division in numerical analysis, see, e.g., Kincaid and Chaney. The use of polynomial deflation allows for the elimination of the sign-change detection procedure from the algorithm. Another advantage is that efficient algorithms (which have better convergence properties than the bisection method) can be exploited together with the property (21) to locate the zeros. For example, when zeros are computed by Newton's method, the algorithm is guaranteed to find all line spectrum frequencies if sufficient numerical accuracy is used.

[0038] The methods of this invention are presented in more detail in the following discussion.

[0039] Consider now the evaluation of R(x) and S(x) for a given value of x in the interval [-1, 1]. The Chebyshev series (19), and similarly (20), to be evaluated can be represented as

with p0 = 1, a0 = (1/2) pmp and ak = pmp-k. The summation in (23) reduces to

where b0(x) and b2 (x) are given by the backward recurrence formula



Kabal and Ramachandran have employed this technique in their algorithm.

[0040] Although (25) is a numerically robust method for evaluating the Chebyshev series, it requires (mp + 1) multiplications and (2mp - 1) additions in this particular application. Also several data-move operations increase the workload in a DSP implementation.

[0041] In this invention, there is instead employed a more efficient method. To evaluate R(x) and S(x), instead convert (19) and (20) to the polynomials



with the explicit forms of the Chebyshev polynomials. The polynomials (25) and (26) can then be evaluated effectively by the procedure of nested multiplications, known also as Horner's algorithm (see, for example, Kincaid and Cheney, Numerical Analysis: Mathematics of Scientific Computing, Brooks/Cole Publishing Company, 1991).

[0042] Horner's algorithm produces R(x) = b0(x), and similarly S(x), by the backward recurrence relation

with an initial value bmp(x) = rmp. The equation (27) requires mp multiplications and mp additions. Naturally, the recursion (27) can be used also in other mathematically equivalent forms.

Example



[0043] Consideration is made of the case when the degree of An(z) is ten (n = 10). Hence mp = 5. The conversion from (19) to (25) gives polynomial coefficients
r0 = 16,    r1 = 8p1,
r2 = 4p2 - 20,    r3 = 2p3 - 8p1,
r4 = -3p2 + p4 + 5    r5 = p1 - p3 + p5/2,
when the polynomial
R(x) = {16x5 - 20x3 + 5x} + p1 {8x4 - 8x2 + 1}
+ p2{4x3 -3x} + p3{2x2 - 1} + p4x + p5/2
is written in the open form. In this form the evaluation of R(x) requires 5 multiplications and 5 additions using Horner's algorithm, while the method (24) would require 6 multiplications and 9 additions. The difference is a total of five operations (plus data-move operations in a fixed-point implementation). As can be appreciated, in some speech coders the polynomials are evaluated hundreds of times in a single speech frame during the computation of line spectrum frequencies and, as a result, a large number of operations can be eliminated. The computation of r0, r1,..., r5 requires six additions and six multiplications. However, these coefficients are usually computed only once per frame.

[0044] A discussion is now made of the polynomial deflation aspects of this invention. It is first pointed out that it is possible to avoid the checking of sign-changes if already-found zeros are successively eliminated from the polynomials R(x) and S(x), or equivalently from R(ω) and S(ω). In numerical analysis this procedure is known as polynomial deflation or synthetic division. The polynomial deflation is based on the relation



[0045] By example, assume that n is even, and a zero x = x1 of R(x) has been located. Then, the remaining zeros x3,..., xn-1 are also zeros of RQ(x). To compute these zeros, the polynomial R(x) can be replaced by the quotient polynomial RQ(x). This is referred to as the polynomial deflation. The procedure can be repeated; as soon as a zero is found, it can be factored out. In this way, the zero finding technique can operate with polynomials of lower and lower degree. The known zeros can be removed from a polynomial by the use of Horner's algorithm. Clearly, the use of polynomial deflation saves arithmetic operations.

[0046] Now, the algorithm employing polynomial deflation for computing the line spectrum frequencies, in accordance with the teaching of this invention, can be summarized as follows:
(29) Algorithm

(a) Initialization. Compute the coefficients of (25) and (26) when An(z) is given. Start the search from the polynomial R(x) with an initial value x0 = 1. Set k = 1.

(b) Compute the zero xk ∈ (-1, xk-1) by Newton's method. The previous zero xk-1 is a reasonable initial value. See Fig. 3.

(c) Refine the numerical estimate of the zero xk by taking additional steps in Newton's method using the undeflated polynomial (Fig. 3). It is noted that this step of refining the numerical estimate is optional.

(d) If the degree of the polynomial is two, solve the last zero analytically (see the next section). Otherwise eliminate the zero xk from the polynomial by the polynomial division technique. Continue the search from the other polynomial.

(e) Set k: = k + 1. Continue to (b) until all n zeros are found.


Solution of the last zero of R(x) and S(x)



[0047] When the deflated polynomials R(x) and S(x) are of degree 2, and their other zero is known, the last zero of the polynomials can be solved analytically. For example, consider the polynomial

which is obtained by dividing already-found zeros from the polynomial R(x). Zeros of RQ(x) are denoted by xQ1 and xQ2.

[0048] Assume now that the zero xQ1 has already been found, and the task is to solve xQ2. Since the polynomial division (rQ0x2 + rQ1x + rQ2)/(x - xQ1) gives the quotient RQ1 (x) = rQ0x + rQ1 + rQ0xQ1, the zero xQ2 is readily obtained from the equation RQ1 (x) = 0. The equation gives xQ2 = -xQ1 -rQ1/rQ0. By this way the computation of the square root resulting from the solution of RQ(x) = 0 is obviated. Moreover, unnecessary Newton's iterations are not needed.

[0049] This procedure is applied in step (d) of algorithm (29), i.e., in the block Solve the last zero in Fig. 2.

[0050] Newton's method is described in Fig. 3 in more detail. Both the value of the polynomial, and its first derivative, are computed effectively by Horner's method.

[0051] Errors in the numerical estimate of xk accumulate when k increases because the zeros of the successive quotient polynomials deviate more and more from the zeros of the undeflated polynomial. This is obviated by refining the zeros given by step (b) of algorithm (29). The zero can be refined by taking additional steps in Newton's method, but now using the undeflated polynomials (Fig. 3). However, if an adequate numerical accuracy is used, the refining step is not usually necessary.

[0052] The term 1/G(x) employed in the Newton's method can be approximated, for example, by tabulating its values in an appropriate range of G(x). If this approximation is denoted by GA(x), the Newton's step xk: =xk - G(xk)/G(xk) can be simplified to the form xk:=xk - GA(x)G(xk). Hence the division operation is not needed. However, it is obvious that the approximation weakens the convergence properties of the zero finder, although it also reduces the complexity of the fixed-point implementation.

[0053] In addition to the Newton's method, other algorithms such as the secant method, see Kincaid and Cheney, can be used for locating or refining the zeros.

[0054] Also the initial values can be chosen differently. For example, the algorithm can be started from x0 = -1. In this case the estimates of zeros given by the procedure would be in ascending order -1 < x1 < x2 < ... < xn < 1.

[0055] It should be noted that all equations required in the algorithm (29) can be derived straightforwardly in accordance with the teaching of this invention for the more general form of the polynomial An(z), that is



[0056] Consequently, the teaching of the present invention is not limited to the all-pole form of the LPC filter.

[0057] Figs. 4A-4D illustrate the progress of the technique (algorithm (29)) employing polynomial deflation. The zeros of the polynomials R(x) and S(x) corresponding to A10(z) are presented in Fig. 4A by crosses (x) and circles (o), respectively. The zero locating procedure is begun from x0 = 1 (Fig. 4B). After the first zero x1 has been found, (x - x1) is divided from the polynomial R(x). The polynomial R(x) after the first deflation is shown in Fig. 4C. Next, the search is continued from S(x) using x1 as an initial value. When x2 has been located, the procedure is again switched back to the deflated R(x) polynomial, and x2 acts as a new initial value. The procedure is continued until all zeros have been found. Fig. 4D illustrates the polynomial R(x) after two deflations.

[0058] It can thus be appreciated that the is invention teaches in one aspect a method for determining line spectrum frequencies of a linear predictive coder (LPC) filter that is expressed as symmetric and antisymmetric polynomials, the zeros of which determine the line spectrum frequencies of the LPC filter. The method includes the steps of (a) expressing the polynomials using explicit forms of Chebyshev polynomials; (b) interatively solving a zero of a first of the polynomials using a zero of the other one of the polynomials; and (c) successively eliminating zeros from the polynomials by polynomial division so as to determine the line spectrum frequencies.

[0059] With regard to the step of iteratively solving, it should be noted that the first polynomial can be either the symmetric or the antisymmetric polynomial, in which case the other polynomial is then the antisymmetric or the symmetric polynomial, respectively. Reference with regard to this step can also be made to elements (b) and (d) of algorithm (29), the equation xk=xk-1 in Fig. 3, and the two separate branches (k odd/even) in Fig. 2. It should be further noted that any of the zeros of the first polynomial can be solved using any of the zeros of the other polynomial.

1.3 Immittance Spectrum Frequency Representation



[0060] Reference can be made to Y. Bistritz and S. Peller, "Immittance spectral pairs (ISP) for speech encoding," in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, Minneapolis, Ml, U.S.A., Vol. 2, pp. 9-12, 27-30 April 1993, for a more detailed understanding of this section.

[0061] The immittance spectrum frequency representation of the LPC filter is based on a similar polynomial decomposition of a predictor polynomial as the line spectrum frequency representation. The immittance spectrum frequency representation (see Y. Bistritz and Peller) is obtained when the polynomial An(z) is decomposed as

where





[0062] The symmetric polynomial Pn(z) and the antisymmetric polynomial Qn(z) also have similar properties as polynomials (3) and (4). The roots of Pn(z) and Qn(z) are on the unit circle, and they are simple and separate from each other. Hence, Pn(z) and Qn(z) can be factored as follows:



where K = -(kn + 1)/(kn - 1), and ω1, ω, ..., ωn-1 are the phase angles of the zeros of the polynomials



such that



[0063] The phase angles ω1, ω2, ..., ωn-1 and the parameter k give an unique parametrization for the LPC filter. The properties of the immittance spectrum frequencies and their relation to line spectrum frequencies have been discussed in more detail by Bistritz and Peller.

[0064] It should be noted that the polynomials (35) and (36) are not identical to the polynomials defined in (3) and (4).

[0065] The degrees of the symmetric polynomials P(z) and Q(z) defined in (35) and (36) are 2mp and 2mQ, respectively, where

and



[0066] For example, if the degree of the predictor polynomial is ten (n = 10), then mp = 5 and mQ = 4. The polynomial A10(z) has only n - 1 = mp + mQ = 9 immittance spectrum frequencies. This is the most apparent difference to the line spectrum frequency representation. Note also, that immittance spectrum frequencies cannot generally be solved from line spectrum frequencies.

[0067] The relations between the coefficients of An(z) and the coefficients of the polynomials P(z) and Q(z) are obtained straightforwardly from the equations



and equations (31)-(34). Equations (38) and (39) can be written to the form (25) and (26) by proceeding as described above. Hence, the immittance spectrum frequencies ω1, ω2, ..., ωn-1 can be solved in accordance with this invention by the techniques summarized in the algorithm (29).

[0068] Although described above in the context of an audio encoder for use in a radiotelephone, it should be realized that the teachings of this invention are not limited for use in only this one important application. For example, and referring again to Fig. 1, the audio encoder can be used in a PC or workstation connected to a network. The communication channel 30 may then be a wired network (e.g., Internet).

[0069] Thus, the invention has been particularly shown and described with respect to preferred embodiments thereof, it will be understood by those skilled in the art that changes in form and details may be made therein without departing from the scope and spirit of the invention.


Claims

1. A method for determining line spectrum frequencies of a linear predictive coder (LPC) filter that is expressed as symmetric and antisymmetric polynomials, the zeros of which determine the line spectrum frequencies of the LPC filter, comprising the steps of:

expressing the polynomials using explicit forms of Chebyshev polynomials;

interatively solving a zero of a first of the polynomials using a zero of the other one of the polynomials; and

successively eliminating zeros from the polynomials by polynomial division so as to determine the line spectrum frequencies.


 
2. A method as set forth in claim 1, and further comprising a step of transmitting a LPC coded signal to a communication channel.
 
3. A method as set forth in claim 2, wherein the step of transmitting transmits the LPC coded signal to a radio communication channel.
 
4. A method as set forth in any preceding claim, wherein the zero of the other one of the polynomials is used as an initial value in iteratively solving the zero of the first one of the polynomials.
 
5. A mobile station capable of wireless communications over a communication channel, said mobile station comprising a speech transducer for outputting a speech signal, and further comprising:

a linear predictive coder (LPC) having an input coupled to the speech signal and an output coupled to the communication channel; said LPC comprising,

a LPC filter having a first input coupled to the speech signal and an output;

a LPC analysis block having an input coupled to the speech signal and an output for generating LPC coefficients for said LPC filter; and

a transform block having an input coupled to said output of said LPC analysis block for transforming said LPC coefficients into a line spectrum frequency (LSF) representation thereof, said transform block having an output coupled to a second input of said LPC filter; wherein

said LPC filter comprises symmetric and antisymmetric auxiliary polynomials, the zeros of which determine the line spectrum frequencies of the LPC filter, and wherein

said transform block includes first means for expressing the auxiliary polynomials using explicit forms of Chebyshev polynomials;

second means for iteratively solving a zero of a first of the polynomials using a zero of the other one of the polynomials; and

said transform block further comprises third means for successively eliminating zeros from the polynomials by polynomial division so as to determine the line spectrum frequencies.


 
6. A mobile station as set forth in claim 5, and further comprising a quantizer block and an interpolator block coupled in series between said output of said transform block and said second input of said LPC analysis filter.
 
7. A mobile station as set forth in claim 5 or 6, wherein said third means is responsive to a degree of a given one of the polynomials being two, for solving the last zero analytically.
 
8. A method for determining the immittance spectrum frequencies of a Linear Predictive Coder (LPC) filter expressed as symmetric and antisymmetric polynomials, the zeros of which determine the immittance spectrum frequencies, comprising the steps of:

expressing the polynomials using explicit forms of Chebyshev polynomials;

iteratively solving a zero of a first one of the polynomials using a zero of the other one of the polynomials; and

successively eliminating zeros from the polynomials by polynomial division so as to determine the immittance spectrum frequencies.


 
9. A method as set forth in claim 8, and further comprising a step of transmitting a LPC coded signal to a communication channel.
 
10. A method as set forth in claim 9, wherein the step of transmitting transmits the LPC coded signal to a radio communication channel.
 
11. A method as set forth in any of claims 8 to 10, wherein the zero of the other one of the polynomials is used as an initial value in iteratively solving the zero of the first one of the polynomials.
 




Drawing