(19)
(11)EP 1 489 174 B1

(12)EUROPEAN PATENT SPECIFICATION

(45)Mention of the grant of the patent:
12.08.2015 Bulletin 2015/33

(21)Application number: 03743597.1

(22)Date of filing:  04.03.2003
(51)International Patent Classification (IPC): 
C12N 15/09(2006.01)
C12N 1/21(2006.01)
C12P 21/02(2006.01)
C12N 9/00(2006.01)
(86)International application number:
PCT/JP2003/002495
(87)International publication number:
WO 2003/074697 (12.09.2003 Gazette  2003/37)

(54)

METHOD OF MODIFYING PROTEIN PROPERTIES

VERFAHREN ZUR MODIFIKATION VON PROTEINEIGENSCHAFTEN

PROCEDE DE MODIFICATION DES PROPRIETES D'UNE PROTEINE


(84)Designated Contracting States:
DE DK FR GB NL

(30)Priority: 04.03.2002 JP 2002057863

(43)Date of publication of application:
22.12.2004 Bulletin 2004/52

(73)Proprietors:
  • National Institute of Technology and Evaluation
    Tokyo 151-0066 (JP)
  • Ajinomoto Co., Inc.
    Tokyo 104-8315 (JP)

(72)Inventors:
  • Nishio, Yosuke, c/o Ajinomoto Co., Inc.
    Kawasaki-shi, Kanagawa 210-8681 (JP)
  • Kimura, Eiichiro, c/o Ajinomoto Co., Inc.
    Chuo-ku, Tokyo 104-8315 (JP)
  • Usuda, Yoshihiro, c/o Ajinomoto Co., Inc.
    Kawasaki-shi, Kanagawa 210-8681 (JP)
  • Ikeo, Kazuho, c/o National Institute Of Genectis
    Mishima-shi, Shizuoka 411-8540 (JP)
  • Nakamura, Yoji, c/o National Institute Of Genectis
    Mishima-shi, Shizuoka 411-8540 (JP)
  • Gojobori, Takashi
    Mishima-shi, Shizuoka 411-8540 (JP)
  • KAWARABAYASHI, Yutaka
    Kisarazu-shi, Chiba 292-0044 (JP)
  • HINO, Yumi
    Yokohama-shi, Kanagawa 224-0025 (JP)
  • Hori, Eiichi
    Shibuya-ku, Tokyo 151-0066 (JP)
  • Yamazaki, Jun
    Shibuya-ku, Tokyo 151-0066 (JP)

(74)Representative: Hoffmann Eitle 
Patent- und Rechtsanwälte PartmbB Arabellastraße 30
81925 München
81925 München (DE)


(56)References cited: : 
EP-A2- 1 182 253
WO-A-01/47956
  
  • GIANESE G. ET AL.: 'Structural adaptation of enzymes to low temperatures' PROTEIN ENG. vol. 14, no. 3, 2001, pages 141 - 148, XP002969557
  
Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 99(1) European Patent Convention).


Description

Technical Field



[0001] The present invention relates to a method for modifying a property of a protein. The present invention also relates to a method for producing a protein having a modified property and a method for producing a microorganism having a modified property. The present invention is useful in the field of microbial industry and the like.

Background Art



[0002] Proteins and microorganisms showing an activity in an environment which is different from that in environments preferred by usual microorganisms may have advantages over other proteins and microorganisms. For example, proteins having an activity at high temperature, in particular, thermostable enzymes have advantages that they do not need to be cooled when they are allowed to act and the like over other proteins that are inactivated at high temperature. Usually, such proteins having an activity at a high temperature are often produced by bacteria known as thermophilic bacteria, which can grow at a high temperature. Accordingly, when a thermostable protein is designed, amino acid sequences of corresponding proteins of a group of such thermophilic bacteria are analyzed, and commonly observed characteristics in the amino acid sequences are used for reference in many cases. Alternatively, a techniques for analyzing three-dimensional structures of proteins produced by thermophilic bacteria, estimating a structure which imparts thermostability based on this information, and modifying the structure of a non-thermostable protein so as to have such a structure, and the like, are employed. Furthermore, a method has been proposed in which amino acid sequences of evolutionarily corresponding proteins derived from two or more species in an evolutionary tree are compared, and an amino acid sequence of a proper ancestral protein estimated to be a protein of a hyperthermophilic bacterium is estimated to improve the thermostability (Japanese Patent Application No. 2001-164332).

Disclosure of the Invention



[0003] Objects of the present invention are to provide a method for modifying a property of a protein, for example, thermostability, a method for producing a protein having a modified property and a method for producing a microorganism having a modified property.

[0004] The inventors of the present invention assiduously studied in order to achieve the above object. As a result, they considered that by using the genome sequences of closely related microorganisms showing difference in a certain property and comparing primary structure information of a large number of proteins, amino acid substitutions involved in the property could be identified. Then, they took note of thermostability as a property of a protein, and they concluded that amino acid substitutions contributing to thermostabilization could be predicted by comparing even moderately thermophilic bacteria growing at about 55 to 74°C or bacteria showing a further lower optimum growth temperature, not hyperthermophilic bacteria which grow at about 75°C or higher temperature, with closely related bacteria of which optimum growth temperature is clearly lower. Furthermore, they confirmed that the amino acid substitutions which might be involved in temperature resistance could be precisely predicted by comparing a large number of amino acid sequences of orthologous genes extracted from genome sequences of two types of closely related bacteria showing different optimum growth temperatures, that is, bacteria of a thermophilic type and a mesophilic type, and thus accomplished the present invention.

[0005] That is, the present invention provides the followings.

(1) A method for modifying a property of a protein, comprising

  1. (a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorganism is closely related to the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,
  2. (b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate the frequency of amino acid substitution for each amino acid substitution type,
    wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,
  3. (c) identifying the amino acid substitution which occur at a high frequency as amino acid substitutions which are involved in said optimum growth condition particular to the second microorganism, and
  4. (d) introducing one or more of the amino acid substitutions identified in (c) into the gene encoding the protein to modify a property of the protein and (e) testing the said property to select a protein of which the property has been modified as intended.

(2) The method according to (1), wherein said optimum growth condition is optimum growth temperature, and the property of the protein is thermostability.

(3) The method according to (1) or (2), wherein genes having a identity of 60% or more and less than 95% on the amino acid sequence level are selected as genes which are orthologs to each other.

(4) The method according to any one of (1) to (3), wherein the first microorganism and the second microorganism are coryneform bacteria.

(5) The method according to (4), wherein the first microorganism is Corynebacterium glutamicum, and the second microorganism is Corynebacterium efficiens.

(7) A method for producing a protein having a modified property comprising

  1. (a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorgansim is closely related the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,
  2. (b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate the frequency of amino acid substitution for each amino acid substitution type,
    wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,
  3. (c) identifying the amino acid substitutions which occur at a high frequency as amino acid substitutions which are involved in said optimum growth condition particular to the second microorganism,
  4. (d) introducing one or more of the amino acid substitutions identified in (c) into the gene encoding the protein to modify a property of the protein, and
  5. (e) introducing the gene obtained in (d) into a host suitable for gene expression to express the protein having a modified property, and
  6. (f) testing the property of the protein obtained in (e), and
  7. (g) selecting a protein having an improved property relating to said optimum growth condition.

(7) The method according to (6) wherein said optimum growth condition is optimum growth temperature, and the property of the protein is thermostability.

(8) A method for producing a microorganism having a modified property comprising

  1. (a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorganism is closely related to the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,
  2. (b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate the frequency of amino acid substitution for each amino acid substitution type,
    wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,
  3. (c) identifying the amino acid substitutions which occur at a high frequency as amino acid substitutions which are involved in said optimum growth condition particular to the second microorganism, and
  4. (d) introducing one or more of the amino acid substitutions into a chromosomal DNA of a microorganism of which property is to be modified to obtain a microorganism having a modified property and (e) testing said property to select a microorganism of which the property has been modifed as intended.

(9) The method according to (8), wherein said optimum growth condition is optimum growth temperature, and the microorganism property of the protein is thermostability.



[0006] Hereafter, the present invention will be explained in detail.

[0007] The method for modifying a property of a protein of the present invention comprises the following steps of:
  1. (a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorganism is closely related to the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,
  2. (b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate a frequency of amino acid substitution for each amino acid substitution type,
    wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,
  3. (c) identifying an amino acid substitution occurring at a higher frequency as an amino acid substitution involved in the optimum growth condition particular to the second microorganism, and
  4. (d) introducing one or more of the amino acid substitutions identified in (c) into the gene encoding the protein to modify a property of the protein and (e) testing the said property to select a protein of which the property has been modified as intended.


[0008] In the present invention, the optimum growth condition means a condition suitable for survival or growth of a microorganism, and examples thereof include optimum growth temperature, optimum growth pH, optimum growth osmotic pressure and so forth. Properties of proteins to be modified by the present invention correspond to these optimum growth conditions, and examples thereof include thermostability, acid or alkali resistance, halophilism and so forth. The degree of difference in the optimum growth condition is not particularly limited so long as those of the first microorganism and the second microorganism are different. However, for example, when the condition is growth temperature, the optimum growth temperatures are preferably different by 5° C or more.

[0009] In the present invention, two types of closely related microorganisms between which difference in a certain optimum growth condition is observed are used as information sources of amino acid substitutions that can impart a desired property to a protein. Examples of the second microorganism closely relating to the first microorganism include microorganisms that have taxonomic properties similar to those of the first microorganism or are at a close evolutionary distance from the first microorganism in view of molecular phylogeny. More specifically, the examples include microorganisms belonging to the same genus or bacterial strains belonging to the same species. Furthermore, specifically, the examples include microorganisms containing 1000 or more orthologs having a homology in such a degree that amino acid substitutions between two types of genes can be extracted, preferably a identity of 60% or more.

[0010] Although microorganisms to which the present invention is applied are not particularly limited, they are desirably industrially useful microorganisms. Examples thereof include, for example, Gram-negative bacteria of the genus Escherichia, Serratia or the like, and Gram-positive bacteria of the genus Corynebacterium, Brevibacterium, Bacillus or the like.

[0011] Examples of the closely relating microorganisms which show difference in an optimum growth condition include, for example, Corynebacterium glutamicum and Corynebacterium efficiens as microorganisms whereby optimum growth temperatures are different. Corynebacterium glutamicum is a mesophilic bacterium whereby the optimum growth temperature is 25 to 35°C. Its genome sequence is open to public in the DNA Data Bank Japan (DDBJ), GenBank, EMBL and so forth (accession numbers AX120085, AX127144, AX127145, AX127146, AX127147, AX127148, AX127149, AX127150, AX127151, AX127152 and AX127153). Furthermore, Corynebacterium efficiens is a moderately thermophilic bacterium isolated as Corynebacterium thermoaminogenes whose optimum growth temperature is 35 to 45°C (Japanese Patent Laid-open (Kokai) No. 63-240779, Japanese Patent Publication (Kokoku) No. 7-63383). However, it has been proposed to be re-classified as Corynebacterium efficiens (Fudou R. et. al., Int. J. Syst. Evol. Microbiol., 52:1127-1131, 2002). The term Corynebacterium efficiens used in the present specification refers to a bacterium previously classified as Corynebacterium thermoaminogenes. Specific examples of bacterial strains classified as Corynebacterium efficiens include the Corynebacterium efficiens AJ12340 strain (also referred to as YS-40 strain), AJ12308 strain (also referred to as YS-52 strain), AJ12309 strain (also referred to as YS-155 strain), AJ12310 strain (also referred to as YS-314 strain) and so forth.

[0012] The AJ12340 strain was originally deposited at the Fermentation Research Institute, Agency of Industrial Science and Technology, Ministry of International Trade and Industry (currently the independent administrative institution, International Patent Organism Depository, National Institute of Advanced Industrial Science and Technology, Tsukuba Central 6, 1-1, Higashi 1-Chome, Tsukuba-shi, Ibaraki-ken, 305-8566, Japan) on March 13, 1987 and received an accession number of FERM P-9277. Then, the deposit was converted to an international deposit under the provisions of the Budapest Treaty on October 27, 1987, and received an accession number of FERM BP-1539. Furthermore, the AJ12308, AJ12309 and AJ12310 strains were originally deposited at the aforementioned depository on March 10, 1987 and received accession numbers of FERM P-9244, FERM P-9245 and FERM P-9246, respectively. Then, the deposits were converted to international deposits under the provisions of the Budapest Treaty on October 27, 1987, and received accession numbers of FERM BP-1540, FERM BP-1541 and FERM BP-1542, respectively.

[0013] Genes which are orthologs to each other are extracted from the aforementioned two types of closely relating microorganisms. The term "gene" used in the present specification means a region in a genome sequence, which encodes or is predicted to encode a protein. The term "orthologs" means a pair of genes showing the highest homology with each other in two of genome sequences.

[0014] As genome sequences of microorganisms used in the present invention, already published sequences or newly determined sequences may be used. For example, genome sequences of a large number of microorganisms have been published since that of Haemophilus influenzae was published (Fleischman R.D. et. al., Science, 269:496-512, 1995) and can be utilized. Genome sequences of microorganisms of which sequences have not been published can be determined by methods represented by the whole genome shotgun approach described in the aforementioned report of Fleischman et al.

[0015] Genes which are orthologs to each other are selected as follows, for example. First, sequences predicted to encode proteins are extracted from the genome sequence of each microorganism. Then, each gene sequence is translated into an amino acid sequence and the homology is calculated.

[0016] As programs for predicting genes estimated to encode proteins from a genome sequence of a microorganism, those utilizing the Hidden Markov model are frequently used. Known as major examples thereof are Glimmer (Delcher, A.L. et. al., Nucleic Acids Res., 27:4636-4641, 1999), GeneHacker (Yada, T. and Hirosawa, M., DNA Res., 3:336-361, 1996; Yada, T. et. al., Proc. Fifth Int. Conf. Intell. Syst. Mol. Biol., pp. 354-357, 1997), GeneMark.hmm (Lukashin, A. and Borodovsky M., Nucleic Acids Res., 26:1107-1115, 1998; Besemer, J. and Borodovsky M., Nucleic Acids Res., 27:3911-3920, 1997) and so forth.

[0017] Genes extracted from the genome sequences of two species can be translated into amino acid sequences, and the calculated homologies were used to detect orthologs as ORFs having the highest homology with each other (Snel, B., Bork, P. and Huynen, M.A., Nat. Genet., 21:108-110, 1999; Tatusov, R.L., Koonin, E.V. and Lipman, D.J., Science, 278:631-637, 1997; Tejaua, F., Lazcano, A. and Dujon, B., Genome Res., 9:550-557, 1999). To examine orthologs, commonly used homology search methods such as FASTA (Lipman, D.J. and Pearson, W.R., Science, 227:1435-1441, 1985) and Smith-Waterman (Smith, T.F. and Waterman, M.S., J. Mol. Biol., 147:195-197, 1981) are available. The most commonly used method is BLASTP (Altschul, F.S., Maddenm T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D.J., Nucleic Acids Res., 25:3389-3402, 1997).

[0018] To compare two amino acid sequences, alignment of the amino acid sequences is performed in which the sequences are aligned in consideration of properties of amino acids. As an alignment technique, CLUSTAL W (Thompson, J.D., Higgins, D.G. and Gibson, T.J., Nucleic Acids Res., 22:4673-4680, 1994) is well known. However, to align two of sequences by comparing them from the N-terminus, a technique called pairwise alignment is effective, and software programs therefor such as needle (Needleman, S.B. and Wunsch, C.D., J. Mol. Biol., 48:443-453, 1970), matcher (Huang, X. and Miller, W., Adv. Appl. Math., 12:373-381, 1991) and stretcher (Myers, E. and Miller, W., CABIOS, 4:11-17, 1988) are available. The stretcher is a software program for performing alignment on the basis of a principle called global alignment, which outputs overall similarities between two sequences having almost the same lengths as a result, and is suitable for the purpose of the present study.

[0019] In the present invention, it is preferable to select 1000 or more pairs of genes which are orthologs to each other, preferably 1500 or more pairs, more preferably 2000 or more pairs. Furthermore, the p-distance (Nei M. et al., Molecular Evolution and Phylogenetics, pp.17-31, Oxford University Press, New York, 2000) between the orthologous genes is preferably 0.3 or less. If the distance is larger than this value, it becomes more likely that parallel or backward substitutions would be extracted as amino acid substitutions. Furthermore, genes which are orthologs to each other preferably have an identity of 60% or more and less than 95% on the amino acid sequence level. If identity is lower than this range, it becomes difficult to perform the correct alignment, and it becomes more likely that amino acids will not correspond to each other one-to-one. Genes having an identity higher than this range do not necessarily need to be excluded. However, since such genes encode extremely conserved functions, it is highly likely that they do not affect phenotypes.

[0020] For each pair of orthologous genes selected as described above, amino acid substitutions between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism are detected. Amino acid substitutions present between two genes can be detected as a result of the aforementioned gene alignment. On the basis of the results, the detected amino acid substitutions are compiled to calculate frequencies of the amino acid substitutions for each amino acid substitution type. Subsequently, a correction is made by subtracting, from a number of a particular amino acid substitution, a number of amino acid substitutions of the direction reverse to the certain amino acid substitution. That is, from a number of a certain substitution (for example, substitution of lysine in the second microorganism gene for arginine in the first microorganism gene), the number of substitution in the direction reverse to the above substitution (substitution of arginine in the second microorganism gene for lysine in the first microorganism gene) is subtracted. Specifically, this is performed as follows, for example.

[0021] On the assumption that amino acids substitutions occur as a one-to-one amino acid correspondence, the alignment results are compiled as a matrix of 20 rows and 20 columns comprising the number of amino acid substitutions for all the 20 types of amino acids. This is assumed as a mathematical matrix (hereinafter, referred to as "A"), and a transposed matrix is created to simultaneously evaluate the reverse amino acid substitutions (hereinafter, referred to as "A-1"). Then, (matrix A - transposed matrix A-1)/2 is calculated (hereinafter, this calculation result will be referred to as "substitution evaluation index').

[0022] Among the amino acid substitutions extracted as described above, amino acid substitutions occurring at a higher substitution frequency (having a higher substitution evaluation index) are identified as the amino acid substitutions involved in the optimum growth condition of the second microorganism. The number of amino acid substitutions to be identified is not particularly limited, and is preferably 2 to 10 types, more preferably 2 to 5 types, particularly preferably 2 to 3 types. It is generally known that isoleucine, valine, leucine and methionine are amino acids that are likely to cause mutation among them (Kreil, D.P. et al., Nucleic Acids Res., 29:1608-1615, 2001). Therefore, among amino acid substitutions occurring at a higher substitution frequency, it is preferable to select substitutions occurring at a higher frequency than those of substitutions among isoleucine, valine, leucine and methionine.

[0023] As described above, the following amino acid substitutions, for example, are identified as amino acid substitutions involved in the impartation of thermostability of Corynebacterium efficiens to Corynebacterium glutamicum.
  1. (i) Substitution of an arginine residue for a lysine residue
  2. (ii) Substitution of a threonine residue for a serine residue
  3. (iii) Substitution of an alanine residue for a serine residue.


[0024] One or more amino acid substitutions identified as described above are introduced into a protein of which property is to be modified. The objective protein of which property is to be modified is not particularly limited, and examples thereof include, for example, proteins that preferably function under an optimum growth condition particular to the aforementioned second microorganism. Such proteins may be proteins of the aforementioned first microorganism or proteins of other microorganisms so long as they are proteins of microorganisms having different optimum growth conditioncompared with that of the second microorganism. One type of amino acid substitution may be introduced at one site or two or more sites.

[0025] To introduce an amino acid substitution into an objective protein, a mutation can usually be introduced into a gene encoding the aforementioned protein by a technique utilized in the protein engineering art, such as site-directed mutagenesis so that a desired amino acid substitution occurs. Furthermore, such a protein can also be produced by introducing the gene introduced with a mutation into a host suitable for gene expression utilizing a technique used in protein production based on a gene recombination technique to express a mutant protein having a modified property. As for the produced mutant proteins, the aforementioned property is tested, if necessary, to select a protein of which property has been modified as intended.

[0026] Furthermore, a microorganism having a modified property such as thermostability can be obtained by introducing the identified amino acid substitution into a gene on a chromosome of a target microorganism. The amino acid substitution can be introduced into the gene on a chromosome by, for example, preparing a gene introduced with a target mutation or a fragment thereof beforehand and substituting the mutant gene for the gene on chromosome on the basis of a gene substitution technique utilizing homologous recombination.

[0027] As methods for isolation of gene, digestion and ligation of DNA, transformation and so forth required for the aforementioned procedures, usual methods known to those skilled in the art can be used. Such methods are described in Sambrook, J., Fritsch, E. F. and Maniatis, T. "Molecular Cloning A Laboratory Manual, Second Edition", Cold Spring Harbor Laboratory Press (1989) and so forth.

[0028] To obtain a microorganism having a modified property, an amino acid substitution that can impart a desired property can be introduced into a gene encoding a protein so that a property of one or more proteins of the microorganism is modified.

[0029] As described above, a protein and microorganism having a modified property can be obtained. Specifically, for example, a protein such as an enzyme that functions at a higher temperature as compared with a wild-type protein or a microorganism having an optimum growth temperature raised can be obtained. Increase of culture temperature is considered to be an important technical factor for improving the economy of the industrial production of amino acids by fermentation, in addition to improvement of yield per saccharide, reduction of culture time, improvement of accumulated amino acid concentrations and so forth. That is, the culture is usually performed at an optimum fermentation temperature, and the optimum temperature of Corynebacterium glutamicum is 31.5°C. Since heat is generated by fermentation when the culture is started, the temperature in the culture increases, and amino acid production markedly decreases. Therefore, a cooling unit is necessary to maintain the culture broth at optimum temperature. On the other hand, if the culture temperature can be elevated, the energy necessary for cooling can be reduced, and furthermore the cooling power of the unit can be reduced. Therefore, if a bacterial strain having improved thermostability can be produced by imparting thermostability comparable with that of Corynebacterium efficiens to a protein exhibiting low thermostability in a bacterial strain having improved amino acid productivity such as Corynebacterium glutamicum and further allowing to the bacterial strain to have such a thermostabilized protein, industrial usefulness of the strain will be clearly increased. For example, if a thermostable enzyme or bacterial strain is used, the load of temperature control during the reaction is relieved, the reaction can be performed at a high temperature, and therefore the reaction rate becomes higher. Furthermore, since the reaction can be performed at a high temperature, contamination of other microorganisms can be minimized.

[0030] Furthermore, the method of the present invention can be used to produce a protein or microorganism suitable for a certain type of growth condition other than culture temperature.

Best Mode for Carrying out the Invention



[0031] Hereafter, the present invention will be explained more specifically by way of examples.

Example 1: Determination of genome sequence of


Corynebacterium efficiens


(1) Preparation of the genomic DNA and preparation of shotgun clones



[0032] A genomic DNA was extracted from the Corynebacterium efficiens AJ12340 strain by using Bacterial Genome DNA Purification Kit (Advanced Genetic Technologies). The following procedure is described in detail in the reference of Kawarabayashi et al. (Kawarabayashi, Y., et. al., DNA Res., 8:123-140, 2001). The genomic DNA of Corynebacterium efficiens was ultrasonicated in three stages of 5, 10 and 20 seconds by using an ultrasonicator, Biorupter (Cosmo Bio), at an output of L. The resultant solution was subjected to electrophoresis using an agarose gel, and DNA fragments having sizes of 0.8 to 1.2 kb and 2.0 to 2.5 kb were excised and cloned into the HincII site of pUC118. As described above, a shotgun library consisting of short (0.8 to 1.2 kb) and long fragments (2.0 to 2.5kb) was prepared.

(2) Sequencing of shotgun library



[0033] Plasmid DNAs in the aforementioned shotgun library comprising short (0.8 to 1.2 kb) and long fragments (2.0 to 2.5 kb) were prepared by using automatic DNA isolators PI-100 and PI-200 (Kurabo Industries Ltd.). The sequencing reaction was performed by using these plasmid DNAs as templates and ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit (PERKIN ELMER) or ABI PRISM BigDye Primer Cycle Sequencing Ready Reaction Kit (PERKIN ELMER). For the short fragment library, each sequence at one end was determined by using M13 forward primer (-21 M13) as a primer. For the long fragment library, sequences at the both ends were determined by using M13 forward primer (-21 M13) and M13 reverse primer (M13 Reverse). PCR System 9600 (PERKIN ELMER) or DNA Engine PTC-200 (MJ RESEARCH) was used for the reaction. The nucleotide sequences of the sequencing reaction products were analyzed by using ABI PRISM 377 DNA Sequencer.

(3) Assembling and sequencing of gap region



[0034] Raw data about 70,000 sequences were assembled by using an assembling software program, Phred-Phrap (CodonCode), and the raw data contained in the obtained contigs were assembled again using Sequencer (Gene Codes) to confirm and correct the nucleotide sequences. The nucleotide sequences of the gap portions between the contigs were determined by walking the shotgun clones including a long fragment using synthetic primers or the like. Thus, the entire 3.14 million base pair genome sequence of Corynebacterium efficiens was determined. This genome sequence was given accession numbers of BA000035 or AP005214 to AP005224 from the DNA Data Bank of Japan (DDBJ), and was registered and opened to public.

Example 2: Extraction of amino acid substitutions by comparison of genome sequences of Corynebacterium glutamicum and Corynebacterium efficiens


(1) Sequence information of Corynebacterium glutamicum



[0035] As the genome sequence of the mesophilic bacterium, Corynebacterium glutamicum, the nucleotide sequences of the Corynebacterium glutamicum ATCC 13032 strain registered with accession numbers of AX120085, AX127144, AX127145, AX127146, AX127147, AX127148, AX127149, AX127150, AX127151, AX127152 and AX127153 at the DNA Data Bank Japan were used.

(2) Prediction of genes encoding proteins



[0036] Genes encoding proteins were predicted by using a gene identification software program, Glimmer, based on the principle of Hidden Markov models (Delcher, A.L. et al., Nucleic Acids Res., 27:4636-4641, 1999). Models used for the gene prediction were created on the basis of the genome sequences of Corynebacterium glutamicum and Corynebacterium efficiens according to the manual of Glimmer. Furthermore, the Shine-Dalgarno sequence (hereinafter, referred to as "SD sequence") was used to enhance precision of the gene prediction by execution of Glimmer. The SD sequence is a sequence for binding of mRNA to 16S RNA on a ribosome, which is a translation apparatus, at the time of translation of a gene. The used SD sequence was a sequence (5'-AGAAAGAGG-3') complementary to the sequence at the 3' end of 16S rRNA of Corynebacterium glutamicum (Amador, J.M.E. et al., Microbiology, 145:915-924, 1999).

(3) Extraction of orthologs of Corynebacterium glutamicum and Corynebacterium efficiens



[0037] Pairs of genes that had the highest homology with each other among the extracted genes encoding proteins were assumed as orthologs (Snel, B. et al., Science, 278:631-637,1997; Tejaua, F. et al., Genome Res., 9:550-557, 1999). Specifically, the nucleotide sequences of the genes encoding proteins were translated into amino acid sequences, and pairs of genes that showed the highest score for each other as a result of execution of BLASTP (Altschul, F.S. et al., Nucleic Acids Res., 25:3389-3402, 1997), which is the most commonly used for homology search, were selected as orthologs. BLASTP was executed under the default conditions using BLOSUM62, which is commonly used as a matrix. Two thousand one hundred seventy-eight genes were extracted and identified as orthologs between the genomes of Corynebacterium glutamicum and Corynebacterium efficiens.

(4) Extraction of amino acid substitutions between orthologs



[0038] To compare the orthologs, pairwise alignment was performed, in which the gene sequences of Corynebacterium glutamicum and Corynebacterium efficiens were aligned from the N-terminus in consideration of properties of amino acids. A software program called stretcher was used for the pairwise alignment. The stretcher establishes alignment on the basis of the principle called global alignment, which outputs overall similarities between two sequences having almost the same lengths as a result (Myers, E. and Miller, W., CABIOS, 4:11-17, 1988). As an index for considering properties of amino acids at the time of the alignment, BLOSUM62 was used, which is commonly used as a matrix expressing the relationships of the same or different types of amino acids as numerical values. According to the alignment results, orthologs were classified into three ranks of those having identities of 95% or more, 60% or more and less than 95%, and less than 60%.

[0039] Because orthologs having an identity of 95% or more constitute a group of extremely conserved proteins primarily consisting of ribosomal proteins, differences in their thermostability is not expected. Furthermore, because orthologs having an identity of less than 60% include an increased number of amino acid substitutions, it becomes more unlikely that the objective amino acid substitutions can be extracted. Therefore, these orthologs were excluded.

[0040] On the other hand, when a value called p-distance (Nei M. et al., Molecular Evolution and Phylogenetics, pp.17-31, Oxford University Press, New York, 2000) was calculated for orthologs having an identity of 60% or more and less than 95%, a result of 0.20 was obtained. It is thought that, when this p-distance value is less than 0.3, it is unnecessary to consider the possibility that different mutations may occur at the same site, or a mutation may occur twice or more at the same site after differentiation of species. Because orthologs having an identity of less than 60% have a p-distance of 0.40, exclusion of orthologs having an identity less than 60% is considered appropriate. For the above reasons, 1430 orthologs having an identity of 60% or more and less than 95% were used for the analysis of amino acid substitutions.

[0041] The results of the alignment were compiled as a matrix of the numbers of the amino acid substitutions for all the amino acids on the assumption that amino acid substitutions are in one-to-one amino acid correspondence (Table 1). In Table 1, the amino acids of Corynebacterium glutamicum are shown in the vertical direction, and the amino acids of Corynebacterium efficiens are shown in the horizontal direction. For example, when a proline in Corynebacterium efficiens gene is substituted for an alanine residue in a gene sequence of Corynebacterium glutamicum, this substitution is entered in the cell at the intersection of the first row and the thirteenth column.

[0042] The aforementioned matrix was assumed as a mathematical matrix (hereinafter, referred to as "A"), and (matrix A - transposed matrix A-1)/2 was calculated to evaluate amino acid substitutions of the reverse direction at the same time (hereinafter, this calculation result is referred to as "substitution evaluation index"). By this procedure, a certain substitution of the reverse direction (for example, mutation from arginine to lysine for mutation from lysine to arginine) can be converted into a single numerical value, and a value for an amino acid showing no substitution can be represented as zero. The calculation results obtained as described above were aligned in the descending order of the substitution evaluation index.


(5) Narrowing of the amino acid substitutions involved in thermostability



[0043] Deviations of amino acid substitutions of orthologs belonging to the rank of identity of 60% or more and less than 95% were aligned in the descending order of the number of substitutions. The result is shown in Table 2. When they are described in the order of Corynebacterium glutamicum - Corynebacterium efficiens, substitutions of lysine-arginine, serine-alanine, serine-threonine, and isoleucine-valine were substitutions of the four highest substitution evaluation indices. Among these, it is known that isoleucine, valine, leucine and methionine are amino acids that are easily mutated from one to another (Kreil, D.P. et al., Nucleic Acids Res. 29: 1608-1615, 2001). From the above, the mutation patterns of the highest three substitution evaluation indices, which showed greater directivity than the mutation from isoleucine to valine, were predicted to be the amino acid substitutions involved in the higher thermostability of Corynebacterium efficiens in comparison with Corynebacterium glutamicum.
Table 2 Amino acid substitutions of the highest 10 substitution evaluation indices in orthologs having identity of 60 to 95%
C. glutamicumC. efficiensSubstitution evaluation index
Lys Arg 1095.5
Ser Ala 503
Ser Thr 450
Ile Val 373.5
Gln Arg 303
Asn Asp 287
Ile Leu 274.5
Ser Gly 201.5
Lys Thr 182.5
Ala Pro 181.5

Example 3: Verification of amino acid substitutions involved in thermostability by comparison of thermostability of enzymes in Corynebacterium glutamicum and Corynebacterium efficiens


<1> Bacterial strains used



[0044] 

Mesophilic bacterium: Corynebacterium glutamicum ATCC 13869 strain

Thermophilic bacterium: Corynebacterium efficiens AJ1234 strain


<2> Media


[Glutamic acid production medium]



[0045] 80 g/l of glucose, 1 g/l of KH2PO4, 0.4 g/l of magnesium sulfate, 480 mg/l of soybean hydrolysate, 200 µg/l of vitamin B1-HCl, 300 µg/l biotin, pH 8.0. pH was adjusted with potassium hydroxide.

[Medium for measurement of isocitrate lyase activity]



[0046] 4% of glucose or acetic acid, 5 g/l of ammonium sulfate, 5 g/l of urea, 0.5 g/l of KH2PO4, 0.5 g/l of K2HPO4, 20.9 g/l of 3-[N-morpholino]propanesulfonic acid (MOPS), 0.25 g/l of magnesium sulfate heptahydrate, 10 mM of calcium chloride heptahydrate, 0.2 mg/l of copper sulfate heptahydrate, 0.2 mg/l of biotin, 10 mg/l of manganese sulfate heptahydrate, 10 mg/l of iron sulfate heptahydrate, 1 mg/l of zinc sulfate heptahydrate, pH 6.5. pH was adjusted with potassium hydroxide.

[CM2G medium]



[0047] 20 g/l of polypeptone, 20 g/l of yeast extract, 5 g/l of sodium chloride, 20 g/l of glucose, pH 7.0. pH was adjusted with sodium hydroxide.

[0048] All the media were sterilized at 120°C for 20 minutes.

<3> Measurement of enzymatic activity


(1) Measurement of thermostability of aspartate kinase



[0049] Cells of each strain were cultured in the glutamic acid production medium, collected and then washed with 0.02 M KH2PO4 (pH 6.75)/0.03 M β-mercaptoethanol. Then, the cells were disrupted by ultrasonication and centrifuged at 33,000 rpm for 1 hour. Ammonium sulfate was added to the obtained supernatant to 80% saturation, and precipitates were obtained by centrifugation. The obtained precipitates were dissolved in 20 mM KH2PO4 (pH 6.75)/30 mM β-mercaptoethanol and used as a crude enzyme solution.

[0050] The crude enzyme solution was pretreated at 30 to 80°C for 1, 3, 5 or 10 minutes and reacted with a reaction mixture at 30°C. The reaction mixture was prepared by adding the crude enzyme solution to an aqueous solution containing 100 mM Tris-HCl (pH 7.5), 10 mM ATP (pH 7.5), 600 mM hydroxylamine, 600 mM ammonium sulfate, 10 mM magnesium sulfate, 50 mM L-aspartic acid (pH 7.5), and adjusted to the total volume of 500 µl with sterilized water. The reaction was allowed to proceed for a predetermined period of time, and the reaction was terminated by addition of 750 µl of a reaction terminating solution. The reaction terminating solution had a composition of 4% trichloroacetic acid, 10% FeCl2 and 1.4 N hydrochloric acid. The reaction mixture added with the reaction terminating solution was centrifuged at 15,000 rpm for 5 minutes, and the absorbance of the supernatant was measured at a wavelength of 540 nm. This measurement was performed for quantifying color development of hydroxymate produced by the reaction in which L-aspartic acid phosphoric acid salt, ADP and hydroxymate were produced from L-aspartic acid, ATP and hydroxylamine. The reaction mixture not containing L-aspartic acid was used as a blank.

(2) Measurement of thermostability of dihydrodipicolinate synthase



[0051] Cells of each strain were cultured in the glutamic acid production medium, and cells in the logarithmic growth phase were collected and washed with 0.85% NaCl. The cell suspension was ultrasonicated and centrifuged at 60,000 rpm for 30 minutes, and the resultant supernatant was used as a crude enzyme solution.

[0052] The crude enzyme solution was pretreated beforehand at 60°C for 1, 3, 5, 10 or 15 minutes and reacted with a reaction mixture at 37°C. The reaction mixture was prepared by adding the crude enzyme solution to 50 mM imidazole hydrochloride (pH 7.4), 2 mM aspartate β-semialdehyde (ASA) and 2 mM sodium pyruvate and adjusted to the total volume of 700 µl with sterilized water.

[0053] The activity was measured based on the increase in absorbance of the reaction mixture at 270 nm. For this measurement, the absorbance of dihydroxypicolinate non-enzymatically produced from a product of the reaction involving aspartate β-semialdehyde and pyruvic acid as substrates and catalyzed by dihydrodipicolinate synthase was measured. The reaction mixture not containing sodium pyruvate was used as a blank.

(3) Measurement of thermostability of diaminopimelate dehydrogenase



[0054] Cells of each strain were cultured in the glutamic acid production medium, and cells in the logarithmic growth phase were collected, washed twice with 0.2% potassium chloride, and suspended in 40 mM potassium phosphate buffer (pH 7.5). The suspension was ultrasonicated and centrifuged at 15,000 rpm for 30 minutes. The resultant supernatant was used as a crude enzyme solution.

[0055] The crude enzyme solution was pretreated beforehand at 60°C for 1, 3, 5, 10 or 15 minutes and reacted with a reaction mixture at 37°C. The reaction mixture was prepared by adding the crude enzyme solution to 200 mM glycine/potassium chloride buffer (pH 7.5, pH was adjusted with sodium hydroxide), 4 mM mesodiaminopimelic acid and 1 mM NADP, and adjusted to the total volume of 700 µl with sterilized water.

[0056] The activity was measured based on the increase in absorbance of the reaction mixture at 340 nm. For this measurement, the absorbance of NADPH produced by the reaction involving mesodiaminopimelic acid as a substrate and catalyzed by diaminopimelate dehydrogenase, because the reaction requires NADP+ as a coenzyme was measured. The reaction mixture not containing mesodiaminopimelic acid was used as a blank.

(4) Measurement of thermostability of diaminopimelate decarboxylase



[0057] Cells of each strain were cultured in the glutamic acid production medium, and the cells in the logarithmic growth phase were collected, washed twice with 50 mM potassium phosphate buffer (pH 7.0), suspended in 50 mM potassium phosphate buffer (pH 7.0) containing 6 mM mercaptoethanol, and disrupted by ultrasonication. The suspension was centrifuged at 15,000 rpm for 30 minutes, and the resultant supernatant was used as a crude enzyme solution.

[0058] The crude enzyme solution was pretreated beforehand at 30 to 80°C for 1, 3, 5 or 10 minutes and reacted with a reaction mixture at 37°C for 30 minutes. The reaction mixture was prepared by adding the crude enzyme solution to 20 mM diaminopimelic acid and 67 pM pyridoxal phosphate and adjusted to the total volume of 300 µl with sterilized water. Sulfuric acid was used to terminate the reaction, and potassium hydroxide was used for neutralization. The enzymatic activity was determined by measuring the amount of lysine produced by the reaction using Biotech Analyzer. The reaction mixture not containing mesodiaminopimelic acid was used as a blank.

(5) Measurement of thermostability of isocitrate dehydrogenase



[0059] Cells of each strain were cultured in the glutamic acid production medium, washed three times with 50 mM Tris-HCl (pH 7.5), and disrupted by ultrasonication. The suspension was centrifuged at 15,000 rpm for 10 minutes, and the resultant supernatant was used as a crude enzyme solution. In a volume of 20 µl of the crude enzyme solution pretreated beforehand at 45°C for 1, 3, 5, 10 or 15 minutes was reacted with 780 µl of a reaction mixture at 30°C. The reaction mixture contained 35 mM Tris-HCl/0.35 mM EDTA (pH 7.5), 1.5 mM manganese sulfate, 0.1 mM NADP and 1.3 mM sodium isocitrate. The activity was calculated by measuring the absorbance of NADPH produced by the reaction catalyzed by isocitrate dehydrogenase, because the reaction utilizes NADP+ as a coenzyme.

(6) Measurement of thermostability of aconitase



[0060] Cells of each strain were cultured in the glutamic acid production medium, washed three times with 50 mM Tris-HCl (pH 7.5), and disrupted by ultrasonication. The suspension was centrifuged at 15,000 rpm for 10 minutes, and the obtained supernatant was used as a crude enzyme solution. In a volume of 20 µl, the crude enzyme solution was pretreated at 50°C for 1, 3, 5 or 10 minutes, and then was reacted with 780 µl of a reaction mixture at 30°C. The reaction mixture contained 20 mM Tris-HCl (pH 7.5), 50 mM sodium chloride and 20 mM trisodium isocitrate. The activity was calculated by measuring the absorbance at 240 nm originated from cis-aconitate produced by the reaction.

(7) Measurement of thermostability of phosphoenolpyruvate carboxylase



[0061] Cells of each strain were cultured in the glutamic acid production medium, washed three times with a washing buffer, and disrupted by ultrasonication. The suspension was centrifuged at 15,000 rpm for 10 minutes to remove disrupted cell debris. The washing buffer contained 100 mM Tris-HCl (pH 8.0), 10 mM magnesium sulfate, 1 mM dithiothreitol (DTT) and 20% glycerol. The supernatant was further centrifuged at 60,000 rpm for 1 hour, and the resultant supernatant was used as a crude enzyme solution.

[0062] In an amount of 20 µl, the crude enzyme solution was pretreated at 45°C for 1, 3, 5, 10 or 20 minutes, and then was reacted with 780 µl of a reaction mixture at 20°C. The reaction mixture contained 100 mM Tris-H2SO4 (pH 8.5), 5 mM phosphoenolpyruvate, 10 mM KHCO3, 0.1 mM acetyl-CoA, 0.15 mM NADH, 10 mM magnesium sulfate, 10 U of malate dehydrogenase and 0.1 mM dithiothreitol. The activity was calculated on the basis of decrease of NADH consumed by the reaction determined by measuring the absorbance at 340 nm for 2 minutes.

(8) Measurement of thermostability of 2-oxoglutarate dehydrogenase



[0063] Cells of each strain were cultured in the glutamic acid production medium, washed twice with 0.2% potassium chloride, suspended in a solution containing 100 mM N-tris(hydroxymethyl)methyl-2-aminoethanesulfonic acid (TES)-NaOH (pH 7.5) and 30% glycerol, and disrupted by ultrasonication. The suspension was then centrifuged at 10000 x g for 30 minutes to obtain a supernatant. The resultant supernatant was desalted by using Sephadex G-25, and the resultant solution was used as a crude enzyme solution.

[0064] The crude enzyme solution was pretreated beforehand at 50°C for 1, 3, 5 or 10 minutes and reacted with a reaction mixture at 37°C. The reaction mixture contained 100 mM TES-NaOH (pH 7.7), 5 mM magnesium chloride, 0.2 mM coenzyme A (CoA), 0.3 mM thiamin pyrophosphate (TPP), 1 mM a-ketoglutaric acid, 3 mM L-cysteine and 1 mM acetylpyridine adenine dinucleotide (APDPN). The activity was calculated based on the decrease of APDPN, which is an analogue of NADP used as a coenzyme in the aforementioned reaction, and determined by measuring the absorbance at 365 nm (Usuda Y. et al., Microbiology, 142:3347-3354, 1996).

(9) Measurement of thermostability of isocitrate lyase



[0065] Cells of each strain were cultured in the medium for measurement of isocitrate lyase activity, washed twice with 50 mM Tris-HCl (pH 7.3), and disrupted by ultrasonication. The suspension was centrifuged at 13,000 x g for 30 minutes, and the resultant supernatant was used as a crude enzyme solution. The crude enzyme solution was pretreated at 50°C for 5 minutes and reacted with a reaction mixture at 37°C. The reaction mixture contained 50 mM MOPS-NaOH (pH 7.3), 5 mM DTT, 15 mM magnesium chloride, 1 mM EDTA, 5 mM dithiothreitol, 0.2 mM NADH and 18 U of lactate dehydrogenase (LDH). The activity was calculated on the basis of decrease of NADH consumed by the reaction determined by measuring the absorbance at 340 nm (Reinscheid, D.J. et al., J. Bacteriol., 176:3474-3483, 1994).

(10) Measurement of thermostability of phosphofructokinase



[0066] Cells of each strain were cultured in the CM2G medium, washed twice with 0.1 M Tris-HCl (pH 7.5), and disrupted by ultrasonication. Then, the suspension was centrifuged at 13,000 x g for 30 minutes, and the resultant supernatant was used as a crude enzyme solution.

[0067] The crude enzyme solution was pretreated at 50°C for 1, 3, 5 or 10 minutes, and then reacted with a reaction mixture at 37°C. The reaction mixture contained 100 mM Tris-HCl (pH 8.2), 0.2 mM NADH, 10 mM magnesium chloride, 2 mM ammonium chloride, 10 mM potassium chloride, 0.2 mM phosphoenolpyruvate, 6.4 mM fructose-6-phosphate, 1 mM ATP and 40 pg of lactate dehydrogenase/pyruvate kinase (LDH/PK). The activity was calculated based on the decrease of NADH consumed by the reaction determined by measuring the absorbance at 340 nm (Mori M. et al., Agric. Biol. Chem., 51:2671-2678, 1987: Campos G. et al., J. Biol. Chem., 259:6147-6152, 1984).

(11) Measurement of thermostability of fructose-1-phosphate kinase



[0068] Cells of each strain were cultured in the CM2G medium, washed twice with 0.1 M Tris-HCl (pH 7.5), and disrupted by ultrasonication. Then, the suspension was centrifuged at 13,000 x g for 30 minutes, and the resultant supernatant was used as a crude enzyme solution.

[0069] The crude enzyme solution was pretreated at 50°C for 1, 3, 5 or 10 minutes, and then reacted with a reaction mixture at 37°C. The reaction mixture contained 100 mM Tris-HCl (pH 8.2), 0.2 mM NADH, 10 mM magnesium chloride, 2 mM ammonium chloride, 10 mM potassium chloride, 0.2 mM phosphoenolpyruvate, 6.4 mM fructose-1-phosphate, 1 mM ATP and 40 µg of lactate dehydrogenase/pyruvate kinase. The activity was calculated based on decrease of NADH consumed by the reaction determined by measuring the absorbance at 340 nm.

(12) Measurement of the thermostability of citrate synthase



[0070] Cells of each strain were cultured in the glutamic acid production medium, washed three times with 0.2 M sodium glutamate hydrate and 50 mM Tris-HCl (pH 7.5), and disrupted by ultrasonication. The suspension was centrifuged at 10,000 rpm for 10 minutes, and the resultant supernatant was used as a crude enzyme solution.

[0071] The crude enzyme solution was pretreated at 50°C for 5 minutes and then reacted with a reaction mixture at 30°C. The reaction mixture contained 0.1 M sodium glutamate hydrate, 0.1 mM 5,5'-dithiobis-(2-nitrobenzoic acid) (DTNB), 0.3 mM acetyl-CoA and 0.5 mM oxaloacetic acid. The activity was calculated by measuring the increase of the absorbance at 412 nm of thiol-CoA (HS-CoA) mercaptide produced by the reaction (Srera, P.A., Method in Enzymol., 13:11-26, 1969; Eikmanns B.J. et al., Microbiology, 140:1817-1828, 1994).

<4> Verification of correlations between amino acid substitutions and thermostability of enzymes



[0072] For each enzyme, the number of three types of amino acid substitutions predicted in Example 2 as amino acid substitutions involved in thermostability of Corynebacterium efficiens (Lys→Arg, Ser→Ala and Ser→Thr, these directions are defined as positive directions) and the number of substitutions in the reverse directions of these amino acid substitutions (Arg→Lys, Ala→Ser and Thr→Ser, these directions are defined as negative directions) were counted. Then, the number of each substitution in the negative direction was subtracted from the number of the substitution in the positive direction to express the extents of the amino acid substitutions with numerical values (hereinafter, each obtained numerical value is referred to as "point"). Subsequently, for each enzyme, data about which enzyme derived from Corynebacterium glutamicum or enzyme derived from Corynebacterium efficiens showed higher thermostability was compared with the point. The results are shown in Table 3.

[0073] Among the enzymes shown in Table 3, the gene sequences of Corynebacterium efficiens encoding 2-oxoglutarate dehydrogenase, isocitrate lyase, phosphofructokinase, fructose-1-phosphate kinase (phosphofructokinase), isocitrate dehydrogenase, aconitase, phosphoenolpyruvate carboxylase and citrate synthase and the amino acid sequences encoded thereby are disclosed in WO01/25447. Furthermore, the gene sequences of Corynebacterium efficiens encoding aspartate kinase, dihydrodipicolinate synthase, diaminopimelate dehydrogenase and diaminopimelate decarboxylase and the amino acid sequences encoded thereby are disclosed in Japanese Patent Laid-open Publication (Kokai) No. 2001-120270.
Table 3 Results of comparison of experimental data about enzyme thermostability with points
NumberEnzymeSpecies showing higher thermostabilityPointPrediction results
1 2-Oxoglutarate dehydrogenase C. efficiens 0 Δ
2 Isocitrate lyase C. efficiens 2
3 Phosphofructokinase C. efficiens -3 X
4 Fructose-1-phosphate kinase C. efficiens 5
5 Isocitrate dehydrogenase C. efficiens 4
6 Aconitase C. efficiens 0 Δ
7 Phosphoenolpyruvate carboxylase C. efficiens 10
8 Citrate synthase C. efficiens 3
9 Aspartate kinase C. glutamicum -1
10 Dihydrodipicolinate synthase C. efficiens 0 Δ
11 Diaminopimelate dehydrogenase C. glutamicum -2
12 Diaminopimelate decarboxylase C. efficiens 2


[0074] If the point is positive, the enzyme derived from Corynebacterium efficiens was expected to be more thermostable, whereas if the point was negative, the enzyme derived from Corynebacterium glutamicum was expected to be more thermostable.

[0075] As for 2-oxoglutarate dehydrogenase, the enzyme derived from Corynebacterium efficiens was more thermostable. However, the point was 0, and thus prediction based on the point was impossible (denoted with Δ in Table 2).

[0076] As for isocitrate lyase, the enzyme derived from Corynebacrerium efficiens was more thermostabilized, and the point was 2, that is, positive. Thus, the experimental result matched the prediction (denoted with ○ in Table 2).

[0077] As for phosphofructokinase, the enzyme derived from Corynebacterium efficiens was more thermostabilized. However, the point was negative, and thus the experimental result did not match the prediction (denoted with X in Table 2).

[0078] As for fructose-1-phophate kinase, the enzyme derived from Corynebacterium efficiens was more thermostabilized, and the point was 5, that is, positive. Thus, the experimental result matched the prediction.

[0079] As for isocitrate dehydrogenase, the enzyme derived from Corynebacterium efficiens was more thermostabilized, and the point was 4, that is, positive. Thus, the experimental result matched the prediction.

[0080] As for aconitase, the enzyme derived from Corynebacterium efficiens was more thermostabilized. However, the point was 0, and prediction based on the point was impossible.

[0081] As for phosphoenolpyruvate carboxylase, the enzyme derived from Corynebacterium efficiens was more thermostabilized, and the point was 10, that is, positive. Thus, the experimental result matched the prediction.

[0082] As for citrate synthase, the enzyme derived from Corynebacterium efficiens was more thermostabilized, and the point was 3, that is, positive. Thus, the experimental result matched the prediction.

[0083] As for aspartate kinase, the enzyme derived from Corynebacterium glutamicum was more thermostabilized, and the point was -1, that is, negative. Thus, the experimental result matched the prediction.

[0084] As for dihydrodipicolinate synthase, the enzyme derived from Corynebacterium efficiens was more thermostabilized. However, the point was 0, and thus prediction based on the point was impossible.

[0085] As for diaminopimelate dehydrogenase, the enzyme derived from Corynebacterium glutamicum was more thermostabilized, and the point was -2, that is, negative. Thus, the experimental result matched the prediction.

[0086] As for diaminopimelate decarboxylase, the enzyme derived from Corynebacterium efficiens was more thermostabilized, and the point was 2, that is, positive. Thus, the experimental result matched the prediction.

[0087] As described above, the directivity of thermostabilization could be correctly predicted for 8 enzymes in 12 enzymes of which enzymatic activities were measured, and could not be predicted for 3 enzymes, whereas the experimental result did not match the point only in one enzyme. Prediction with such a high probability was possible by considering directions of only 3 amino acid substitutions.

Industrial Applicability



[0088] According to the present invention, a property of a protein such as thermostability can be modified by using only information on the primary structure without using information on the secondary structure and tertiary structure of the protein. In particular, thermostability of proteins produced by mesophilic bacteria or mesophilic bacteria themselves currently being industrially used can be improved.


Claims

1. A method for modifying a property of a protein, comprising

(a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorganism closely related to the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,

(b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate the frequency of amino acid substitution for each amino acid substitution type,
wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,

(c) identifying the amino acid substitutions which occur at a high frequency as amino acid substitutions which are involved in said optimum growth condition particular to the second microorganism, and

(d) introducing one or more of the amino acid substitutions identified in (c) into the gene encoding the protein to modify a property of the protein, and

(e) testing the said property to select a protein of which the property has been modified as intended.


 
2. The method according to claim 1, wherein said optimum growth condition is optimum growth temperature, and the property of the protein is thermostability.
 
3. The method according to claim 1 or 2, wherein genes having an identity of 60% or more and less than 95% on the amino acid sequence level are selected as genes which are orthologs to each other.
 
4. The method according to any one of claims 1 to 3, wherein the first microorganism and the second microorganism are coryneform bacteria.
 
5. The method according to claim 4, wherein the first microorganism is Corynebacterium glutamicum, and the second microorganism is Corynebacterium efficiens.
 
6. A method for producing a protein having a modified property comprising

(a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorganism is closely related to the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,

(b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate the frequency of amino acid substitution for each amino acid substitution type,
wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,

(c) identifying the amino acid substitutions which occur at a high frequency as.amino acid substitutions which are involved in said optimum growth condition particular to the second microorganism,

(d) introducing one or more of the amino acid substitutions identified in (c) into the gene encoding the protein to modify a property of the protein, and

(e) introducing the gene obtained in (d) into a host suitable for gene expression to express the protein of which property is modified, and

(f) testing the property of the protein obtained in (e), and (g) selecting a protein having an improved property relating to said optimum growth condition.


 
7. The method according to claim 6, wherein said optimum growth condition is optimum growth temperature, and the property of the protein is thermostability.
 
8. A method for producing a microorganism having a modified property comprising

(a) selecting 1000 or more pairs of genes which are orthologs to each other from genes encoded by genomes of a first microorganism and a second microorganism, respectively, wherein the second microorganism is closely related to the first microorganism and shows difference in a certain optimum growth condition when compared with the first microorganism,

(b) detecting amino acid substitutions present between an amino acid sequence encoded by a gene of the first microorganism and an amino acid sequence encoded by a gene of the second microorganism for each pair of the selected genes, and compiling the detected amino acid substitutions to calculate the frequency of amino acid substitution for each amino acid substitution type,
wherein a correction is made by subtracting, from a number of a certain amino acid substitution, a number of amino acid substitution of the direction reverse to the certain amino acid substitution,

(c) identifying the amino acid substitutions which occur at a high frequency as amino acid substitutions which are involved in said optimum growth condition particular to the second microorganism, and

(d) introducing one or more of the amino acid substitutions into a chromosomal DNA of a microorganism of which property is to be modified to obtain a microorganism of which property is modified, and

(e) testing said property to select a microorganism of which the property has been modified as intended.


 
9. The method according to claim 8, wherein said optimum growth condition is optimum growth temperature, and the property of the microorganism is thermostability.
 


Ansprüche

1. Verfahren zur Modifikation einer Proteineigenschaft, umfassend:

(a) das Auswählen von 1.000 oder mehr Genpaaren, die ortholog zueinander sind, aus Genen, die durch die Genome eines ersten Mikroorganismus bzw. eines zweiten Mikroorganismus kodiert werden, wobei der zweite Mikroorganismus eng mit dem ersten Mikroorganismus verwandt ist und im Vergleich zu dem ersten Mikroorganismus einen Unterschied in einer bestimmten optimalen Wachstumsbedingung zeigt,

(b) das Detektieren von zwischen einer Aminosäuresequenz, die durch ein Gen des ersten Mikroorganismus kodiert wird, und einer Aminosäuresequenz, die durch ein Gen des zweiten Mikroorganismus kodiert wird, vorliegenden Aminosäuresubstitutionen für jedes Paar der ausgewählten Gene und das Zusammentragen der detektierten Aminosäuresubstitutionen, um die Häufigkeit von Aminosäuresubstitutionen für jeden Aminosäuresubstitutionstyp zu berechnen,
wobei eine Korrektur vorgenommen wird, indem von einer Anzahl einer bestimmten Aminosäuresubstitution eine Anzahl von Aminosäuresubstitutionen in entgegengesetzter Richtung zu der bestimmten Aminosäuresubstitution abgezogen wird,

(c) das Identifizieren der Aminosäuresubstitutionen, die mit hoher Häufigkeit auftreten, als Aminosäuresubstitutionen, die in der für den zweiten Mikroorganismus typischen optimalen Wachstumsbedingung involviert sind, und

(d) das Einführen von einer oder mehreren der in (c) identifizierten Aminosäuresubstitutionen in Gene, die das Protein kodieren, um so eine Eigenschaft des Proteins zu modifizieren, und

(e) das Testen der Eigenschaft, um ein Protein auszuwählen, dessen Eigenschaft wie beabsichtigt modifiziert worden ist.


 
2. Verfahren gemäß Anspruch 1, wobei die optimale Wachstumsbedingung eine optimale Wachstumstemperatur ist und die Eigenschaft des Proteins Thermostabilität ist.
 
3. Verfahren gemäß Anspruch 1 oder 2, wobei Gene mit einer Identität von 60 % oder mehr und weniger als 95 % auf Aminosäuresequenzniveau als Gene ausgewählt werden, die ortholog zueinander sind.
 
4. Verfahren gemäß mindestens einem der Ansprüche 1 bis 3, wobei der erste Mikroorganismus und der zweite Mikroorganismus coryneforme Bakterien sind.
 
5. Verfahren gemäß Anspruch 4, wobei der erste Mikroorganismus Corynebacterium glutamicum ist und der zweite Mikroorganismus Corynebacterium efficiens ist.
 
6. Verfahren zur Herstellung eines Proteins mit einer modifizierten Eigenschaft, umfassend:

(a) das Auswählen von 1.000 oder mehr Genpaaren, die ortholog zueinander sind, aus Genen, die durch die Genome eines ersten Mikroorganismus bzw. eines zweiten Mikroorganismus kodiert werden, wobei der zweite Mikroorganismus eng mit dem ersten Mikroorganismus verwandt ist und im Vergleich zu dem ersten Mikroorganismus einen Unterschied in einer bestimmten optimalen Wachstumsbedingung zeigt,

(b) das Detektieren von zwischen einer Aminosäuresequenz, die durch ein Gen des ersten Mikroorganismus kodiert wird, und einer Aminosäuresequenz, die durch ein Gen des zweiten Mikroorganismus kodiert wird, vorliegenden Aminosäuresubstitutionen für jedes Paar der ausgewählten Gene, und das Zusammentragen der detektierten Aminosäuresubstitutionen, um die Häufigkeit von Aminosäuresubstitutionen für jeden Aminosäuresubstitutionstyp zu berechnen,
wobei eine Korrektur vorgenommen wird, indem von einer Anzahl einer bestimmten Aminosäuresubstitution eine Anzahl von Aminosäuresubstitutionen in entgegengesetzter Richtung zu der bestimmten Aminosäuresubstitution abgezogen wird,

(c) das Identifizieren der Aminosäuresubstitutionen, die mit hoher Häufigkeit auftreten, als Aminosäuresubstitutionen, die in der für den zweiten Mikroorganismus typischen optimalen Wachstumsbedingung involviert sind,

(d) das Einführen von einer oder mehreren der in (c) identifizierten Aminosäuresubstitutionen in Gene, die das Protein kodieren, um so eine Eigenschaft des Proteins zu modifizieren, und

(e) das Einführen des in (d) erhaltenen Gens in einen Wirt, der geeignet zur Genexpression ist, um so das Protein mit der modifizierten Eigenschaft zu exprimieren, und

(f) das Testen der Eigenschaft des in (e) erhaltenen Proteins, und

(g) das Auswählen eines Proteins mit einer verbesserten Eigenschaft bezüglich der optimalen Wachstumsbedingung.


 
7. Verfahren gemäß Anspruch 6, wobei die optimale Wachstumsbedingung eine optimale Wachstumstemperatur ist und die Eigenschaft des Proteins Thermostabilität ist.
 
8. Verfahren zur Herstellung eines Mikroorganismus mit einer modifizierten Eigenschaft, umfassend:

(a) das Auswählen von 1.000 oder mehr Genpaaren, die ortholog zueinander sind, aus Genen, die durch die Genome eines ersten Mikroorganismus bzw. eines zweiten Mikroorganismus kodiert werden, wobei der zweite Mikroorganismus eng mit dem ersten Mikroorganismus verwandt ist und im Vergleich zu dem ersten Mikroorganismus einen Unterschied in einer bestimmten optimalen Wachstumsbedingung zeigt,

(b) das Detektieren von zwischen einer Aminosäuresequenz, die durch ein Gen des ersten Mikroorganismus kodiert wird, und einer Aminosäuresequenz, die durch ein Gen des zweiten Mikroorganismus kodiert wird, vorliegenden Aminosäuresubstitutionen für jedes Paar der ausgewählten Gene, und das Zusammentragen der detektierten Aminosäuresubstitutionen, um die Häufigkeit von Aminosäuresubstitutionen für jeden Aminosäuresubstitutionstyp zu berechnen,
wobei eine Korrektur vorgenommen wird, indem von einer Anzahl einer bestimmten Aminosäuresubstitution eine Anzahl von Aminosäuresubstitutionen in entgegengesetzter Richtung zu der bestimmten Aminosäuresubstitution abgezogen wird,

(c) das Identifizieren der Aminosäuresubstitutionen, die mit hoher Häufigkeit auftreten, als Aminosäuresubstitutionen, die in der für den zweiten Mikroorganismus typischen optimalen Wachstumsbedingung involviert sind, und

(d) das Einführen von einer oder mehreren der Aminosäuresubstitutionen in eine chromosomale DNA eines Mikroorganismus, dessen Eigenschaft modifiziert werden soll, um so einen Mikroorganismus mit einer modifizierten Eigenschaft zu erhalten, und

(e) das Testen der Eigenschaft, um einen Mikroorganismus auszuwählen, dessen Eigenschaft wie beabsichtigt modifiziert worden ist.


 
9. Verfahren gemäß Anspruch 8, wobei die optimale Wachstumsbedingung eine optimale Wachstumstemperatur ist und die Eigenschaft des Mikroorganismus Thermostabilität ist.
 


Revendications

1. Procédé permettant de modifier une propriété d'une protéine, comprenant les étapes consistant

(a) à sélectionner 1000 paires ou plus de gènes qui sont orthologues à partir de gènes codés par des génomes d'un premier micro-organisme et d'un deuxième micro-organisme, respectivement, où le deuxième micro-organisme est étroitement lié au premier micro-organisme et présente une différence dans une certaine condition de croissance optimale en comparaison avec le premier micro-organisme,

(b) à détecter des substitutions d'acide aminé présentes entre une séquence d'acides aminés codée par un gène du premier micro-organisme et une séquence d'acides aminés codée par un gène du deuxième micro-organisme pour chaque paire des gènes sélectionnés, et à compiler les substitutions d'acide aminé détectées pour calculer la fréquence de substitution d'acide aminé pour chaque type de substitution d'acide aminé,
où une correction est effectuée en soustrayant, d'un nombre d'une certaine substitution d'acide aminé, un nombre de substitution d'acide aminé de la direction inverse à la certaine substitution d'acide aminé,

(c) à identifier les substitutions d'acide aminé qui se produisent à une fréquence élevée comme des substitutions d'acide aminé qui sont impliquées dans ladite condition de croissance optimale particulière au deuxième micro-organisme, et

(d) à introduire une ou plusieurs des substitutions d'acide aminé identifiées dans l'étape (c) dans le gène codant pour la protéine afin de modifier une propriété de la protéine, et

(e) à tester ladite propriété pour sélectionner une protéine dont la propriété a été modifiée comme prévu.


 
2. Procédé selon la revendication 1, dans lequel ladite condition de croissance optimale est la température de croissance optimale, et la propriété de la protéine est la thermostabilité.
 
3. Procédé selon la revendication 1 ou 2, dans lequel des gènes ayant une identité supérieure ou égale à 60% et inférieure à 95% au niveau de la séquence d'acides aminés sont sélectionnés comme étant des gènes qui sont orthologues.
 
4. Procédé selon l'une quelconque des revendications 1 à 3, dans lequel le premier micro-organisme et le deuxième micro-organisme sont des bactéries corynéformes.
 
5. Procédé selon la revendication 4, dans lequel le premier micro-organisme est Corynebacterium glutamicum, et le deuxième micro-organisme est Corynebacterium efficiens.
 
6. Procède de production d'une protéine ayant une propriété modifiée comprenant les étapes consistant

(a) à sélectionner 1000 paires ou plus de gènes qui sont orthologues à partir de gènes codés par des génomes d'un premier micro-organisme et d'un deuxième micro-organisme, respectivement, où le deuxième micro-organisme est étroitement lié au premier micro-organisme et présente une différence dans une certaine condition de croissance optimale en comparaison avec le premier micro-organisme,

(b) à détecter des substitutions d'acide aminé présentes entre une séquence d'acides aminés codée par un gène du premier micro-organisme et une séquence d'acides aminés codée par un gène du deuxième micro-organisme pour chaque paire des gènes sélectionnés, et à compiler les substitutions d'acide aminé détectées pour calculer la fréquence de substitution d'acide aminé pour chaque type de substitution d'acide aminé,
où une correction est effectuée en soustrayant, d'un nombre d'une certaine substitution d'acide aminé, un nombre de substitution d'acide aminé de la direction inverse à la certaine substitution d'acide aminé,

(c) à identifier les substitutions d'acide aminé qui se produisent à une fréquence élevée comme des substitutions d'acide aminé qui sont impliquées dans ladite condition de croissance optimale particulière au deuxième micro-organisme,

(d) à introduire une ou plusieurs des substitutions d'acide aminé identifiées dans l'étape (c) dans le gène codant pour la protéine afin de modifier une propriété de la protéine, et

(e) à introduire le gène obtenu dans l'étape (d) dans un hôte approprié pour l'expression génétique afin d'exprimer la protéine dont la propriété est modifiée, et

(f) à tester la propriété de la protéine obtenue dans l'étape (e), et

(g) à sélectionner une protéine ayant une propriété améliorée relative à ladite condition de croissance optimale.


 
7. Procédé selon la revendication 6, dans lequel ladite condition de croissance optimale est la température de croissance optimale, et la propriété de la protéine est la thermostabilité.
 
8. Procédé de production d'un micro-organisme ayant une propriété modifiée comprenant les étapes consistant

(a) à sélectionner 1000 paires ou plus de gènes qui sont orthologues à partir de gènes codés par des génomes d'un premier micro-organisme et d'un deuxième micro-organisme, respectivement, où le deuxième micro-organisme est étroitement lié au premier micro-organisme et présente une différence dans une certaine condition de croissance optimale en comparaison avec le premier micro-organisme,

(b) à détecter des substitutions d'acide aminé présentes entre une séquence d'acides aminés codée par un gène du premier micro-organisme et une séquence d'acides aminés codée par un gène du deuxième micro-organisme pour chaque paire des gènes sélectionnés, et à compiler les substitutions d'acide aminé détectées pour calculer la fréquence de substitution d'acide aminé pour chaque type de substitution d'acide aminé,
où une correction est effectuée en soustrayant, d'un nombre d'une certaine substitution d'acide aminé, un nombre de substitution d'acide aminé de la direction inverse à la certaine substitution d'acide aminé,

(c) à identifier les substitutions d'acide aminé qui se produisent à une fréquence élevée comme des substitutions d'acide aminé qui sont impliquées dans ladite condition de croissance optimale particulière au deuxième micro-organisme ; et

(d) à introduire une ou plusieurs des substitutions d'acide aminé dans un ADN chromosomique d'un micro-organisme dont la propriété doit être modifiée afin d'obtenir un micro-organisme dont la propriété est modifiée, et

(e) à tester ladite propriété afin de sélectionner un micro-organisme dont la propriété a été modifiée comme prévu.


 
9. Procédé selon la revendication 8, dans lequel ladite condition de croissance optimale est la température de croissance optimale, et la propriété du micro-organisme est la thermostabilité.
 






Cited references

REFERENCES CITED IN THE DESCRIPTION



This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.

Patent documents cited in the description




Non-patent literature cited in the description