                         SEQUENCE LISTING

<110>  Ajinomoto Co., Inc.
 
<120>  Heparan sulfate having high 3-O-sufation rate of glucosamine 
       residue

<130>  PAMA-281305

<150>  JP 2015-257022
<151>  2015-12-28

<160>  41    

<170>  PatentIn version 3.5

<210>  1
<211>  7467
<212>  DNA
<213>  Escherichia coli


<220>
<221>  CDS
<222>  (445)..(1164)

<220>
<221>  CDS
<222>  (1593)..(3284)

<220>
<221>  CDS
<222>  (4576)..(6138)

<220>
<221>  CDS
<222>  (6180)..(7358)

<400>  1
ggaggcctga ttactgttgc actaacagtg tcattgccgg agattgtaat cacactctat     60
ataattatat aaactctatt gtatttagtg tatgaggagg atggacagta tactttgaac    120
taggtaatta tgaatttgat cgtgatctcg taatacgttg ctgttattct ttaattaatt    180
atctgccaat ttatttttag atagttacag gaaatgttta tgcaaagagt ggtttgatat    240
ggtaagagta ataatttaga tgaagataaa tatatcaaac gtacacccta gtagttattt    300
ttaattaaac atatcgtcca tgaggtgcgg agtcattcta atcaacttaa tgtgttctgt    360
ttattaagca tttcctataa ataaacgact atcaatacgt tgatagtttt cattaacatg    420
caatattaat taaaatatta cccc atg att gtt gca aat atg tca tca tac       471
                           Met Ile Val Ala Asn Met Ser Ser Tyr          
                           1               5                            
cca cct cga aaa aaa gag ttg gtg cat tct ata caa agt tta cat gct      519
Pro Pro Arg Lys Lys Glu Leu Val His Ser Ile Gln Ser Leu His Ala         
10                  15                  20                  25          
caa gta gat aaa att aat ctt tgc ctg aat gag ttt gaa gaa att cct      567
Gln Val Asp Lys Ile Asn Leu Cys Leu Asn Glu Phe Glu Glu Ile Pro         
                30                  35                  40              
gag gaa tta gat ggt ttt tca aaa tta aat cca gtt att cca gat aaa      615
Glu Glu Leu Asp Gly Phe Ser Lys Leu Asn Pro Val Ile Pro Asp Lys         
            45                  50                  55                  
gat tat aag gat gtg ggc aaa ttt ata ttt cct tgc gct aaa aat gat      663
Asp Tyr Lys Asp Val Gly Lys Phe Ile Phe Pro Cys Ala Lys Asn Asp         
        60                  65                  70                      
atg atc gta ctt aca gat gat gat att att tac cct ccc gat tat gta      711
Met Ile Val Leu Thr Asp Asp Asp Ile Ile Tyr Pro Pro Asp Tyr Val         
    75                  80                  85                          
gaa aaa atg ctc aat ttt tat aat tcc ttt gca ata ttc aat tgc att      759
Glu Lys Met Leu Asn Phe Tyr Asn Ser Phe Ala Ile Phe Asn Cys Ile         
90                  95                  100                 105         
gtt ggg att cat ggc tgt ata tac ata gat gca ttt gat gga gat cag      807
Val Gly Ile His Gly Cys Ile Tyr Ile Asp Ala Phe Asp Gly Asp Gln         
                110                 115                 120             
tct aaa aga aaa gta ttt tca ttt act caa ggg cta ttg cga ccg aga      855
Ser Lys Arg Lys Val Phe Ser Phe Thr Gln Gly Leu Leu Arg Pro Arg         
            125                 130                 135                 
gtt gta aat caa tta ggt aca ggg act gtt ttt ctt aag gca gat caa      903
Val Val Asn Gln Leu Gly Thr Gly Thr Val Phe Leu Lys Ala Asp Gln         
        140                 145                 150                     
tta cca tct tta aaa tat atg gat ggt tct caa cga ttc gtc gat gtt      951
Leu Pro Ser Leu Lys Tyr Met Asp Gly Ser Gln Arg Phe Val Asp Val         
    155                 160                 165                         
aga ttt tct cgc tat atg tta gag aat gaa att ggt atg ata tgt gtt      999
Arg Phe Ser Arg Tyr Met Leu Glu Asn Glu Ile Gly Met Ile Cys Val         
170                 175                 180                 185         
ccc aga gaa aaa aac tgg cta aga gag gtc tca tca ggt tca atg gaa     1047
Pro Arg Glu Lys Asn Trp Leu Arg Glu Val Ser Ser Gly Ser Met Glu         
                190                 195                 200             
gga ctt tgg aac aca ttt aca aaa aaa tgg cct tta gac atc ata aaa     1095
Gly Leu Trp Asn Thr Phe Thr Lys Lys Trp Pro Leu Asp Ile Ile Lys         
            205                 210                 215                 
gaa aca caa gca atc gca gga tat tca aaa ctt aac ctc gaa tta gtg     1143
Glu Thr Gln Ala Ile Ala Gly Tyr Ser Lys Leu Asn Leu Glu Leu Val         
        220                 225                 230                     
tat aat gtg gaa ggg taa aaa cttacttttt tattcacatt cctgtatttt        1194
Tyr Asn Val Glu Gly     Lys                                             
    235                                                                 
gtgttggttt ctgaagttta tagtataaat acttgtttta aatagttgta cgttgatatt   1254
ttgttatata cttatttaaa ccatttgttt tatgattttg aaaaatatca gcgttagttt   1314
ggtagagttt ataattaaga tttttgtcta aaagaaggtg gtaacgcaat atgtcaatta   1374
ttaggaggtg ctctgagtta tattgatatt gtttattgat gaatggctat accaaataaa   1434
tcagatgtgc tattgagata tagatagttt catttagtat tatcacataa cgccacctaa   1494
attacattac agatttgaaa tatatgtctg caatatcacc attacgataa acgacagtgt   1554
ttaaaataaa gtaatcttgt agataataaa gaggaaat atg atg aat aaa tta gtg   1610
                                          Met Met Asn Lys Leu Val       
                                          240                 245       
cta gtc gga cat cct ggc tca aag tat cag ata gtt gaa cat ttt ttg     1658
Leu Val Gly His Pro Gly Ser Lys Tyr Gln Ile Val Glu His Phe Leu         
                250                 255                 260             
aaa gaa att ggc atg aac tca cca aat tat tct aca agt aat aaa att     1706
Lys Glu Ile Gly Met Asn Ser Pro Asn Tyr Ser Thr Ser Asn Lys Ile         
            265                 270                 275                 
tcc cca gaa tat atc acc gct tca tta tgt caa ttt tat caa aca cca     1754
Ser Pro Glu Tyr Ile Thr Ala Ser Leu Cys Gln Phe Tyr Gln Thr Pro         
        280                 285                 290                     
gaa gtt aat gat gta gta gat gag aga gaa ttc tca gct gtt caa gtc     1802
Glu Val Asn Asp Val Val Asp Glu Arg Glu Phe Ser Ala Val Gln Val         
    295                 300                 305                         
tca acc atg tgg gat agc atg gtt ctt gaa cta atg atg aac aat cta     1850
Ser Thr Met Trp Asp Ser Met Val Leu Glu Leu Met Met Asn Asn Leu         
310                 315                 320                 325         
aat aac aaa ctt tgg ggg tgg gca gat cca tct ata ata ttt ttt ctt     1898
Asn Asn Lys Leu Trp Gly Trp Ala Asp Pro Ser Ile Ile Phe Phe Leu         
                330                 335                 340             
gat ttt tgg aaa aat ata gat aaa agc ata aaa ttc atc atg ata tat     1946
Asp Phe Trp Lys Asn Ile Asp Lys Ser Ile Lys Phe Ile Met Ile Tyr         
            345                 350                 355                 
gat cac cct aaa tat aat tta atg cgt tca gta aat aat gcc cct ctc     1994
Asp His Pro Lys Tyr Asn Leu Met Arg Ser Val Asn Asn Ala Pro Leu         
        360                 365                 370                     
tct tta aat ata aat aat agt gta gat aac tgg att gca tat aat aaa     2042
Ser Leu Asn Ile Asn Asn Ser Val Asp Asn Trp Ile Ala Tyr Asn Lys         
    375                 380                 385                         
aga ttg ctt gat ttt ttt ttg gag aat aaa gaa cga tgt gtg ttg att     2090
Arg Leu Leu Asp Phe Phe Leu Glu Asn Lys Glu Arg Cys Val Leu Ile         
390                 395                 400                 405         
aat ttt gag gcg ttt caa agc aat aag aaa aat att ata aag cca ttg     2138
Asn Phe Glu Ala Phe Gln Ser Asn Lys Lys Asn Ile Ile Lys Pro Leu         
                410                 415                 420             
agt aat att ata aaa ata gat aat cta atg tct gcg cat tac aaa aat     2186
Ser Asn Ile Ile Lys Ile Asp Asn Leu Met Ser Ala His Tyr Lys Asn         
            425                 430                 435                 
tca ata ttg ttt gat gtg gtt gag aat aat gat tat aca aaa tca aat     2234
Ser Ile Leu Phe Asp Val Val Glu Asn Asn Asp Tyr Thr Lys Ser Asn         
        440                 445                 450                     
gaa att gcc ctg ctt gaa aaa tat aca act tta ttt tct tta agt gca     2282
Glu Ile Ala Leu Leu Glu Lys Tyr Thr Thr Leu Phe Ser Leu Ser Ala         
    455                 460                 465                         
aat gag act gaa att aca ttt aat gat aca aag gtt agt gag tac tta     2330
Asn Glu Thr Glu Ile Thr Phe Asn Asp Thr Lys Val Ser Glu Tyr Leu         
470                 475                 480                 485         
gta tct gaa tta ata aaa gaa aga acc gag gtt ctg aag ctt tat aat     2378
Val Ser Glu Leu Ile Lys Glu Arg Thr Glu Val Leu Lys Leu Tyr Asn         
                490                 495                 500             
gag tta caa gcc tat gca aac cta cct tat ata gaa aca tcg aaa gat     2426
Glu Leu Gln Ala Tyr Ala Asn Leu Pro Tyr Ile Glu Thr Ser Lys Asp         
            505                 510                 515                 
aac gtt tcg gct gag gct gca tta tgg gag gta gtc gaa gag aga aat     2474
Asn Val Ser Ala Glu Ala Ala Leu Trp Glu Val Val Glu Glu Arg Asn         
        520                 525                 530                     
tct atc ttc aat att gta tct cat ttg gtg caa gag tca aaa aag aag     2522
Ser Ile Phe Asn Ile Val Ser His Leu Val Gln Glu Ser Lys Lys Lys         
    535                 540                 545                         
gat gca gat att gaa ttg act aaa tct ata ttt aag aaa aga caa ttt     2570
Asp Ala Asp Ile Glu Leu Thr Lys Ser Ile Phe Lys Lys Arg Gln Phe         
550                 555                 560                 565         
tta tta ttg aac agg att aat gag cta aaa aaa gaa aag gaa gag gta     2618
Leu Leu Leu Asn Arg Ile Asn Glu Leu Lys Lys Glu Lys Glu Glu Val         
                570                 575                 580             
att aaa ctt tca aaa ata aat cac aac gat gtt gtg aga caa gaa aaa     2666
Ile Lys Leu Ser Lys Ile Asn His Asn Asp Val Val Arg Gln Glu Lys         
            585                 590                 595                 
tat cca gat gat att gaa aaa aaa ata aat gac ata cag aaa tat gaa     2714
Tyr Pro Asp Asp Ile Glu Lys Lys Ile Asn Asp Ile Gln Lys Tyr Glu         
        600                 605                 610                     
gaa gag ata agc gaa aaa gaa tca aaa ctc act cag gca ata tca gaa     2762
Glu Glu Ile Ser Glu Lys Glu Ser Lys Leu Thr Gln Ala Ile Ser Glu         
    615                 620                 625                         
aaa gaa cag att tta aaa caa ttg cat aaa tat gaa gaa gag ata agc     2810
Lys Glu Gln Ile Leu Lys Gln Leu His Lys Tyr Glu Glu Glu Ile Ser         
630                 635                 640                 645         
gaa aaa gaa tca aaa ctc act cag gca ata tca gaa aaa gaa cag att     2858
Glu Lys Glu Ser Lys Leu Thr Gln Ala Ile Ser Glu Lys Glu Gln Ile         
                650                 655                 660             
tta aaa caa ttg cat ata gtg caa gag cag ttg gaa cac tat ttt ata     2906
Leu Lys Gln Leu His Ile Val Gln Glu Gln Leu Glu His Tyr Phe Ile         
            665                 670                 675                 
gaa aat cag gaa att aaa aag aaa ctt cca cct gtg cta tat gga gca     2954
Glu Asn Gln Glu Ile Lys Lys Lys Leu Pro Pro Val Leu Tyr Gly Ala         
        680                 685                 690                     
gct gag cag ata aaa caa gag tta ggt tat cga ctt ggt tat att ata     3002
Ala Glu Gln Ile Lys Gln Glu Leu Gly Tyr Arg Leu Gly Tyr Ile Ile         
    695                 700                 705                         
gtc tcg tat tct aaa tcc ctc aag ggg att att acc atg cca ttt gca     3050
Val Ser Tyr Ser Lys Ser Leu Lys Gly Ile Ile Thr Met Pro Phe Ala         
710                 715                 720                 725         
ctt atc cgt gag tgt gtt ttt gaa aaa aaa cgt aag aag agt tat ggc     3098
Leu Ile Arg Glu Cys Val Phe Glu Lys Lys Arg Lys Lys Ser Tyr Gly         
                730                 735                 740             
gtt gat gtg cca ctc tat tta tat gct gat gct gat aag gct gaa aga     3146
Val Asp Val Pro Leu Tyr Leu Tyr Ala Asp Ala Asp Lys Ala Glu Arg         
            745                 750                 755                 
gtt aag aaa cat tta tct tat caa tta ggg cag gct att atc tcc agt     3194
Val Lys Lys His Leu Ser Tyr Gln Leu Gly Gln Ala Ile Ile Ser Ser         
        760                 765                 770                     
gct aat tcg ata ttt gga ttc att acc ctt cca ttt aag tta att gtt     3242
Ala Asn Ser Ile Phe Gly Phe Ile Thr Leu Pro Phe Lys Leu Ile Val         
    775                 780                 785                         
gtt gtt tat aaa tat agg aga gct aaa atc aag ggc tgt taa             3284
Val Val Tyr Lys Tyr Arg Arg Ala Lys Ile Lys Gly Cys                     
790                 795                 800                             
aaatgtgaac cctaatgaga tatattgcaa attttatttt tctctttgtg gtgttttgct   3344
ttcgttaaaa tagttagtta ttttatttat ataatatcac gcattataat accaatttat   3404
acttttgcaa gtgaccgtat agattcgcca catattgcaa attttgttct ctcgtaaaat   3464
attttcttct ggtgtcagta attcgagcac ttcatgctgt cttttactgc agtatagtac   3524
taggttttca gctagtttct gattaaatat gtttagctct ttaaagagtc tatttattta   3584
aattcaatga actgttcttt cggcgttcgc ttaaaacgtt cagaggtgcg ttctaggcgt   3644
aaaggggtag gtccataggg cttgctagcg gattcctgcg gtgctttgtc gaagtttcct   3704
gggaactctt tcccgttatc tgcgatgccc tggttgatat cgatagggaa tagccggtct   3764
gcacggcaaa agaagagttt gatgatatca cagttaagtg acggcacggc ctgtgccaga   3824
gcgtagttgc tgtgttcgtc gatcatggta atgacataac agcgcagctc gcccattctg   3884
agctcaatag catctatacc aacgagctca ctgctcttta ccgggcgata gcgttttgat   3944
ctgcgggttg gaggattgta ttttttatga acagtgtttt tccttgggga ggcagtcaca   4004
tcggtatcag tcgcatttta tcgtgtgcgg ccgtgaacat cctgccgatg gttgaaatgc   4064
tcggacagac caggcggtgc tgttcactcc acggcttcaa gcaaacaaaa atctgttctt   4124
ctaggttggg aagctctatt ctcagtcgac gtatttctta cagaattacg agatgccatt   4184
gctttgtgcg atgtactagc ggagctttgc tacgcgaaat aggtgcctct gggccataat   4244
acagcgttcg tgtggataca gcaaaaactt ccgcaactgt attgatctca tgtttctccc   4304
agaagtatat tttttcatcc ttaattttgt aatctcaggt ataacaaagt gtttcatcac   4364
atagatgttg gcatggtaat gcctcaaata tccgccgcag atacgttgca tcaacttagc   4424
atttccctcg cttgtccgga gataattgca atatctctgt gagcttacac tgtgacattc   4484
gttgagtttt agtgatgttt ttaaagattt atatttataa tatttagtaa atgcagtttt   4544
attctcattt tatttatcat taagtgaatg t atg aac gca gaa tat ata aat      4596
                                   Met Asn Ala Glu Tyr Ile Asn          
                                           805                          
tta gtt gaa cgt aaa aag aaa tta ggg aca aat att ggt gct ctt gat     4644
Leu Val Glu Arg Lys Lys Lys Leu Gly Thr Asn Ile Gly Ala Leu Asp         
810                 815                 820                 825         
ttt tta tta tca att cat aag gag aaa gtt gat ctt caa cat aaa aac     4692
Phe Leu Leu Ser Ile His Lys Glu Lys Val Asp Leu Gln His Lys Asn         
                830                 835                 840             
tcg cct tta aaa ggt aac gat aac ctt att cac aaa aga ata aac gaa     4740
Ser Pro Leu Lys Gly Asn Asp Asn Leu Ile His Lys Arg Ile Asn Glu         
            845                 850                 855                 
tac gac aat gta ctt gaa cta tct aag aat gta tca gct cag aat tct     4788
Tyr Asp Asn Val Leu Glu Leu Ser Lys Asn Val Ser Ala Gln Asn Ser         
        860                 865                 870                     
ggc aat gag ttt tct tat tta ttg gga tat gca gat tct ctt aga aaa     4836
Gly Asn Glu Phe Ser Tyr Leu Leu Gly Tyr Ala Asp Ser Leu Arg Lys         
    875                 880                 885                         
gtt ggt atg ttg gat act tat att aaa att gtt tgt tat cta aca att     4884
Val Gly Met Leu Asp Thr Tyr Ile Lys Ile Val Cys Tyr Leu Thr Ile         
890                 895                 900                 905         
caa tct cgt tat ttt aaa aat ggc gaa cga gtt aag ctt ttt gaa cat     4932
Gln Ser Arg Tyr Phe Lys Asn Gly Glu Arg Val Lys Leu Phe Glu His         
                910                 915                 920             
ata agt aac gct cta cgg tat tca agg agt gat ttt ctc att aat ctt     4980
Ile Ser Asn Ala Leu Arg Tyr Ser Arg Ser Asp Phe Leu Ile Asn Leu         
            925                 930                 935                 
att ttt gaa cga tat atc gaa tat ata aac cat cta aaa ttg tcg ccc     5028
Ile Phe Glu Arg Tyr Ile Glu Tyr Ile Asn His Leu Lys Leu Ser Pro         
        940                 945                 950                     
aaa caa aaa gat ttt tat ttt tgt acg aag ttt tca aaa ttt cat gat     5076
Lys Gln Lys Asp Phe Tyr Phe Cys Thr Lys Phe Ser Lys Phe His Asp         
    955                 960                 965                         
tat act aaa aat gga tat aaa tat tta gca ttt gat aat caa gcc gat     5124
Tyr Thr Lys Asn Gly Tyr Lys Tyr Leu Ala Phe Asp Asn Gln Ala Asp         
970                 975                 980                 985         
gca ggg tat ggc ctg act tta tta tta aat gca aac gat gat atg  caa    5172
Ala Gly Tyr Gly Leu Thr Leu Leu Leu Asn Ala Asn Asp Asp Met  Gln        
                990                 995                 1000            
gat agt tat aat  cta ctc cct gag caa  gaa ctt ttt att tgt  aat      5217
Asp Ser Tyr Asn  Leu Leu Pro Glu Gln  Glu Leu Phe Ile Cys  Asn          
            1005                 1010                 1015              
gct gta ata gat  aat atg aat att tat  agg agt caa ttt aac  aaa      5262
Ala Val Ile Asp  Asn Met Asn Ile Tyr  Arg Ser Gln Phe Asn  Lys          
            1020                 1025                 1030              
tgt cta cga aaa  tac gat tta tca gaa  ata act gat ata tac  cca      5307
Cys Leu Arg Lys  Tyr Asp Leu Ser Glu  Ile Thr Asp Ile Tyr  Pro          
            1035                 1040                 1045              
aat aaa att ata  ttg caa gga att aag  ttc gat aag aaa aaa  aat      5352
Asn Lys Ile Ile  Leu Gln Gly Ile Lys  Phe Asp Lys Lys Lys  Asn          
            1050                 1055                 1060              
gtt tat gga aaa  gat ctt gtt agt ata  ata atg tca gta ttc  aat      5397
Val Tyr Gly Lys  Asp Leu Val Ser Ile  Ile Met Ser Val Phe  Asn          
            1065                 1070                 1075              
tca gaa gat act  att gca tac tca tta  cat tca ttg ttg aat  caa      5442
Ser Glu Asp Thr  Ile Ala Tyr Ser Leu  His Ser Leu Leu Asn  Gln          
            1080                 1085                 1090              
aca tat gaa aat  att gaa att ctc gtg  tgc gat gat tgt tca  tcg      5487
Thr Tyr Glu Asn  Ile Glu Ile Leu Val  Cys Asp Asp Cys Ser  Ser          
            1095                 1100                 1105              
gac aaa agc ctt  gaa ata att aag agc  ata gct tat tct gat  tca      5532
Asp Lys Ser Leu  Glu Ile Ile Lys Ser  Ile Ala Tyr Ser Asp  Ser          
            1110                 1115                 1120              
aga gtg aaa gta  tat agc tca cga aaa  aac caa ggc cct tat  aat      5577
Arg Val Lys Val  Tyr Ser Ser Arg Lys  Asn Gln Gly Pro Tyr  Asn          
            1125                 1130                 1135              
ata aga aat gag  cta ata aaa aaa gca  cac ggt aat ttc atc  acc      5622
Ile Arg Asn Glu  Leu Ile Lys Lys Ala  His Gly Asn Phe Ile  Thr          
            1140                 1145                 1150              
ttt caa gat gca  gat gat ctt tct cat  ccg gag aga ata caa  aga      5667
Phe Gln Asp Ala  Asp Asp Leu Ser His  Pro Glu Arg Ile Gln  Arg          
            1155                 1160                 1165              
caa gtt gag gtt  ctt cgc aat aat aag  gct gta atc tgt atg  gct      5712
Gln Val Glu Val  Leu Arg Asn Asn Lys  Ala Val Ile Cys Met  Ala          
            1170                 1175                 1180              
aac tgg atc cgt  gtt gcg tca aat gga  aaa att caa ttc ttc  tat      5757
Asn Trp Ile Arg  Val Ala Ser Asn Gly  Lys Ile Gln Phe Phe  Tyr          
            1185                 1190                 1195              
gat gat aaa gcc  aca aga atg tct gtt  gta tcg tca atg ata  aaa      5802
Asp Asp Lys Ala  Thr Arg Met Ser Val  Val Ser Ser Met Ile  Lys          
            1200                 1205                 1210              
aaa gat att ttt  gcg aca gtt ggt ggc  tat aga caa tct tta  att      5847
Lys Asp Ile Phe  Ala Thr Val Gly Gly  Tyr Arg Gln Ser Leu  Ile          
            1215                 1220                 1225              
ggt gca gat acg  gag ttt tat gaa aca  gta ata atg cgt tat  ggg      5892
Gly Ala Asp Thr  Glu Phe Tyr Glu Thr  Val Ile Met Arg Tyr  Gly          
            1230                 1235                 1240              
cga gaa agt att  gta aga tta ctg cag  cca ttg ata ttg ggg  tta      5937
Arg Glu Ser Ile  Val Arg Leu Leu Gln  Pro Leu Ile Leu Gly  Leu          
            1245                 1250                 1255              
tgg gga gac tcc  gga ctt acc agg aat  aaa gga aca gaa gct  cta      5982
Trp Gly Asp Ser  Gly Leu Thr Arg Asn  Lys Gly Thr Glu Ala  Leu          
            1260                 1265                 1270              
cct gat gga tat  ata tca caa tct cga  aga gaa tat agt gat  atc      6027
Pro Asp Gly Tyr  Ile Ser Gln Ser Arg  Arg Glu Tyr Ser Asp  Ile          
            1275                 1280                 1285              
gcg gca aga caa  cga gtg tta ggg aaa  agt atc gta agt gat  aaa      6072
Ala Ala Arg Gln  Arg Val Leu Gly Lys  Ser Ile Val Ser Asp  Lys          
            1290                 1295                 1300              
gat gta cgt ggt  tta tta tct cgc tat  ggt ttg ttt aaa gat  gta      6117
Asp Val Arg Gly  Leu Leu Ser Arg Tyr  Gly Leu Phe Lys Asp  Val          
            1305                 1310                 1315              
tca gga ata att  gaa caa tag tttgttattc tatatatatt aaatttttgg       6168
Ser Gly Ile Ile  Glu Gln                                                
            1320                                                        
ggctatataa a atg ttc gga  aca cta aaa ata act  gtt tca ggc gct      6215
             Met Phe Gly  Thr Leu Lys Ile Thr  Val Ser Gly Ala          
                     1325                 1330                          
ggt  tac gtt ggg ctt tca  aat gga att cta atg  gct caa aat cat      6260
Gly  Tyr Val Gly Leu Ser  Asn Gly Ile Leu Met  Ala Gln Asn His          
1335                 1340                 1345                          
gaa  gtg gtt gca ttt gat  acc cat caa aaa aaa  gtt gac tta ctt      6305
Glu  Val Val Ala Phe Asp  Thr His Gln Lys Lys  Val Asp Leu Leu          
1350                 1355                 1360                          
aat  gat aaa ctc tct cct  ata gag gat aag gaa  att gaa aat tat      6350
Asn  Asp Lys Leu Ser Pro  Ile Glu Asp Lys Glu  Ile Glu Asn Tyr          
1365                 1370                 1375                          
ctt  tca act aaa ata ctt  aat ttt cgc gca act  act aac aaa tat      6395
Leu  Ser Thr Lys Ile Leu  Asn Phe Arg Ala Thr  Thr Asn Lys Tyr          
1380                 1385                 1390                          
gaa  gcc tat aaa aat gcc  aat tac gtt att att  gct aca cca acg      6440
Glu  Ala Tyr Lys Asn Ala  Asn Tyr Val Ile Ile  Ala Thr Pro Thr          
1395                 1400                 1405                          
aat  tat gac cca ggt tca  aat tac ttt gat aca  tca agc gtt gaa      6485
Asn  Tyr Asp Pro Gly Ser  Asn Tyr Phe Asp Thr  Ser Ser Val Glu          
1410                 1415                 1420                          
gct  gtc att cgt gac gta  acg gaa atc aac cca  aac gca att atg      6530
Ala  Val Ile Arg Asp Val  Thr Glu Ile Asn Pro  Asn Ala Ile Met          
1425                 1430                 1435                          
gtg  gtt aaa tct acg gtc  cca gta ggt ttc aca  aaa aca att aaa      6575
Val  Val Lys Ser Thr Val  Pro Val Gly Phe Thr  Lys Thr Ile Lys          
1440                 1445                 1450                          
gaa  cat tta ggt att aat  aat att atc ttc tct  cca gaa ttt tta      6620
Glu  His Leu Gly Ile Asn  Asn Ile Ile Phe Ser  Pro Glu Phe Leu          
1455                 1460                 1465                          
cga  gaa gga aga gcc cta  tac gat aat ctc cat  cca tct cgc att      6665
Arg  Glu Gly Arg Ala Leu  Tyr Asp Asn Leu His  Pro Ser Arg Ile          
1470                 1475                 1480                          
att  atc ggt gaa tgt tct  gaa cgg gca gaa cgt  ttg gca gtg tta      6710
Ile  Ile Gly Glu Cys Ser  Glu Arg Ala Glu Arg  Leu Ala Val Leu          
1485                 1490                 1495                          
ttt  cag gaa gga gcg att  aaa caa aat ata ccc  gtt tta ttt aca      6755
Phe  Gln Glu Gly Ala Ile  Lys Gln Asn Ile Pro  Val Leu Phe Thr          
1500                 1505                 1510                          
gat  tct acg gaa gcg gaa  gcg att aag tta ttt  tca aat act tat      6800
Asp  Ser Thr Glu Ala Glu  Ala Ile Lys Leu Phe  Ser Asn Thr Tyr          
1515                 1520                 1525                          
ttg  gct atg cga gtt gca  ttt ttt aat gaa ttg  gat agt tac gca      6845
Leu  Ala Met Arg Val Ala  Phe Phe Asn Glu Leu  Asp Ser Tyr Ala          
1530                 1535                 1540                          
gaa  agt ttt ggt ctg aat  acg cgt cag att att  gac ggt gtt tgt      6890
Glu  Ser Phe Gly Leu Asn  Thr Arg Gln Ile Ile  Asp Gly Val Cys          
1545                 1550                 1555                          
ttg  gat ccg cgc att ggt  aat tac tac aat aat  cct tct ttt ggt      6935
Leu  Asp Pro Arg Ile Gly  Asn Tyr Tyr Asn Asn  Pro Ser Phe Gly          
1560                 1565                 1570                          
tat  ggt ggc tac tgt ttg  cca aaa gat acc aag  caa tta tta gcc      6980
Tyr  Gly Gly Tyr Cys Leu  Pro Lys Asp Thr Lys  Gln Leu Leu Ala          
1575                 1580                 1585                          
aac  tat cag tct gtt ccg  aat aaa ctt ata tct  gca att gtt gat      7025
Asn  Tyr Gln Ser Val Pro  Asn Lys Leu Ile Ser  Ala Ile Val Asp          
1590                 1595                 1600                          
gct  aac cgt aca cgt aag  gac ttt atc act aat  gtt att ttg aaa      7070
Ala  Asn Arg Thr Arg Lys  Asp Phe Ile Thr Asn  Val Ile Leu Lys          
1605                 1610                 1615                          
cat  aga cca caa gtt gtg  ggg gtt tat cgt ttg  att atg aaa agt      7115
His  Arg Pro Gln Val Val  Gly Val Tyr Arg Leu  Ile Met Lys Ser          
1620                 1625                 1630                          
ggt  tca gat aat ttt aga  gat tct tct att ctt  ggt att ata aag      7160
Gly  Ser Asp Asn Phe Arg  Asp Ser Ser Ile Leu  Gly Ile Ile Lys          
1635                 1640                 1645                          
cgt  atc aag aaa aaa ggc  gtg aaa gta att att  tat gag ccg ctt      7205
Arg  Ile Lys Lys Lys Gly  Val Lys Val Ile Ile  Tyr Glu Pro Leu          
1650                 1655                 1660                          
att  tct gga gat aca ttc  ttt aac tca cct ttg  gaa cgg gag ctg      7250
Ile  Ser Gly Asp Thr Phe  Phe Asn Ser Pro Leu  Glu Arg Glu Leu          
1665                 1670                 1675                          
gcg  atc ttt aaa ggg aaa  gct gat att att atc  act aac cga atg      7295
Ala  Ile Phe Lys Gly Lys  Ala Asp Ile Ile Ile  Thr Asn Arg Met          
1680                 1685                 1690                          
tca  gag gag ttg aac gat  gtg gtc gac aaa gtc  tat agt cgc gat      7340
Ser  Glu Glu Leu Asn Asp  Val Val Asp Lys Val  Tyr Ser Arg Asp          
1695                 1700                 1705                          
ttg  ttt aaa tgt gac taa tgtattgtta tatactatta actattaaga           7388
Leu  Phe Lys Cys Asp                                                    
1710                                                                    
gaaggaaatg cattatttaa tccgttaaaa atatgcctcg ttggtatgtt ctttattaat   7448
cctcgatcgt aaaataaga                                                7467


<210>  2
<211>  238
<212>  PRT
<213>  Escherichia coli

<400>  2
Met Ile Val Ala Asn Met Ser Ser Tyr Pro Pro Arg Lys Lys Glu Leu 
1               5                   10                  15      
Val His Ser Ile Gln Ser Leu His Ala Gln Val Asp Lys Ile Asn Leu 
            20                  25                  30          
Cys Leu Asn Glu Phe Glu Glu Ile Pro Glu Glu Leu Asp Gly Phe Ser 
        35                  40                  45              
Lys Leu Asn Pro Val Ile Pro Asp Lys Asp Tyr Lys Asp Val Gly Lys 
    50                  55                  60                  
Phe Ile Phe Pro Cys Ala Lys Asn Asp Met Ile Val Leu Thr Asp Asp 
65                  70                  75                  80  
Asp Ile Ile Tyr Pro Pro Asp Tyr Val Glu Lys Met Leu Asn Phe Tyr 
                85                  90                  95      
Asn Ser Phe Ala Ile Phe Asn Cys Ile Val Gly Ile His Gly Cys Ile 
            100                 105                 110         
Tyr Ile Asp Ala Phe Asp Gly Asp Gln Ser Lys Arg Lys Val Phe Ser 
        115                 120                 125             
Phe Thr Gln Gly Leu Leu Arg Pro Arg Val Val Asn Gln Leu Gly Thr 
    130                 135                 140                 
Gly Thr Val Phe Leu Lys Ala Asp Gln Leu Pro Ser Leu Lys Tyr Met 
145                 150                 155                 160 
Asp Gly Ser Gln Arg Phe Val Asp Val Arg Phe Ser Arg Tyr Met Leu 
                165                 170                 175     
Glu Asn Glu Ile Gly Met Ile Cys Val Pro Arg Glu Lys Asn Trp Leu 
            180                 185                 190         
Arg Glu Val Ser Ser Gly Ser Met Glu Gly Leu Trp Asn Thr Phe Thr 
        195                 200                 205             
Lys Lys Trp Pro Leu Asp Ile Ile Lys Glu Thr Gln Ala Ile Ala Gly 
    210                 215                 220                 
Tyr Ser Lys Leu Asn Leu Glu Leu Val Tyr Asn Val Glu Gly 
225                 230                 235             


<210>  3
<211>  563
<212>  PRT
<213>  Escherichia coli

<400>  3
Met Met Asn Lys Leu Val Leu Val Gly His Pro Gly Ser Lys Tyr Gln 
1               5                   10                  15      
Ile Val Glu His Phe Leu Lys Glu Ile Gly Met Asn Ser Pro Asn Tyr 
            20                  25                  30          
Ser Thr Ser Asn Lys Ile Ser Pro Glu Tyr Ile Thr Ala Ser Leu Cys 
        35                  40                  45              
Gln Phe Tyr Gln Thr Pro Glu Val Asn Asp Val Val Asp Glu Arg Glu 
    50                  55                  60                  
Phe Ser Ala Val Gln Val Ser Thr Met Trp Asp Ser Met Val Leu Glu 
65                  70                  75                  80  
Leu Met Met Asn Asn Leu Asn Asn Lys Leu Trp Gly Trp Ala Asp Pro 
                85                  90                  95      
Ser Ile Ile Phe Phe Leu Asp Phe Trp Lys Asn Ile Asp Lys Ser Ile 
            100                 105                 110         
Lys Phe Ile Met Ile Tyr Asp His Pro Lys Tyr Asn Leu Met Arg Ser 
        115                 120                 125             
Val Asn Asn Ala Pro Leu Ser Leu Asn Ile Asn Asn Ser Val Asp Asn 
    130                 135                 140                 
Trp Ile Ala Tyr Asn Lys Arg Leu Leu Asp Phe Phe Leu Glu Asn Lys 
145                 150                 155                 160 
Glu Arg Cys Val Leu Ile Asn Phe Glu Ala Phe Gln Ser Asn Lys Lys 
                165                 170                 175     
Asn Ile Ile Lys Pro Leu Ser Asn Ile Ile Lys Ile Asp Asn Leu Met 
            180                 185                 190         
Ser Ala His Tyr Lys Asn Ser Ile Leu Phe Asp Val Val Glu Asn Asn 
        195                 200                 205             
Asp Tyr Thr Lys Ser Asn Glu Ile Ala Leu Leu Glu Lys Tyr Thr Thr 
    210                 215                 220                 
Leu Phe Ser Leu Ser Ala Asn Glu Thr Glu Ile Thr Phe Asn Asp Thr 
225                 230                 235                 240 
Lys Val Ser Glu Tyr Leu Val Ser Glu Leu Ile Lys Glu Arg Thr Glu 
                245                 250                 255     
Val Leu Lys Leu Tyr Asn Glu Leu Gln Ala Tyr Ala Asn Leu Pro Tyr 
            260                 265                 270         
Ile Glu Thr Ser Lys Asp Asn Val Ser Ala Glu Ala Ala Leu Trp Glu 
        275                 280                 285             
Val Val Glu Glu Arg Asn Ser Ile Phe Asn Ile Val Ser His Leu Val 
    290                 295                 300                 
Gln Glu Ser Lys Lys Lys Asp Ala Asp Ile Glu Leu Thr Lys Ser Ile 
305                 310                 315                 320 
Phe Lys Lys Arg Gln Phe Leu Leu Leu Asn Arg Ile Asn Glu Leu Lys 
                325                 330                 335     
Lys Glu Lys Glu Glu Val Ile Lys Leu Ser Lys Ile Asn His Asn Asp 
            340                 345                 350         
Val Val Arg Gln Glu Lys Tyr Pro Asp Asp Ile Glu Lys Lys Ile Asn 
        355                 360                 365             
Asp Ile Gln Lys Tyr Glu Glu Glu Ile Ser Glu Lys Glu Ser Lys Leu 
    370                 375                 380                 
Thr Gln Ala Ile Ser Glu Lys Glu Gln Ile Leu Lys Gln Leu His Lys 
385                 390                 395                 400 
Tyr Glu Glu Glu Ile Ser Glu Lys Glu Ser Lys Leu Thr Gln Ala Ile 
                405                 410                 415     
Ser Glu Lys Glu Gln Ile Leu Lys Gln Leu His Ile Val Gln Glu Gln 
            420                 425                 430         
Leu Glu His Tyr Phe Ile Glu Asn Gln Glu Ile Lys Lys Lys Leu Pro 
        435                 440                 445             
Pro Val Leu Tyr Gly Ala Ala Glu Gln Ile Lys Gln Glu Leu Gly Tyr 
    450                 455                 460                 
Arg Leu Gly Tyr Ile Ile Val Ser Tyr Ser Lys Ser Leu Lys Gly Ile 
465                 470                 475                 480 
Ile Thr Met Pro Phe Ala Leu Ile Arg Glu Cys Val Phe Glu Lys Lys 
                485                 490                 495     
Arg Lys Lys Ser Tyr Gly Val Asp Val Pro Leu Tyr Leu Tyr Ala Asp 
            500                 505                 510         
Ala Asp Lys Ala Glu Arg Val Lys Lys His Leu Ser Tyr Gln Leu Gly 
        515                 520                 525             
Gln Ala Ile Ile Ser Ser Ala Asn Ser Ile Phe Gly Phe Ile Thr Leu 
    530                 535                 540                 
Pro Phe Lys Leu Ile Val Val Val Tyr Lys Tyr Arg Arg Ala Lys Ile 
545                 550                 555                 560 
Lys Gly Cys 
            


<210>  4
<211>  520
<212>  PRT
<213>  Escherichia coli

<400>  4
Met Asn Ala Glu Tyr Ile Asn Leu Val Glu Arg Lys Lys Lys Leu Gly 
1               5                   10                  15      
Thr Asn Ile Gly Ala Leu Asp Phe Leu Leu Ser Ile His Lys Glu Lys 
            20                  25                  30          
Val Asp Leu Gln His Lys Asn Ser Pro Leu Lys Gly Asn Asp Asn Leu 
        35                  40                  45              
Ile His Lys Arg Ile Asn Glu Tyr Asp Asn Val Leu Glu Leu Ser Lys 
    50                  55                  60                  
Asn Val Ser Ala Gln Asn Ser Gly Asn Glu Phe Ser Tyr Leu Leu Gly 
65                  70                  75                  80  
Tyr Ala Asp Ser Leu Arg Lys Val Gly Met Leu Asp Thr Tyr Ile Lys 
                85                  90                  95      
Ile Val Cys Tyr Leu Thr Ile Gln Ser Arg Tyr Phe Lys Asn Gly Glu 
            100                 105                 110         
Arg Val Lys Leu Phe Glu His Ile Ser Asn Ala Leu Arg Tyr Ser Arg 
        115                 120                 125             
Ser Asp Phe Leu Ile Asn Leu Ile Phe Glu Arg Tyr Ile Glu Tyr Ile 
    130                 135                 140                 
Asn His Leu Lys Leu Ser Pro Lys Gln Lys Asp Phe Tyr Phe Cys Thr 
145                 150                 155                 160 
Lys Phe Ser Lys Phe His Asp Tyr Thr Lys Asn Gly Tyr Lys Tyr Leu 
                165                 170                 175     
Ala Phe Asp Asn Gln Ala Asp Ala Gly Tyr Gly Leu Thr Leu Leu Leu 
            180                 185                 190         
Asn Ala Asn Asp Asp Met Gln Asp Ser Tyr Asn Leu Leu Pro Glu Gln 
        195                 200                 205             
Glu Leu Phe Ile Cys Asn Ala Val Ile Asp Asn Met Asn Ile Tyr Arg 
    210                 215                 220                 
Ser Gln Phe Asn Lys Cys Leu Arg Lys Tyr Asp Leu Ser Glu Ile Thr 
225                 230                 235                 240 
Asp Ile Tyr Pro Asn Lys Ile Ile Leu Gln Gly Ile Lys Phe Asp Lys 
                245                 250                 255     
Lys Lys Asn Val Tyr Gly Lys Asp Leu Val Ser Ile Ile Met Ser Val 
            260                 265                 270         
Phe Asn Ser Glu Asp Thr Ile Ala Tyr Ser Leu His Ser Leu Leu Asn 
        275                 280                 285             
Gln Thr Tyr Glu Asn Ile Glu Ile Leu Val Cys Asp Asp Cys Ser Ser 
    290                 295                 300                 
Asp Lys Ser Leu Glu Ile Ile Lys Ser Ile Ala Tyr Ser Asp Ser Arg 
305                 310                 315                 320 
Val Lys Val Tyr Ser Ser Arg Lys Asn Gln Gly Pro Tyr Asn Ile Arg 
                325                 330                 335     
Asn Glu Leu Ile Lys Lys Ala His Gly Asn Phe Ile Thr Phe Gln Asp 
            340                 345                 350         
Ala Asp Asp Leu Ser His Pro Glu Arg Ile Gln Arg Gln Val Glu Val 
        355                 360                 365             
Leu Arg Asn Asn Lys Ala Val Ile Cys Met Ala Asn Trp Ile Arg Val 
    370                 375                 380                 
Ala Ser Asn Gly Lys Ile Gln Phe Phe Tyr Asp Asp Lys Ala Thr Arg 
385                 390                 395                 400 
Met Ser Val Val Ser Ser Met Ile Lys Lys Asp Ile Phe Ala Thr Val 
                405                 410                 415     
Gly Gly Tyr Arg Gln Ser Leu Ile Gly Ala Asp Thr Glu Phe Tyr Glu 
            420                 425                 430         
Thr Val Ile Met Arg Tyr Gly Arg Glu Ser Ile Val Arg Leu Leu Gln 
        435                 440                 445             
Pro Leu Ile Leu Gly Leu Trp Gly Asp Ser Gly Leu Thr Arg Asn Lys 
    450                 455                 460                 
Gly Thr Glu Ala Leu Pro Asp Gly Tyr Ile Ser Gln Ser Arg Arg Glu 
465                 470                 475                 480 
Tyr Ser Asp Ile Ala Ala Arg Gln Arg Val Leu Gly Lys Ser Ile Val 
                485                 490                 495     
Ser Asp Lys Asp Val Arg Gly Leu Leu Ser Arg Tyr Gly Leu Phe Lys 
            500                 505                 510         
Asp Val Ser Gly Ile Ile Glu Gln 
        515                 520 


<210>  5
<211>  392
<212>  PRT
<213>  Escherichia coli

<400>  5
Met Phe Gly Thr Leu Lys Ile Thr Val Ser Gly Ala Gly Tyr Val Gly 
1               5                   10                  15      
Leu Ser Asn Gly Ile Leu Met Ala Gln Asn His Glu Val Val Ala Phe 
            20                  25                  30          
Asp Thr His Gln Lys Lys Val Asp Leu Leu Asn Asp Lys Leu Ser Pro 
        35                  40                  45              
Ile Glu Asp Lys Glu Ile Glu Asn Tyr Leu Ser Thr Lys Ile Leu Asn 
    50                  55                  60                  
Phe Arg Ala Thr Thr Asn Lys Tyr Glu Ala Tyr Lys Asn Ala Asn Tyr 
65                  70                  75                  80  
Val Ile Ile Ala Thr Pro Thr Asn Tyr Asp Pro Gly Ser Asn Tyr Phe 
                85                  90                  95      
Asp Thr Ser Ser Val Glu Ala Val Ile Arg Asp Val Thr Glu Ile Asn 
            100                 105                 110         
Pro Asn Ala Ile Met Val Val Lys Ser Thr Val Pro Val Gly Phe Thr 
        115                 120                 125             
Lys Thr Ile Lys Glu His Leu Gly Ile Asn Asn Ile Ile Phe Ser Pro 
    130                 135                 140                 
Glu Phe Leu Arg Glu Gly Arg Ala Leu Tyr Asp Asn Leu His Pro Ser 
145                 150                 155                 160 
Arg Ile Ile Ile Gly Glu Cys Ser Glu Arg Ala Glu Arg Leu Ala Val 
                165                 170                 175     
Leu Phe Gln Glu Gly Ala Ile Lys Gln Asn Ile Pro Val Leu Phe Thr 
            180                 185                 190         
Asp Ser Thr Glu Ala Glu Ala Ile Lys Leu Phe Ser Asn Thr Tyr Leu 
        195                 200                 205             
Ala Met Arg Val Ala Phe Phe Asn Glu Leu Asp Ser Tyr Ala Glu Ser 
    210                 215                 220                 
Phe Gly Leu Asn Thr Arg Gln Ile Ile Asp Gly Val Cys Leu Asp Pro 
225                 230                 235                 240 
Arg Ile Gly Asn Tyr Tyr Asn Asn Pro Ser Phe Gly Tyr Gly Gly Tyr 
                245                 250                 255     
Cys Leu Pro Lys Asp Thr Lys Gln Leu Leu Ala Asn Tyr Gln Ser Val 
            260                 265                 270         
Pro Asn Lys Leu Ile Ser Ala Ile Val Asp Ala Asn Arg Thr Arg Lys 
        275                 280                 285             
Asp Phe Ile Thr Asn Val Ile Leu Lys His Arg Pro Gln Val Val Gly 
    290                 295                 300                 
Val Tyr Arg Leu Ile Met Lys Ser Gly Ser Asp Asn Phe Arg Asp Ser 
305                 310                 315                 320 
Ser Ile Leu Gly Ile Ile Lys Arg Ile Lys Lys Lys Gly Val Lys Val 
                325                 330                 335     
Ile Ile Tyr Glu Pro Leu Ile Ser Gly Asp Thr Phe Phe Asn Ser Pro 
            340                 345                 350         
Leu Glu Arg Glu Leu Ala Ile Phe Lys Gly Lys Ala Asp Ile Ile Ile 
        355                 360                 365             
Thr Asn Arg Met Ser Glu Glu Leu Asn Asp Val Val Asp Lys Val Tyr 
    370                 375                 380                 
Ser Arg Asp Leu Phe Lys Cys Asp 
385                 390         


<210>  6
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  6
agctgagtcg acccccagga aaaattggtt aataac                               36


<210>  7
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  7
agctgagcat gcttccaact gcgctaatga cgc                                  33


<210>  8
<211>  313
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  promoter

<400>  8
gcatgcttcc aactgcgcta atgacgcagc tggacgaagg cgggattctc gtcttacccg     60
taggggagga gcaccagtat ttgaaacggg tgcgtcgtcg gggaggcgaa tttattatcg    120
ataccgtgga ggccgtgcgc tttgtccctt tagtgaaggg tgagctggct taaaacgtga    180
ggaaatacct ggatttttcc tggttatttt gccgcaggtc agcgtatcgt gaacatcttt    240
tccagtgttc agtagggtgc cttgcacggt aattatgtca ctggttatta accaattttt    300
cctgggggtc gac                                                       313


<210>  9
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  9
agctgatcta gaaaacagaa tttgcctggc ggc                                  33


<210>  10
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  10
agctgaggat ccaggaagag tttgtagaaa cgc                                  33


<210>  11
<211>  369
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  terminator

<400>  11
tctagaaaca gaatttgcct ggcggcagta gcgcggtggt cccacctgac cccatgccga     60
actcagaagt gaaacgccgt agcgccgatg gtagtgtggg gtctccccat gcgagagtag    120
ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga aagactgggc ctttcgtttt    180
atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa atccgccggg agcggatttg    240
aacgttgcga agcaacggcc cggagggtgg cgggcaggac gcccgccata aactgccagg    300
catcaaatta agcagaaggc catcctgacg gatggccttt ttgcgtttct acaaactctt    360
cctggatcc                                                            369


<210>  12
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  12
ttcctggggg tcgacatgac tacgaaaatt tttaa                                35


<210>  13
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  13
attctgtttt ctagactaag gaaccaacac aagct                                35


<210>  14
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  14
gtcgaccccc aggaaaaatt ggttaataac                                      30


<210>  15
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  15
tctagaaaac agaatttgcc tggcggcagt                                      30


<210>  16
<211>  1980
<212>  DNA
<213>  Flavobacterium heparinum

<400>  16
atgactacga aaatttttaa aaggatcatt gtatttgctg taattgccct atcgtcggga     60
aatatacttg cacaaagctc ttccattacc aggaaagatt ttgaccacat caaccttgag    120
tattccggac tggaaaaggt taataaagca gttgctgccg gcaactatga cgatgcggcc    180
aaagcattac tggcatacta cagggaaaaa agtaaggcca gggaacctga tttcagtaat    240
gcagaaaagc ctgccgatat acgccagccc atagataagg ttacgcgtga aatggccgac    300
aaggctttgg tccaccagtt tcaaccgcac aaaggctacg gctattttga ttatggtaaa    360
gacatcaact ggcagatgtg gccggtaaaa gacaatgaag tacgctggca gttgcaccgt    420
gtaaaatggt ggcaggctat ggccctggtt tatcacgcta cgggcgatga aaaatatgca    480
agagaatggg tatatcagta cagcgattgg gccagaaaaa acccattggg cctgtcgcag    540
gataatgata aatttgtgtg gcggcccctt gaagtgtcgg acagggtaca aagtcttccc    600
ccaaccttca gcttatttgt aaactcgcca gcctttaccc cagccttttt aatggaattt    660
ttaaacagtt accaccaaca ggccgattat ttatctacgc attatgccga acagggaaac    720
caccgtttat ttgaagccca acgcaacttg tttgcagggg tatctttccc tgaatttaaa    780
gattcaccaa gatggaggca aaccggcata tcggtgctga acaccgagat caaaaaacag    840
gtttatgccg atgggatgca gtttgaactt tcaccaattt accatgtagc tgccatcgat    900
atcttcttaa aggcctatgg ttctgcaaaa cgagttaacc ttgaaaaaga atttccgcaa    960
tcttatgtac aaactgtaga aaatatgatt atggcgctga tcagtatttc actgccagat   1020
tataacaccc ctatgtttgg agattcatgg attacagata aaaatttcag gatggcacag   1080
tttgccagct gggcccgggt tttcccggca aaccaggcca taaaatattt tgctacagat   1140
ggcaaacaag gtaaggcgcc taacttttta tccaaagcat tgagcaatgc aggcttttat   1200
acgtttagaa gcggatggga taaaaatgca accgttatgg tattaaaagc cagtcctccc   1260
ggagaatttc atgcccagcc ggataacggg acttttgaac tttttataaa gggcagaaac   1320
tttaccccag acgccggggt atttgtgtat agcggcgacg aagccatcat gaaactgcgg   1380
aactggtacc gtcaaacccg catacacagc acgcttacac tcgacaatca aaatatggtc   1440
attaccaaag cccggcaaaa caaatgggaa acaggaaata accttgatgt gcttacctat   1500
accaacccaa gctatccgaa tctggaccat cagcgcagtg tacttttcat caacaaaaaa   1560
tactttctgg tcatcgatag ggcaataggc gaagctaccg gaaacctggg cgtacactgg   1620
cagcttaaag aagacagcaa ccctgttttc gataagacaa agaaccgggt ttacaccact   1680
tacagagatg gtaacaacct gatgatccaa tcgttgaatg cggacaggac cagcctcaat   1740
gaagaagaag gaaaggtatc ttatgtttac aataaggagc tgaaaagacc tgctttcgta   1800
tttgaaaagc ctaaaaagaa tgccggcaca caaaattttg tcagtatagt ttatccatac   1860
gacggccaga aggctccaga gatcagcata cgggaaaaca agggcaatga ttttgagaaa   1920
ggcaagctta atctaaccct taccattaac ggaaaacaac agcttgtgtt ggttccttag   1980


<210>  17
<211>  659
<212>  PRT
<213>  Flavobacterium heparinum

<400>  17
Met Thr Thr Lys Ile Phe Lys Arg Ile Ile Val Phe Ala Val Ile Ala 
1               5                   10                  15      
Leu Ser Ser Gly Asn Ile Leu Ala Gln Ser Ser Ser Ile Thr Arg Lys 
            20                  25                  30          
Asp Phe Asp His Ile Asn Leu Glu Tyr Ser Gly Leu Glu Lys Val Asn 
        35                  40                  45              
Lys Ala Val Ala Ala Gly Asn Tyr Asp Asp Ala Ala Lys Ala Leu Leu 
    50                  55                  60                  
Ala Tyr Tyr Arg Glu Lys Ser Lys Ala Arg Glu Pro Asp Phe Ser Asn 
65                  70                  75                  80  
Ala Glu Lys Pro Ala Asp Ile Arg Gln Pro Ile Asp Lys Val Thr Arg 
                85                  90                  95      
Glu Met Ala Asp Lys Ala Leu Val His Gln Phe Gln Pro His Lys Gly 
            100                 105                 110         
Tyr Gly Tyr Phe Asp Tyr Gly Lys Asp Ile Asn Trp Gln Met Trp Pro 
        115                 120                 125             
Val Lys Asp Asn Glu Val Arg Trp Gln Leu His Arg Val Lys Trp Trp 
    130                 135                 140                 
Gln Ala Met Ala Leu Val Tyr His Ala Thr Gly Asp Glu Lys Tyr Ala 
145                 150                 155                 160 
Arg Glu Trp Val Tyr Gln Tyr Ser Asp Trp Ala Arg Lys Asn Pro Leu 
                165                 170                 175     
Gly Leu Ser Gln Asp Asn Asp Lys Phe Val Trp Arg Pro Leu Glu Val 
            180                 185                 190         
Ser Asp Arg Val Gln Ser Leu Pro Pro Thr Phe Ser Leu Phe Val Asn 
        195                 200                 205             
Ser Pro Ala Phe Thr Pro Ala Phe Leu Met Glu Phe Leu Asn Ser Tyr 
    210                 215                 220                 
His Gln Gln Ala Asp Tyr Leu Ser Thr His Tyr Ala Glu Gln Gly Asn 
225                 230                 235                 240 
His Arg Leu Phe Glu Ala Gln Arg Asn Leu Phe Ala Gly Val Ser Phe 
                245                 250                 255     
Pro Glu Phe Lys Asp Ser Pro Arg Trp Arg Gln Thr Gly Ile Ser Val 
            260                 265                 270         
Leu Asn Thr Glu Ile Lys Lys Gln Val Tyr Ala Asp Gly Met Gln Phe 
        275                 280                 285             
Glu Leu Ser Pro Ile Tyr His Val Ala Ala Ile Asp Ile Phe Leu Lys 
    290                 295                 300                 
Ala Tyr Gly Ser Ala Lys Arg Val Asn Leu Glu Lys Glu Phe Pro Gln 
305                 310                 315                 320 
Ser Tyr Val Gln Thr Val Glu Asn Met Ile Met Ala Leu Ile Ser Ile 
                325                 330                 335     
Ser Leu Pro Asp Tyr Asn Thr Pro Met Phe Gly Asp Ser Trp Ile Thr 
            340                 345                 350         
Asp Lys Asn Phe Arg Met Ala Gln Phe Ala Ser Trp Ala Arg Val Phe 
        355                 360                 365             
Pro Ala Asn Gln Ala Ile Lys Tyr Phe Ala Thr Asp Gly Lys Gln Gly 
    370                 375                 380                 
Lys Ala Pro Asn Phe Leu Ser Lys Ala Leu Ser Asn Ala Gly Phe Tyr 
385                 390                 395                 400 
Thr Phe Arg Ser Gly Trp Asp Lys Asn Ala Thr Val Met Val Leu Lys 
                405                 410                 415     
Ala Ser Pro Pro Gly Glu Phe His Ala Gln Pro Asp Asn Gly Thr Phe 
            420                 425                 430         
Glu Leu Phe Ile Lys Gly Arg Asn Phe Thr Pro Asp Ala Gly Val Phe 
        435                 440                 445             
Val Tyr Ser Gly Asp Glu Ala Ile Met Lys Leu Arg Asn Trp Tyr Arg 
    450                 455                 460                 
Gln Thr Arg Ile His Ser Thr Leu Thr Leu Asp Asn Gln Asn Met Val 
465                 470                 475                 480 
Ile Thr Lys Ala Arg Gln Asn Lys Trp Glu Thr Gly Asn Asn Leu Asp 
                485                 490                 495     
Val Leu Thr Tyr Thr Asn Pro Ser Tyr Pro Asn Leu Asp His Gln Arg 
            500                 505                 510         
Ser Val Leu Phe Ile Asn Lys Lys Tyr Phe Leu Val Ile Asp Arg Ala 
        515                 520                 525             
Ile Gly Glu Ala Thr Gly Asn Leu Gly Val His Trp Gln Leu Lys Glu 
    530                 535                 540                 
Asp Ser Asn Pro Val Phe Asp Lys Thr Lys Asn Arg Val Tyr Thr Thr 
545                 550                 555                 560 
Tyr Arg Asp Gly Asn Asn Leu Met Ile Gln Ser Leu Asn Ala Asp Arg 
                565                 570                 575     
Thr Ser Leu Asn Glu Glu Glu Gly Lys Val Ser Tyr Val Tyr Asn Lys 
            580                 585                 590         
Glu Leu Lys Arg Pro Ala Phe Val Phe Glu Lys Pro Lys Lys Asn Ala 
        595                 600                 605             
Gly Thr Gln Asn Phe Val Ser Ile Val Tyr Pro Tyr Asp Gly Gln Lys 
    610                 615                 620                 
Ala Pro Glu Ile Ser Ile Arg Glu Asn Lys Gly Asn Asp Phe Glu Lys 
625                 630                 635                 640 
Gly Lys Leu Asn Leu Thr Leu Thr Ile Asn Gly Lys Gln Gln Leu Val 
                645                 650                 655     
Leu Val Pro 
            


<210>  18
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  18
ttcagaattc ggatccaata aatgtagcag cgataaagca attc                      44


<210>  19
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  19
ggccagtgcc aagcttttaa ttgtgttttg cacggctacc tttc                      44


<210>  20
<211>  6646
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid

<400>  20
ccgacaccat cgaatggtgc aaaacctttc gcggtatggc atgatagcgc ccggaagaga     60
gtcaattcag ggtggtgaat gtgaaaccag taacgttata cgatgtcgca gagtatgccg    120
gtgtctctta tcagaccgtt tcccgcgtgg tgaaccaggc cagccacgtt tctgcgaaaa    180
cgcgggaaaa agtggaagcg gcgatggcgg agctgaatta cattcccaac cgcgtggcac    240
aacaactggc gggcaaacag tcgttgctga ttggcgttgc cacctccagt ctggccctgc    300
acgcgccgtc gcaaattgtc gcggcgatta aatctcgcgc cgatcaactg ggtgccagcg    360
tggtggtgtc gatggtagaa cgaagcggcg tcgaagcctg taaagcggcg gtgcacaatc    420
ttctcgcgca acgcgtcagt gggctgatca ttaactatcc gctggatgac caggatgcca    480
ttgctgtgga agctgcctgc actaatgttc cggcgttatt tcttgatgtc tctgaccaga    540
cacccatcaa cagtattatt ttctcccatg aagacggtac gcgactgggc gtggagcatc    600
tggtcgcatt gggtcaccag caaatcgcgc tgttagcggg cccattaagt tctgtctcgg    660
cgcgtctgcg tctggctggc tggcataaat atctcactcg caatcaaatt cagccgatag    720
cggaacggga aggcgactgg agtgccatgt ccggttttca acaaaccatg caaatgctga    780
atgagggcat cgttcccact gcgatgctgg ttgccaacga tcagatggcg ctgggcgcaa    840
tgcgcgccat taccgagtcc gggctgcgcg ttggtgcgga tatctcggta gtgggatacg    900
acgataccga agacagctca tgttatatcc cgccgttaac caccatcaaa caggattttc    960
gcctgctggg gcaaaccagc gtggaccgct tgctgcaact ctctcagggc caggcggtga   1020
agggcaatca gctgttgccc gtctcactgg tgaaaagaaa aaccaccctg gcgcccaata   1080
cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt   1140
cccgactgga aagcgggcag tgagcgcaac gcaattaatg taagttagct cactcattag   1200
gcacaattct catgtttgac agcttatcat cgactgcacg gtgcaccaat gcttctggcg   1260
tcaggcagcc atcggaagct gtggtatggc tgtgcaggtc gtaaatcact gcataattcg   1320
tgtcgctcaa ggcgcactcc cgttctggat aatgtttttt gcgccgacat cataacggtt   1380
ctggcaaata ttctgaaatg agctgttgac aattaatcat cggctcgtat aatgtgtgga   1440
attgtgagcg gataacaatt tcacacagga aacagccagt ccgtttaggt gttttcacga   1500
gcacttcacc aacaaggacc atagcatatg aaaatcgaag aaggtaaact ggtaatctgg   1560
attaacggcg ataaaggcta taacggtctc gctgaagtcg gtaagaaatt cgagaaagat   1620
accggaatta aagtcaccgt tgagcatccg gataaactgg aagagaaatt cccacaggtt   1680
gcggcaactg gcgatggccc tgacattatc ttctgggcac acgaccgctt tggtggctac   1740
gctcaatctg gcctgttggc tgaaatcacc ccggacaaag cgttccagga caagctgtat   1800
ccgtttacct gggatgccgt acgttacaac ggcaagctga ttgcttaccc gatcgctgtt   1860
gaagcgttat cgctgattta taacaaagat ctgctgccga acccgccaaa aacctgggaa   1920
gagatcccgg cgctggataa agaactgaaa gcgaaaggta agagcgcgct gatgttcaac   1980
ctgcaagaac cgtacttcac ctggccgctg attgctgctg acgggggtta tgcgttcaag   2040
tatgaaaacg gcaagtacga cattaaagac gtgggcgtgg ataacgctgg cgcgaaagcg   2100
ggtctgacct tcctggttga cctgattaaa aacaaacaca tgaatgcaga caccgattac   2160
tccatcgcag aagctgcctt taataaaggc gaaacagcga tgaccatcaa cggcccgtgg   2220
gcatggtcca acatcgacac cagcaaagtg aattatggtg taacggtact gccgaccttc   2280
aagggtcaac catccaaacc gttcgttggc gtgctgagcg caggtattaa cgccgccagt   2340
ccgaacaaag agctggcaaa agagttcctc gaaaactatc tgctgactga tgaaggtctg   2400
gaagcggtta ataaagacaa accgctgggt gccgtagcgc tgaagtctta cgaggaagag   2460
ttggcgaaag atccacgtat tgccgccact atggaaaacg cccagaaagg tgaaatcatg   2520
ccgaacatcc cgcagatgtc cgctttctgg tatgccgtgc gtactgcggt gatcaacgcc   2580
gccagcggtc gtcagactgt cgatgaagcc ctgaaagacg cgcagactaa ttcgagctcg   2640
aacaacaaca acaataacaa taacaacaac ctcgggatcg agggaaggat ttcagaattc   2700
ggatcctcta gagtcgacct gcaggcaagc ttggcactgg ccgtcgtttt acaacgtcgt   2760
gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc   2820
agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg   2880
aatggcgaat ggcagcttgg ctgttttggc ggatgagata agattttcag cctgatacag   2940
attaaatcag aacgcagaag cggtctgata aaacagaatt tgcctggcgg cagtagcgcg   3000
gtggtcccac ctgaccccat gccgaactca gaagtgaaac gccgtagcgc cgatggtagt   3060
gtggggtctc cccatgcgag agtagggaac tgccaggcat caaataaaac gaaaggctca   3120
gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg gtgaacgctc tcctgagtag   3180
gacaaatccg ccgggagcgg atttgaacgt tgcgaagcaa cggcccggag ggtggcgggc   3240
aggacgcccg ccataaactg ccaggcatca aattaagcag aaggccatcc tgacggatgg   3300
cctttttgcg tttctacaaa ctctttttgt ttatttttct aaatacattc aaatatgtat   3360
ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg   3420
agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt   3480
tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga   3540
gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa   3600
gaacgttctc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt   3660
gttgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt   3720
gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc   3780
agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga   3840
ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat   3900
cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct   3960
gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc   4020
cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg   4080
gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc   4140
ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg   4200
acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca   4260
ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta   4320
ccccggttga taatcagaaa agccccaaaa acaggaagat tgtataagca aatatttaaa   4380
ttgtaaacgt taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt   4440
ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag   4500
ggttgagtgt tgttccagtt tggaacaaga gtccactatt aaagaacgtg gactccaacg   4560
tcaaagggcg aaaaaccgtc tatcagggcg atggcccact acgtgaacca tcacccaaat   4620
caagtttttt ggggtcgagg tgccgtaaag cactaaatcg gaaccctaaa gggagccccc   4680
gatttagagc ttgacgggga aagccggcga acgtggcgag aaaggaaggg aagaaagcga   4740
aaggagcggg cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac   4800
ccgccgcgct taatgcgccg ctacagggcg cgtaaaagga tctaggtgaa gatccttttt   4860
gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc   4920
gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg   4980
caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact   5040
ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg   5100
tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg   5160
ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac   5220
tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca   5280
cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga   5340
gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc   5400
ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct   5460
gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg   5520
agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct   5580
tttgctcaca tgttctttcc tgcgttatcc cctgattctg tggataaccg tattaccgcc   5640
tttgagtgag ctgataccgc tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc   5700
gaggaagcgg aagagcgcct gatgcggtat tttctcctta cgcatctgtg cggtatttca   5760
caccgcatat atggtgcact ctcagtacaa tctgctctga tgccgcatag ttaagccagt   5820
atacactccg ctatcgctac gtgactgggt catggctgcg ccccgacacc cgccaacacc   5880
cgctgacgcg ccctgacggg cttgtctgct cccggcatcc gcttacagac aagctgtgac   5940
cgtctccggg agctgcatgt gtcagaggtt ttcaccgtca tcaccgaaac gcgcgaggca   6000
gctgcggtaa agctcatcag cgtggtcgtg cagcgattca cagatgtctg cctgttcatc   6060
cgcgtccagc tcgttgagtt tctccagaag cgttaatgtc tggcttctga taaagcgggc   6120
catgttaagg gcggtttttt cctgtttggt cactgatgcc tccgtgtaag ggggatttct   6180
gttcatgggg gtaatgatac cgatgaaacg agagaggatg ctcacgatac gggttactga   6240
tgatgaacat gcccggttac tggaacgttg tgagggtaaa caactggcgg tatggatgcg   6300
gcgggaccag agaaaaatca ctcagggtca atgccagcgc ttcgttaata cagatgtagg   6360
tgttccacag ggtagccagc agcatcctgc gatgcagatc cggaacataa tggtgcaggg   6420
cgctgacttc cgcgtttcca gactttacga aacacggaaa ccgaagacca ttcatgttgt   6480
tgctcaggtc gcagacgttt tgcagcagca gtcgcttcac gttcgctcgc gtatcggtga   6540
ttcattctgc taaccagtaa ggcaaccccg ccagcctagc cgggtcctca acgacaggag   6600
cacgatcatg cgcacccgtg gccaggaccc aacgctgccc gaaatt                  6646


<210>  21
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  21
aagcttggca ctggccgtcg ttttacaacg tcgtg                                35


<210>  22
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  22
ggatccgaat tctgaaatcc ttccctcgat cccga                                35


<210>  23
<211>  1770
<212>  DNA
<213>  Homo sapiens

<400>  23
aataaatgta gcagcgataa agcaattcag tttccgcgtc gtagcagcag cggttttcgt     60
gttgatggtt ttgaaaaacg tgcagcagcc agcgaaagca ataactatat gaatcatgtt    120
gccaaacagc agagcgaaga agcatttccg caagaacagc agaaagcacc gcctgttgtt    180
ggtggtttta atagcaatgt tggtagcaaa gttctgggcc tgaaatatga agaaattgac    240
tgcctgatca acgatgagca taccattaaa ggtcgtcgtg aaggtaatga agtttttctg    300
ccgtttacct gggtggagaa atactttgat gtttatggta aagtggtgca gtatgatggc    360
tatgatcgtt ttgaatttag ccatagctac agcaaagttt atgcacagcg tgcaccgtat    420
catcctgatg gtgtttttat gagctttgag ggctataatg ttgaagttcg tgatcgcgtt    480
aaatgcatta gcggtgttga aggtgttccg ctgagcaccc agtggggtcc gcagggttat    540
ttctatccga ttcagattgc acagtatggc ctgagccatt atagcaaaaa tctgaccgaa    600
aaaccgcctc acattgaagt ttatgaaacc gcagaagatc gcgacaaaaa caaaccgaat    660
gattggaccg ttccgaaagg ttgttttatg gcaaatgttg cagataaaag ccgcttcacc    720
aatgtgaaac agtttattgc accggaaacc agcgaaggtg ttagcctgca gctgggtaat    780
accaaagatt ttatcattag cttcgatctg aaatttctga ccaatggtag cgttagcgtt    840
gttctggaaa ccaccgaaaa aaatcagctg tttaccatcc attatgtgag caatgcccag    900
ctgattgcat ttaaagaacg cgatatctat tatggcattg gtccgcgtac cagttggagc    960
accgttaccc gtgatctggt taccgatctg cgtaaaggtg ttggtctgag caatacaaaa   1020
gcagttaaac cgaccaaaat tatgccgaaa aaagttgttc gtctgatcgc caaaggtaaa   1080
ggttttctgg ataacattac cattagcacc accgcacata tggcagcatt ttttgcagca   1140
agcgattggc tggttcgtaa ccaggatgaa aaaggtggtt ggccgattat ggttacccgt   1200
aaactgggtg aaggttttaa aagcctggaa ccgggttggt atagcgcaat ggcacagggt   1260
caggcaatta gcaccctggt tcgtgcatat ctgctgacca aagatcatat ttttctgaat   1320
agcgcactgc gtgcaaccgc accgtacaaa tttctgtcag aacagcatgg tgttaaagcc   1380
gtgtttatga acaaacacga ttggtatgaa gaatatccga ccaccccgag cagctttgtt   1440
ctgaatggtt ttatgtatag cctgatcggt ctgtacgacc tgaaagaaac agccggtgaa   1500
aaactgggta aagaagcacg tagcctgtac gaacgtggta tggaaagcct gaaagcaatg   1560
ctgccgctgt atgataccgg tagcggcacc atttatgatc tgcgtcattt tatgctgggt   1620
atcgcaccga atctggcacg ttgggattat cataccaccc atattaatca gctgcaactg   1680
ctgagtacca ttgatgaaag tccggtgttt aaagaatttg tgaaacgctg gaaaagctac   1740
ctgaaaggta gccgtgcaaa acacaattaa                                    1770


<210>  24
<211>  589
<212>  PRT
<213>  Homo sapiens

<400>  24
Asn Lys Cys Ser Ser Asp Lys Ala Ile Gln Phe Pro Arg Arg Ser Ser 
1               5                   10                  15      
Ser Gly Phe Arg Val Asp Gly Phe Glu Lys Arg Ala Ala Ala Ser Glu 
            20                  25                  30          
Ser Asn Asn Tyr Met Asn His Val Ala Lys Gln Gln Ser Glu Glu Ala 
        35                  40                  45              
Phe Pro Gln Glu Gln Gln Lys Ala Pro Pro Val Val Gly Gly Phe Asn 
    50                  55                  60                  
Ser Asn Val Gly Ser Lys Val Leu Gly Leu Lys Tyr Glu Glu Ile Asp 
65                  70                  75                  80  
Cys Leu Ile Asn Asp Glu His Thr Ile Lys Gly Arg Arg Glu Gly Asn 
                85                  90                  95      
Glu Val Phe Leu Pro Phe Thr Trp Val Glu Lys Tyr Phe Asp Val Tyr 
            100                 105                 110         
Gly Lys Val Val Gln Tyr Asp Gly Tyr Asp Arg Phe Glu Phe Ser His 
        115                 120                 125             
Ser Tyr Ser Lys Val Tyr Ala Gln Arg Ala Pro Tyr His Pro Asp Gly 
    130                 135                 140                 
Val Phe Met Ser Phe Glu Gly Tyr Asn Val Glu Val Arg Asp Arg Val 
145                 150                 155                 160 
Lys Cys Ile Ser Gly Val Glu Gly Val Pro Leu Ser Thr Gln Trp Gly 
                165                 170                 175     
Pro Gln Gly Tyr Phe Tyr Pro Ile Gln Ile Ala Gln Tyr Gly Leu Ser 
            180                 185                 190         
His Tyr Ser Lys Asn Leu Thr Glu Lys Pro Pro His Ile Glu Val Tyr 
        195                 200                 205             
Glu Thr Ala Glu Asp Arg Asp Lys Asn Lys Pro Asn Asp Trp Thr Val 
    210                 215                 220                 
Pro Lys Gly Cys Phe Met Ala Asn Val Ala Asp Lys Ser Arg Phe Thr 
225                 230                 235                 240 
Asn Val Lys Gln Phe Ile Ala Pro Glu Thr Ser Glu Gly Val Ser Leu 
                245                 250                 255     
Gln Leu Gly Asn Thr Lys Asp Phe Ile Ile Ser Phe Asp Leu Lys Phe 
            260                 265                 270         
Leu Thr Asn Gly Ser Val Ser Val Val Leu Glu Thr Thr Glu Lys Asn 
        275                 280                 285             
Gln Leu Phe Thr Ile His Tyr Val Ser Asn Ala Gln Leu Ile Ala Phe 
    290                 295                 300                 
Lys Glu Arg Asp Ile Tyr Tyr Gly Ile Gly Pro Arg Thr Ser Trp Ser 
305                 310                 315                 320 
Thr Val Thr Arg Asp Leu Val Thr Asp Leu Arg Lys Gly Val Gly Leu 
                325                 330                 335     
Ser Asn Thr Lys Ala Val Lys Pro Thr Lys Ile Met Pro Lys Lys Val 
            340                 345                 350         
Val Arg Leu Ile Ala Lys Gly Lys Gly Phe Leu Asp Asn Ile Thr Ile 
        355                 360                 365             
Ser Thr Thr Ala His Met Ala Ala Phe Phe Ala Ala Ser Asp Trp Leu 
    370                 375                 380                 
Val Arg Asn Gln Asp Glu Lys Gly Gly Trp Pro Ile Met Val Thr Arg 
385                 390                 395                 400 
Lys Leu Gly Glu Gly Phe Lys Ser Leu Glu Pro Gly Trp Tyr Ser Ala 
                405                 410                 415     
Met Ala Gln Gly Gln Ala Ile Ser Thr Leu Val Arg Ala Tyr Leu Leu 
            420                 425                 430         
Thr Lys Asp His Ile Phe Leu Asn Ser Ala Leu Arg Ala Thr Ala Pro 
        435                 440                 445             
Tyr Lys Phe Leu Ser Glu Gln His Gly Val Lys Ala Val Phe Met Asn 
    450                 455                 460                 
Lys His Asp Trp Tyr Glu Glu Tyr Pro Thr Thr Pro Ser Ser Phe Val 
465                 470                 475                 480 
Leu Asn Gly Phe Met Tyr Ser Leu Ile Gly Leu Tyr Asp Leu Lys Glu 
                485                 490                 495     
Thr Ala Gly Glu Lys Leu Gly Lys Glu Ala Arg Ser Leu Tyr Glu Arg 
            500                 505                 510         
Gly Met Glu Ser Leu Lys Ala Met Leu Pro Leu Tyr Asp Thr Gly Ser 
        515                 520                 525             
Gly Thr Ile Tyr Asp Leu Arg His Phe Met Leu Gly Ile Ala Pro Asn 
    530                 535                 540                 
Leu Ala Arg Trp Asp Tyr His Thr Thr His Ile Asn Gln Leu Gln Leu 
545                 550                 555                 560 
Leu Ser Thr Ile Asp Glu Ser Pro Val Phe Lys Glu Phe Val Lys Arg 
                565                 570                 575     
Trp Lys Ser Tyr Leu Lys Gly Ser Arg Ala Lys His Asn 
            580                 585                 


<210>  25
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  25
ttcagaattc ggatcccgtg aaattgaaca gcgtca                               36


<210>  26
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  26
ggccagtgcc aagcttttaa ttgcttttcg gataga                               36


<210>  27
<211>  918
<212>  DNA
<213>  Cricetulus griseus

<400>  27
cgtgaaattg aacagcgtca taccatggat ggtccgcgtc aggatgcagc agttgatgaa     60
gaagaagata tcgtcattat ctataaccgt gttccgaaaa ccgcaagcac cagctttacc    120
aatattgcaa ttgatctgtg cgccaaaaat cgctatcatg tgctgcatat caacaccacc    180
aaaaataacc cggttatgag cctgcaggat caggttcgtt ttgttaaaaa cattaccacc    240
tggaacgaaa tgaaaccggg tttttatcat ggccatatca gctatctgga ttttgcgaaa    300
tttggcgtga aaaaaaaacc gatctacatc aacgttattc gcgatccgat tgaacgtctg    360
gttagctatt attactttct gcgcttcggt gatgattatc gtccgggtct gcgtcgtcgt    420
aaacagggcg acaaaaaaac ctttgatgaa tgtgttgccg aaggtggtag cgattgtgca    480
ccggaaaaac tgtggctgca gattccgttt ttttgcggtc atagcagcga atgttggaat    540
gttggtagcc gttgggcaat ggatcaggcc aaatataacc tgatcaacga atattttctg    600
gtgggtgtga ccgaagaact ggaagatttc attatgctgc tggaagcagc actgcctcgt    660
ttttttcgtg gtgcaaccga tctgtatcgt accggtaaaa aaagccatct gcgtaaaacg    720
acggaaaaaa aactgccgac caaacagacc attgcaaaac tgcagcagag cgatatttgg    780
aaaatggaaa acgagtttta tgaatttgcc ctggaacagt ttcagtttat tcgtgcacat    840
gcagttcgtg aaaaagatgg tgatctgtat attctggccc agaacttctt ctacgaaaaa    900
atctatccga aaagcaat                                                  918


<210>  28
<211>  306
<212>  PRT
<213>  Cricetulus griseus

<400>  28
Arg Glu Ile Glu Gln Arg His Thr Met Asp Gly Pro Arg Gln Asp Ala 
1               5                   10                  15      
Ala Val Asp Glu Glu Glu Asp Ile Val Ile Ile Tyr Asn Arg Val Pro 
            20                  25                  30          
Lys Thr Ala Ser Thr Ser Phe Thr Asn Ile Ala Ile Asp Leu Cys Ala 
        35                  40                  45              
Lys Asn Arg Tyr His Val Leu His Ile Asn Thr Thr Lys Asn Asn Pro 
    50                  55                  60                  
Val Met Ser Leu Gln Asp Gln Val Arg Phe Val Lys Asn Ile Thr Thr 
65                  70                  75                  80  
Trp Asn Glu Met Lys Pro Gly Phe Tyr His Gly His Ile Ser Tyr Leu 
                85                  90                  95      
Asp Phe Ala Lys Phe Gly Val Lys Lys Lys Pro Ile Tyr Ile Asn Val 
            100                 105                 110         
Ile Arg Asp Pro Ile Glu Arg Leu Val Ser Tyr Tyr Tyr Phe Leu Arg 
        115                 120                 125             
Phe Gly Asp Asp Tyr Arg Pro Gly Leu Arg Arg Arg Lys Gln Gly Asp 
    130                 135                 140                 
Lys Lys Thr Phe Asp Glu Cys Val Ala Glu Gly Gly Ser Asp Cys Ala 
145                 150                 155                 160 
Pro Glu Lys Leu Trp Leu Gln Ile Pro Phe Phe Cys Gly His Ser Ser 
                165                 170                 175     
Glu Cys Trp Asn Val Gly Ser Arg Trp Ala Met Asp Gln Ala Lys Tyr 
            180                 185                 190         
Asn Leu Ile Asn Glu Tyr Phe Leu Val Gly Val Thr Glu Glu Leu Glu 
        195                 200                 205             
Asp Phe Ile Met Leu Leu Glu Ala Ala Leu Pro Arg Phe Phe Arg Gly 
    210                 215                 220                 
Ala Thr Asp Leu Tyr Arg Thr Gly Lys Lys Ser His Leu Arg Lys Thr 
225                 230                 235                 240 
Thr Glu Lys Lys Leu Pro Thr Lys Gln Thr Ile Ala Lys Leu Gln Gln 
                245                 250                 255     
Ser Asp Ile Trp Lys Met Glu Asn Glu Phe Tyr Glu Phe Ala Leu Glu 
            260                 265                 270         
Gln Phe Gln Phe Ile Arg Ala His Ala Val Arg Glu Lys Asp Gly Asp 
        275                 280                 285             
Leu Tyr Ile Leu Ala Gln Asn Phe Phe Tyr Glu Lys Ile Tyr Pro Lys 
    290                 295                 300                 
Ser Asn 
305     


<210>  29
<211>  311
<212>  PRT
<213>  Mus musculus

<400>  29
Met Thr Leu Leu Leu Leu Gly Ala Val Leu Leu Val Ala Gln Pro Gln 
1               5                   10                  15      
Leu Val His Ser His Pro Ala Ala Pro Gly Pro Gly Leu Lys Gln Gln 
            20                  25                  30          
Glu Leu Leu Arg Lys Val Ile Ile Leu Pro Glu Asp Thr Gly Glu Gly 
        35                  40                  45              
Thr Ala Ser Asn Gly Ser Thr Gln Gln Leu Pro Gln Thr Ile Ile Ile 
    50                  55                  60                  
Gly Val Arg Lys Gly Gly Thr Arg Ala Leu Leu Glu Met Leu Ser Leu 
65                  70                  75                  80  
His Pro Asp Val Ala Ala Ala Glu Asn Glu Val His Phe Phe Asp Trp 
                85                  90                  95      
Glu Glu His Tyr Ser Gln Gly Leu Gly Trp Tyr Leu Thr Gln Met Pro 
            100                 105                 110         
Phe Ser Ser Pro His Gln Leu Thr Val Glu Lys Thr Pro Ala Tyr Phe 
        115                 120                 125             
Thr Ser Pro Lys Val Pro Glu Arg Ile His Ser Met Asn Pro Thr Ile 
    130                 135                 140                 
Arg Leu Leu Leu Ile Leu Arg Asp Pro Ser Glu Arg Val Leu Ser Asp 
145                 150                 155                 160 
Tyr Thr Gln Val Leu Tyr Asn His Leu Gln Lys His Lys Pro Tyr Pro 
                165                 170                 175     
Pro Ile Glu Asp Leu Leu Met Arg Asp Gly Arg Leu Asn Leu Asp Tyr 
            180                 185                 190         
Lys Ala Leu Asn Arg Ser Leu Tyr His Ala His Met Leu Asn Trp Leu 
        195                 200                 205             
Arg Phe Phe Pro Leu Gly His Ile His Ile Val Asp Gly Asp Arg Leu 
    210                 215                 220                 
Ile Arg Asp Pro Phe Pro Glu Ile Gln Lys Val Glu Arg Phe Leu Lys 
225                 230                 235                 240 
Leu Ser Pro Gln Ile Asn Ala Ser Asn Phe Tyr Phe Asn Lys Thr Lys 
                245                 250                 255     
Gly Phe Tyr Cys Leu Arg Asp Ser Gly Lys Asp Arg Cys Leu His Glu 
            260                 265                 270         
Ser Lys Gly Arg Ala His Pro Gln Val Asp Pro Lys Leu Leu Asp Lys 
        275                 280                 285             
Leu His Glu Tyr Phe His Glu Pro Asn Lys Lys Phe Phe Lys Leu Val 
    290                 295                 300                 
Gly Arg Thr Phe Asp Trp His 
305                 310     


<210>  30
<211>  808
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  3-OST-1 gene optimized for expression of E. coli

<400>  30
gaattcgggc accgcaagca atggtagcac ccagcagctg ccgcagacca ttattatcgg     60
tgttcgtaaa ggtggcaccc gtgcactgct ggaaatgctg agcctgcatc ctgatgttgc    120
agcagcagaa aatgaagtgc atttttttga ttgggaggaa cattatagcc agggtctggg    180
ttggtatctg acccagatgc cgtttagcag tccgcatcag ctgaccgttg aaaaaacacc    240
ggcatatttc accagcccga aagtgccgga acgtattcat agcatgaatc cgaccattcg    300
cctgctgctg attctgcgtg atccgagcga acgtgttctg agcgattata cccaggttct    360
gtataatcat ctgcagaaac ataaaccgta tccgcctatt gaagatctgc tgatgcgtga    420
tggtcgtctg aatctggatt ataaagcact gaatcgtagc ctgtatcatg cccatatgct    480
gaattggctg cgtttttttc cgctgggtca tattcatatt gttgatggtg atcgtctgat    540
tcgtgatccg tttcctgaaa ttcagaaagt ggaacgtttt ctgaaactga gtccgcagat    600
taatgccagc aacttctatt ttaacaaaac caaaggcttc tattgcctgc gtgatagcgg    660
taaagatcgt tgtctgcatg aaagcaaagg tcgtgcacat ccgcaggttg atccgaaact    720
gctggataaa ctgcatgaat attttcatga accgaacaaa aaattcttta aactggtggg    780
tcgtaccttc gattggcatt aagtcgac                                       808


<210>  31
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  31
ataagatctg ctgccgaacc cgccaa                                            26


<210>  32
<211>  59
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  32
ataaagcttg gatccgagct cgaggcggcc gccagggctg catcgacagt ctgacgacc        59


<210>  33
<211>  6556
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  plasmid

<400>  33
ccgacaccat cgaatggtgc aaaacctttc gcggtatggc atgatagcgc ccggaagaga       60

gtcaattcag ggtggtgaat gtgaaaccag taacgttata cgatgtcgca gagtatgccg      120

gtgtctctta tcagaccgtt tcccgcgtgg tgaaccaggc cagccacgtt tctgcgaaaa      180

cgcgggaaaa agtggaagcg gcgatggcgg agctgaatta cattcccaac cgcgtggcac      240

aacaactggc gggcaaacag tcgttgctga ttggcgttgc cacctccagt ctggccctgc      300

acgcgccgtc gcaaattgtc gcggcgatta aatctcgcgc cgatcaactg ggtgccagcg      360

tggtggtgtc gatggtagaa cgaagcggcg tcgaagcctg taaagcggcg gtgcacaatc      420

ttctcgcgca acgcgtcagt gggctgatca ttaactatcc gctggatgac caggatgcca      480

ttgctgtgga agctgcctgc actaatgttc cggcgttatt tcttgatgtc tctgaccaga      540

cacccatcaa cagtattatt ttctcccatg aagacggtac gcgactgggc gtggagcatc      600

tggtcgcatt gggtcaccag caaatcgcgc tgttagcggg cccattaagt tctgtctcgg      660

cgcgtctgcg tctggctggc tggcataaat atctcactcg caatcaaatt cagccgatag      720

cggaacggga aggcgactgg agtgccatgt ccggttttca acaaaccatg caaatgctga      780

atgagggcat cgttcccact gcgatgctgg ttgccaacga tcagatggcg ctgggcgcaa      840

tgcgcgccat taccgagtcc gggctgcgcg ttggtgcgga tatctcggta gtgggatacg      900

acgataccga agacagctca tgttatatcc cgccgttaac caccatcaaa caggattttc      960

gcctgctggg gcaaaccagc gtggaccgct tgctgcaact ctctcagggc caggcggtga     1020

agggcaatca gctgttgccc gtctcactgg tgaaaagaaa aaccaccctg gcgcccaata     1080

cgcaaaccgc ctctccccgc gcgttggccg attcattaat gcagctggca cgacaggttt     1140

cccgactgga aagcgggcag tgagcgcaac gcaattaatg taagttagct cactcattag     1200

gcacaattct catgtttgac agcttatcat cgactgcacg gtgcaccaat gcttctggcg     1260

tcaggcagcc atcggaagct gtggtatggc tgtgcaggtc gtaaatcact gcataattcg     1320

tgtcgctcaa ggcgcactcc cgttctggat aatgtttttt gcgccgacat cataacggtt     1380

ctggcaaata ttctgaaatg agctgttgac aattaatcat cggctcgtat aatgtgtgga     1440

attgtgagcg gataacaatt tcacacagga aacagccagt ccgtttaggt gttttcacga     1500

gcacttcacc aacaaggacc atagcatatg aaaatcgaag aaggtaaact ggtaatctgg     1560

attaacggcg ataaaggcta taacggtctc gctgaagtcg gtaagaaatt cgagaaagat     1620

accggaatta aagtcaccgt tgagcatccg gataaactgg aagagaaatt cccacaggtt     1680

gcggcaactg gcgatggccc tgacattatc ttctgggcac acgaccgctt tggtggctac     1740

gctcaatctg gcctgttggc tgaaatcacc ccggacaaag cgttccagga caagctgtat     1800

ccgtttacct gggatgccgt acgttacaac ggcaagctga ttgcttaccc gatcgctgtt     1860

gaagcgttat cgctgattta taacaaagat ctgctgccga acccgccaaa aacctgggaa     1920

gagatcccgg cgctggataa agaactgaaa gcgaaaggta agagcgcgct gatgttcaac     1980

ctgcaagaac cgtacttcac ctggccgctg attgctgctg acgggggtta tgcgttcaag     2040

tatgaaaacg gcaagtacga cattaaagac gtgggcgtgg ataacgctgg cgcgaaagcg     2100

ggtctgacct tcctggttga cctgattaaa aacaaacaca tgaatgcaga caccgattac     2160

tccatcgcag aagctgcctt taataaaggc gaaacagcga tgaccatcaa cggcccgtgg     2220

gcatggtcca acatcgacac cagcaaagtg aattatggtg taacggtact gccgaccttc     2280

aagggtcaac catccaaacc gttcgttggc gtgctgagcg caggtattaa cgccgccagt     2340

ccgaacaaag agctggcaaa agagttcctc gaaaactatc tgctgactga tgaaggtctg     2400

gaagcggtta ataaagacaa accgctgggt gccgtagcgc tgaagtctta cgaggaagag     2460

ttggcgaaag atccacgtat tgccgccact atggaaaacg cccagaaagg tgaaatcatg     2520

ccgaacatcc cgcagatgtc cgctttctgg tatgccgtgc gtactgcggt gatcaacgcc     2580

gccagcggtc gtcagactgt cgatgcagcc ctggcggccg cctcgagctc ggatccaagc     2640

ttggcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt     2700

aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc     2760

gatcgccctt cccaacagtt gcgcagcctg aatggcgaat ggcagcttgg ctgttttggc     2820

ggatgagata agattttcag cctgatacag attaaatcag aacgcagaag cggtctgata     2880

aaacagaatt tgcctggcgg cagtagcgcg gtggtcccac ctgaccccat gccgaactca     2940

gaagtgaaac gccgtagcgc cgatggtagt gtggggtctc cccatgcgag agtagggaac     3000

tgccaggcat caaataaaac gaaaggctca gtcgaaagac tgggcctttc gttttatctg     3060

ttgtttgtcg gtgaacgctc tcctgagtag gacaaatccg ccgggagcgg atttgaacgt     3120

tgcgaagcaa cggcccggag ggtggcgggc aggacgcccg ccataaactg ccaggcatca     3180

aattaagcag aaggccatcc tgacggatgg cctttttgcg tttctacaaa ctctttttgt     3240

ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg     3300

cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt     3360

cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta     3420

aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc     3480

ggtaagatcc ttgagagttt tcgccccgaa gaacgttctc caatgatgag cacttttaaa     3540

gttctgctat gtggcgcggt attatcccgt gttgacgccg ggcaagagca actcggtcgc     3600

cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt     3660

acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact     3720

gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac     3780

aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata     3840

ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta     3900

ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg     3960

gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat     4020

aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt     4080

aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga     4140

aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa     4200

gtttactcat atatacttta gattgattta ccccggttga taatcagaaa agccccaaaa     4260

acaggaagat tgtataagca aatatttaaa ttgtaaacgt taatattttg ttaaaattcg     4320

cgttaaattt ttgttaaatc agctcatttt ttaaccaata ggccgaaatc ggcaaaatcc     4380

cttataaatc aaaagaatag accgagatag ggttgagtgt tgttccagtt tggaacaaga     4440

gtccactatt aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg     4500

atggcccact acgtgaacca tcacccaaat caagtttttt ggggtcgagg tgccgtaaag     4560

cactaaatcg gaaccctaaa gggagccccc gatttagagc ttgacgggga aagccggcga     4620

acgtggcgag aaaggaaggg aagaaagcga aaggagcggg cgctagggcg ctggcaagtg     4680

tagcggtcac gctgcgcgta accaccacac ccgccgcgct taatgcgccg ctacagggcg     4740

cgtaaaagga tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt     4800

gagttttcgt tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat     4860

cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg     4920

gtttgtttgc cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga     4980

gcgcagatac caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac     5040

tctgtagcac cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt     5100

ggcgataagt cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag     5160

cggtcgggct gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc     5220

gaactgagat acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag     5280

gcggacaggt atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca     5340

gggggaaacg cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt     5400

cgatttttgt gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc     5460

tttttacggt tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc     5520

cctgattctg tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc     5580

cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat     5640

tttctcctta cgcatctgtg cggtatttca caccgcatat atggtgcact ctcagtacaa     5700

tctgctctga tgccgcatag ttaagccagt atacactccg ctatcgctac gtgactgggt     5760

catggctgcg ccccgacacc cgccaacacc cgctgacgcg ccctgacggg cttgtctgct     5820

cccggcatcc gcttacagac aagctgtgac cgtctccggg agctgcatgt gtcagaggtt     5880

ttcaccgtca tcaccgaaac gcgcgaggca gctgcggtaa agctcatcag cgtggtcgtg     5940

cagcgattca cagatgtctg cctgttcatc cgcgtccagc tcgttgagtt tctccagaag     6000

cgttaatgtc tggcttctga taaagcgggc catgttaagg gcggtttttt cctgtttggt     6060

cactgatgcc tccgtgtaag ggggatttct gttcatgggg gtaatgatac cgatgaaacg     6120

agagaggatg ctcacgatac gggttactga tgatgaacat gcccggttac tggaacgttg     6180

tgagggtaaa caactggcgg tatggatgcg gcgggaccag agaaaaatca ctcagggtca     6240

atgccagcgc ttcgttaata cagatgtagg tgttccacag ggtagccagc agcatcctgc     6300

gatgcagatc cggaacataa tggtgcaggg cgctgacttc cgcgtttcca gactttacga     6360

aacacggaaa ccgaagacca ttcatgttgt tgctcaggtc gcagacgttt tgcagcagca     6420

gtcgcttcac gttcgctcgc gtatcggtga ttcattctgc taaccagtaa ggcaaccccg     6480

ccagcctagc cgggtcctca acgacaggag cacgatcatg cgcacccgtg gccaggaccc     6540

aacgctgccc gaaatt                                                     6556


<210>  34
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  34
atagcggccg cgtcttctgg aggcctgaaa tatgaagaaa ttgactgc                    48


<210>  35
<211>  31
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  35
atactcgagt taattgtgtt ttgcacggct a                                      31


<210>  36
<211>  1575
<212>  DNA
<213>  Homo sapiens

<400>  36
ggccgcgtct tctggaggcc tgaaatatga agaaattgac tgcctgatca acgatgagca       60

taccattaaa ggtcgtcgtg aaggtaatga agtttttctg ccgtttacct gggtggagaa      120

atactttgat gtttatggta aagtggtgca gtatgatggc tatgatcgtt ttgaatttag      180

ccatagctac agcaaagttt atgcacagcg tgcaccgtat catcctgatg gtgtttttat      240

gagctttgag ggctataatg ttgaagttcg tgatcgcgtt aaatgcatta gcggtgttga      300

aggtgttccg ctgagcaccc agtggggtcc gcagggttat ttctatccga ttcagattgc      360

acagtatggc ctgagccatt atagcaaaaa tctgaccgaa aaaccgcctc acattgaagt      420

ttatgaaacc gcagaagatc gcgacaaaaa caaaccgaat gattggaccg ttccgaaagg      480

ttgttttatg gcaaatgttg cagataaaag ccgcttcacc aatgtgaaac agtttattgc      540

accggaaacc agcgaaggtg ttagcctgca gctgggtaat accaaagatt ttatcattag      600

cttcgatctg aaatttctga ccaatggtag cgttagcgtt gttctggaaa ccaccgaaaa      660

aaatcagctg tttaccatcc attatgtgag caatgcccag ctgattgcat ttaaagaacg      720

cgatatctat tatggcattg gtccgcgtac cagttggagc accgttaccc gtgatctggt      780

taccgatctg cgtaaaggtg ttggtctgag caatacaaaa gcagttaaac cgaccaaaat      840

tatgccgaaa aaagttgttc gtctgatcgc caaaggtaaa ggttttctgg ataacattac      900

cattagcacc accgcacata tggcagcatt ttttgcagca agcgattggc tggttcgtaa      960

ccaggatgaa aaaggtggtt ggccgattat ggttacccgt aaactgggtg aaggttttaa     1020

aagcctggaa ccgggttggt atagcgcaat ggcacagggt caggcaatta gcaccctggt     1080

tcgtgcatat ctgctgacca aagatcatat ttttctgaat agcgcactgc gtgcaaccgc     1140

accgtacaaa tttctgtcag aacagcatgg tgttaaagcc gtgtttatga acaaacacga     1200

ttggtatgaa gaatatccga ccaccccgag cagctttgtt ctgaatggtt ttatgtatag     1260

cctgatcggt ctgtacgacc tgaaagaaac agccggtgaa aaactgggta aagaagcacg     1320

tagcctgtac gaacgtggta tggaaagcct gaaagcaatg ctgccgctgt atgataccgg     1380

tagcggcacc atttatgatc tgcgtcattt tatgctgggt atcgcaccga atctggcacg     1440

ttgggattat cataccaccc atattaatca gctgcaactg ctgagtacca ttgatgaaag     1500

tccggtgttt aaagaatttg tgaaacgctg gaaaagctac ctgaaaggta gccgtgcaaa     1560

acacaattaa ctcga                                                      1575


<210>  37
<211>  517
<212>  PRT
<213>  Homo sapiens

<400>  37

Gly Leu Lys Tyr Glu Glu Ile Asp Cys Leu Ile Asn Asp Glu His Thr 
1               5                   10                  15      


Ile Lys Gly Arg Arg Glu Gly Asn Glu Val Phe Leu Pro Phe Thr Trp 
            20                  25                  30          


Val Glu Lys Tyr Phe Asp Val Tyr Gly Lys Val Val Gln Tyr Asp Gly 
        35                  40                  45              


Tyr Asp Arg Phe Glu Phe Ser His Ser Tyr Ser Lys Val Tyr Ala Gln 
    50                  55                  60                  


Arg Ala Pro Tyr His Pro Asp Gly Val Phe Met Ser Phe Glu Gly Tyr 
65                  70                  75                  80  


Asn Val Glu Val Arg Asp Arg Val Lys Cys Ile Ser Gly Val Glu Gly 
                85                  90                  95      


Val Pro Leu Ser Thr Gln Trp Gly Pro Gln Gly Tyr Phe Tyr Pro Ile 
            100                 105                 110         


Gln Ile Ala Gln Tyr Gly Leu Ser His Tyr Ser Lys Asn Leu Thr Glu 
        115                 120                 125             


Lys Pro Pro His Ile Glu Val Tyr Glu Thr Ala Glu Asp Arg Asp Lys 
    130                 135                 140                 


Asn Lys Pro Asn Asp Trp Thr Val Pro Lys Gly Cys Phe Met Ala Asn 
145                 150                 155                 160 


Val Ala Asp Lys Ser Arg Phe Thr Asn Val Lys Gln Phe Ile Ala Pro 
                165                 170                 175     


Glu Thr Ser Glu Gly Val Ser Leu Gln Leu Gly Asn Thr Lys Asp Phe 
            180                 185                 190         


Ile Ile Ser Phe Asp Leu Lys Phe Leu Thr Asn Gly Ser Val Ser Val 
        195                 200                 205             


Val Leu Glu Thr Thr Glu Lys Asn Gln Leu Phe Thr Ile His Tyr Val 
    210                 215                 220                 


Ser Asn Ala Gln Leu Ile Ala Phe Lys Glu Arg Asp Ile Tyr Tyr Gly 
225                 230                 235                 240 


Ile Gly Pro Arg Thr Ser Trp Ser Thr Val Thr Arg Asp Leu Val Thr 
                245                 250                 255     


Asp Leu Arg Lys Gly Val Gly Leu Ser Asn Thr Lys Ala Val Lys Pro 
            260                 265                 270         


Thr Lys Ile Met Pro Lys Lys Val Val Arg Leu Ile Ala Lys Gly Lys 
        275                 280                 285             


Gly Phe Leu Asp Asn Ile Thr Ile Ser Thr Thr Ala His Met Ala Ala 
    290                 295                 300                 


Phe Phe Ala Ala Ser Asp Trp Leu Val Arg Asn Gln Asp Glu Lys Gly 
305                 310                 315                 320 


Gly Trp Pro Ile Met Val Thr Arg Lys Leu Gly Glu Gly Phe Lys Ser 
                325                 330                 335     


Leu Glu Pro Gly Trp Tyr Ser Ala Met Ala Gln Gly Gln Ala Ile Ser 
            340                 345                 350         


Thr Leu Val Arg Ala Tyr Leu Leu Thr Lys Asp His Ile Phe Leu Asn 
        355                 360                 365             


Ser Ala Leu Arg Ala Thr Ala Pro Tyr Lys Phe Leu Ser Glu Gln His 
    370                 375                 380                 


Gly Val Lys Ala Val Phe Met Asn Lys His Asp Trp Tyr Glu Glu Tyr 
385                 390                 395                 400 


Pro Thr Thr Pro Ser Ser Phe Val Leu Asn Gly Phe Met Tyr Ser Leu 
                405                 410                 415     


Ile Gly Leu Tyr Asp Leu Lys Glu Thr Ala Gly Glu Lys Leu Gly Lys 
            420                 425                 430         


Glu Ala Arg Ser Leu Tyr Glu Arg Gly Met Glu Ser Leu Lys Ala Met 
        435                 440                 445             


Leu Pro Leu Tyr Asp Thr Gly Ser Gly Thr Ile Tyr Asp Leu Arg His 
    450                 455                 460                 


Phe Met Leu Gly Ile Ala Pro Asn Leu Ala Arg Trp Asp Tyr His Thr 
465                 470                 475                 480 


Thr His Ile Asn Gln Leu Gln Leu Leu Ser Thr Ile Asp Glu Ser Pro 
                485                 490                 495     


Val Phe Lys Glu Phe Val Lys Arg Trp Lys Ser Tyr Leu Lys Gly Ser 
            500                 505                 510         


Arg Ala Lys His Asn 
        515         


<210>  38
<211>  63
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  38
atagcggccg cgcagactaa tgcagcagcg gatgaagaag aagatatcgt cattatctat       60

aac                                                                     63


<210>  39
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer

<400>  39
atactcgagt taattgcttt tcggatagat tttttc                                 36


<210>  40
<211>  897
<212>  DNA
<213>  Cricetulus griseus

<400>  40
ggccgcgcag actaatgcag cagcggatga agaagaagat atcgtcatta tctataaccg       60

tgttccgaaa accgcaagca ccagctttac caatattgca tatgatctgt gcgccaaaaa      120

tcgctatcat gtgctgcata ttaacaccac caaaaataac ccggttatga gcctgcagga      180

tcaggttcgt tttgttaaaa acattaccac ctggaacgaa atgaaaccgg gtttttatca      240

tggccatatc agctatctgg attttgcgaa atttggcgtg aaaaaaaaac cgatctacat      300

caacgttatt cgcgatccga ttgaacgtct ggttagctat tattactttc tgcgcttcgg      360

tgatgattat cgtccgggtc tgcgtcgtcg taaacagggc gacaaaaaaa cctttgatga      420

atgtgttgcc gaaggtggta gcgattgtgc accggaaaaa ctgtggctgc agattccgtt      480

tttttgcggt catagcagcg aatgttggaa tgttggtagc cgttgggcaa tggatcaggc      540

caaatataac ctgatcaacg aatattttct ggtgggtgtg accgaagaac tggaagattt      600

cattatgctg ctggaagcag cactgcctcg tttttttcgt ggtgcaaccg atctgtatcg      660

taccggtaaa aaaagccatc tgcgtaaaac gacggaaaaa aaactgccga ccaaacagac      720

cattgcaaaa ctgcagcaga gcgatatttg gaaaatggaa aacgagtttt atgaatttgc      780

cctggaacag tttcagttta ttcgtgcaca tgcagttcgt gaaaaagatg gtgatctgta      840

tattctggcc cagaacttct tctacgaaaa aatctatccg aaaagcaatt aactcga         897


<210>  41
<211>  288
<212>  PRT
<213>  Cricetulus griseus

<400>  41

Asp Glu Glu Glu Asp Ile Val Ile Ile Tyr Asn Arg Val Pro Lys Thr 
1               5                   10                  15      


Ala Ser Thr Ser Phe Thr Asn Ile Ala Tyr Asp Leu Cys Ala Lys Asn 
            20                  25                  30          


Arg Tyr His Val Leu His Ile Asn Thr Thr Lys Asn Asn Pro Val Met 
        35                  40                  45              


Ser Leu Gln Asp Gln Val Arg Phe Val Lys Asn Ile Thr Thr Trp Asn 
    50                  55                  60                  


Glu Met Lys Pro Gly Phe Tyr His Gly His Ile Ser Tyr Leu Asp Phe 
65                  70                  75                  80  


Ala Lys Phe Gly Val Lys Lys Lys Pro Ile Tyr Ile Asn Val Ile Arg 
                85                  90                  95      


Asp Pro Ile Glu Arg Leu Val Ser Tyr Tyr Tyr Phe Leu Arg Phe Gly 
            100                 105                 110         


Asp Asp Tyr Arg Pro Gly Leu Arg Arg Arg Lys Gln Gly Asp Lys Lys 
        115                 120                 125             


Thr Phe Asp Glu Cys Val Ala Glu Gly Gly Ser Asp Cys Ala Pro Glu 
    130                 135                 140                 


Lys Leu Trp Leu Gln Ile Pro Phe Phe Cys Gly His Ser Ser Glu Cys 
145                 150                 155                 160 


Trp Asn Val Gly Ser Arg Trp Ala Met Asp Gln Ala Lys Tyr Asn Leu 
                165                 170                 175     


Ile Asn Glu Tyr Phe Leu Val Gly Val Thr Glu Glu Leu Glu Asp Phe 
            180                 185                 190         


Ile Met Leu Leu Glu Ala Ala Leu Pro Arg Phe Phe Arg Gly Ala Thr 
        195                 200                 205             


Asp Leu Tyr Arg Thr Gly Lys Lys Ser His Leu Arg Lys Thr Thr Glu 
    210                 215                 220                 


Lys Lys Leu Pro Thr Lys Gln Thr Ile Ala Lys Leu Gln Gln Ser Asp 
225                 230                 235                 240 


Ile Trp Lys Met Glu Asn Glu Phe Tyr Glu Phe Ala Leu Glu Gln Phe 
                245                 250                 255     


Gln Phe Ile Arg Ala His Ala Val Arg Glu Lys Asp Gly Asp Leu Tyr 
            260                 265                 270         


Ile Leu Ala Gln Asn Phe Phe Tyr Glu Lys Ile Tyr Pro Lys Ser Asn 
        275                 280                 285             


