(used for mutation description)
(last modified October 19, 2013)
This file was created to facilitate the description of sequence variants
in the NEB gene based on a coding DNA reference sequence following the
HGVS recommendations. The sequence was taken from NG_009382.2,
covering transcript variant-4 (NM_001271208.1),
containing
all 183 exons. In reality exons 143 and 144 have thus far not been found
together in one transcript (see variant-1 (NM_001164507.1)
and variant-2 (NM_001164508.1)).
Transcript
variant-3 misses exons 82-105 (NM_004543.4).
NOTE: the sequence contains three
identical blocks (exons 82-89,
90-97 and
98-105); variants found here can not be mapped exactly. The
three repeated blocks roughly go from position c.12330+176_13789-968
(81i_89i), c.13789-969_15247-968 (89i_97i) and c.15247-969_16705-1485
(97i_105i). Blocks can be discriminated using intronic variants.
Please note that introns are available by clicking on the exon numbers above the sequence.
(upstream sequence) . . g.5023 aggcatcacgctggcttttgcct c.-181 . . . . . . g.5083 ggggttctcaggaggggagagttgggagaggctttgctgctgaggaaatttatttggtag c.-121 . | 02 . . . . . g.5745 attgaag | gtttgaacgagagctacagaaacgaaagaaaaagtctgtataagccaatggtg c.-61 . . . . | 03 . . g.6331 ttcgggaagaaaataaccccattgccttgag | tttgtaggtgccactactactctggaaaa c.-1 . . . | 04 . . . g.9855 ATGGCAGATGACGAAGACTATGAGGAGGTGGTGGAG | TACTACACAGAAGAAGTGGTTTAC c.60 M A D D E D Y E E V V E | Y Y T E E V V Y p.20 . | 05 . . . . g.11623 GAAGAGGTGCCGGGAGAG | ACAATAACAAAAATTTATGAGACTACGACAACAAGGACATCT c.120 E E V P G E | T I T K I Y E T T T T R T S p.40 . . . . . . g.11683 GACTATGAGCAATCAGAAACTTCCAAACCAGCTCTGGCACAGCCAGCACTGGCACAGCCA c.180 D Y E Q S E T S K P A L A Q P A L A Q P p.60 . . . . . . g.11743 GCATCAGCAAAGCCGGTGGAGAGGAGGAAGGTCATCCGGAAGAAAGTGGATCCTTCAAAG c.240 A S A K P V E R R K V I R K K V D P S K p.80 . . . . . | 06 . g.13933 TTCATGACCCCCTACATTGCACACAGTCAGAAAATGCAGGATCTTTTTAGCCCA | AATAAA c.300 F M T P Y I A H S Q K M Q D L F S P | N K p.100 . . . . . . g.13993 TACAAGGAGAAGTTTGAGAAAACAAAAGGACAGCCATACGCCAGCACAACAGATACTCCA c.360 Y K E K F E K T K G Q P Y A S T T D T P p.120 . . . . | 07 . . g.14544 GAACTTCGCAGAATCAAAAAAGTACAAGATCAACTCAGTGAG | GTTAAGTATCGAATGGAT c.420 E L R R I K K V Q D Q L S E | V K Y R M D p.140 . . . . . . g.14604 GGTGATGTTGCTAAGACTATATGTCACGTAGATGAAAAAGCAAAGGATATTGAACATGCA c.480 G D V A K T I C H V D E K A K D I E H A p.160 . . | 08. . . . g.15156 AAGAAAGTGTCGCAGCAAGTCAGTAAG | GTTTTATACAAGCAGAACTGGGAAGACACCAAG c.540 K K V S Q Q V S K | V L Y K Q N W E D T K p.180 . . . . . . g.15216 GATAAGTACCTGCTTCCTCCTGATGCCCCTGAACTTGTCCAGGCCGTTAAGAACACCGCC c.600 D K Y L L P P D A P E L V Q A V K N T A p.200 . | 09 . . . . . g.16049 ATGTTCAGCAAG | AAACTGTACACTGAAGACTGGGAAGCAGACAAAAGTTTGTTTTACCCC c.660 M F S K | K L Y T E D W E A D K S L F Y P p.220 . . . . . | 10. g.21970 TATAATGATAGCCCGGAACTGAGGAGAGTTGCCCAGGCCCAGAAAGCTCTCAGTGAT | GTT c.720 Y N D S P E L R R V A Q A Q K A L S D | V p.240 . . . . . . g.22030 GCCTACAAAAAAGGTCTCGCTGAACAGCAAGCTCAATTCACGCCTCTGGCTGATCCTCCA c.780 A Y K K G L A E Q Q A Q F T P L A D P P p.260 . . . . | 11 . . g.28967 GATATAGAATTTGCCAAGAAAGTAACCAATCAAGTGAGCAAG | CAAAAATACAAAGAAGAC c.840 D I E F A K K V T N Q V S K | Q K Y K E D p.280 . . . . . . g.29027 TATGAAAATAAAATCAAAGGCAAATGGAGTGAGACACCTTGCTTTGAAGTTGCAAATGCC c.900 Y E N K I K G K W S E T P C F E V A N A p.300 . . | 12. . . . g.29757 AGAATGAATGCTGATAACATTAGCACA | AGGAAATACCAGGAAGATTTTGAAAACATGAAA c.960 R M N A D N I S T | R K Y Q E D F E N M K p.320 . . . . . . g.29817 GACCAGATCTACTTCATGCAGACCGAAACACCAGAGTATAAAATGAATAAAAAAGCTGGT c.1020 D Q I Y F M Q T E T P E Y K M N K K A G p.340 . | 13 . . . . . g.32535 GTGGCAGCTAGCAAG | GTAAAATACAAAGAAGACTATGAAAAGAATAAAGGAAAAGCAGAT c.1080 V A A S K | V K Y K E D Y E K N K G K A D p.360 . . . . . . g.32595 TATAATGTGCTTCCTGCTTCAGAGAACCCACAGCTTAGGCAGCTGAAGGCAGCAGGAGAT c.1140 Y N V L P A S E N P Q L R Q L K A A G D p.380 . | 14 . . . . . g.41887 GCCCTAAGTGAC | AAACTATACAAGGAAAACTATGAAAAGACAAAAGCAAAGAGCATAAAT c.1200 A L S D | K L Y K E N Y E K T K A K S I N p.400 . . . . . | 15. g.42033 TACTGCGAGACCCCCAAATTCAAGCTCGATACTGTTCTGCAGAACTTCAGTAGTGAT | AAA c.1260 Y C E T P K F K L D T V L Q N F S S D | K p.420 . . . . . . g.42093 AAATATAAAGATTCCTACTTAAAAGATATTTTGGGACATTATGTAGGCAGCTTCGAGGAT c.1320 K Y K D S Y L K D I L G H Y V G S F E D p.440 . . . . | 16 . . g.42250 CCATACCATTCACACTGCATGAAAGTCACAGCTCAAAACAGTGAT | AAAAACTACAAAGCA c.1380 P Y H S H C M K V T A Q N S D | K N Y K A p.460 . . . . . . g.42310 GAATACGAAGAAGACAGAGGCAAAGGCTTCTTCCCTCAGACCATAACTCAAGAATATGAA c.1440 E Y E E D R G K G F F P Q T I T Q E Y E p.480 . . . | 17 . . . g.42782 GCAATTAAGAAACTAGATCAGTGTAAAGAC | CACACCTACAAAGTCCATCCAGATAAGACA c.1500 A I K K L D Q C K D | H T Y K V H P D K T p.500 . . . . . . g.42842 AAATTCACCCAAGTTACAGACTCTCCTGTTCTGCTACAAGCCCAAGTCAATTCCAAACAA c.1560 K F T Q V T D S P V L L Q A Q V N S K Q p.520 | 18 . . . . . g.43856 CTGAGTGAC | TTAAATTACAAAGCAAAACATGAAAGTGAAAAGTTCAAGTGCCATATCCCC c.1620 L S D | L N Y K A K H E S E K F K C H I P p.540 . . . . . | 19 . g.44864 CCTGATACTCCTGCTTTTATCCAGCACAAAGTCAATGCCTATAACTTGAGTGAT | AATCTT c.1680 P D T P A F I Q H K V N A Y N L S D | N L p.560 . . . . . . g.44924 TATAAGCAAGACTGGGAGAAGAGCAAAGCCAAAAAGTTTGACATTAAAGTGGATGCCATT c.1740 Y K Q D W E K S K A K K F D I K V D A I p.580 . . . . | 20 . . g.45069 CCCCTGCTGGCAGCCAAAGCCAACACCAAGAACACCAGCGAT | GTGATGTACAAGAAAGAC c.1800 P L L A A K A N T K N T S D | V M Y K K D p.600 . . . . . . g.45129 TATGAAAAAAACAAAGGGAAAATGATTGGAGTCCTCAGCATTAATGACGATCCCAAGATG c.1860 Y E K N K G K M I G V L S I N D D P K M p.620 . . . | 21 . . . g.47149 CTGCACTCCTTGAAGGTGGCCAAAAACCAGAGTGAT | AGATTATACAAGGAAAACTATGAG c.1920 L H S L K V A K N Q S D | R L Y K E N Y E p.640 . . . . . . g.47209 AAGACAAAGGCAAAGAGTATGAATTACTGTGAGACCCCAAAATATCAACTTGATACTCAG c.1980 K T K A K S M N Y C E T P K Y Q L D T Q p.660 . | 22 . . . . g.47363 CTGAAGAACTTCAGTGAG | GCTAGATATAAAGACTTATATGTAAAGGATGTTTTGGGACAT c.2040 L K N F S E | A R Y K D L Y V K D V L G H p.680 . . . . . . g.47423 TATGTAGGCAGCATGGAGGACCCATATCACACACACTGCATGAAAGTTGCAGCTCAAAAC c.2100 Y V G S M E D P Y H T H C M K V A A Q N p.700 | 23 . . . . . . g.47573 AGTGAT | AAAAGTTACAAAGCAGAATATGAAGAAGATAAAGGAAAATGCTATTTCCCTCAG c.2160 S D | K S Y K A E Y E E D K G K C Y F P Q p.720 . . . . . | 24 . g.48671 ACAATAACACAAGAATATGAAGCAATCAAGAAGCTGGACCAGTGTAAAGAT | CATACCTAC c.2220 T I T Q E Y E A I K K L D Q C K D | H T Y p.740 . . . . . . g.48731 AAAGTTCATCCAGATAAGACCAAATTCACGGCAGTCACTGATTCTCCTGTACTGTTGCAA c.2280 K V H P D K T K F T A V T D S P V L L Q p.760 . . . | 25 . . . g.51121 GCCCAGCTCAACACGAAACAGCTTAGTGAT | CTGAATTACAAAGCAAAACATGAAGGTGAG c.2340 A Q L N T K Q L S D | L N Y K A K H E G E p.780 . . . . . . g.51181 AAGTTCAAGTGCCATATACCAGCAGATGCTCCACAGTTTATCCAACACAGAGTCAATGCC c.2400 K F K C H I P A D A P Q F I Q H R V N A p.800 . | 26 . . . . . g.51799 TATAATCTGAGTGAT | AATGTTTATAAGCAAGACTGGGAGAAGAGCAAAGCCAAGAAGTTT c.2460 Y N L S D | N V Y K Q D W E K S K A K K F p.820 . . . . . . g.51859 GACATTAAAGTGGACGCCATTCCCCTGTTGGCAGCCAAAGCCAACACCAAGAACACCAGC c.2520 D I K V D A I P L L A A K A N T K N T S p.840 | 27 . . . . . . g.52012 GAT | GTGATGTACAAGAAAGACTATGAAAAGAGCAAAGGGAAAATGATTGGAGCCCTCAGC c.2580 D | V M Y K K D Y E K S K G K M I G A L S p.860 . . . . . | 28. g.54515 ATTAATGACGATCCAAAGATGCTGCACTCCTTGAAGACAGCCAAAAACCAGAGTGAT | CGC c.2640 I N D D P K M L H S L K T A K N Q S D | R p.880 . . . . . . g.54575 GAATATCGAAAAGATTATGAAAAGTCAAAAACTATCTACACGGCACCTCTTGATATGCTC c.2700 E Y R K D Y E K S K T I Y T A P L D M L p.900 . . . . . . g.54635 CAAGTCACTCAAGCTAAGAAATCTCAGGCAATTGCCAGCGACGTTGATTATAAGCACATC c.2760 Q V T Q A K K S Q A I A S D V D Y K H I p.920 . . . . . . g.54695 TTACACAGTTACAGCTACCCCCCTGATAGCATCAATGTGGACCTTGCCAAGAAGGCATAT c.2820 L H S Y S Y P P D S I N V D L A K K A Y p.940 . | 29 . . . . . g.56763 GCGCTGCAGAGCGAT | GTTGAATACAAAGCTGACTACAATAGCTGGATGAAAGGTTGTGGC c.2880 A L Q S D | V E Y K A D Y N S W M K G C G p.960 . . . . . . g.56823 TGGGTGCCTTTTGGGTCCTTAGAAATGGAAAAGGCAAAGCGAGCTTCAGACATCCTCAAT c.2940 W V P F G S L E M E K A K R A S D I L N p.980 | 30 . . . . . . g.58716 GAG | AAAAAATATCGCCAACATCCAGACACCCTCAAGTTTACCTCGATTGAAGATGCTCCA c.3000 E | K K Y R Q H P D T L K F T S I E D A P p.1000 . . . . | 31 . . g.59483 ATTACAGTACAGTCTAAAATTAACCAGGCCCAGAGGAGTGAT | ATCGCTTACAAAGCCAAA c.3060 I T V Q S K I N Q A Q R S D | I A Y K A K p.1020 . . . . . . g.59543 GGAGAGGAAATTATTCACAAATACAACCTGCCACCAGACCTGCCCCAGTTCATCCAGGCT c.3120 G E E I I H K Y N L P P D L P Q F I Q A p.1040 . . | 32. . . . g.59692 AAAGTTAATGCCTACAATATCAGTGAG | AATATGTACAAAGCAGACTTGAAAGACTTGAGC c.3180 K V N A Y N I S E | N M Y K A D L K D L S p.1060 . . . . . . g.59752 AAGAAGGGATATGACCTGAGAACTGATGCGATTCCCATCAGAGCTGCCAAAGCTGCCAGG c.3240 K K G Y D L R T D A I P I R A A K A A R p.1080 . | 33 . . . . . g.61345 CAGGCGGCGAGTGAC | GTTCAGTACAAAAAAGACTATGAAAAGGCTAAAGGGAAAATGGTT c.3300 Q A A S D | V Q Y K K D Y E K A K G K M V p.1100 . . . . . . g.61405 GGCTTCCAAAGTCTTCAAGATGACCCTAAACTGGTTCATTATATGAACGTGGCCAAGATA c.3360 G F Q S L Q D D P K L V H Y M N V A K I p.1120 . . . . . . g.61465 CAATCAGATCGGGAGTATAAAAAAGACTATGAGAAGACAAAGTCCAAATACAACACGCCC c.3420 Q S D R E Y K K D Y E K T K S K Y N T P p.1140 . . . . . . g.61525 CATGATATGTTCAATGTCGTGGCGGCTAAGAAAGCCCAGGATGTGGTCAGCAATGTCAAC c.3480 H D M F N V V A A K K A Q D V V S N V N p.1160 . . . . . . g.61585 TATAAGCATTCTCTCCATCATTACACCTACTTGCCTGACGCCATGGACCTGGAGCTGTCT c.3540 Y K H S L H H Y T Y L P D A M D L E L S p.1180 . . | 34. . . . g.61749 AAGAACATGATGCAGATACAGAGTGAT | AACGTCTACAAGGAAGACTACAACAACTGGATG c.3600 K N M M Q I Q S D | N V Y K E D Y N N W M p.1200 . . . . . . g.61809 AAAGGCATTGGCTGGATTCCTATTGGCAGTCTCGACGTCGAAAAAGTTAAAAAGGCCGGT c.3660 K G I G W I P I G S L D V E K V K K A G p.1220 . . . . . . g.61869 GATGCTCTGAATGAAAAGAAGTACAGGCAACATCCAGACACCCTCAAATTTACCAGCATT c.3720 D A L N E K K Y R Q H P D T L K F T S I p.1240 . . . . . | 35 . g.64102 GTGGACTCCCCAGTTATGGTCCAGGCAAAACAGAACACGAAGCAAGTCAGTGAT | ATCTTA c.3780 V D S P V M V Q A K Q N T K Q V S D | I L p.1260 . . . . . . g.64162 TACAAGGCTAAAGGAGAAGATGTGAAACATAAATACACCATGAGTCCTGATCTTCCTCAG c.3840 Y K A K G E D V K H K Y T M S P D L P Q p.1280 . . . | 36 . . g.64924 TTTCTCCAGGCCAAGTGCAATGCTTACAATATAAGTGAC | GTCTGTTATAAACGGGATTGG c.3900 F L Q A K C N A Y N I S D | V C Y K R D W p.1300 . . . . . . g.64984 CATGACTTAATAGCCAAGGGCAACAATGTGCTGGGCGATGCTATTCCCATCACTGCAGCC c.3960 H D L I A K G N N V L G D A I P I T A A p.1320 . . | 37. . . . g.66840 AAGGCATCGAGAAACATTGCCAGTGAT | TATAAATACAAGGAAGCTTATGAGAAGTCAAAG c.4020 K A S R N I A S D | Y K Y K E A Y E K S K p.1340 . . . . . . g.66900 GGAAAGCATGTGGGTTTCAGAAGCCTCCAGGATGATCCCAAGCTGGTCCACTATATGAAT c.4080 G K H V G F R S L Q D D P K L V H Y M N p.1360 . . . . . . g.66960 GTGGCAAAGCTGCAGTCTGATCGTGAATACAAGAAGAACTATGAGAACACCAAAACCAGC c.4140 V A K L Q S D R E Y K K N Y E N T K T S p.1380 . . . . . . g.67020 TACCATACCCCTGGGGACATGGTTAGCATCACAGCTGCAAAGATGGCCCAGGATGTCGCT c.4200 Y H T P G D M V S I T A A K M A Q D V A p.1400 . . . . . . g.67080 ACCAATGTCAACTACAAACAGCCATTGCATCATTACACATACCTACCTGACGCCATGAGT c.4260 T N V N Y K Q P L H H Y T Y L P D A M S p.1420 . . . | 38 . . g.68279 CTTGAGCATACGAGGAATGTCAATCAAATTCAGAGTGAT | AATGTGTATAAAGACGAGTAT c.4320 L E H T R N V N Q I Q S D | N V Y K D E Y p.1440 . . . . . . g.68339 AACAGCTTCTTGAAGGGCATCGGATGGATCCCTATTGGTTCCCTGGAGGTGGAGAAGGTC c.4380 N S F L K G I G W I P I G S L E V E K V p.1460 . . . . . . g.68399 AAGAAAGCAGGCGATGCATTAAATGAGAGGAAGTATCGACAGCACCCAGATACCGTCAAG c.4440 K K A G D A L N E R K Y R Q H P D T V K p.1480 . . . . . . g.68459 TTCACAAGTGTGCCTGATTCCATGGGCATGGTGTTGGCTCAGCATAACACAAAGCAGCTA c.4500 F T S V P D S M G M V L A Q H N T K Q L p.1500 | 39 . . . . . . g.70410 AGTGAT | TTGAACTACAAGGTAGAGGGAGAGAAACTGAAGCACAAGTATACTATTGACCCT c.4560 S D | L N Y K V E G E K L K H K Y T I D P p.1520 . . . . . | 40 . g.71585 GAATTGCCTCAGTTTATTCAAGCCAAAGTCAACGCCCTCAACATGAGTGAT | GCTCATTAT c.4620 E L P Q F I Q A K V N A L N M S D | A H Y p.1540 . . . . . . g.71645 AAAGCAGATTGGAAGAAAACCATTGCCAAGGGCTATGATTTGAGACCAGATGCCATCCCA c.4680 K A D W K K T I A K G Y D L R P D A I P p.1560 . . . | 41 . . g.73107 ATTGTTGCTGCAAAAAGTTCAAGGAATATTGCTAGTGAT | TGCAAATATAAGGAGGCCTAC c.4740 I V A A K S S R N I A S D | C K Y K E A Y p.1580 . . . . . . g.73167 GAGAAAGCCAAAGGCAAGCAAGTTGGATTTCTCAGTCTTCAGGATGATCCTAAACTGGTT c.4800 E K A K G K Q V G F L S L Q D D P K L V p.1600 . . . . . . g.73227 CACTACATGAATGTGGCCAAAATCCAGTCTGATCGTGAGTACAAAAAGGGCTATGAAGCC c.4860 H Y M N V A K I Q S D R E Y K K G Y E A p.1620 . . . . . . g.73287 AGCAAGACCAAGTACCACACACCTCTGGATATGGTCAGTGTGACAGCTGCAAAGAAATCT c.4920 S K T K Y H T P L D M V S V T A A K K S p.1640 . . . . . . g.73347 CAGGAGGTTGCCACCAACGCCAACTACAGACAGTCATACCACCACTACACTCTCCTGCCC c.4980 Q E V A T N A N Y R Q S Y H H Y T L L P p.1660 . . . . . | 42 . g.73957 GATGCCTTGAATGTGGAGCACTCCAGGAATGCCATGCAGATTCAGAGTGAT | AATCTGTAC c.5040 D A L N V E H S R N A M Q I Q S D | N L Y p.1680 . . . . . . g.74017 AAATCTGACTTCACCAATTGGATGAAAGGGATCGGCTGGGTGCCCATAGAGTCCCTGGAG c.5100 K S D F T N W M K G I G W V P I E S L E p.1700 . . . . . . g.74077 GTGGAGAAGGCAAAGAAAGCAGGAGAGATTCTTAGTGAGAAGAAGTATCGCCAGCACCCC c.5160 V E K A K K A G E I L S E K K Y R Q H P p.1720 . . . . . . g.74137 GAGAAGCTGAAGTTCACTTACGCCATGGACACAATGGAACAGGCACTTAACAAGAGTAAC c.5220 E K L K F T Y A M D T M E Q A L N K S N p.1740 . | 43 . . . . g.74666 AAACTGAACATGGACAAG | AGGCTCTACACTGAAAAATGGAACAAGGACAAGACCACCATT c.5280 K L N M D K | R L Y T E K W N K D K T T I p.1760 . . . . . . g.74726 CATGTCATGCCTGACACACCGGATATTTTACTCTCCAGAGTAAACCAAATCACCATGAGT c.5340 H V M P D T P D I L L S R V N Q I T M S p.1780 | 44 . . . . . . g.74936 GAT | AAACTGTACAAAGCTGGCTGGGAAGAGGAAAAGAAGAAAGGATATGACCTGAGGCCT c.5400 D | K L Y K A G W E E E K K K G Y D L R P p.1800 . . . . . | 45 . g.75637 GATGCCATTGCAATAAAGGCTGCAAGAGCCTCTAGAGACATTGCCAGTGAT | TACAAATAC c.5460 D A I A I K A A R A S R D I A S D | Y K Y p.1820 . . . . . . g.75697 AAGAAAGCCTATGAACAAGCCAAAGGGAAACACATTGGCTTCCGGAGCCTGGAAGATGAC c.5520 K K A Y E Q A K G K H I G F R S L E D D p.1840 . . . . . . g.75757 CCCAAGCTGGTGCACTTCATGCAAGTGGCCAAGATGCAGTCAGACCGGGAATACAAGAAG c.5580 P K L V H F M Q V A K M Q S D R E Y K K p.1860 . . . . . . g.75817 GGATATGAGAAATCCAAGACCTCCTTCCACACCCCGGTGGACATGCTCAGTGTGGTGGCA c.5640 G Y E K S K T S F H T P V D M L S V V A p.1880 . . . . . . g.75877 GCCAAGAAGTCTCAGGAAGTGGCCACCAATGCCAACTACAGGAACGTGATCCATACCTAC c.5700 A K K S Q E V A T N A N Y R N V I H T Y p.1900 . . . . . . g.75937 AACATGCTTCCTGATGCCATGAGCTTTGAATTGGCCAAAAATATGATGCAGATTCAAAGT c.5760 N M L P D A M S F E L A K N M M Q I Q S p.1920 | 46 . . . . . . g.77203 GAT | AATCAGTACAAGGCTGACTATGCTGACTTCATGAAGGGCATTGGATGGCTCCCTCTG c.5820 D | N Q Y K A D Y A D F M K G I G W L P L p.1940 . . . . . . g.77263 GGCTCCCTGGAAGCAGAGAAAAACAAGAAAGCCATGGAGATTATTAGTGAAAAGAAGTAC c.5880 G S L E A E K N K K A M E I I S E K K Y p.1960 . . . . . . g.77323 CGCCAGCACCCAGACACTTTGAAGTATTCCACACTCATGGACTCGATGAACATGGTTTTG c.5940 R Q H P D T L K Y S T L M D S M N M V L p.1980 . . . | 47 . . . g.80348 GCCCAGAATAATGCAAAAATTATGAACGAA | CATCTCTACAAACAAGCATGGGAGGCTGAC c.6000 A Q N N A K I M N E | H L Y K Q A W E A D p.2000 . . . . . . g.80408 AAAACCAAAGTCCACATCATGCCTGATATCCCCCAGATTATTTTGGCAAAGGCAAATGCA c.6060 K T K V H I M P D I P Q I I L A K A N A p.2020 . | 48 . . . . . g.81442 ATTAATATGAGTGAT | AAACTCTACAAACTTTCCTTGGAAGAGTCTAAAAAGAAAGGCTAT c.6120 I N M S D | K L Y K L S L E E S K K K G Y p.2040 . . . . . . g.81502 GATCTCAGACCTGATGCAATTCCTATCAAAGCTGCCAAGGCTTCCAGAGATATTGCAAGT c.6180 D L R P D A I P I K A A K A S R D I A S p.2060 | 49 . . . . . . g.83080 GAT | TATAAATACAAGTACAATTATGAAAAAGGGAAGGGGAAAATGGTTGGTTTCCGCAGT c.6240 D | Y K Y K Y N Y E K G K G K M V G F R S p.2080 . . . . . . g.83140 CTCGAGGATGATCCCAAATTAGTCCATTCCATGCAAGTGGCTAAGATGCAATCTGATCGG c.6300 L E D D P K L V H S M Q V A K M Q S D R p.2100 . . . . . . g.83200 GAGTACAAGAAAAACTATGAGAACACAAAGACCAGCTACCACACCCCTGCCGACATGCTC c.6360 E Y K K N Y E N T K T S Y H T P A D M L p.2120 . . . . . . g.83260 AGTGTCACGGCTGCAAAGGATGCCCAAGCCAACATCACCAACACTAACTACAAGCACCTG c.6420 S V T A A K D A Q A N I T N T N Y K H L p.2140 . . . . . . g.83320 ATTCACAAGTACATCCTCCTTCCAGATGCAATGAACATTGAGCTGACCAGGAATATGAAT c.6480 I H K Y I L L P D A M N I E L T R N M N p.2160 . | 50 . . . . . g.83509 CGCATACAGAGTGAT | AATGAATATAAGCAAGATTACAATGAATGGTACAAAGGGCTTGGC c.6540 R I Q S D | N E Y K Q D Y N E W Y K G L G p.2180 . . . . . . g.83569 TGGAGTCCAGCAGGTTCTCTGGAAGTGGAGAAGGCCAAGAAAGCAACTGAATATGCCAGT c.6600 W S P A G S L E V E K A K K A T E Y A S p.2200 . . . . . . g.83629 GATCAGAAATACCGCCAGCACCCGAGCAACTTCCAGTTTAAGAAGCTGACTGATTCCATG c.6660 D Q K Y R Q H P S N F Q F K K L T D S M p.2220 . . . . | 51 . . g.84131 GACATGGTGCTTGCCAAGCAGAATGCACATACCATGAACAAG | CATTTATACACCATTGAT c.6720 D M V L A K Q N A H T M N K | H L Y T I D p.2240 . . . . . . g.84191 TGGAATAAAGATAAGACCAAGATTCATGTGATGCCTGATACACCAGATATTTTACAAGCC c.6780 W N K D K T K I H V M P D T P D I L Q A p.2260 . . | 52. . . . g.85421 AAGCAGAATCAAACACTGTATAGTCAG | AAACTCTATAAACTTGGATGGGAAGAAGCTTTG c.6840 K Q N Q T L Y S Q | K L Y K L G W E E A L p.2280 . . . . . . g.85481 AAGAAAGGCTATGATCTCCCAGTTGATGCAATTTCTGTACAGCTAGCTAAAGCTTCAAGA c.6900 K K G Y D L P V D A I S V Q L A K A S R p.2300 . | 53 . . . . . g.88647 GACATTGCTAGTGAT | TATAAATACAAACAAGGCTACCGAAAGCAACTTGGCCACCATGTT c.6960 D I A S D | Y K Y K Q G Y R K Q L G H H V p.2320 . . . . . . g.88707 GGATTCCGGAGTCTGCAAGATGACCCAAAACTTGTGTTGTCCATGAATGTAGCCAAAATG c.7020 G F R S L Q D D P K L V L S M N V A K M p.2340 . . . . . . g.88767 CAGAGTGAAAGAGAATACAAGAAGGACTTTGAGAAGTGGAAAACTAAGTTCTCCAGCCCA c.7080 Q S E R E Y K K D F E K W K T K F S S P p.2360 . . . . . . g.88827 GTGGACATGTTGGGAGTGGTACTGGCCAAGAAGTGTCAGGAGTTGGTTAGTGACGTGGAC c.7140 V D M L G V V L A K K C Q E L V S D V D p.2380 . . . . . . g.88887 TACAAGAACTACCTGCATCAGTGGACATGTCTGCCTGATCAGAACGATGTTGTGCAAGCT c.7200 Y K N Y L H Q W T C L P D Q N D V V Q A p.2400 . . | 54. . . . g.89141 AAGAAAGTTTATGAACTGCAAAGTGAG | AATCTATATAAATCTGACCTTGAGTGGCTGAGA c.7260 K K V Y E L Q S E | N L Y K S D L E W L R p.2420 . . . . . . g.89201 GGCATAGGATGGAGTCCCTTGGGTTCTTTAGAGGCAGAAAAGAACAAGCGGGCTTCGGAA c.7320 G I G W S P L G S L E A E K N K R A S E p.2440 . . . . . . g.89261 ATCATCAGTGAGAAGAAATATCGTCAGCCTCCAGACAGAAACAAGTTCACCAGCATTCCT c.7380 I I S E K K Y R Q P P D R N K F T S I P p.2460 . . . . . | 55 . g.93262 GATGCCATGGATATAGTTCTGGCAAAGACAAATGCCAAAAATAGGAGTGAT | AGACTTTAT c.7440 D A M D I V L A K T N A K N R S D | R L Y p.2480 . . . . . . g.93322 AGAGAAGCTTGGGACAAAGACAAGACTCAGATCCACATCATGCCTGATACACCTGACATT c.7500 R E A W D K D K T Q I H I M P D T P D I p.2500 . . . | 56 . . . g.94936 GTTCTGGCTAAAGCAAACTTAATCAACACAAGTGAT | AAACTCTACCGAATGGGTTATGAG c.7560 V L A K A N L I N T S D | K L Y R M G Y E p.2520 . . . . . . g.94996 GAGCTGAAGAGAAAAGGTTACGATCTTCCTGTTGATGCCATACCAATCAAAGCAGCAAAA c.7620 E L K R K G Y D L P V D A I P I K A A K p.2540 . . | 57 . . . . g.95394 GCCTCCCGGGAAATTGCCAGTGAA | TACAAGTACAAGGAAGGCTTTCGCAAGCAGCTCGGC c.7680 A S R E I A S E | Y K Y K E G F R K Q L G p.2560 . . . . . . g.95454 CACCACATTGGTGCCCGGAACATTGAAGATGACCCCAAGATGATGTGGTCCATGCATGTG c.7740 H H I G A R N I E D D P K M M W S M H V p.2580 . . . . . . g.95514 GCCAAGATCCAGAGTGACAGGGAGTACAAGAAGGACTTTGAGAAGTGGAAGACCAAGTTC c.7800 A K I Q S D R E Y K K D F E K W K T K F p.2600 . . . . . . g.95574 AGCAGCCCAGTGGACATGCTGGGGGTGGTGTTGGCCAAGAAGTGCCAGACCTTAGTCAGC c.7860 S S P V D M L G V V L A K K C Q T L V S p.2620 . . . . . . g.95634 GACGTGGACTACAAGAACTACCTGCACCAGTGGACATGCCTGCCCGACCAGAGCGATGTC c.7920 D V D Y K N Y L H Q W T C L P D Q S D V p.2640 . . . | 58 . . . g.96158 ATCCATGCTCGGCAGGCCTATGACCTCCAGAGCGAT | AATTTGTACAAGTCAGACCTTCAG c.7980 I H A R Q A Y D L Q S D | N L Y K S D L Q p.2660 . . . . . . g.96218 TGGCTAAAAGGCATTGGCTGGATGACTAGTGGTTCTCTCGAGGATGAGAAAAATAAACGA c.8040 W L K G I G W M T S G S L E D E K N K R p.2680 . . . . . . g.96278 GCCACCCAGATTTTGAGTGACCATGTTTACCGTCAGCACCCAGATCAATTTAAGTTTTCC c.8100 A T Q I L S D H V Y R Q H P D Q F K F S p.2700 . . . . . . g.96338 AGCCTTATGGATTCCATACCAATGGTTTTGGCAAAAAACAATGCTATTACCATGAATCAT c.8160 S L M D S I P M V L A K N N A I T M N H p.2720 | 59 . . . . . . g.96678 | CGCCTCTATACAGAAGCTTGGGATAAAGATAAAACCACTGTCCACATTATGCCAGATACC c.8220 | R L Y T E A W D K D K T T V H I M P D T p.2740 . . . . | 60 . . g.96821 CCTGAAGTTTTATTAGCTAAACAAAACAAAGTAAATTACAGTGAG | AAATTGTATAAGCTT c.8280 P E V L L A K Q N K V N Y S E | K L Y K L p.2760 . . . . . . g.96881 GGCCTAGAAGAAGCCAAGAGGAAAGGTTATGACATGCGGGTAGATGCCATTCCTATCAAG c.8340 G L E E A K R K G Y D M R V D A I P I K p.2780 . . . | 61 . . . g.98848 GCAGCCAAGGCCTCCAGAGATATTGCAAGTGAA | TTCAAGTACAAAGAAGGCTATCGTAAG c.8400 A A K A S R D I A S E | F K Y K E G Y R K p.2800 . . . . . . g.98908 CAGCTCGGCCACCACATTGGTGCCCGAGCTATACGTGATGACCCCAAGATGATGTGGTCC c.8460 Q L G H H I G A R A I R D D P K M M W S p.2820 . . . . . . g.98968 ATGCACGTGGCCAAGATCCAGAGTGACAGGGAGTACAAGAAGGACTTTGAGAAGTGGAAG c.8520 M H V A K I Q S D R E Y K K D F E K W K p.2840 . . . . . . g.99028 ACCAAGTTCAGCAGCCCAGTGGACATGCTGGGGGTGGTGCTGGCCAAGAAGTGCCAGACC c.8580 T K F S S P V D M L G V V L A K K C Q T p.2860 . . . . . . g.99088 TTAGTCAGCGATGTGGACTACAAGAACTACCTGCACCAGTGGACATGCCTGCCCGACCAG c.8640 L V S D V D Y K N Y L H Q W T C L P D Q p.2880 . . . . | 62 . . g.99442 AGCGACGTCATCCATGCTCGGCAGGCCTATGACCTCCAGAGCGAT | AATATGTACAAGTCT c.8700 S D V I H A R Q A Y D L Q S D | N M Y K S p.2900 . . . . . . g.99502 GATCTCCAGTGGATGAGAGGCATTGGCTGGGTGTCCATTGGCTCTTTGGATGTGGAAAAA c.8760 D L Q W M R G I G W V S I G S L D V E K p.2920 . . . . . . g.99562 TGCAAAAGGGCAACTGAAATTTTGAGTGATAAAATCTATCGCCAGCCTCCAGACAGATTC c.8820 C K R A T E I L S D K I Y R Q P P D R F p.2940 . . . . . . g.99622 AAATTTACCAGTGTGACTGACTCTCTGGAACAAGTGTTGGCCAAAAATAATGCCATCACT c.8880 K F T S V T D S L E Q V L A K N N A I T p.2960 | 63 . . . . . g.100154 ATGAACAAG | CGTTTATACACAGAAGCCTGGGACAAAGACAAGACTCAGATCCACATAATG c.8940 M N K | R L Y T E A W D K D K T Q I H I M p.2980 . . . . . | 64 . g.103159 CCAGACACACCAGAAATTATGTTGGCAAGAATGAACAAAATCAACTACAGTGAG | AGTCTG c.9000 P D T P E I M L A R M N K I N Y S E | S L p.3000 . . . . . . g.103219 TACAAACTTGCTAATGAAGAAGCAAAGAAGAAAGGCTATGACTTGCGAAGCGACGCCATC c.9060 Y K L A N E E A K K K G Y D L R S D A I p.3020 . . . . | 65 . . g.105540 CCCATCGTGGCGGCCAAGGCCTCCAGGGACATCATCAGTGAC | TACAAATATAAAGATGGT c.9120 P I V A A K A S R D I I S D | Y K Y K D G p.3040 . . . . . . g.105600 TACTGCAAGCAACTTGGCCACCATATTGGAGCCCGGAACATTGAAGATGACCCCAAGATG c.9180 Y C K Q L G H H I G A R N I E D D P K M p.3060 . . . . . . g.105660 ATGTGGTCCATGCACGTAGCCAAGATCCAGAGTGACAGGGAGTACAAAAAGGACTTTGAG c.9240 M W S M H V A K I Q S D R E Y K K D F E p.3080 . . . . . . g.105720 AAGTGGAAGACCAAGTTCAGCAGCCCAGTGGACATGCTGGGGGTGGTGCTGGCCAAGAAG c.9300 K W K T K F S S P V D M L G V V L A K K p.3100 . . . . . . g.105780 TGCCAGACCTTAGTCAGTGACGTGGACTATAAGAACTACCTGCACGAGTGGACATGCCTG c.9360 C Q T L V S D V D Y K N Y L H E W T C L p.3120 . . . . . | 66 . g.108147 CCTGACCAGAGCGATGTCATCCATGCTCGGCAGGCCTATGACCTCCAGAGTGAC | AATATT c.9420 P D Q S D V I H A R Q A Y D L Q S D | N I p.3140 . . . . . . g.108207 TACAAGTCAGATCTCCAGTGGCTGAGAGGCATTGGCTGGGTCCCCATTGGGTCTATGGAT c.9480 Y K S D L Q W L R G I G W V P I G S M D p.3160 . . . . . . g.108267 GTGGTCAAGTGCAAGAGAGCTACCGAAATACTGAGTGATAACATCTACCGCCAGCCTCCG c.9540 V V K C K R A T E I L S D N I Y R Q P P p.3180 . . . . . . g.108327 GACAAGCTGAAATTTACCAGTGTGACTGATTCTCTAGAGCAGGTGCTGGCCAAGAACAAT c.9600 D K L K F T S V T D S L E Q V L A K N N p.3200 . | 67 . . . . g.108710 GCTCTCAACATGAATAAG | CGTTTATACACAGAGGCCTGGGACAAAGACAAGACTCAAATT c.9660 A L N M N K | R L Y T E A W D K D K T Q I p.3220 . . . . . . g.108770 CACATAATGCCTGATACACCAGAGATTATGTTGGCAAGGCAGAACAAAATCAACTACAGT c.9720 H I M P D T P E I M L A R Q N K I N Y S p.3240 | 68 . . . . . . g.109898 GAG | ACTCTATACAAACTTGCCAATGAAGAAGCAAAAAAGAAAGGCTACGACTTGCGAAGT c.9780 E | T L Y K L A N E E A K K K G Y D L R S p.3260 . . . . . | 69 . g.111662 GACGCCATCCCCATCGTGGCTGCCAAGGCCTCCAGGGACGTTATCAGTGAT | TACAAATAC c.9840 D A I P I V A A K A S R D V I S D | Y K Y p.3280 . . . . . . g.111722 AAAGATGGTTACCGCAAGCAGCTCGGCCACCACATTGGAGCCCGGAACATTGAAGATGAC c.9900 K D G Y R K Q L G H H I G A R N I E D D p.3300 . . . . . . g.111782 CCCAAGATGATGTGGTCCATGCATGTGGCCAAGATCCAGAGTGACAGGGAGTATAAGAAG c.9960 P K M M W S M H V A K I Q S D R E Y K K p.3320 . . . . . . g.111842 GACTTTGAGAAGTGGAAGACCAAGTTCAGCAGCCCAGTGGACATGCTGGGAGTGGTGTTA c.10020 D F E K W K T K F S S P V D M L G V V L p.3340 . . . . . . g.111902 GCCAAGAAGTGCCAGACCTTAGTCAGCGATGTGGACTACAAGAACTACCTGCACGAGTGG c.10080 A K K C Q T L V S D V D Y K N Y L H E W p.3360 . . . . . . g.111962 ACGTGCCTGCCCGACCAGAATGATGTCATCCATGCTCGGCAGGCCTATGACCTCCAGAGC c.10140 T C L P D Q N D V I H A R Q A Y D L Q S p.3380 | 70 . . . . . . g.112339 GAT | AACATTTACAAATCTGATCTCCAGTGGCTGAGAGGCATTGGCTGGGTCCCCATTGGG c.10200 D | N I Y K S D L Q W L R G I G W V P I G p.3400 . . . . . . g.112399 TCTATGGATGTGGTCAAGTGCAAGAGAGCTGCTGAAATACTGAGTGATAACATCTACCGC c.10260 S M D V V K C K R A A E I L S D N I Y R p.3420 . . . . . . g.112459 CAGCCTCCGGACAAGCTGAAATTTACCAGTGTGACTGACTCTCTAGAGCAGGTGCTGGCC c.10320 Q P P D K L K F T S V T D S L E Q V L A p.3440 . . | 71. . . . g.113882 AAGAACAATGCTCTCAATATGAACAAG | CGCTTATACACAGAAGCCTGGGACAAAGACAAG c.10380 K N N A L N M N K | R L Y T E A W D K D K p.3460 . . . . . . g.113942 ACCCAAGTCCATATTATGCCTGATACACCTGAAATCATGTTGGCAAGACAAAATAAAATA c.10440 T Q V H I M P D T P E I M L A R Q N K I p.3480 . | 72 . . . . . g.118509 AATTATAGTGAG | AGCCTCTATCGTCAGGCCATGGAAGAAGCCAAGAAAGAAGGCTATGAC c.10500 N Y S E | S L Y R Q A M E E A K K E G Y D p.3500 . . . . . . g.118569 TTGAGAAGTGATGCCATTCCCATTGTGGCTGCCAAGGCCTCTCGGGATATTGCCAGTGAT c.10560 L R S D A I P I V A A K A S R D I A S D p.3520 | 73 . . . . . . g.119785 | TACAAATACAAAGAAGCATATCGTAAACAGTTGGGTCACCACATTGGCGCCCGAGCAGTA c.10620 | Y K Y K E A Y R K Q L G H H I G A R A V p.3540 . . . . . . g.119845 CACGATGACCCCAAGATAATGTGGTCCCTCCACATTGCCAAAGTGCAGAGTGACCGTGAG c.10680 H D D P K I M W S L H I A K V Q S D R E p.3560 . . . . . . g.119905 TACAAGAAAGATTTTGAGAAATACAAGACAAGGTACAGCAGCCCAGTGGACATGCTTGGT c.10740 Y K K D F E K Y K T R Y S S P V D M L G p.3580 . . . . . . g.119965 ATCGTTTTGGCCAAGAAGTGTCAGACCTTGGTCAGCGATGTGGACTATAAACATCCTCTG c.10800 I V L A K K C Q T L V S D V D Y K H P L p.3600 . . . . . . g.120025 CATGAATGGATCTGCCTGCCCGACCAGAATGACATCATTCATGCACGGAAAGCCTATGAC c.10860 H E W I C L P D Q N D I I H A R K A Y D p.3620 . | 74 . . . . . g.121057 CTCCAGAGTGAC | AATTTGTATAAGTCAGACCTTGAATGGATGAAAGGCATTGGCTGGGTT c.10920 L Q S D | N L Y K S D L E W M K G I G W V p.3640 . . . . . . g.121117 CCGATTGATTCCTTGGAAGTTGTTAGGGCCAAGAGAGCTGGAGAATTACTTAGTGATACT c.10980 P I D S L E V V R A K R A G E L L S D T p.3660 . . . . . . g.121177 ATCTACCGTCAGCGTCCAGAAACGCTGAAATTTACCAGTATAACGGACACTCCGGAGCAG c.11040 I Y R Q R P E T L K F T S I T D T P E Q p.3680 . . . | 75 . . . g.122043 GTGCTGGCAAAAAACAATGCTTTAAACATGAATAAG | CGCTTATATACTGAAGCCTGGGAC c.11100 V L A K N N A L N M N K | R L Y T E A W D p.3700 . . . . . . g.122103 AATGACAAGAAAACTATTCATGTCATGCCTGATACACCAGAAATCATGTTAGCCAAACTC c.11160 N D K K T I H V M P D T P E I M L A K L p.3720 . . | 76 . . . . g.123417 AACCGAATAAACTACAGTGAT | AAACTCTATAAACTTGCTTTGGAAGAGTCCAAGAAGGAA c.11220 N R I N Y S D | K L Y K L A L E E S K K E p.3740 . . . . . . g.123477 GGCTATGACTTGCGTCTGGATGCCATTCCAATCCAAGCAGCCAAGGCTTCAAGAGATATT c.11280 G Y D L R L D A I P I Q A A K A S R D I p.3760 | 77 . . . . . g.124951 GCTAGTGAT | TACAAGTACAAGGAAGGCTACCGCAAACAGCTTGGCCACCATATTGGGGCC c.11340 A S D | Y K Y K E G Y R K Q L G H H I G A p.3780 . . . . . . g.125011 CGGAACATTAAGGATGACCCGAAGATGATGTGGTCCATCCATGTGGCCAAGATCCAGAGT c.11400 R N I K D D P K M M W S I H V A K I Q S p.3800 . . . . . . g.125071 GACAGGGAGTACAAGAAGGAGTTTGAGAAGTGGAAGACCAAGTTCAGCAGCCCAGTGGAC c.11460 D R E Y K K E F E K W K T K F S S P V D p.3820 . . . . . . g.125131 ATGCTGGGGGTGGTGCTGGCCAAGAAGTGTCAGATCCTTGTAAGCGACATAGACTACAAG c.11520 M L G V V L A K K C Q I L V S D I D Y K p.3840 . . . . . . g.125191 CATCCCCTGCATGAATGGACCTGCCTGCCTGATCAGAATGACGTCATTCAGGCTCGGAAG c.11580 H P L H E W T C L P D Q N D V I Q A R K p.3860 . . | 78 . . . . g.127137 GCCTATGACCTGCAGAGTGAT | GCTATTTACAAATCTGATCTTGAGTGGCTGAGAGGCATA c.11640 A Y D L Q S D | A I Y K S D L E W L R G I p.3880 . . . . . . g.127197 GGATGGGTTCCCATTGGCTCTGTAGAGGTCGAGAAAGTGAAGAGAGCTGGAGAAATCCTG c.11700 G W V P I G S V E V E K V K R A G E I L p.3900 . . . . . . g.127257 AGTGACAGGAAGTATCGCCAGCCTGCAGACCAGCTCAAATTCACATGCATTACCGACACT c.11760 S D R K Y R Q P A D Q L K F T C I T D T p.3920 . . . . | 79 . . g.128636 CCGGAAATTGTCCTAGCAAAGAATAATGCCCTGACAATGAGCAAG | CATTTATACACAGAA c.11820 P E I V L A K N N A L T M S K | H L Y T E p.3940 . . . . . . g.128696 GCTTGGGATGCTGACAAAACCTCCATCCACGTGATGCCAGACACCCCAGATATCCTGCTG c.11880 A W D A D K T S I H V M P D T P D I L L p.3960 . . . | 80 . . . g.128894 GCCAAGAGTAATTCTGCCAATATCAGCCAA | AAACTTTACACCAAGGGATGGGATGAATCA c.11940 A K S N S A N I S Q | K L Y T K G W D E S p.3980 . . . . . . g.128954 AAGATGAAGGACTATGATCTGAGAGCAGATGCTATTTCCATCAAAAGTGCCAAGGCCTCC c.12000 K M K D Y D L R A D A I S I K S A K A S p.4000 . | 81 . . . . g.129409 AGGGACATCGCCAGTGAC | TACAAATACAAGGAAGCCTATGAGAAACAGAAAGGCCACCAC c.12060 R D I A S D | Y K Y K E A Y E K Q K G H H p.4020 . . . . . . g.129469 ATTGGAGCCCAGAGCATTGAAGATGATCCCAAGATTATGTGTGCCATACATGCAGGAAAA c.12120 I G A Q S I E D D P K I M C A I H A G K p.4040 . . . . . . g.129529 ATTCAAAGTGAAAGGGAGTACAAGAAGGAATTCCAAAAGTGGAAAACCAAGTTCTCTAGC c.12180 I Q S E R E Y K K E F Q K W K T K F S S p.4060 . . . . . . g.129589 CCAGTGGACATGTTAAGCATCTTGCTGGCCAAGAAATGTCAGACTTTGGTCACTGACATT c.12240 P V D M L S I L L A K K C Q T L V T D I p.4080 . . . . . . g.129649 GATTATCGCAATTACCTGCATGAATGGACATGCATGCCGGATCAAAACGACATTATCCAA c.12300 D Y R N Y L H E W T C M P D Q N D I I Q p.4100 . . . | 82 . . . g.130841 GCAAAAAAGGCCTATGACCTGCAGAGTGAT | AGTGTGTATAAGGCAGACCTGGAGTGGCTG c.12360 A K K A Y D L Q S D | S V Y K A D L E W L p.4120 ^ sequence block 1 . . . . . . g.130901 CGTGGCATCGGCTGGATGCCAGAAGGCTCAGTGGAAATGAACAGAGTGAAGGTTGCTCAA c.12420 R G I G W M P E G S V E M N R V K V A Q p.4140 . . . . . . g.130961 GACCTCGTGAATGAAAGACTCTATAGGACACGTCCAGAAGCTTTGTCATTCACCAGCATT c.12480 D L V N E R L Y R T R P E A L S F T S I p.4160 . . . . . | 83 . g.131885 GTCGACACTCCAGAAGTTGTCTTGGCAAAAGCCAATTCTCTGCAAATAAGTGAG | AAACTG c.12540 V D T P E V V L A K A N S L Q I S E | K L p.4180 . . . . . . g.131945 TATCAGGAAGCCTGGAATAAAGATAAAAGCAACATCACCATTCCTTCTGATACTCCGGAG c.12600 Y Q E A W N K D K S N I T I P S D T P E p.4200 . . . | 84 . . g.132795 ATGCTGCAGGCCCACATCAATGCCTTGCAAATCAGCAAT | AAACTCTACCAAAAAGACTGG c.12660 M L Q A H I N A L Q I S N | K L Y Q K D W p.4220 . . . . . . g.132855 AATGACGCCAAGCAGAAAGGCTATGACATAAGGGCAGATGCCATTGAAATCAAGCACGCC c.12720 N D A K Q K G Y D I R A D A I E I K H A p.4240 . . | 85. . . . g.134649 AAGGCCTCCAGAGAAATTGCCAGTGAG | TACAAATACAAAGAAGGTTACCGTAAGCAACTG c.12780 K A S R E I A S E | Y K Y K E G Y R K Q L p.4260 . . . . . . g.134709 GGCCACCACATGGGTTTCCGCACCCTACAAGATGACCCCAAGTCAGTATGGGCTATACAT c.12840 G H H M G F R T L Q D D P K S V W A I H p.4280 . . . . . . g.134769 GCTGCCAAGATCCAGAGTGACAGAGAATATAAGAAAGCTTATGAGAAGTCTAAAGGAATT c.12900 A A K I Q S D R E Y K K A Y E K S K G I p.4300 . . . . . . g.134829 CACAACACACCGTTGGACATGATGTCAATTGTTCAAGCCAAGAAATGCCAGGTCCTGGTT c.12960 H N T P L D M M S I V Q A K K C Q V L V p.4320 . . . . . . g.134889 AGCGACATTGATTATCGCAATTATCTGCACCAGTGGACGTGTCTGCCAGATCAGAACGAT c.13020 S D I D Y R N Y L H Q W T C L P D Q N D p.4340 . . . | 86 . . g.135736 GTGATCCAGGCCAAGAAAGCCTACGACCTGCAGAGCGAT | AACTTGTACAAGTCAGACCTG c.13080 V I Q A K K A Y D L Q S D | N L Y K S D L p.4360 . . . . . . g.135796 GAATGGCTGAAGGGTATTGGATGGCTGCCAGAGGGTTCTGTGGAAGTGATGAGAGTGAAG c.13140 E W L K G I G W L P E G S V E V M R V K p.4380 . . . . . . g.135856 AATGCCCAGAATCTCTTGAATGAAAGGCTGTATCGTATAAAGCCTGAGGCCCTCAAATTC c.13200 N A Q N L L N E R L Y R I K P E A L K F p.4400 . . . . . . g.135916 ACCAGCATTGTTGACACCCCGGAAGTAATCCAGGCAAAGATCAATGCTGTACAGATCAGT c.13260 T S I V D T P E V I Q A K I N A V Q I S p.4420 | 87 . . . . . . g.136853 GAG | CCATTGTATCGCGATGCCTGGGAGAAAGAGAAGGCTAATGTGAACGTGCCAGCTGAC c.13320 E | P L Y R D A W E K E K A N V N V P A D p.4440 . . . . | 88 . g.137491 ACTCCCCTGATGCTGCAATCCAAAATCAATGCCCTGCAGATCAGCAAT | AAACGCTATCAG c.13380 T P L M L Q S K I N A L Q I S N | K R Y Q p.4460 . . . . . . g.137551 CAAGCTTGGGAAGATGTCAAGATGACTGGTTATGACCTGCGAGCAGATGCCATTGGGATC c.13440 Q A W E D V K M T G Y D L R A D A I G I p.4480 . . . | 89 . . . g.138758 CAGCATGCCAAGGCTTCCAGGGATATTGCCAGTGAC | TATCTGTACAAAACTGCTTATGAG c.13500 Q H A K A S R D I A S D | Y L Y K T A Y E p.4500 . . . . . . g.138818 AAACAGAAAGGCCATTACATTGGCTGTCGCAGCGCCAAGGAAGACCCTAAACTGGTTTGG c.13560 K Q K G H Y I G C R S A K E D P K L V W p.4520 . . . . . . g.138878 GCAGCAAATGTGTTGAAGATGCAGAATGACAGGCTGTACAAAAAGGCCTACAACGACCAC c.13620 A A N V L K M Q N D R L Y K K A Y N D H p.4540 . . . . . . g.138938 AAAGCCAAGATCTCCATCCCTGTGGACATGGTGTCCATCAGCGCTGCCAAAGAAGGTCAG c.13680 K A K I S I P V D M V S I S A A K E G Q p.4560 . . . . . . g.138998 GCACTGGCAAGTGATGTGGACTATCGCCATTACCTGCACCACTGGTCTTGCTTTCCCGAC c.13740 A L A S D V D Y R H Y L H H W S C F P D p.4580 . . . . | 90 . g.141369 CAGAATGATGTGATCCAAGCCAGGAAAGCCTACGACCTACAGAGCGAC | AGTGTGTATAAG c.13800 Q N D V I Q A R K A Y D L Q S D | S V Y K p.4600 ^ sequence block 2 . . . . . . g.141429 GCAGACCTGGAGTGGCTGCGTGGCATCGGCTGGATGCCAGAAGGCTCAGTGGAAATGAAC c.13860 A D L E W L R G I G W M P E G S V E M N p.4620 . . . . . . g.141489 AGAGTGAAGGTTGCTCAAGACCTCGTGAATGAAAGACTCTATAGGACACGTCCAGAAGCT c.13920 R V K V A Q D L V N E R L Y R T R P E A p.4640 . . . . . . g.141549 TTGTCATTCACCAGCATTGTCGACACTCCAGAAGTTGTCTTGGCAAAAGCCAATTCTCTG c.13980 L S F T S I V D T P E V V L A K A N S L p.4660 . | 91 . . . . . g.142473 CAAATAAGTGAG | AAACTGTATCAGGAAGCCTGGAATAAAGATAAAAGCAACATCACCATT c.14040 Q I S E | K L Y Q E A W N K D K S N I T I p.4680 . . . . . | 92. g.143323 CCTTCTGATACTCCGGAGATGCTGCAGGCCCACATCAATGCCTTGCAAATCAGCAAT | AAA c.14100 P S D T P E M L Q A H I N A L Q I S N | K p.4700 . . . . . . g.143383 CTCTACCAAAAAGACTGGAATGACACCAAGCAGAAAGGCTATGACATAAGGGCAGATGCC c.14160 L Y Q K D W N D T K Q K G Y D I R A D A p.4720 . . . . | 93 . . g.145184 ATTGAAATCAAGCACGCCAAGGCCTCCAGAGAAATTGCCAGTGAG | TACAAATACAAAGAA c.14220 I E I K H A K A S R E I A S E | Y K Y K E p.4740 . . . . . . g.145244 GGTTACCGTAAGCAACTGGGCCACCACATGGGTTTCCGCACCCTACAAGATGACCCCAAG c.14280 G Y R K Q L G H H M G F R T L Q D D P K p.4760 . . . . . . g.145304 TCAGTATGGGCTATACATGCTGCCAAGATCCAGAGTGACAGAGAATATAAGAAAGCTTAT c.14340 S V W A I H A A K I Q S D R E Y K K A Y p.4780 . . . . . . g.145364 GAGAAGTCTAAAGGAATTCACAACACACCGTTGGACATGATGTCAATTGTTCAAGCCAAG c.14400 E K S K G I H N T P L D M M S I V Q A K p.4800 . . . . . . g.145424 AAATGCCAGGTCCTGGTTAGCGACATTGATTATCGCAATTATCTGCACCAGTGGACGTGT c.14460 K C Q V L V S D I D Y R N Y L H Q W T C p.4820 . . . . . | 94. g.146271 CTGCCAGATCAGAACGATGTGATCCAGGCCAAGAAAGCCTACGACCTGCAGAGCGAT | AAC c.14520 L P D Q N D V I Q A K K A Y D L Q S D | N p.4840 . . . . . . g.146331 TTGTACAAGTCAGACCTGGAATGGCTGAAGGGTATTGGATGGTTGCCAGAGGGTTCTGTG c.14580 L Y K S D L E W L K G I G W L P E G S V p.4860 . . . . . . g.146391 GAAGTGATGAGAGTGAAGAATGCCCAGAATCTCTTGAATGAAAGGCTGTATCGTATAAAG c.14640 E V M R V K N A Q N L L N E R L Y R I K p.4880 . . . . . . g.146451 CCTGAGGCCCTCAAATTCACCAGCATTGTTGACACCCCGGAAGTAATCCAGGCAAAGATC c.14700 P E A L K F T S I V D T P E V I Q A K I p.4900 . . | 95 . . . . g.147388 AATGCTGTACAGATCAGTGAG | CCATTGTATCGCAATGCCTGGGAGAAAGAGAAGGCTAAT c.14760 N A V Q I S E | P L Y R N A W E K E K A N p.4920 . . . . . . g.147448 GTGAACGTGCCAGCTGACACTCCCCTGATGCTGCAATCCAAAATCAATGCTCTGCAGATC c.14820 V N V P A D T P L M L Q S K I N A L Q I p.4940 | 96 . . . . . . g.148086 AGCAAT | AAACGCTATCAGCAAGCTTGGGAAGATGTCAAGATGACTGGTTATGACCTGCGA c.14880 S N | K R Y Q Q A W E D V K M T G Y D L R p.4960 . . . . . | 97 . g.149293 GCAGATGCCATTGGGATCCAGCATGCCAAGGCTTCCAGGGATATTGCCAGTGAT | TATCTG c.14940 A D A I G I Q H A K A S R D I A S D | Y L p.4980 . . . . . . g.149353 TACAAAACTGCTTATGAGAAACAGAAAGGCCATTACATTGGCTGTCGCAGCGCCAAGGAA c.15000 Y K T A Y E K Q K G H Y I G C R S A K E p.5000 . . . . . . g.149413 GACCCTAAACTGGTTTGGGCAGCAAATGTGTTGAAGATGCAGAATGACAGGCTGTACAAA c.15060 D P K L V W A A N V L K M Q N D R L Y K p.5020 . . . . . . g.149473 AAGGCCTACAACGACCACAAAGCCAAGATCTCCATCCCTGTGGACATGGTGTCCATCAGC c.15120 K A Y N D H K A K I S I P V D M V S I S p.5040 . . . . . . g.149533 GCTGCCAAAGAAGGTCAGGCACTGGCAAGTGATGTGGACTATCGCCATTACCTGCACCAC c.15180 A A K E G Q A L A S D V D Y R H Y L H H p.5060 . . . . . . g.149593 TGGTCTTGCTTTCCCGACCAGAATGATGTGATCCAAGCCAGGAAAGCCTACGACCTACAG c.15240 W S C F P D Q N D V I Q A R K A Y D L Q p.5080 | 98 . . . . . . g.151964 AGCGAC | AGTGTGTATAAGGCAGACCTGGAGTGGCTGCGTGGCATCGGCTGGATGCCAGAA c.15300 S D | S V Y K A D L E W L R G I G W M P E p.5100 ^ sequence block 3 . . . . . . g.152024 GGCTCAGTGGAAATGAACAGAGTGAAGGTTGCTCAAGACCTCGTGAATGAAAGACTCTAT c.15360 G S V E M N R V K V A Q D L V N E R L Y p.5120 . . . . . . g.152084 AGGACACGTCCAGAAGCTTTGTCATTCACCAGCATTGTCGACACTCCAGAAGTTGTCTTG c.15420 R T R P E A L S F T S I V D T P E V V L p.5140 . . . | 99 . . . g.153008 GCAAAAGCCAATTCTCTGCAAATAAGTGAG | AAACTGTATCAGGAAGCCTGGAATAAAGAT c.15480 A K A N S L Q I S E | K L Y Q E A W N K D p.5160 . . . . . . g.153068 AAAAGCAACATCACCATTCCTTCTGATACTCCGGAGATGCTGCAGGCCCACATCAATGCC c.15540 K S N I T I P S D T P E M L Q A H I N A p.5180 . | 100 . . . . . g.153918 TTGCAAATCAGCAAT | AAACTCTACCAAAAAGACTGGAATGACACCAAGCAGAAAGGCTAT c.15600 L Q I S N | K L Y Q K D W N D T K Q K G Y p.5200 . . . . . . g.153978 GACATAAGGGCAGATGCCATTGAAATCAAGCACGCCAAGGCCTCCAGAGAAATTGCCAGT c.15660 D I R A D A I E I K H A K A S R E I A S p.5220 | 101 . . . . . . g.155778 GAG | TACAAATACAAAGAAGGTTACCGTAAGCAACTGGGCCACCACATGGGTTTCCGCACC c.15720 E | Y K Y K E G Y R K Q L G H H M G F R T p.5240 . . . . . . g.155838 CTACAAGATGACCCCAAGTCAGTATGGGCTATACATGCTGCCAAGATCCAGAGTGACAGA c.15780 L Q D D P K S V W A I H A A K I Q S D R p.5260 . . . . . . g.155898 GAATATAAGAAAGCTTATGAGAAGTCTAAAGGAATTCACAACACACCGTTGGACATGATG c.15840 E Y K K A Y E K S K G I H N T P L D M M p.5280 . . . . . . g.155958 TCAATTGTTCAAGCCAAGAAATGCCAGGTCCTGGTTAGCGACATTGATTATCGCAATTAT c.15900 S I V Q A K K C Q V L V S D I D Y R N Y p.5300 . . . . . . g.156018 CTGCACCAGTGGACGTGTCTGCCAGATCAGAACGATGTGATCCAGGCCAAGAAAGCCTAC c.15960 L H Q W T C L P D Q N D V I Q A K K A Y p.5320 . | 102 . . . . . g.156865 GACCTGCAGAGCGAT | AACTTGTACAAGTCAGACCTGGAATGGCTGAAGGGTATTGGATGG c.16020 D L Q S D | N L Y K S D L E W L K G I G W p.5340 . . . . . . g.156925 TTGCCAGAGGGTTCTGTGGAAGTGATGAGAGTGAAGAATGCCCAGAATCTCTTGAATGAA c.16080 L P E G S V E V M R V K N A Q N L L N E p.5360 . . . . . . g.156985 AGGCTGTATCGTATAAAGCCTGAGGCCCTCAAATTCACCAGCATTGTTGACACCCCGGAA c.16140 R L Y R I K P E A L K F T S I V D T P E p.5380 . . . | 103 . . g.157921 GTAATCCAGGCAAAGATCAATGCTGTACAGATCAGTGAG | CCATTGTATCGCGATGCCTGG c.16200 V I Q A K I N A V Q I S E | P L Y R D A W p.5400 . . . . . . g.157981 GAGAAAGAGAAGGCTAATGTGAACGTGCCAGCTGACACTCCCCTGATGCTGCAATCCAAA c.16260 E K E K A N V N V P A D T P L M L Q S K p.5420 . . | 104 . . . . g.158619 ATCAATGCCCTGCAGATCAGCAAT | AAACGCTATCAGCAAGCTTGGGAAGATGTCAAGATG c.16320 I N A L Q I S N | K R Y Q Q A W E D V K M p.5440 . . . . . . g.158679 ACTGGTTATGACCTGCGAGCAGATGCCATTGGGATCCAGCATGCCAAGGCTTCCAGGGAT c.16380 T G Y D L R A D A I G I Q H A K A S R D p.5460 . | 105 . . . . . g.159886 ATTGCCAGTGAC | TATCTGTACAAAACTGCTTATGAGAAACAGAAAGGCCATTACATTGGC c.16440 I A S D | Y L Y K T A Y E K Q K G H Y I G p.5480 . . . . . . g.159946 TGTCGCAGCGCCAAGGAAGACCCTAAACTGGTTTGGGCAGCAAATGTGTTGAAGATGCAG c.16500 C R S A K E D P K L V W A A N V L K M Q p.5500 . . . . . . g.160006 AATGACAGGCTGTACAAAAAGGCCTACAACGACCACAAAGCCAAGATCTCCATCCCTGTG c.16560 N D R L Y K K A Y N D H K A K I S I P V p.5520 . . . . . . g.160066 GACATGGTGTCCATCAGCGCTGCCAAAGAAGGTCAGGCACTGGCAAGTGATGTGGACTAT c.16620 D M V S I S A A K E G Q A L A S D V D Y p.5540 . . . . . . g.160126 CGCCATTACCTGCACCGCTGGTCTTGCTTTCCCGACCAGAATGATGTGATCCAAGCCAGG c.16680 R H Y L H R W S C F P D Q N D V I Q A R p.5560 . . | 106 . . . . g.163169 AAAGCCTACGACCTACAGAGCGAC | GCCCTCTACAAGGCTGACTTGGAGTGGTTGCGTGGC c.16740 K A Y D L Q S D | A L Y K A D L E W L R G p.5580 . . . . . . g.163229 ATTGGCTGGATGCCCCAAGGGTCTCCTGAAGTGTTGAGAGTCAAAAACGCCCAGAATATC c.16800 I G W M P Q G S P E V L R V K N A Q N I p.5600 . . . . . . g.163289 TTTTGTGACAGTGTCTATCGGACGCCTGTGGTGAACCTTAAGTACACAAGCATTGTTGAC c.16860 F C D S V Y R T P V V N L K Y T S I V D p.5620 . . . . | 107 . g.163700 ACACCTGAAGTGGTCCTTGCTAAATCAAATGCTGAAAATATTAGTATT | CCAAAGTACAGA c.16920 T P E V V L A K S N A E N I S I | P K Y R p.5640 . . . . . . g.163760 GAGGTTTGGGACAAGGATAAAACTTCAATACACATAATGCCAGATACTCCAGAAATTAAT c.16980 E V W D K D K T S I H I M P D T P E I N p.5660 . . . | 108 . . . g.168913 CTCGCTAGAGCAAATGCTCTTAATGTGAGCAAT | AAACTTTACCGTGAGGGCTGGGATGAA c.17040 L A R A N A L N V S N | K L Y R E G W D E p.5680 . . . . . . g.168973 ATGAAGGCGGGCTGTGATGTCCGGCTGGATGCCATCCCCATCCAGGCTGCCAAGGCCTCC c.17100 M K A G C D V R L D A I P I Q A A K A S p.5700 . | 109 . . . . g.169137 AGGGAGATTGCCAGTGAC | TATAAATATAAGCTTGACCATGAGAAGCAGAAGGGACACTAC c.17160 R E I A S D | Y K Y K L D H E K Q K G H Y p.5720 . . . . . . g.169197 GTGGGCACCCTCACAGCCAGGGATGACAACAAGATCCGCTGGGCCCTCATAGCTGACAAG c.17220 V G T L T A R D D N K I R W A L I A D K p.5740 . . . . . . g.169257 CTCCAGAATGAACGAGAGTACCGGCTGGACTGGGCCAAATGGAAGGCCAAGATCCAGAGC c.17280 L Q N E R E Y R L D W A K W K A K I Q S p.5760 . . . . . . g.169317 CCTGTGGACATGCTTTCCATCCTGCACTCTAAAAATTCCCAGGCTCTGGTCAGTGACATG c.17340 P V D M L S I L H S K N S Q A L V S D M p.5780 . . . . . . g.169377 GATTACCGCAATTACCTGCACCAGTGGACCTGCATGCCCGACCAGAACGATGTGATTCAG c.17400 D Y R N Y L H Q W T C M P D Q N D V I Q p.5800 . . . | 110 . . . g.170145 GCCAAGAAGGCCTACGAACTGCAGAGCGAT | AATGTTTACAAGGCTGACTTGGAATGGTTG c.17460 A K K A Y E L Q S D | N V Y K A D L E W L p.5820 . . . . . . g.170205 CGTGGAATTGGGTGGATGCCAAATGACTCCGTGTCCGTCAATCATGCCAAACATGCCGCG c.17520 R G I G W M P N D S V S V N H A K H A A p.5840 . | 111 . . . . . g.170816 GACATCTTCAGTGAG | AAAAAATATCGCACAAAAATAGAAACTCTCAACTTTACGCCTGTG c.17580 D I F S E | K K Y R T K I E T L N F T P V p.5860 . . . . . | 112 . g.171076 GATGACAGAGTTGATTATGTGACAGCGAAACAAAGTGGCGAGATCCTCGATGAT | ATTAAA c.17640 D D R V D Y V T A K Q S G E I L D D | I K p.5880 . . . . . . g.171136 TACCGGAAAGACTGGAATGCCACCAAATCAAAGTACACCCTCACAGAAACCCCCCTGCTG c.17700 Y R K D W N A T K S K Y T L T E T P L L p.5900 . . . | 113. . . g.171333 CACACTGCCCAGGAGGCTGCTAGGATACTGGACCAG | TATCTCTACAAGGAAGGCTGGGAG c.17760 H T A Q E A A R I L D Q | Y L Y K E G W E p.5920 . . . . . . g.171393 AGACAAAAAGCCACAGGTTACATTTTGCCTCCAGATGCTGTGCCATTTGTTCATGCCCAT c.17820 R Q K A T G Y I L P P D A V P F V H A H p.5940 . . | 114 . . . . g.172044 CACTGCAATGACGTTCAGAGTGAG | CTGAAATACAAAGCTGAACATGTGAAGCAAAAAGGT c.17880 H C N D V Q S E | L K Y K A E H V K Q K G p.5960 . . . . . . g.172104 CATTATGTTGGTGTCCCGACGATGAGAGATGATCCTAAGCTGGTTTGGTTTGAGCATGCA c.17940 H Y V G V P T M R D D P K L V W F E H A p.5980 . . . . . . g.172164 GGCCAGATTCAGAATGAGAGACTATACAAAGAGGACTATCACAAAACAAAGGCCAAAATC c.18000 G Q I Q N E R L Y K E D Y H K T K A K I p.6000 . . . . . . g.172224 AATATACCTGCTGATATGGTGTCAGTCTTGGCCGCCAAGCAGGGGCAGACCCTTGTCAGT c.18060 N I P A D M V S V L A A K Q G Q T L V S p.6020 . . . . . . g.172284 GATATTGATTATCGTAATTACTTGCACCAATGGATGTGTCATCCTGACCAGAACGATGTT c.18120 D I D Y R N Y L H Q W M C H P D Q N D V p.6040 . . . | 115. . . g.173691 ATTCAGGCAAGAAAGGCCTATGACCTACAGAGTGAT | AATGTCTACAGAGCTGACCTGGAG c.18180 I Q A R K A Y D L Q S D | N V Y R A D L E p.6060 . . . . . . g.173751 TGGCTCCGAGGCATTGGCTGGATCCCACTGGATTCTGTGGACCATGTAAGGGTTACTAAG c.18240 W L R G I G W I P L D S V D H V R V T K p.6080 . . | 116 . . . . g.173921 AACCAGGAAATGATGAGTCAG | ATCAAATATAAGAAAAATGCCCTTGAAAACTATCCTAAC c.18300 N Q E M M S Q | I K Y K K N A L E N Y P N p.6100 . . . . . . g.173981 TTTAGAAGTGTGGTGGATCCTCCAGAGATTGTTTTAGCCAAGATTAATTCTGTCAATCAA c.18360 F R S V V D P P E I V L A K I N S V N Q p.6120 | 117. . . . . . g.174393 AGTGAT | GTAAAATATAAAGAAACATTTAATAAAGCAAAGGGCAAATATACGTTTTCACCA c.18420 S D | V K Y K E T F N K A K G K Y T F S P p.6140 . . . . . | 118 . g.175566 GATACACCACATATCTCCCACTCCAAAGACATGGGAAAACTCTACAGTACT | ATACTGTAT c.18480 D T P H I S H S K D M G K L Y S T | I L Y p.6160 . . . . . . g.175626 AAAGGGGCGTGGGAGGGCACCAAGGCCTATGGCTACACCCTGGATGAGCGCTACATTCCC c.18540 K G A W E G T K A Y G Y T L D E R Y I P p.6180 . . . | 119 . . g.175789 ATTGTTGGAGCCAAGCATGCTGATCTGGTGAACAGTGAG | CTTAAATACAAAGAGACATAT c.18600 I V G A K H A D L V N S E | L K Y K E T Y p.6200 . . . . . . g.175849 GAGAAGCAGAAAGGTCACTACCTGGCTGGAAAAGTGATCGGTGAATTCCCTGGTGTGGTT c.18660 E K Q K G H Y L A G K V I G E F P G V V p.6220 . . . | 120 . . . g.176706 CACTGTCTGGATTTCCAAAAGATGAGGAGTGCG | TTGAACTACAGAAAACATTATGAGGAT c.18720 H C L D F Q K M R S A | L N Y R K H Y E D p.6240 . . . . . . g.176766 ACCAAAGCAAATGTTCATATCCCCAATGACATGATGAATCACGTGCTGGCTAAAAGGTGC c.18780 T K A N V H I P N D M M N H V L A K R C p.6260 . . . . . . g.176826 CAGTACATCCTCAGTGACCTGGAGTATCGACACTATTTCCACCAGTGGACGTCTCTTCTG c.18840 Q Y I L S D L E Y R H Y F H Q W T S L L p.6280 . . . . . | 121 . g.177282 GAAGAACCCAATGTTATACGCGTCCGAAACGCCCAGGAGATCTTGAGTGAT | AATGTGTAT c.18900 E E P N V I R V R N A Q E I L S D | N V Y p.6300 . . . . . . g.177342 AAAGATGACCTGAATTGGTTGAAAGGCATTGGTTGCTACGTTTGGGATACACCCCAAATC c.18960 K D D L N W L K G I G C Y V W D T P Q I p.6320 . . . | 122. . . g.178199 CTCCATGCCAAGAAATCATACGACCTTCAGAGTCAG | CTACAATATACAGCAGCAGGTAAA c.19020 L H A K K S Y D L Q S Q | L Q Y T A A G K p.6340 . . . . . . g.178259 GAAAATCTACAAAACTATAATCTGGTCACAGACACGCCCCTCTATGTGACTGCTGTTCAG c.19080 E N L Q N Y N L V T D T P L Y V T A V Q p.6360 . . | 123 . . . . g.178418 AGTGGCATTAATGCCAGTGAG | GTAAAATATAAAGAAAATTATCATCAGATTAAGGACAAA c.19140 S G I N A S E | V K Y K E N Y H Q I K D K p.6380 . . . . . . g.178478 TACACAACAGTTCTAGAAACAGTGGATTATGACAGAACCAGAAACCTGAAGAATCTTTAC c.19200 Y T T V L E T V D Y D R T R N L K N L Y p.6400 | 124. . . . . . g.178842 AGCAGT | AACCTGTACAAGGAGGCCTGGGATAGAGTGAAAGCCACCAGCTACATCCTGCCT c.19260 S S | N L Y K E A W D R V K A T S Y I L P p.6420 . . . . . | 125 . g.184449 TCCAGCACCTTGTCCCTGACACACGCCAAGAACCAGAAGCATCTGGCCAGCCAT | ATCAAA c.19320 S S T L S L T H A K N Q K H L A S H | I K p.6440 . . . . . . g.184509 TATCGGGAAGAATATGAAAAGTTCAAAGCTCTTTATACGTTACCAAGAAGTGTTGACGAT c.19380 Y R E E Y E K F K A L Y T L P R S V D D p.6460 . . . . | 126 . g.185474 GATCCGAACACAGCACGGTGCCTCCGAGTTGGCAAGCTTAACATCGAT | CGCCTGTACAGA c.19440 D P N T A R C L R V G K L N I D | R L Y R p.6480 . . . . . . g.185534 TCAGTTTATGAAAAGAACAAGATGAAAATCCACATCGTGCCCGACATGGTAGAGATGGTT c.19500 S V Y E K N K M K I H I V P D M V E M V p.6500 . . . . . . g.185594 ACTGCCAAGGATTCCCAGAAGAAAGTCAGTGAGATTGATTACCGCCTGCGCCTCCACGAA c.19560 T A K D S Q K K V S E I D Y R L R L H E p.6520 . . . . . . g.185654 TGGATTTGCCACCCCGACTTGCAAGTCAATGATCACGTCAGGAAAGTCACAGATCAGATC c.19620 W I C H P D L Q V N D H V R K V T D Q I p.6540 | 127. . . . . . g.186039 AGCGAT | ATTGTATACAAGGATGACCTCAACTGGCTGAAAGGCATTGGTTGCTACGTCTGG c.19680 S D | I V Y K D D L N W L K G I G C Y V W p.6560 . . . . . | 128 . g.186720 GACACTCCTGAAATCCTCCATGCCAAGCATGCTTATGATCTACGTGATGAT | ATCAAGTAT c.19740 D T P E I L H A K H A Y D L R D D | I K Y p.6580 . . . . . . g.186780 AAAGCTCACATGTTGAAAACAAGGAATGACTACAAGCTTGTCACAGATACACCAGTCTAC c.19800 K A H M L K T R N D Y K L V T D T P V Y p.6600 . . . | 129. . . g.187666 GTGCAGGCTGTCAAAAGTGGGAAACAGCTAAGTGAC | GCTGTCTACCACTATGACTATGTG c.19860 V Q A V K S G K Q L S D | A V Y H Y D Y V p.6620 . . . . . . g.187726 CACAGTGTCAGAGGCAAAGTGGCTCCAACTACCAAAACCGTGGATCTGGACCGGGCCCTT c.19920 H S V R G K V A P T T K T V D L D R A L p.6640 . . | 130 . . . . g.189783 CATGCATACAAGCTCCAGAGTTCG | AATCTATACAAAACCAGCCTGCGCACCCTGCCCACT c.19980 H A Y K L Q S S | N L Y K T S L R T L P T p.6660 . . . . . . g.189843 GGATATAGACTTCCAGGTGACACTCCTCACTTCAAACACATCAAGGACACCCGTTACATG c.20040 G Y R L P G D T P H F K H I K D T R Y M p.6680 | 131 . . . . . g.191123 AGCAGTTAT | TTCAAGTACAAAGAAGCCTATGAACACACCAAGGCATATGGGTATACACTT c.20100 S S Y | F K Y K E A Y E H T K A Y G Y T L p.6700 . . . . . | 132 g.191752 GGCCCCAAAGATGTTCCATTTGTCCACGTCCGGAGAGTCAACAATGTTACCAGCGAG | AGA c.20160 G P K D V P F V H V R R V N N V T S E | R p.6720 . . . . . . g.191812 CTGTATCGGGAATTGTACCACAAACTGAAAGACAAGATCCATACAACTCCCGATACCCCT c.20220 L Y R E L Y H K L K D K I H T T P D T P p.6740 . . . . | 133 . . g.191972 GAGATCCGCCAAGTCAAGAAGACACAAGAGGCTGTCAGTGAG | TTGATCTACAAATCAGAC c.20280 E I R Q V K K T Q E A V S E | L I Y K S D p.6760 . . . . . . g.192032 TTCTTCAAGATGCAGGGCCACATGATCTCTCTGCCATACACACCCCAAGTGATCCATTGC c.20340 F F K M Q G H M I S L P Y T P Q V I H C p.6780 . . | 134 . . . g.193077 CGCTATGTGGGAGACATCACCAGTGAT | ATTAAATACAAAGAGGACTTGCAGGTCCTGAAG c.20400 R Y V G D I T S D | I K Y K E D L Q V L K p.6800 . . . . . . g.193137 GGATTTGGCTGCTTCCTGTATGACACTCCTGACATGGTCCGCTCCCGGCACCTGCGGAAG c.20460 G F G C F L Y D T P D M V R S R H L R K p.6820 | 135. . . . . . g.193543 CTCTGG | TCTAATTACCTATACACTGATAAGGCAAGGAAGATGCGAGACAAATACAAAGTG c.20520 L W | S N Y L Y T D K A R K M R D K Y K V p.6840 . . . . . | 136 g.197939 GTGCTTGACACTCCAGAATACAGAAAAGTGCAAGAACTGAAGACACATCTGAGTGAG | CTG c.20580 V L D T P E Y R K V Q E L K T H L S E | L p.6860 . . . . . . g.197999 GTCTACAGAGCTGCAGGCAAGAAGCAGAAGTCAATCTTTACTTCAGTTCCTGATACTCCT c.20640 V Y R A A G K K Q K S I F T S V P D T P p.6880 . . . . | 137 . . g.198704 GATCTTTTAAGAGCCAAGCGAGGGCAGAAGCTTCAGAGTCAG | TATCTGTATGTTGAACTT c.20700 D L L R A K R G Q K L Q S Q | Y L Y V E L p.6900 . . . . . . g.198764 GCCACCAAAGAGAGACCCCATCATCACGCTGGAAACCAGACCACAGCCTTGAAGCATGCT c.20760 A T K E R P H H H A G N Q T T A L K H A p.6920 . . | 138 . . . g.199072 AAAGACGTGAAGGACATGGTCAGTGAG | AAAAAGTACAAGATTCAATATGAAAAGATGAAA c.20820 K D V K D M V S E | K K Y K I Q Y E K M K p.6940 . . . . . . g.199132 GACAAGTACACTCCGGTTCCAGATACGCCAATCCTCATCAGAGCCAAGAGGGCTTACTGG c.20880 D K Y T P V P D T P I L I R A K R A Y W p.6960 . | 139 . . . . . g.201291 AATGCCAGTGAT | CTACGCTACAAAGAAACATTTCAAAAGACCAAAGGGAAATACCACACG c.20940 N A S D | L R Y K E T F Q K T K G K Y H T p.6980 . . . . . | 140 g.201514 GTGAAAGATGCCCTAGACATTGTCTATCATCGCAAAGTCACAGATGACATCAGTAAA | ATA c.21000 V K D A L D I V Y H R K V T D D I S K | I p.7000 . . . . . . g.201574 AAATACAAGGAGAACTACATGAGCCAGTTGGGTATCTGGAGGTCCATTCCTGATCGTCCA c.21060 K Y K E N Y M S Q L G I W R S I P D R P p.7020 . . . . | 141 . . g.202269 GAGCATTTCCACCACCGAGCAGTCACTGACACAGTCAGTGAT | GTAAAATATAAAGAAGAC c.21120 E H F H H R A V T D T V S D | V K Y K E D p.7040 . . . . . . g.202329 TTGACTTGGCTTAAAGGCATTGGTTGCTATGCCTATGATACCCCTGATTTCACTCTGGCT c.21180 L T W L K G I G C Y A Y D T P D F T L A p.7060 . . | 142 . . . g.203725 GAAAAGAACAAGACTCTCTACAGCAAG | TATAAGTATAAAGAAGTATTTGAAAGGACAAAG c.21240 E K N K T L Y S K | Y K Y K E V F E R T K p.7080 . . . . . . g.203785 TCAGATTTCAAGTATGTTGCCGACTCTCCGATCAATAGGCATTTCAAGTATGCAACTCAA c.21300 S D F K Y V A D S P I N R H F K Y A T Q p.7100 . | 143 . . . . . g.205216 TTGATGAATGAG | AAAAAATACAGAGCTGATTATGAGCAGCGGAAAGATAAATACCACCTG c.21360 L M N E | K K Y R A D Y E Q R K D K Y H L p.7120 ^ alternative splicing; exon 143 or exon 144 is present . . . . . | 144 g.205944 GTAGTCGATGAGCCTAGACATCTGCTGGCTAAGACCGCAGGCGACCAGATCAGTCAG | AGA c.21420 V V D E P R H L L A K T A G D Q I S Q | R p.7140 . . . . . . g.206004 AAATATAAATCTAGTGCCAAGATGTTTCTGCAACATGGATGTAATGAAATTCTGCGTCCA c.21480 K Y K S S A K M F L Q H G C N E I L R P p.7160 . . . . | 145 . . g.207609 GATATGTTGACTGCTCTCTACAATTCGCATATGTGGAGCCAG | ATCAAATACAGGAAAAAC c.21540 D M L T A L Y N S H M W S Q | I K Y R K N p.7180 . . . . . . g.207669 TATGAAAAATCAAAGGACAAATTTACCTCAATTGTGGATACTCCAGAACACCTGCGTACT c.21600 Y E K S K D K F T S I V D T P E H L R T p.7200 . . | 146 . . . g.208419 ACAAAAGTCAACAAACAAATCAGCGAT | ATCCTTTATAAATTGGAATACAACAAGGCCAAA c.21660 T K V N K Q I S D | I L Y K L E Y N K A K p.7220 . . . . . . g.208479 CCCAGAGGCTACACCACAATCCACGACACACCCATGTTGCTGCATGTCCGCAAGGTTAAA c.21720 P R G Y T T I H D T P M L L H V R K V K p.7240 . | 147 . . . . . g.210218 GATGAAGTCAGTGAT | CTGAAATACAAAGAAGTATACCAAAGAAATAAATCCAACTGCACC c.21780 D E V S D | L K Y K E V Y Q R N K S N C T p.7260 . . . . . . g.210278 ATTGAGCCAGATGCTGTTCATATCAAAGCAGCCAAGGACGCCTACAAAGTCAACACCAAT c.21840 I E P D A V H I K A A K D A Y K V N T N p.7280 | 148 . . . . . . g.211962 | CTGGACTATAAGAAACAGTACGAAGCCAACAAAGCCCACTGGAAGTGGACTCCTGACCGA c.21900 | L D Y K K Q Y E A N K A H W K W T P D R p.7300 . . . . | 149 . . g.212480 CCGGACTTCCTCCAGGCTGCCAAGTCATCCCTGCAGCAAAGCGAT | TTTGAATATAAGCTG c.21960 P D F L Q A A K S S L Q Q S D | F E Y K L p.7320 . . . . . . g.212540 GACCGGGAGTTCCTCAAGGGTTGCAAGCTTTCTGTCACTGATGACAAAAACACGGTGCTC c.22020 D R E F L K G C K L S V T D D K N T V L p.7340 . . . | 150 . . . g.213255 GCCCTCAGGAATACTTTAATAGAAAGTGAT | CTGAAATACAAAGAGAAACATGTCAAGGAA c.22080 A L R N T L I E S D | L K Y K E K H V K E p.7360 . . . . . . g.213315 AGAGGAACCTGCCATGCCGTACCTGACACGCCTCAGATCCTGCTGGCGAAGACTGTCAGC c.22140 R G T C H A V P D T P Q I L L A K T V S p.7380 . | 151 . . . . . g.213464 AACCTGGTGTCTGAG | AACAAGTACAAGGACCATGTCAAGAAGCACTTGGCACAGGGCTCA c.22200 N L V S E | N K Y K D H V K K H L A Q G S p.7400 . . . . . . g.213524 TACACAACACTACCAGAGACCCGGGACACTGTTCACGTCAAGGAAGTGACCAAGCATGTC c.22260 Y T T L P E T R D T V H V K E V T K H V p.7420 | 152. . . . . . g.214268 AGTGAT | ACAAATTACAAAAAGAAGTTTGTCAAGGAGAAAGGAAAATCCAACTACTCCATC c.22320 S D | T N Y K K K F V K E K G K S N Y S I p.7440 . . . . . | 153 g.214874 ATGCTGGAGCCACCAGAGGTGAAACATGCTATGGAAGTGGCCAAGAAGCAAAGTGAT | GTC c.22380 M L E P P E V K H A M E V A K K Q S D | V p.7460 . . . . . . g.214934 GCTTACAGAAAAGATGCCAAAGAGAACCTGCATTACACCACAGTGGCTGATCGACCAGAC c.22440 A Y R K D A K E N L H Y T T V A D R P D p.7480 . . . | 154 . . g.215093 ATCAAGAAGGCCACACAGGCAGCCAAACAGGCCAGTGAG | GTGGAGTACAGAGCCAAGCAC c.22500 I K K A T Q A A K Q A S E | V E Y R A K H p.7500 . . . . . . g.215153 CGCAAGGAAGGCAGCCATGGCTTAAGCATGCTCGGTCGCCCAGACATAGAAATGGCCAAG c.22560 R K E G S H G L S M L G R P D I E M A K p.7520 . . | 155 . . . . g.219755 AAGGCAGCCAAGCTGAGCAGCCAG | GTTAAATACCGAGAAAATTTCGATAAAGAAAAGGGC c.22620 K A A K L S S Q | V K Y R E N F D K E K G p.7540 . . . . . . g.219815 AAGACACCAAAATACAATCCAAAAGACAGCCAGCTCTACAAAGTCATGAAAGATGCTAAT c.22680 K T P K Y N P K D S Q L Y K V M K D A N p.7560 . | 156 . . . . . g.220463 AATCTTGCAAGTGAG | GTTAAATACAAGGCTGACCTGAAGAAACTTCACAAACCCGTGACT c.22740 N L A S E | V K Y K A D L K K L H K P V T p.7580 . . . . . . g.220523 GACATGAAGGAGTCTCTGATCATGAATCATGTCCTGAATACAAGCCAACTTGCCAGTTCT c.22800 D M K E S L I M N H V L N T S Q L A S S p.7600 | 157 . . . . . . g.221125 | TACCAGTACAAGAAGAAGTATGAGAAGAGTAAAGGCCACTACCACACCATACCCGATAAT c.22860 | Y Q Y K K K Y E K S K G H Y H T I P D N p.7620 . . . . | 158 . . g.222939 CTGGAGCAGCTTCACCTAAAAGAGGCCACAGAATTACAGAGTATA | GTGAAATACAAAGAA c.22920 L E Q L H L K E A T E L Q S I | V K Y K E p.7640 . . . . . . g.222999 AAGTATGAAAAGGAACGAGGAAAACCCATGCTGGACTTTGAAACACCAACGTACATCACT c.22980 K Y E K E R G K P M L D F E T P T Y I T p.7660 . . . | 159 . . . g.224589 GCCAAAGAGTCTCAGCAGATGCAGAGTGGG | AAAGAATATAGGAAAGATTATGAAGAGTCC c.23040 A K E S Q Q M Q S G | K E Y R K D Y E E S p.7680 . . . . . . g.224649 ATTAAAGGCAGAAACCTGACTGGCCTGGAGGTCACGCCAGCTTTGTTACATGTCAAATAT c.23100 I K G R N L T G L E V T P A L L H V K Y p.7700 . . | 160 . . . . g.225098 GCAACTAAAATAGCAAGCGAG | AAAGAGTACAGGAAAGATCTAGAGGAAAGCATCCGTGGG c.23160 A T K I A S E | K E Y R K D L E E S I R G p.7720 . . . . . . g.225158 AAGGGCCTCACTGAAATGGAAGATACACCTGACATGCTAAGAGCAAAGAATGCCACTCAA c.23220 K G L T E M E D T P D M L R A K N A T Q p.7740 . | 161 . . . . . g.225842 ATCCTCAATGAG | AAAGAATATAAGCGAGACCTGGAACTGGAAGTCAAAGGAAGAGGCCTG c.23280 I L N E | K E Y K R D L E L E V K G R G L p.7760 . . . . . . g.225902 AATGCCATGGCCAATGAAACTCCGGATTTTATGAGGGCCAGGAATGCTACTGATATTGCC c.23340 N A M A N E T P D F M R A R N A T D I A p.7780 | 162. . . . . . g.226704 AGTCAG | ATTAAGTATAAGCAATCAGCAGAAATGGAGAAAGCCAATTTCACTTCTGTGGTT c.23400 S Q | I K Y K Q S A E M E K A N F T S V V p.7800 . . . . . | 163 . g.231387 GATACTCCAGAGATCATTCATGCCCAACAAGTCAAGAATCTTTCAAGCCAG | AAAAAGTAC c.23460 D T P E I I H A Q Q V K N L S S Q | K K Y p.7820 . . . . . . g.231447 AAGGAAGATGCTGAGAAGTCCATGTCGTATTATGAGACTGTTTTGGACACCCCAGAGATA c.23520 K E D A E K S M S Y Y E T V L D T P E I p.7840 . . . | 164. . . g.232498 CAGAGAGTCCGGGAGAACCAAAAGAACTTCAGCCTT | CTCCAATACCAGTGTGACCTTAAA c.23580 Q R V R E N Q K N F S L | L Q Y Q C D L K p.7860 . . . . . . g.232558 AACAGTAAAGGAAAAATTACAGTTGTTCAAGACACGCCAGAAATACTGCGTGTAAAAGAA c.23640 N S K G K I T V V Q D T P E I L R V K E p.7880 . . | 165 . . . . g.233268 AATCAGAAGAATTTCAGCTCG | GTTTTATATAAAGAGGATGTCTCACCAGGAACGGCTATC c.23700 N Q K N F S S | V L Y K E D V S P G T A I p.7900 . . . . . | 166 . g.233923 GGAAAGACACCTGAGATGATGAGAGTGAAACAAACACAGGACCACATTAGCTCG | GTGAAG c.23760 G K T P E M M R V K Q T Q D H I S S | V K p.7920 . . . . . . g.233983 TATAAGGAAGCAATAGGACAAGGAACTCCAATCCCTGACCTGCCTGAAGTGAAACGTGTG c.23820 Y K E A I G Q G T P I P D L P E V K R V p.7940 . . | 167 . . . g.236079 AAGGAGACGCAGAAGCACATTAGCTCG | GTTATGTACAAAGAAAACTTGGGAACAGGCATT c.23880 K E T Q K H I S S | V M Y K E N L G T G I p.7960 . . . . . . g.236139 CCAACCACTGTGACTCCAGAGATTGAGAGAGTCAAACGCAATCAAGAGAACTTTAGCTCG c.23940 P T T V T P E I E R V K R N Q E N F S S p.7980 | 168 . . . . . . g.236662 | GTTTTGTACAAAGAAAATTTGGGGAAAGGAATCCCAACACCTATCACTCCAGAGATGGAG c.24000 | V L Y K E N L G K G I P T P I T P E M E p.8000 . . . | 169 . . . g.238031 AGAGTCAAACGCAATCAAGAGAACTTTAGCTCG | ATATTGTACAAAGAGAACTTGAGCAAG c.24060 R V K R N Q E N F S S | I L Y K E N L S K p.8020 . . . . . . g.238091 GGGACTCCCCTACCTGTCACTCCTGAGATGGAGCGAGTCAAACTCAATCAAGAAAACTTT c.24120 G T P L P V T P E M E R V K L N Q E N F p.8040 | 170. . . . . . g.240151 AGCTCG | GTGTTGTATAAAGAAAACGTTGGAAAAGGGATTCCAATCCCCATCACTCCAGAG c.24180 S S | V L Y K E N V G K G I P I P I T P E p.8060 . . . | 171 . . g.241156 ATGGAGAGAGTCAAACACAATCAAGAAAACTTTAGTTCG | GTGCTATACAAAGAAAACCTG c.24240 M E R V K H N Q E N F S S | V L Y K E N L p.8080 . . . . . . g.241216 GGGACAGGAATTCCAATCCCCATCACTCCTGAGATGCAGAGAGTCAAACACAATCAAGAA c.24300 G T G I P I P I T P E M Q R V K H N Q E p.8100 . | 172 . . . . . g.241817 AACCTTAGCTCG | GTGTTATACAAAGAAAACATGGGCAAGGGAACCCCTTTACCTGTCACT c.24360 N L S S | V L Y K E N M G K G T P L P V T p.8120 . . . . | 173 . . g.242469 CCCGAGATGGAAAGAGTCAAACACAATCAAGAAAATATTAGCTCG | GTGTTATACAAAGAA c.24420 P E M E R V K H N Q E N I S S | V L Y K E p.8140 . . . . . . g.242529 AACATGGGCAAGGGAACCCCTCTACCTGTCACTCCTGAGATGGAGAGAGTCAAACACAAT c.24480 N M G K G T P L P V T P E M E R V K H N p.8160 . | 174 . . . . g.243161 CAAGAAAATATTAGCTCG | GTGTTATACAAAGAAAACATGGGCAAGGGAACTCCTTTAGCT c.24540 Q E N I S S | V L Y K E N M G K G T P L A p.8180 . . . . . | 175 . g.245243 GTCACTCCCGAGATGGAGCGAGTCAAACACAATCAAGAAAATATTAGCTCG | GTTTTGTAC c.24600 V T P E M E R V K H N Q E N I S S | V L Y p.8200 . . . . . . g.245303 AAAGAAAATGTGGGGAAAGCCACCGCAACCCCTGTCACTCCTGAGATGCAGAGAGTCAAA c.24660 K E N V G K A T A T P V T P E M Q R V K p.8220 . . | 176 . . . . g.245656 CGCAATCAAGAAAACATTAGCTCG | GTGTTATACAAAGAGAACCTGGGGAAAGCAACCCCC c.24720 R N Q E N I S S | V L Y K E N L G K A T P p.8240 . . . . . | 177 g.246045 ACACCCTTTACTCCTGAGATGGAAAGAGTGAAACGCAATCAAGAAAACTTTAGCTCG | GTA c.24780 T P F T P E M E R V K R N Q E N F S S | V p.8260 . . . . . . g.246105 TTGTACAAAGAGAACATGAGAAAAGCAACTCCGACACCTGTTACTCCAGAGATGGAGAGA c.24840 L Y K E N M R K A T P T P V T P E M E R p.8280 . . . | 178 . . . g.247023 GCTAAGCGCAACCAAGAAAACATTAGCTCG | GTTCTTTATTCTGATAGTTTCCGGAAACAA c.24900 A K R N Q E N I S S | V L Y S D S F R K Q p.8300 . . . . . . g.247083 ATACAAGGCAAAGCTGCCTATGTATTGGATACCCCCGAGATGAGACGGGTGAGGGAGACC c.24960 I Q G K A A Y V L D T P E M R R V R E T p.8320 . | 179 . . . . g.247248 CAACGGCACATCTCAACG | GTGAAATATCATGAAGACTTTGAGAAACACAAGGGTTGCTTC c.25020 Q R H I S T | V K Y H E D F E K H K G C F p.8340 . . . . . . g.247308 ACACCAGTGGTGACAGATCCTATCACTGAACGAGTAAAGAAGAACATGCAGGACTTCAGT c.25080 T P V V T D P I T E R V K K N M Q D F S p.8360 . . . . . . g.247368 GACATTAACTACCGAGGTATTCAGAGGAAAGTGGTAGAAATGGAACAAAAACGGAATGAC c.25140 D I N Y R G I Q R K V V E M E Q K R N D p.8380 . . | 180 . . . . g.247750 CAAGATCAGGAGACTATTACAG | GTTTACGTGTCTGGCGTACTAATCCTGGTTCGGTTTTT c.25200 Q D Q E T I T G | L R V W R T N P G S V F p.8400 . . . . . | 181 . g.248974 GACTATGATCCAGCAGAAGACAACATCCAGTCCCGAAGCTTACACATGATTAATG | TCCAA c.25260 D Y D P A E D N I Q S R S L H M I N V | Q p.8420 . . . . . . g.249034 GCTCAGCGCCGGAGCCGGGAGCAGTCACGATCTGCCAGTGCACTAAGCATCAGTGGGGGT c.25320 A Q R R S R E Q S R S A S A L S I S G G p.8440 . . . . . . g.249094 GAGGAGAAGTCTGAGCATTCAGAAGCACCAGACCACCACCTTTCGACTTACAGCGACGGG c.25380 E E K S E H S E A P D H H L S T Y S D G p.8460 . . | 182 . . . . g.249448 GGTGTCTTTGCAGTCTCAACAG | CTTACAAACATGCAAAAACCACAGAGCTCCCACAACAA c.25440 G V F A V S T A | Y K H A K T T E L P Q Q p.8480 . . . . . . g.249508 CGATCATCTTCAGTTGCTACCCAACAGACAACGGTATCTTCCATCCCATCTCATCCATCT c.25500 R S S S V A T Q Q T T V S S I P S H P S p.8500 | 183 . . . . . g.253605 ACTGCTGGA | AAAATCTTCCGTGCCATGTATGACTATATGGCTGCTGATGCAGATGAGGTG c.25560 T A G | K I F R A M Y D Y M A A D A D E V p.8520 . . . . . . g.253665 TCCTTCAAGGATGGAGATGCCATCATAAATGTTCAAGCAATTGATGAAGGCTGGATGTAT c.25620 S F K D G D A I I N V Q A I D E G W M Y p.8540 . . . . . . g.253725 GGCACTGTGCAGAGGACTGGCAGGACCGGAATGCTCCCAGCCAACTACGTTGAAGCTATT c.25680 G T V Q R T G R T G M L P A N Y V E A I p.8560 g.253728 TAG c.25683 X p.8560 . . . . . . g.253788 gcatttcaaagcatcacacttgtctgcaggacttacagatcctgcagtcaatgtttcggt c.*60 . . . . . . g.253848 ttagactctccactgttacctaagttctcaagctgcctatggtttttctgtgtcaatgtg c.*120 . . . . . . g.253908 atttatggtagtaccatcctttctcctttgggttttaaaataagttgcagaacagacact c.*180 . . . . . . g.253968 ttaaaagcttctgcaatattatttctgtgcctagagtctttctccattataaacatgttt c.*240 . . . . . . g.254028 taacattatttcttttctaaaacagggattttgaatatgccaaacacattaaaggaaaaa c.*300 . . . . . . g.254088 tagcagagatgttcaccttttccttgctgattgctaatgcttattatttctaattcagtt c.*360 . . . . . . g.254148 ctgaagttataaacttataatcaatacaaaccagcaactaataaaacctctaattctgca c.*420 g.254149 a c.*421 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS
for a 'Coding DNA Reference Sequence') is indicated at the right of the
sequence, counting the A of the ATG translation initiating Methionine as
1. Every 10th nucleotide is indicated by a "." above the
sequence. The Nebulin protein sequence is shown below the coding DNA
sequence, with numbering indicated at the right starting with 1 for the
translation initiating Methionine. Every 10th amino acid is
shown in bold. The position of introns is indicated by a vertical line,
splitting the two exons. The start of the first exon (transcription
initiation site) is indicated by a '\', the end of the last exon (poly-A
addition site) by a '/'. The exon number is indicated above the first
nucleotide(s) of the exon. To aid the description of frame shift
mutations, all stop codons in the +1 frame are shown in bold
while all stop codons in the +2 frame are underlined.
Powered by LOVD
v.2.0 Build 29
©2004-2010 Leiden
University Medical Center