Leiden Muscular Dystrophy pages©

DMD (dystrophin) cDNA Reference Sequence

used for mutation description

(last modified November 29, 2005)


We have introduced a change in the numbering of the nucleotides 3' of the stop codon, following the most recent HGVS-recommendations.

Please note that the ancient dystrophin, lacking exon 78, encodes a protein that is has a different, longer C-terminal end. Consequently, variants up to nucleotide c.*86 affect the protein.

As of January 1, 2003 we use a new coding DNA Reference Sequence for nucleotide numbering to describe sequence changes in the DMD gene. This reference sequence is based on GenBank file NM_004006.1 (with one difference 12505G>A), containing the Dp427m isoform (muscle) of dystrophin. This new file replaces the old coding DNA reference sequence. The gene flanking and intronic sequences are derived from a range of GenBank files (see Genomic reference sequence of the DMD gene).


(upstream sequence)
 exon 01                                                 tcct       -241

 .         .         .         .         .         .         
 ggcatcagttactgtgttgactcactcagtgttgggatcactcactttccccctacagga       -181

 .         .         .         .         .         .         
 ctcagatctgggaggcaattaccttcggagaaaaacgaataggaaaaactgaagtgttac       -121

 .         .         .         .         .         .         
 tttttttaaagctgctgaagtttgttggtttctcattgtttttaagcctactggagcaat       -61

 .         .         .         .         .         .         
 aaagtttgaagaacttttaccaggttttttttatcgctgccttgatatacacttttcaaa       -1

          .         .         .  | 2       .         .         .
 ATGCTTTGGTGGGAAGAAGTAGAGGACTGTT | ATGAAAGAGAAGATGTTCAAAAGAAAACA     60
 M  L  W  W  E  E  V  E  D  C  Y |   E  R  E  D  V  Q  K  K  T       20

          .         .         .    | 3     .         .         .
 TTCACAAAATGGGTAAATGCACAATTTTCTAAG | TTTGGGAAGCAGCATATTGAGAACCTC     120
 F  T  K  W  V  N  A  Q  F  S  K   | F  G  K  Q  H  I  E  N  L       40

          .         .         .         .         .         .
 TTCAGTGACCTACAGGATGGGAGGCGCCTCCTAGACCTCCTCGAAGGCCTGACAGGGCAA        180
 F  S  D  L  Q  D  G  R  R  L  L  D  L  L  E  G  L  T  G  Q          60

        | 4  .         .         .         .         .         .
 AAACTG | CCAAAAGAAAAAGGATCCACAAGAGTTCATGCCCTGAACAATGTCAACAAGGCA     240
 K  L   | P  K  E  K  G  S  T  R  V  H  A  L  N  N  V  N  K  A       80

          .         .     | 5    .         .         .         .
 CTGCGGGTTTTGCAGAACAATAAT | GTTGATTTAGTGAATATTGGAAGTACTGACATCGTA     300
 L  R  V  L  Q  N  N  N   | V  D  L  V  N  I  G  S  T  D  I  V       100

          .         .         .         .         .        | 6 .
 GATGGAAATCATAAACTGACTCTTGGTTTGATTTGGAATATAATCCTCCACTGGCAG | GTC     360
 D  G  N  H  K  L  T  L  G  L  I  W  N  I  I  L  H  W  Q   | V       120

          .         .         .         .         .         .
 AAAAATGTAATGAAAAATATCATGGCTGGATTGCAACAAACCAACAGTGAAAAGATTCTC        420
 K  N  V  M  K  N  I  M  A  G  L  Q  Q  T  N  S  E  K  I  L          140

          .         .         .         .         .         .
 CTGAGCTGGGTCCGACAATCAACTCGTAATTATCCACAGGTTAATGTAATCAACTTCACC        480
 L  S  W  V  R  Q  S  T  R  N  Y  P  Q  V  N  V  I  N  F  T          160

          .         .         .         .         . | 7        .
 ACCAGCTGGTCTGATGGCCTGGCTTTGAATGCTCTCATCCATAGTCATAG | GCCAGACCTA     540
 T  S  W  S  D  G  L  A  L  N  A  L  I  H  S  H  R  |  P  D  L       180

          .         .         .         .         .         .
 TTTGACTGGAATAGTGTGGTTTGCCAGCAGTCAGCCACACAACGACTGGAACATGCATTC        600
 F  D  W  N  S  V  V  C  Q  Q  S  A  T  Q  R  L  E  H  A  F          200

          .         .         .         .          | 8         .
 AACATCGCCAGATATCAATTAGGCATAGAGAAACTACTCGATCCTGAAG | ATGTTGATACC     660
 N  I  A  R  Y  Q  L  G  I  E  K  L  L  D  P  E  D |   V  D  T       220

          .         .         .         .         .         .
 ACCTATCCAGATAAGAAGTCCATCTTAATGTACATCACATCACTCTTCCAAGTTTTGCCT        720
 T  Y  P  D  K  K  S  I  L  M  Y  I  T  S  L  F  Q  V  L  P          240

          .         .         .         .         .         .
 CAACAAGTGAGCATTGAAGCCATCCAGGAAGTGGAAATGTTGCCAAGGCCACCTAAAGTG        780
 Q  Q  V  S  I  E  A  I  Q  E  V  E  M  L  P  R  P  P  K  V          260

          .         .         .         .         .  | 9       .
 ACTAAAGAAGAACATTTTCAGTTACATCATCAAATGCACTATTCTCAACAG | ATCACGGTC     840
 T  K  E  E  H  F  Q  L  H  H  Q  M  H  Y  S  Q  Q   | I  T  V       280

          .         .         .         .         .         .
 AGTCTAGCACAGGGATATGAGAGAACTTCTTCCCCTAAGCCTCGATTCAAGAGCTATGCC        900
 S  L  A  Q  G  Y  E  R  T  S  S  P  K  P  R  F  K  S  Y  A          300

          .         .         .         .         .         .
 TACACACAGGCTGCTTATGTCACCACCTCTGACCCTACACGGAGCCCATTTCCTTCACAG        960
 Y  T  Q  A  A  Y  V  T  T  S  D  P  T  R  S  P  F  P  S  Q          320

  | 10       .         .         .         .         .         .
  | CATTTGGAAGCTCCTGAAGACAAGTCATTTGGCAGTTCATTGATGGAGAGTGAAGTAAAC     1020
  | H  L  E  A  P  E  D  K  S  F  G  S  S  L  M  E  S  E  V  N       340

          .         .         .         .         .         .
 CTGGACCGTTATCAAACAGCTTTAGAAGAAGTATTATCGTGGCTTCTTTCTGCTGAGGAC        1080
 L  D  R  Y  Q  T  A  L  E  E  V  L  S  W  L  L  S  A  E  D          360

          .         .         .         .         .         .
 ACATTGCAAGCACAAGGAGAGATTTCTAATGATGTGGAAGTGGTGAAAGACCAGTTTCAT        1140
 T  L  Q  A  Q  G  E  I  S  N  D  V  E  V  V  K  D  Q  F  H          380

           | 11        .         .         .         .         .
 ACTCATGAG | GGGTACATGATGGATTTGACAGCCCATCAGGGCCGGGTTGGTAATATTCTA     1200
 T  H  E   | G  Y  M  M  D  L  T  A  H  Q  G  R  V  G  N  I  L       400

          .         .         .         .         .         .
 CAATTGGGAAGTAAGCTGATTGGAACAGGAAAATTATCAGAAGATGAAGAAACTGAAGTA        1260
 Q  L  G  S  K  L  I  G  T  G  K  L  S  E  D  E  E  T  E  V          420

          .         .         .         .         .         .
 CAAGAGCAGATGAATCTCCTAAATTCAAGATGGGAATGCCTCAGGGTAGCTAGCATGGAA        1320
 Q  E  Q  M  N  L  L  N  S  R  W  E  C  L  R  V  A  S  M  E          440

          .  | 12      .         .         .         .         .
 AAACAAAGCAA | TTTACATAGAGTTTTAATGGATCTCCAGAATCAGAAACTGAAAGAGTTG     1380
 K  Q  S  N  |  L  H  R  V  L  M  D  L  Q  N  Q  K  L  K  E  L       460

          .         .         .         .         .         .
 AATGACTGGCTAACAAAAACAGAAGAAAGAACAAGGAAAATGGAGGAAGAGCCTCTTGGA        1440
 N  D  W  L  T  K  T  E  E  R  T  R  K  M  E  E  E  P  L  G          480

          .         .         .         .   | 13     .         .
 CCTGATCTTGAAGACCTAAAACGCCAAGTACAACAACATAAG | GTGCTTCAAGAAGATCTA     1500
 P  D  L  E  D  L  K  R  Q  V  Q  Q  H  K   | V  L  Q  E  D  L       500

          .         .         .         .         .         .
 GAACAAGAACAAGTCAGGGTCAATTCTCTCACTCACATGGTGGTGGTAGTTGATGAATCT        1560
 E  Q  E  Q  V  R  V  N  S  L  T  H  M  V  V  V  V  D  E  S          520

          .         .         .         .   | 14     .         .
 AGTGGAGATCACGCAACTGCTGCTTTGGAAGAACAACTTAAG | GTATTGGGAGATCGATGG     1620
 S  G  D  H  A  T  A  A  L  E  E  Q  L  K   | V  L  G  D  R  W       540

          .         .         .         .         .         .
 GCAAACATCTGTAGATGGACAGAAGACCGCTGGGTTCTTTTACAAGACATCCTTCTCAAA        1680
 A  N  I  C  R  W  T  E  D  R  W  V  L  L  Q  D  I  L  L  K          560

          .         .     | 15   .         .         .         .
 TGGCAACGTCTTACTGAAGAACAG | TGCCTTTTTAGTGCATGGCTTTCAGAAAAAGAAGAT     1740
 W  Q  R  L  T  E  E  Q   | C  L  F  S  A  W  L  S  E  K  E  D       580

          .         .         .         .         .         .
 GCAGTGAACAAGATTCACACAACTGGCTTTAAAGATCAAAATGAAATGTTATCAAGTCTT        1800
 A  V  N  K  I  H  T  T  G  F  K  D  Q  N  E  M  L  S  S  L          600

          .   | 16     .         .         .         .         .
 CAAAAACTGGCC | GTTTTAAAAGCGGATCTAGAAAAGAAAAAGCAATCCATGGGCAAACTG     1860
 Q  K  L  A   | V  L  K  A  D  L  E  K  K  K  Q  S  M  G  K  L       620

          .         .         .         .         .         .
 TATTCACTCAAACAAGATCTTCTTTCAACACTGAAGAATAAGTCAGTGACCCAGAAGACG        1920
 Y  S  L  K  Q  D  L  L  S  T  L  K  N  K  S  V  T  Q  K  T          640

          .         .         .         .         .         .
 GAAGCATGGCTGGATAACTTTGCCCGGTGTTGGGATAATTTAGTCCAAAAACTTGAAAAG        1980
 E  A  W  L  D  N  F  A  R  C  W  D  N  L  V  Q  K  L  E  K          660

          .   | 17     .         .         .         .         .
 AGTACAGCACAG | ATTTCACAGGCTGTCACCACCACTCAGCCATCACTAACACAGACAACT     2040
 S  T  A  Q   | I  S  Q  A  V  T  T  T  Q  P  S  L  T  Q  T  T       680

          .         .         .         .         .         .
 GTAATGGAAACAGTAACTACGGTGACCACAAGGGAACAGATCCTGGTAAAGCATGCTCAA        2100
 V  M  E  T  V  T  T  V  T  T  R  E  Q  I  L  V  K  H  A  Q          700

          .         .         .         .         .         .
 GAGGAACTTCCACCACCACCTCCCCAAAAGAAGAGGCAGATTACTGTGGATTCTGAAATT        2160
 E  E  L  P  P  P  P  P  Q  K  K  R  Q  I  T  V  D  S  E  I          720

          | 18         .         .         .         .         .
 AGGAAAAG | GTTGGATGTTGATATAACTGAACTTCACAGCTGGATTACTCGCTCAGAAGCT     2220
 R  K  R  |  L  D  V  D  I  T  E  L  H  S  W  I  T  R  S  E  A       740

          .         .         .         .         .         .
 GTGTTGCAGAGTCCTGAATTTGCAATCTTTCGGAAGGAAGGCAACTTCTCAGACTTAAAA        2280
 V  L  Q  S  P  E  F  A  I  F  R  K  E  G  N  F  S  D  L  K          760

          .   | 19     .         .         .         .         .
 GAAAAAGTCAAT | GCCATAGAGCGAGAAAAAGCTGAGAAGTTCAGAAAACTGCAAGATGCC     2340
 E  K  V  N   | A  I  E  R  E  K  A  E  K  F  R  K  L  Q  D  A       780

          .         .         .         . | 20       .         .
 AGCAGATCAGCTCAGGCCCTGGTGGAACAGATGGTGAATG | AGGGTGTTAATGCAGATAGC     2400
 S  R  S  A  Q  A  L  V  E  Q  M  V  N  E |   G  V  N  A  D  S       800

          .         .         .         .         .         .
 ATCAAACAAGCCTCAGAACAACTGAACAGCCGGTGGATCGAATTCTGCCAGTTGCTAAGT        2460
 I  K  Q  A  S  E  Q  L  N  S  R  W  I  E  F  C  Q  L  L  S          820

          .         .         .         .         .         .
 GAGAGACTTAACTGGCTGGAGTATCAGAACAACATCATCGCTTTCTATAATCAGCTACAA        2520
 E  R  L  N  W  L  E  Y  Q  N  N  I  I  A  F  Y  N  Q  L  Q          840

          .         .         .         .         .         .
 CAATTGGAGCAGATGACAACTACTGCTGAAAACTGGTTGAAAATCCAACCCACCACCCCA        2580
 Q  L  E  Q  M  T  T  T  A  E  N  W  L  K  I  Q  P  T  T  P          860

          .         .         .         .   | 21     .         .
 TCAGAGCCAACAGCAATTAAAAGTCAGTTAAAAATTTGTAAG | GATGAAGTCAACCGGCTA     2640
 S  E  P  T  A  I  K  S  Q  L  K  I  C  K   | D  E  V  N  R  L       880

          .         .         .         .         .         .
 TCAGGTCTTCAACCTCAAATTGAACGATTAAAAATTCAAAGCATAGCCCTGAAAGAGAAA        2700
 S  G  L  Q  P  Q  I  E  R  L  K  I  Q  S  I  A  L  K  E  K          900

          .         .         .         .         .         .
 GGACAAGGACCCATGTTCCTGGATGCAGACTTTGTGGCCTTTACAAATCATTTTAAGCAA        2760
 G  Q  G  P  M  F  L  D  A  D  F  V  A  F  T  N  H  F  K  Q          920

          .         .         .         .    | 22    .         .
 GTCTTTTCTGATGTGCAGGCCAGAGAGAAAGAGCTACAGACAA | TTTTTGACACTTTGCCA     2820
 V  F  S  D  V  Q  A  R  E  K  E  L  Q  T  I |   F  D  T  L  P       940

          .         .         .         .         .         .
 CCAATGCGCTATCAGGAGACCATGAGTGCCATCAGGACATGGGTCCAGCAGTCAGAAACC        2880
 P  M  R  Y  Q  E  T  M  S  A  I  R  T  W  V  Q  Q  S  E  T          960

          .         .         .         .         .         .
 AAACTCTCCATACCTCAACTTAGTGTCACCGACTATGAAATCATGGAGCAGAGACTCGGG        2940
 K  L  S  I  P  Q  L  S  V  T  D  Y  E  I  M  E  Q  R  L  G          980

           | 23        .         .         .         .         .
 GAATTGCAG | GCTTTACAAAGTTCTCTGCAAGAGCAACAAAGTGGCCTATACTATCTCAGC     3000
 E  L  Q   | A  L  Q  S  S  L  Q  E  Q  Q  S  G  L  Y  Y  L  S       1000

          .         .         .         .         .         .
 ACCACTGTGAAAGAGATGTCGAAGAAAGCGCCCTCTGAAATTAGCCGGAAATATCAATCA        3060
 T  T  V  K  E  M  S  K  K  A  P  S  E  I  S  R  K  Y  Q  S          1020

          .         .         .         .         .         .
 GAATTTGAAGAAATTGAGGGACGCTGGAAGAAGCTCTCCTCCCAGCTGGTTGAGCATTGT        3120
 E  F  E  E  I  E  G  R  W  K  K  L  S  S  Q  L  V  E  H  C          1040

          .         .         .         .   | 24     .         .
 CAAAAGCTAGAGGAGCAAATGAATAAACTCCGAAAAATTCAG | AATCACATACAAACCCTG     3180
 Q  K  L  E  E  Q  M  N  K  L  R  K  I  Q   | N  H  I  Q  T  L       1060

          .         .         .         .         .         .
 AAGAAATGGATGGCTGAAGTTGATGTTTTTCTGAAGGAGGAATGGCCTGCCCTTGGGGAT        3240
 K  K  W  M  A  E  V  D  V  F  L  K  E  E  W  P  A  L  G  D          1080

          .         .         .       | 25 .         .         .
 TCAGAAATTCTAAAAAAGCAGCTGAAACAGTGCAGA | CTTTTAGTCAGTGATATTCAGACA     3300
 S  E  I  L  K  K  Q  L  K  Q  C  R   | L  L  V  S  D  I  Q  T       1100

          .         .         .         .         .         .
 ATTCAGCCCAGTCTAAACAGTGTCAATGAAGGTGGGCAGAAGATAAAGAATGAAGCAGAG        3360
 I  Q  P  S  L  N  S  V  N  E  G  G  Q  K  I  K  N  E  A  E          1120

          .         .         .         .         .         .
 CCAGAGTTTGCTTCGAGACTTGAGACAGAACTCAAAGAACTTAACACTCAGTGGGATCAC        3420
 P  E  F  A  S  R  L  E  T  E  L  K  E  L  N  T  Q  W  D  H          1140

          .   | 26     .         .         .         .         .
 ATGTGCCAACAG | GTCTATGCCAGAAAGGAGGCCTTGAAGGGAGGTTTGGAGAAAACTGTA     3480
 M  C  Q  Q   | V  Y  A  R  K  E  A  L  K  G  G  L  E  K  T  V       1160

          .         .         .         .         .         .
 AGCCTCCAGAAAGATCTATCAGAGATGCACGAATGGATGACACAAGCTGAAGAAGAGTAT        3540
 S  L  Q  K  D  L  S  E  M  H  E  W  M  T  Q  A  E  E  E  Y          1180

          .         .         .         .         .         .
 CTTGAGAGAGATTTTGAATATAAAACTCCAGATGAATTACAGAAAGCAGTTGAAGAGATG        3600
 L  E  R  D  F  E  Y  K  T  P  D  E  L  Q  K  A  V  E  E  M          1200

     | 27    .         .         .         .         .         .
 AAG | AGAGCTAAAGAAGAGGCCCAACAAAAAGAAGCGAAAGTGAAACTCCTTACTGAGTCT     3660
 K   | R  A  K  E  E  A  Q  Q  K  E  A  K  V  K  L  L  T  E  S       1220

          .         .         .         .         .         .
 GTAAATAGTGTCATAGCTCAAGCTCCACCTGTAGCACAAGAGGCCTTAAAAAAGGAACTT        3720
 V  N  S  V  I  A  Q  A  P  P  V  A  Q  E  A  L  K  K  E  L          1240

          .         .         .         .         .         .
 GAAACTCTAACCACCAACTACCAGTGGCTCTGCACTAGGCTGAATGGGAAATGCAAGACT        3780
 E  T  L  T  T  N  Y  Q  W  L  C  T  R  L  N  G  K  C  K  T          1260

        | 28 .         .         .         .         .         .
 TTGGAA | GAAGTTTGGGCATGTTGGCATGAGTTATTGTCATACTTGGAGAAAGCAAACAAG     3840
 L  E   | E  V  W  A  C  W  H  E  L  L  S  Y  L  E  K  A  N  K       1280

          .         .         .         .         .         .
 TGGCTAAATGAAGTAGAATTTAAACTTAAAACCACTGAAAACATTCCTGGCGGAGCTGAG        3900
 W  L  N  E  V  E  F  K  L  K  T  T  E  N  I  P  G  G  A  E          1300

          .         .  | 29      .         .         .         .
 GAAATCTCTGAGGTGCTAGAT | TCACTTGAAAATTTGATGCGACATTCAGAGGATAACCCA     3960
 E  I  S  E  V  L  D   | S  L  E  N  L  M  R  H  S  E  D  N  P       1320

          .         .         .         .         .         .
 AATCAGATTCGCATATTGGCACAGACCCTAACAGATGGCGGAGTCATGGATGAGCTAATC        4020
 N  Q  I  R  I  L  A  Q  T  L  T  D  G  G  V  M  D  E  L  I          1340

          .         .         .         .         .  | 30      .
 AATGAGGAACTTGAGACATTTAATTCTCGTTGGAGGGAACTACATGAAGAG | GCTGTAAGG     4080
 N  E  E  L  E  T  F  N  S  R  W  R  E  L  H  E  E   | A  V  R       1360

          .         .         .         .         .         .
 AGGCAAAAGTTGCTTGAACAGAGCATCCAGTCTGCCCAGGAGACTGAAAAATCCTTACAC        4140
 R  Q  K  L  L  E  Q  S  I  Q  S  A  Q  E  T  E  K  S  L  H          1380

          .         .         .         .         .         .
 TTAATCCAGGAGTCCCTCACATTCATTGACAAGCAGTTGGCAGCTTATATTGCAGACAAG        4200
 L  I  Q  E  S  L  T  F  I  D  K  Q  L  A  A  Y  I  A  D  K          1400

          .         .         .    | 31    .         .         .
 GTGGACGCAGCTCAAATGCCTCAGGAAGCCCAG | AAAATCCAATCTGATTTGACAAGTCAT     4260
 V  D  A  A  Q  M  P  Q  E  A  Q   | K  I  Q  S  D  L  T  S  H       1420

          .         .         .         .         .         .
 GAGATCAGTTTAGAAGAAATGAAGAAACATAATCAGGGGAAGGAGGCTGCCCAAAGAGTC        4320
 E  I  S  L  E  E  M  K  K  H  N  Q  G  K  E  A  A  Q  R  V          1440

          .         .     | 32   .         .         .         .
 CTGTCTCAGATTGATGTTGCACAG | AAAAAATTACAAGATGTCTCCATGAAGTTTCGATTA     4380
 L  S  Q  I  D  V  A  Q   | K  K  L  Q  D  V  S  M  K  F  R  L       1460

          .         .         .         .         .         .
 TTCCAGAAACCAGCCAATTTTGAGCAGCGTCTACAAGAAAGTAAGATGATTTTAGATGAA        4440
 F  Q  K  P  A  N  F  E  Q  R  L  Q  E  S  K  M  I  L  D  E          1480

          .         .         .         .         .         .
 GTGAAGATGCACTTGCCTGCATTGGAAACAAAGAGTGTGGAACAGGAAGTAGTACAGTCA        4500
 V  K  M  H  L  P  A  L  E  T  K  S  V  E  Q  E  V  V  Q  S          1500

          .         | 33         .         .         .         .
 CAGCTAAATCATTGTGTG | AACTTGTATAAAAGTCTGAGTGAAGTGAAGTCTGAAGTGGAA     4560
 Q  L  N  H  C  V   | N  L  Y  K  S  L  S  E  V  K  S  E  V  E       1520

          .         .         .         .         .         .
 ATGGTGATAAAGACTGGACGTCAGATTGTACAGAAAAAGCAGACGGAAAATCCCAAAGAA        4620
 M  V  I  K  T  G  R  Q  I  V  Q  K  K  Q  T  E  N  P  K  E          1540

          .         .         .         .         .     | 34   .
 CTTGATGAAAGAGTAACAGCTTTGAAATTGCATTATAATGAGCTGGGAGCAAAG | GTAACA     4680
 L  D  E  R  V  T  A  L  K  L  H  Y  N  E  L  G  A  K   | V  T       1560

          .         .         .         .         .         .
 GAAAGAAAGCAACAGTTGGAGAAATGCTTGAAATTGTCCCGTAAGATGCGAAAGGAAATG        4740
 E  R  K  Q  Q  L  E  K  C  L  K  L  S  R  K  M  R  K  E  M          1580

          .         .         .         .         .         .
 AATGTCTTGACAGAATGGCTGGCAGCTACAGATATGGAATTGACAAAGAGATCAGCAGTT        4800
 N  V  L  T  E  W  L  A  A  T  D  M  E  L  T  K  R  S  A  V          1600

          .         .         .         .      | 35  .         .
 GAAGGAATGCCTAGTAATTTGGATTCTGAAGTTGCCTGGGGAAAG | GCTACTCAAAAAGAG     4860
 E  G  M  P  S  N  L  D  S  E  V  A  W  G  K   | A  T  Q  K  E       1620

          .         .         .         .         .         .
 ATTGAGAAACAGAAGGTGCACCTGAAGAGTATCACAGAGGTAGGAGAGGCCTTGAAAACA        4920
 I  E  K  Q  K  V  H  L  K  S  I  T  E  V  G  E  A  L  K  T          1640

          .         .         .         .         .         .
 GTTTTGGGCAAGAAGGAGACGTTGGTGGAAGATAAACTCAGTCTTCTGAATAGTAACTGG        4980
 V  L  G  K  K  E  T  L  V  E  D  K  L  S  L  L  N  S  N  W          1660

          .         .         .         .      | 36  .         .
 ATAGCTGTCACCTCCCGAGCAGAAGAGTGGTTAAATCTTTTGTTG | GAATACCAGAAACAC     5040
 I  A  V  T  S  R  A  E  E  W  L  N  L  L  L   | E  Y  Q  K  H       1680

          .         .         .         .         .         .
 ATGGAAACTTTTGACCAGAATGTGGACCACATCACAAAGTGGATCATTCAGGCTGACACA        5100
 M  E  T  F  D  Q  N  V  D  H  I  T  K  W  I  I  Q  A  D  T          1700

          .         .         .         .         .     | 37   .
 CTTTTGGATGAATCAGAGAAAAAGAAACCCCAGCAAAAAGAAGACGTGCTTAAG | CGTTTA     5160
 L  L  D  E  S  E  K  K  K  P  Q  Q  K  E  D  V  L  K   | R  L       1720

          .         .         .         .         .         .
 AAGGCAGAACTGAATGACATACGCCCAAAGGTGGACTCTACACGTGACCAAGCAGCAAAC        5220
 K  A  E  L  N  D  I  R  P  K  V  D  S  T  R  D  Q  A  A  N          1740

          .         .         .         .         .         .
 TTGATGGCAAACCGCGGTGACCACTGCAGGAAATTAGTAGAGCCCCAAATCTCAGAGCTC        5280
 L  M  A  N  R  G  D  H  C  R  K  L  V  E  P  Q  I  S  E  L          1760

          .         .         .         .      | 38  .         .
 AACCATCGATTTGCAGCCATTTCACACAGAATTAAGACTGGAAAG | GCCTCCATTCCTTTG     5340
 N  H  R  F  A  A  I  S  H  R  I  K  T  G  K   | A  S  I  P  L       1780

          .         .         .         .         .         .
 AAGGAATTGGAGCAGTTTAACTCAGATATACAAAAATTGCTTGAACCACTGGAGGCTGAA        5400
 K  E  L  E  Q  F  N  S  D  I  Q  K  L  L  E  P  L  E  A  E          1800

          .         .         .         .         | 39         .
 ATTCAGCAGGGGGTGAATCTGAAAGAGGAAGACTTCAATAAAGATATG | AATGAAGACAAT     5460
 I  Q  Q  G  V  N  L  K  E  E  D  F  N  K  D  M   | N  E  D  N       1820

          .         .         .         .         .         .
 GAGGGTACTGTAAAAGAATTGTTGCAAAGAGGAGACAACTTACAACAAAGAATCACAGAT        5520
 E  G  T  V  K  E  L  L  Q  R  G  D  N  L  Q  Q  R  I  T  D          1840

          .         .         .         .         .         .
 GAGAGAAAGCGAGAGGAAATAAAGATAAAACAGCAGCTGTTACAGACAAAACATAATGCT        5580
 E  R  K  R  E  E  I  K  I  K  Q  Q  L  L  Q  T  K  H  N  A          1860

        | 40 .         .         .         .         .         .
 CTCAAG | GATTTGAGGTCTCAAAGAAGAAAAAAGGCTCTAGAAATTTCTCATCAGTGGTAT     5640
 L  K   | D  L  R  S  Q  R  R  K  K  A  L  E  I  S  H  Q  W  Y       1880

          .         .         .         .         .         .
 CAGTACAAGAGGCAGGCTGATGATCTCCTGAAATGCTTGGATGACATTGAAAAAAAATTA        5700
 Q  Y  K  R  Q  A  D  D  L  L  K  C  L  D  D  I  E  K  K  L          1900

          .         .         .          | 41        .         .
 GCCAGCCTACCTGAGCCCAGAGATGAAAGGAAAATAAAG | GAAATTGATCGGGAATTGCAG     5760
 A  S  L  P  E  P  R  D  E  R  K  I  K   | E  I  D  R  E  L  Q       1920

          .         .         .         .         .         .
 AAGAAGAAAGAGGAGCTGAATGCAGTGCGTAGGCAAGCTGAGGGCTTGTCTGAGGATGGG        5820
 K  K  K  E  E  L  N  A  V  R  R  Q  A  E  G  L  S  E  D  G          1940

          .         .         .         .         .         .
 GCCGCAATGGCAGTGGAGCCAACTCAGATCCAGCTCAGCAAGCGCTGGCGGGAAATTGAG        5880
 A  A  M  A  V  E  P  T  Q  I  Q  L  S  K  R  W  R  E  I  E          1960

          .         .         .         .   | 42     .         .
 AGCAAATTTGCTCAGTTTCGAAGACTCAACTTTGCACAAATT | CACACTGTCCGTGAAGAA     5940
 S  K  F  A  Q  F  R  R  L  N  F  A  Q  I   | H  T  V  R  E  E       1980

          .         .         .         .         .         .
 ACGATGATGGTGATGACTGAAGACATGCCTTTGGAAATTTCTTATGTGCCTTCTACTTAT        6000
 T  M  M  V  M  T  E  D  M  P  L  E  I  S  Y  V  P  S  T  Y          2000

          .         .         .         .         .         .
 TTGACTGAAATCACTCATGTCTCACAAGCCCTATTAGAAGTGGAACAACTTCTCAATGCT        6060
 L  T  E  I  T  H  V  S  Q  A  L  L  E  V  E  Q  L  L  N  A          2020

          .         .         .         .         .        | 43.
 CCTGACCTCTGTGCTAAGGACTTTGAAGATCTCTTTAAGCAAGAGGAGTCTCTGAAG | AAT     6120
 P  D  L  C  A  K  D  F  E  D  L  F  K  Q  E  E  S  L  K   | N       2040

          .         .         .         .         .         .
 ATAAAAGATAGTCTACAACAAAGCTCAGGTCGGATTGACATTATTCATAGCAAGAAGACA        6180
 I  K  D  S  L  Q  Q  S  S  G  R  I  D  I  I  H  S  K  K  T          2060

          .         .         .         .         .         .
 GCAGCATTGCAAAGTGCAACGCCTGTGGAAAGGGTGAAGCTACAGGAAGCTCTCTCCCAG        6240
 A  A  L  Q  S  A  T  P  V  E  R  V  K  L  Q  E  A  L  S  Q          2080

          .         .         .         .         . | 44       .
 CTTGATTTCCAATGGGAAAAAGTTAACAAAATGTACAAGGACCGACAAGG | GCGATTTGAC     6300
 L  D  F  Q  W  E  K  V  N  K  M  Y  K  D  R  Q  G  |  R  F  D       2100

          .         .         .         .         .         .
 AGATCTGTTGAGAAATGGCGGCGTTTTCATTATGATATAAAGATATTTAATCAGTGGCTA        6360
 R  S  V  E  K  W  R  R  F  H  Y  D  I  K  I  F  N  Q  W  L          2120

          .         .         .         .         .         .
 ACAGAAGCTGAACAGTTTCTCAGAAAGACACAAATTCCTGAGAATTGGGAACATGCTAAA        6420
 T  E  A  E  Q  F  L  R  K  T  Q  I  P  E  N  W  E  H  A  K          2140

          .         | 45         .         .         .         .
 TACAAATGGTATCTTAAG | GAACTCCAGGATGGCATTGGGCAGCGGCAAACTGTTGTCAGA     6480
 Y  K  W  Y  L  K   | E  L  Q  D  G  I  G  Q  R  Q  T  V  V  R       2160

          .         .         .         .         .         .
 ACATTGAATGCAACTGGGGAAGAAATAATTCAGCAATCCTCAAAAACAGATGCCAGTATT        6540
 T  L  N  A  T  G  E  E  I  I  Q  Q  S  S  K  T  D  A  S  I          2180

          .         .         .         .         .         .
 CTACAGGAAAAATTGGGAAGCCTGAATCTGCGGTGGCAGGAGGTCTGCAAACAGCTGTCA        6600
 L  Q  E  K  L  G  S  L  N  L  R  W  Q  E  V  C  K  Q  L  S          2200

          .     | 46   .         .         .         .         .
 GACAGAAAAAAGAG | GCTAGAAGAACAAAAGAATATCTTGTCAGAATTTCAAAGAGATTTA     6660
 D  R  K  K  R  |  L  E  E  Q  K  N  I  L  S  E  F  Q  R  D  L       2220

          .         .         .         .         .         .
 AATGAATTTGTTTTATGGTTGGAGGAAGCAGATAACATTGCTAGTATCCCACTTGAACCT        6720
 N  E  F  V  L  W  L  E  E  A  D  N  I  A  S  I  P  L  E  P          2240

          .         .         .         .   | 47     .         .
 GGAAAAGAGCAGCAACTAAAAGAAAAGCTTGAGCAAGTCAAG | TTACTGGTGGAAGAGTTG     6780
 G  K  E  Q  Q  L  K  E  K  L  E  Q  V  K   | L  L  V  E  E  L       2260

          .         .         .         .         .         .
 CCCCTGCGCCAGGGAATTCTCAAACAATTAAATGAAACTGGAGGACCCGTGCTTGTAAGT        6840
 P  L  R  Q  G  I  L  K  Q  L  N  E  T  G  G  P  V  L  V  S          2280

          .         .         .         .         .         .
 GCTCCCATAAGCCCAGAAGAGCAAGATAAACTTGAAAATAAGCTCAAGCAGACAAATCTC        6900
 A  P  I  S  P  E  E  Q  D  K  L  E  N  K  L  K  Q  T  N  L          2300

          .   | 48     .         .         .         .         .
 CAGTGGATAAAG | GTTTCCAGAGCTTTACCTGAGAAACAAGGAGAAATTGAAGCTCAAATA     6960
 Q  W  I  K   | V  S  R  A  L  P  E  K  Q  G  E  I  E  A  Q  I       2320

          .         .         .         .         .         .
 AAAGACCTTGGGCAGCTTGAAAAAAAGCTTGAAGACCTTGAAGAGCAGTTAAATCATCTG        7020
 K  D  L  G  Q  L  E  K  K  L  E  D  L  E  E  Q  L  N  H  L          2340

          .         .         .         .         .         .
 CTGCTGTGGTTATCTCCTATTAGGAATCAGTTGGAAATTTATAACCAACCAAACCAAGAA        7080
 L  L  W  L  S  P  I  R  N  Q  L  E  I  Y  N  Q  P  N  Q  E          2360

          .         | 49         .         .         .         .
 GGACCATTTGACGTTCAG | GAAACTGAAATAGCAGTTCAAGCTAAACAACCGGATGTGGAA     7140
 G  P  F  D  V  Q   | E  T  E  I  A  V  Q  A  K  Q  P  D  V  E       2380

          .         .         .         .         .         .
 GAGATTTTGTCTAAAGGGCAGCATTTGTACAAGGAAAAACCAGCCACTCAGCCAGTGAAG        7200
 E  I  L  S  K  G  Q  H  L  Y  K  E  K  P  A  T  Q  P  V  K          2400

  | 50       .         .         .         .         .         .
  | AGGAAGTTAGAAGATCTGAGCTCTGAGTGGAAGGCGGTAAACCGTTTACTTCAAGAGCTG     7260
  | R  K  L  E  D  L  S  S  E  W  K  A  V  N  R  L  L  Q  E  L       2420

          .         .         .         .          | 51        .
 AGGGCAAAGCAGCCTGACCTAGCTCCTGGACTGACCACTATTGGAGCCT | CTCCTACTCAG     7320
 R  A  K  Q  P  D  L  A  P  G  L  T  T  I  G  A  S |   P  T  Q       2440

          .         .         .         .         .         .
 ACTGTTACTCTGGTGACACAACCTGTGGTTACTAAGGAAACTGCCATCTCCAAACTAGAA        7380
 T  V  T  L  V  T  Q  P  V  V  T  K  E  T  A  I  S  K  L  E          2460

          .         .         .         .         .         .
 ATGCCATCTTCCTTGATGTTGGAGGTACCTGCTCTGGCAGATTTCAACCGGGCTTGGACA        7440
 M  P  S  S  L  M  L  E  V  P  A  L  A  D  F  N  R  A  W  T          2480

          .         .         .         .         .         .
 GAACTTACCGACTGGCTTTCTCTGCTTGATCAAGTTATAAAATCACAGAGGGTGATGGTG        7500
 E  L  T  D  W  L  S  L  L  D  Q  V  I  K  S  Q  R  V  M  V          2500

          .         .         .         .   | 52     .         .
 GGTGACCTTGAGGATATCAACGAGATGATCATCAAGCAGAAG | GCAACAATGCAGGATTTG     7560
 G  D  L  E  D  I  N  E  M  I  I  K  Q  K   | A  T  M  Q  D  L       2520

          .         .         .         .         .         .
 GAACAGAGGCGTCCCCAGTTGGAAGAACTCATTACCGCTGCCCAAAATTTGAAAAACAAG        7620
 E  Q  R  R  P  Q  L  E  E  L  I  T  A  A  Q  N  L  K  N  K          2540

          .         .         .         . | 53       .         .
 ACCAGCAATCAAGAGGCTAGAACAATCATTACGGATCGAA | TTGAAAGAATTCAGAATCAG     7680
 T  S  N  Q  E  A  R  T  I  I  T  D  R  I |   E  R  I  Q  N  Q       2560

          .         .         .         .         .         .
 TGGGATGAAGTACAAGAACACCTTCAGAACCGGAGGCAACAGTTGAATGAAATGTTAAAG        7740
 W  D  E  V  Q  E  H  L  Q  N  R  R  Q  Q  L  N  E  M  L  K          2580

          .         .         .         .         .         .
 GATTCAACACAATGGCTGGAAGCTAAGGAAGAAGCTGAGCAGGTCTTAGGACAGGCCAGA        7800
 D  S  T  Q  W  L  E  A  K  E  E  A  E  Q  V  L  G  Q  A  R          2600

          .         .         .         .         .         .
 GCCAAGCTTGAGTCATGGAAGGAGGGTCCCTATACAGTAGATGCAATCCAAAAGAAAATC        7860
 A  K  L  E  S  W  K  E  G  P  Y  T  V  D  A  I  Q  K  K  I          2620

          .   | 54     .         .         .         .         .
 ACAGAAACCAAG | CAGTTGGCCAAAGACCTCCGCCAGTGGCAGACAAATGTAGATGTGGCA     7920
 T  E  T  K   | Q  L  A  K  D  L  R  Q  W  Q  T  N  V  D  V  A       2640

          .         .         .         .         .         .
 AATGACTTGGCCCTGAAACTTCTCCGGGATTATTCTGCAGATGATACCAGAAAAGTCCAC        7980
 N  D  L  A  L  K  L  L  R  D  Y  S  A  D  D  T  R  K  V  H          2660

          .         .         .         .        | 55.         .
 ATGATAACAGAGAATATCAATGCCTCTTGGAGAAGCATTCATAAAAG | GGTGAGTGAGCGA     8040
 M  I  T  E  N  I  N  A  S  W  R  S  I  H  K  R  |  V  S  E  R       2680

          .         .         .         .         .         .
 GAGGCTGCTTTGGAAGAAACTCATAGATTACTGCAACAGTTCCCCCTGGACCTGGAAAAG        8100
 E  A  A  L  E  E  T  H  R  L  L  Q  Q  F  P  L  D  L  E  K          2700

          .         .         .         .         .         .
 TTTCTTGCCTGGCTTACAGAAGCTGAAACAACTGCCAATGTCCTACAGGATGCTACCCGT        8160
 F  L  A  W  L  T  E  A  E  T  T  A  N  V  L  Q  D  A  T  R          2720

          .         .         .         .         .        | 56.
 AAGGAAAGGCTCCTAGAAGACTCCAAGGGAGTAAAAGAGCTGATGAAACAATGGCAA | GAC     8220
 K  E  R  L  L  E  D  S  K  G  V  K  E  L  M  K  Q  W  Q   | D       2740

          .         .         .         .         .         .
 CTCCAAGGTGAAATTGAAGCTCACACAGATGTTTATCACAACCTGGATGAAAACAGCCAA        8280
 L  Q  G  E  I  E  A  H  T  D  V  Y  H  N  L  D  E  N  S  Q          2760

          .         .         .         .         .         .
 AAAATCCTGAGATCCCTGGAAGGTTCCGATGATGCAGTCCTGTTACAAAGACGTTTGGAT        8340
 K  I  L  R  S  L  E  G  S  D  D  A  V  L  L  Q  R  R  L  D          2780

          .         .         .         .         . | 57       .
 AACATGAACTTCAAGTGGAGTGAACTTCGGAAAAAGTCTCTCAACATTAG | GTCCCATTTG     8400
 N  M  N  F  K  W  S  E  L  R  K  K  S  L  N  I  R  |  S  H  L       2800

          .         .         .         .         .         .
 GAAGCCAGTTCTGACCAGTGGAAGCGTCTGCACCTTTCTCTGCAGGAACTTCTGGTGTGG        8460
 E  A  S  S  D  Q  W  K  R  L  H  L  S  L  Q  E  L  L  V  W          2820

          .         .         .         .         .         .
 CTACAGCTGAAAGATGATGAATTAAGCCGGCAGGCACCTATTGGAGGCGACTTTCCAGCA        8520
 L  Q  L  K  D  D  E  L  S  R  Q  A  P  I  G  G  D  F  P  A          2840

          .         .        | 58.         .         .         .
 GTTCAGAAGCAGAACGATGTACATAGG | GCCTTCAAGAGGGAATTGAAAACTAAAGAACCT     8580
 V  Q  K  Q  N  D  V  H  R   | A  F  K  R  E  L  K  T  K  E  P       2860

          .         .         .         .         .         .
 GTAATCATGAGTACTCTTGAGACTGTACGAATATTTCTGACAGAGCAGCCTTTGGAAGGA        8640
 V  I  M  S  T  L  E  T  V  R  I  F  L  T  E  Q  P  L  E  G          2880

          .         .         | 59         .         .         .
 CTAGAGAAACTCTACCAGGAGCCCAGAG | AGCTGCCTCCTGAGGAGAGAGCCCAGAATGTC     8700
 L  E  K  L  Y  Q  E  P  R  E |   L  P  P  E  E  R  A  Q  N  V       2900

          .         .         .         .         .         .
 ACTCGGCTTCTACGAAAGCAGGCTGAGGAGGTCAATACTGAGTGGGAAAAATTGAACCTG        8760
 T  R  L  L  R  K  Q  A  E  E  V  N  T  E  W  E  K  L  N  L          2920

          .         .         .         .         .         .
 CACTCCGCTGACTGGCAGAGAAAAATAGATGAGACCCTTGAAAGACTCCAGGAACTTCAA        8820
 H  S  A  D  W  Q  R  K  I  D  E  T  L  E  R  L  Q  E  L  Q          2940

          .         .         .         .         .         .
 GAGGCCACGGATGAGCTGGACCTCAAGCTGCGCCAAGCTGAGGTGATCAAGGGATCCTGG        8880
 E  A  T  D  E  L  D  L  K  L  R  Q  A  E  V  I  K  G  S  W          2960

          .         .         .         .         .        | 60.
 CAGCCCGTGGGCGATCTCCTCATTGACTCTCTCCAAGATCACCTCGAGAAAGTCAAG | GCA     8940
 Q  P  V  G  D  L  L  I  D  S  L  Q  D  H  L  E  K  V  K   | A       2980

          .         .         .         .         .         .
 CTTCGAGGAGAAATTGCGCCTCTGAAAGAGAACGTGAGCCACGTCAATGACCTTGCTCGC        9000
 L  R  G  E  I  A  P  L  K  E  N  V  S  H  V  N  D  L  A  R          3000

          .         .         .         .         .         .
 CAGCTTACCACTTTGGGCATTCAGCTCTCACCGTATAACCTCAGCACTCTGGAAGACCTG        9060
 Q  L  T  T  L  G  I  Q  L  S  P  Y  N  L  S  T  L  E  D  L          3020

          .         .     | 61   .         .         .         .
 AACACCAGATGGAAGCTTCTGCAG | GTGGCCGTCGAGGACCGAGTCAGGCAGCTGCATGAA     9120
 N  T  R  W  K  L  L  Q   | V  A  V  E  D  R  V  R  Q  L  H  E       3040

          .         .         .         .    | 62    .         .
 GCCCACAGGGACTTTGGTCCAGCATCTCAGCACTTTCTTTCCA | CGTCTGTCCAGGGTCCC     9180
 A  H  R  D  F  G  P  A  S  Q  H  F  L  S  T |   S  V  Q  G  P       3060

          .         .         .         .     | 63   .         .
 TGGGAGAGAGCCATCTCGCCAAACAAAGTGCCCTACTATATCAA | CCACGAGACTCAAACA     9240
 W  E  R  A  I  S  P  N  K  V  P  Y  Y  I  N  |  H  E  T  Q  T       3080

          .         .         .         .       | 64 .         .
 ACTTGCTGGGACCATCCCAAAATGACAGAGCTCTACCAGTCTTTAG | CTGACCTGAATAAT     9300
 T  C  W  D  H  P  K  M  T  E  L  Y  Q  S  L  A |   D  L  N  N       3100

          .         .         .         .         .         .
 GTCAGATTCTCAGCTTATAGGACTGCCATGAAACTCCGAAGACTGCAGAAGGCCCTTTGC        9360
 V  R  F  S  A  Y  R  T  A  M  K  L  R  R  L  Q  K  A  L  C          3120

   | 65      .         .         .         .         .         .
 T | TGGATCTCTTGAGCCTGTCAGCTGCATGTGATGCCTTGGACCAGCACAACCTCAAGCAA     9420
 L |   D  L  L  S  L  S  A  A  C  D  A  L  D  Q  H  N  L  K  Q       3140

          .         .         .         .         .         .
 AATGACCAGCCCATGGATATCCTGCAGATTATTAATTGTTTGACCACTATTTATGACCGC        9480
 N  D  Q  P  M  D  I  L  Q  I  I  N  C  L  T  T  I  Y  D  R          3160

          .         .         .         .         .         .
 CTGGAGCAAGAGCACAACAATTTGGTCAACGTCCCTCTCTGCGTGGATATGTGTCTGAAC        9540
 L  E  Q  E  H  N  N  L  V  N  V  P  L  C  V  D  M  C  L  N          3180

          .         .    | 66    .         .         .         .
 TGGCTGCTGAATGTTTATGATAC | GGGACGAACAGGGAGGATCCGTGTCCTGTCTTTTAAA     9600
 W  L  L  N  V  Y  D  T  |  G  R  T  G  R  I  R  V  L  S  F  K       3200

          .         .         .         .          | 67        .
 ACTGGCATCATTTCCCTGTGTAAAGCACATTTGGAAGACAAGTACAGAT | ACCTTTTCAAG     9660
 T  G  I  I  S  L  C  K  A  H  L  E  D  K  Y  R  Y |   L  F  K       3220

          .         .         .         .         .         .
 CAAGTGGCAAGTTCAACAGGATTTTGTGACCAGCGCAGGCTGGGCCTCCTTCTGCATGAT        9720
 Q  V  A  S  S  T  G  F  C  D  Q  R  R  L  G  L  L  L  H  D          3240

          .         .         .         .         .         .
 TCTATCCAAATTCCAAGACAGTTGGGTGAAGTTGCATCCTTTGGGGGCAGTAACATTGAG        9780
 S  I  Q  I  P  R  Q  L  G  E  V  A  S  F  G  G  S  N  I  E          3260

          .         .        | 68.         .         .         .
 CCAAGTGTCCGGAGCTGCTTCCAATTT | GCTAATAATAAGCCAGAGATCGAAGCGGCCCTC     9840
 P  S  V  R  S  C  F  Q  F   | A  N  N  K  P  E  I  E  A  A  L       3280

          .         .         .         .         .         .
 TTCCTAGACTGGATGAGACTGGAACCCCAGTCCATGGTGTGGCTGCCCGTCCTGCACAGA        9900
 F  L  D  W  M  R  L  E  P  Q  S  M  V  W  L  P  V  L  H  R          3300

          .         .         .         .         .         .
 GTGGCTGCTGCAGAAACTGCCAAGCATCAGGCCAAATGTAACATCTGCAAAGAGTGTCCA        9960
 V  A  A  A  E  T  A  K  H  Q  A  K  C  N  I  C  K  E  C  P          3320

          .     | 69   .         .         .         .         .
 ATCATTGGATTCAG | GTACAGGAGTCTAAAGCACTTTAATTATGACATCTGCCAAAGCTGC     10020
 I  I  G  F  R  |  Y  R  S  L  K  H  F  N  Y  D  I  C  Q  S  C       3340

          .         .         .         .         .         .
 TTTTTTTCTGGTCGAGTTGCAAAAGGCCATAAAATGCACTATCCCATGGTGGAATATTGC        10080
 F  F  S  G  R  V  A  K  G  H  K  M  H  Y  P  M  V  E  Y  C          3360

        | 70 .         .         .         .         .         .
 ACTCCG | ACTACATCAGGAGAAGATGTTCGAGACTTTGCCAAGGTACTAAAAAACAAATTT     10140
 T  P   | T  T  S  G  E  D  V  R  D  F  A  K  V  L  K  N  K  F       3380

          .         .         .         .         .         .
 CGAACCAAAAGGTATTTTGCGAAGCATCCCCGAATGGGCTACCTGCCAGTGCAGACTGTC        10200
 R  T  K  R  Y  F  A  K  H  P  R  M  G  Y  L  P  V  Q  T  V          3400

          .         .    | 71    .         .         .         .
 TTAGAGGGGGACAACATGGAAAC | TCCCGTTACTCTGATCAACTTCTGGCCAGTAGATTCT     10260
 L  E  G  D  N  M  E  T  |  P  V  T  L  I  N  F  W  P  V  D  S       3420

    | 72     .         .         .         .         .         .
 GC | GCCTGCCTCGTCCCCTCAGCTTTCACACGATGATACTCATTCACGCATTGAACATTAT     10320
 A  |  P  A  S  S  P  Q  L  S  H  D  D  T  H  S  R  I  E  H  Y       3440

          | 73         .         .         .         .         .
 GCTAGCAG | GCTAGCAGAAATGGAAAACAGCAATGGATCTTATCTAAATGATAGCATCTCT     10380
 A  S  R  |  L  A  E  M  E  N  S  N  G  S  Y  L  N  D  S  I  S       3460

          .     | 74   .         .         .         .         .
 CCTAATGAGAGCAT | AGATGATGAACATTTGTTAATCCAGCATTACTGCCAAAGTTTGAAC     10440
 P  N  E  S  I  |  D  D  E  H  L  L  I  Q  H  Y  C  Q  S  L  N       3480

          .         .         .         .         .         .
 CAGGACTCCCCCCTGAGCCAGCCTCGTAGTCCTGCCCAGATCTTGATTTCCTTAGAGAGT        10500
 Q  D  S  P  L  S  Q  P  R  S  P  A  Q  I  L  I  S  L  E  S          3500

          .         .         .         .         .    | 75    .
 GAGGAAAGAGGGGAGCTAGAGAGAATCCTAGCAGATCTTGAGGAAGAAAACAG | GAATCTG     10560
 E  E  R  G  E  L  E  R  I  L  A  D  L  E  E  E  N  R  |  N  L       3520

          .         .         .         .         .         .
 CAAGCAGAATATGACCGTCTAAAGCAGCAGCACGAACATAAAGGCCTGTCCCCACTGCCG        10620
 Q  A  E  Y  D  R  L  K  Q  Q  H  E  H  K  G  L  S  P  L  P          3540

          .         .         .         .         .         .
 TCCCCTCCTGAAATGATGCCCACCTCTCCCCAGAGTCCCCGGGATGCTGAGCTCATTGCT        10680
 S  P  P  E  M  M  P  T  S  P  Q  S  P  R  D  A  E  L  I  A          3560

          .         .         .         .         .         .
 GAGGCCAAGCTACTGCGTCAACACAAAGGCCGCCTGGAAGCCAGGATGCAAATCCTGGAA        10740
 E  A  K  L  L  R  Q  H  K  G  R  L  E  A  R  M  Q  I  L  E          3580

          .         .         .         .         .        | 76.
 GACCACAATAAACAGCTGGAGTCACAGTTACACAGGCTAAGGCAGCTGCTGGAGCAA | CCC     10800
 D  H  N  K  Q  L  E  S  Q  L  H  R  L  R  Q  L  L  E  Q   | P       3600

          .         .         .         .         .         .
 CAGGCAGAGGCCAAAGTGAATGGCACAACGGTGTCCTCTCCTTCTACCTCTCTACAGAGG        10860
 Q  A  E  A  K  V  N  G  T  T  V  S  S  P  S  T  S  L  Q  R          3620

          .         .         .         .         .         .
 TCCGACAGCAGTCAGCCTATGCTGCTCCGAGTGGTTGGCAGTCAAACTTCGGACTCCATG        10920
 S  D  S  S  Q  P  M  L  L  R  V  V  G  S  Q  T  S  D  S  M          3640

   | 77      .         .         .         .         .         .
 G | GTGAGGAAGATCTTCTCAGTCCTCCCCAGGACACAAGCACAGGGTTAGAGGAGGTGATG     10980
 G |   E  E  D  L  L  S  P  P  Q  D  T  S  T  G  L  E  E  V  M       3660

          .         .         .     | 78   .         .         .
 GAGCAACTCAACAACTCCTTCCCTAGTTCAAGAG | GAAGAAATACCCCTGGAAAGCCAATG     11040
 E  Q  L  N  N  S  F  P  S  S  R  G |   R  N  T  P  G  K  P  M       3680

        | 79 .  
 AGAGAG | GACACAATGTAG      11058
 R  E   | D  T  M  *         3685
            H  N  V  G
          .         .         .         .         .         .
 gaagtcttttccacatggcagatgatttgggcagagcgatggagtccttagtatcagtca        *60
   S  L  F  H  M  A  D  D  L  G  R  A  M  E  S  L  V  S  V  M

          .         .         .         .         .         .
 tgacagatgaagaaggagcagaataaatgttttacaactcctgattcccgcatggttttt        *120
   T  D  E  E  G  A  E  *
      (C-terminal end ancient dystrophin, -ex78 transcript)

          .         .         .         .         .         .
 ataatattcatacaacaaagaggattagacagtaagagtttacaagaaataaatctatat        *180

          .         .         .         .         .         .
 ttttgtgaagggtagtggtattatactgtagatttcagtagtttctaagtctgttattgt        *240

          .         .         .         .         .         .
 tttgttaacaatggcaggttttacacgtctatgcaattgtacaaaaaagttataagaaaa        *300

          .         .         .         .         .         .
 ctacatgtaaaatcttgatagctaaataacttgccatttctttatatggaacgcattttg        *360

          .         .         .         .         .         .
 ggttgtttaaaaatttataacagttataaagaaagattgtaaactaaagtgtgctttata        *420

          .         .         .         .         .         .
 aaaaaaagttgtttataaaaacccctaaaaacaaaacaaacacacacacacacacataca        *480

          .         .         .         .         .         .
 cacacacacacaaaactttgaggcagcgcattgttttgcatccttttggcgtgatatcca        *540

          .         .         .         .         .         .
 tatgaaattcatggctttttctttttttgcatattaaagataagacttcctctaccacca        *600

          .         .         .         .         .         .
 caccaaatgactactacacactgctcatttgagaactgtcagctgagtggggcaggcttg        *660

          .         .         .         .         .         .
 agttttcatttcatatatctatatgtctataagtatataaatactatagttatatagata        *720

          .         .         .         .         .         .
 aagagatacgaatttctatagactgactttttccattttttaaatgttcatgtcacatcc        *780

          .         .         .         .         .         .
 taatagaaagaaattacttctagtcagtcatccaggcttacctgcttggtctagaatgga        *840

          .         .         .         .         .         .
 tttttcccggagccggaagccaggaggaaactacaccacactaaaacattgtctacagct        *900

          .         .         .         .         .         .
 ccagatgtttctcattttaaacaactttccactgacaacgaaagtaaagtaaagtattgg        *960

          .         .         .         .         .         .
 atttttttaaagggaacatgtgaatgaatacacaggacttattatatcagagtgagtaat        *1020

          .         .         .         .         .         .
 cggttggttggttgattgattgattgattgatacattcagcttcctgctgctagcaatgc        *1080

          .         .         .         .         .         .
 cacgatttagatttaatgatgcttcagtggaaatcaatcagaaggtattctgaccttgtg        *1140

          .         .         .         .         .         .
 aacatcagaaggtattttttaactcccaagcagtagcaggacgatgatagggctggaggg        *1200

          .         .         .         .         .         .
 ctatggattcccagcccatccctgtgaaggagtaggccactctttaagtgaaggattgga        *1260

          .         .         .         .         .         .
 tgattgttcataatacataaagttctctgtaattacaactaaattattatgccctcttct        *1320

          .         .         .         .         .         .
 cacagtcaaaaggaactgggtggtttggtttttgttgcttttttagatttattgtcccat        *1380

          .         .         .         .         .         .
 gtgggatgagtttttaaatgccacaagacataatttaaaataaataaactttgggaaaag        *1440

          .         .         .         .         .         .
 gtgtaaaacagtagccccatcacatttgtgatactgacaggtatcaacccagaagcccat        *1500

          .         .         .         .         .         .
 gaactgtgtttccatcctttgcatttctctgcgagtagttccacacaggtttgtaagtaa        *1560

          .         .         .         .         .         .
 gtaagaaagaaggcaaattgattcaaatgttacaaaaaaacccttcttggtggattagac        *1620

          .         .         .         .         .         .
 aggttaaatatataaacaaacaaacaaaaattgctcaaaaaagaggagaaaagctcaaga        *1680

          .         .         .         .         .         .
 ggaaaagctaaggactggtaggaaaaagctttactctttcatgccattttatttcttttt        *1740

          .         .         .         .         .         .
 gatttttaaatcattcattcaatagataccaccgtgtgacctataattttgcaaatctgt        *1800

          .         .         .         .         .         .
 tacctctgacatcaagtgtaattagcttttggagagtgggctgacatcaagtgtaattag        *1860

          .         .         .         .         .         .
 cttttggagagtgggttttgtccattattaataattaattaattaacatcaaacacggct        *1920

          .         .         .         .         .         .
 tctcatgctatttctacctcactttggttttggggtgttcctgataattgtgcacacctg        *1980

          .         .         .         .         .         .
 agttcacagcttcaccacttgtccattgcgttattttctttttcctttataattctttct        *2040

          .         .         .         .         .         .
 ttttccttcataattttcaaaagaaaacccaaagctctaaggtaacaaattaccaaatta        *2100

          .         .         .         .         .         .
 catgaagatttggtttttgtcttgcatttttttcctttatgtgacgctggaccttttctt        *2160

          .         .         .         .         .         .
 tacccaaggatttttaaaactcagatttaaaacaaggggttactttacatcctactaaga        *2220

          .         .         .         .         .         .
 agtttaagtaagtaagtttcattctaaaatcagaggtaaatagagtgcataaataatttt        *2280

          .         .         .         .         .         .
 gttttaatctttttgtttttcttttagacacattagctctggagtgagtctgtcataata        *2340

          .         .         .         .         .         .
 tttgaacaaaaattgagagctttattgctgcattttaagcataattaatttggacattat        *2400

          .         .         .         .         .         .
 ttcgtgttgtgttctttataaccaccaagtattaaactgtaaatcataatgtaactgaag        *2460

          .         .         .         .         .         .
 cataaacatcacatggcatgttttgtcattgttttcaggtactgagttcttacttgagta        *2520

          .         .         .         .         .         .
 tcataatatattgtgttttaacaccaacactgtaacatttacgaattatttttttaaact        *2580

          .         .         .         .         .         .
 tcagttttactgcattttcacaacatatcagacttcaccaaatatatgccttactattgt        *2640

          .         .         .         .         . 
 attatagtactgctttactgtgtatctcaataaagcacgcagttatgttac                 *2691

 (downstream sequence)
Legend:
Please note that the ancient dystrophin, lacking exon 78, encodes a protein that is has a different, longer C-terminal end. Consequently, variants up to nucleotide c.*86 affect the protein.
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The DMD protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift mutations, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

| Top of page | LMDp home page |
| Remarks / information | Copyright©, liability |