collagen type VI alpha 3 (COL6A3) - coding DNA reference sequence

(used for mutation description)

(last modified February 5, 2011)


This file was created to facilitate the description of sequence variants in the COL6A3 gene based on a coding DNA reference sequence following the HGVS recommendations. The sequence was taken from NG_008676.1, covering COL6A3 transcript variant-1 (NM_004369.3). Exons 3 and 4 are alternatively spliced in transcript variants-5 (lacking exon 3, NM_057167.3) and -4 (lacking both, NM_057166.4). Transcript variants-3 (NM_057165.4) and -2 (NM_057164.4) use an alternative transcription termination site in intron 9 (exon 9b). In addition, transcript variant-3 skips exon 3 while variant-4 skips both exons 3 and 4.

Please note that introns are available by clicking on the exon numbers above the sequence.
 (upstream sequence)
                     .         .         .         .                g.5045
                aagccctgactggtatccctggccccagtccagtttggagctcag       c.-241

 .         .         .         .         .         .                g.5105
 tcttccaccaaaggccgttcagttctcctgggctccagcctcctgcaaggactgcaagag       c.-181

 .         .         .         .         .         .                g.5165
 ttttcctccgcagctctgagtctccacttttttggtggagaaaggctgcaaaaagaaaaa       c.-121

 .         .         .         .         .         .                g.5225
 gagacgcagtgagtgggaaaagtatgcatcctattcaaacctaattgaatcgaggagccc       c.-61

 .         .         .          | 2         .         .             g.22390
 agggacacacgccttcaggtttgctcaggg | gttcatatttggtgcttagacaaattcaaa    c.-1

          .         .         .         .         .         .       g.22450
 ATGAGGAAACATCGGCACTTGCCCTTAGTGGCCGTCTTTTGCCTCTTTCTCTCAGGCTTT       c.60
 M  R  K  H  R  H  L  P  L  V  A  V  F  C  L  F  L  S  G  F         p.20

          .         .         .  | 3       .         .         .    g.24032
 CCTACAACTCATGCCCAGCAGCAGCAAGCAG | ATGTCAAAAATGGTGCGGCTGCTGATATA    c.120
 P  T  T  H  A  Q  Q  Q  Q  A  D |   V  K  N  G  A  A  A  D  I      p.40
                                 ^  alternatively spliced exon

          .         .         .         .         .         .       g.24092
 ATATTTCTAGTGGATTCCTCTTGGACCATTGGAGAGGAACATTTCCAACTTGTTCGAGAG       c.180
 I  F  L  V  D  S  S  W  T  I  G  E  E  H  F  Q  L  V  R  E         p.60

          .         .         .         .         .         .       g.24152
 TTTCTATATGATGTTGTAAAATCCTTAGCTGTGGGAGAAAATGATTTCCATTTTGCTCTG       c.240
 F  L  Y  D  V  V  K  S  L  A  V  G  E  N  D  F  H  F  A  L         p.80

          .         .         .         .         .         .       g.24212
 GTCCAGTTCAACGGAAACCCACATACCGAGTTCCTGTTAAATACGTATCGTACTAAACAA       c.300
 V  Q  F  N  G  N  P  H  T  E  F  L  L  N  T  Y  R  T  K  Q         p.100

          .         .         .         .         .         .       g.24272
 GAAGTCCTTTCTCATATTTCCAACATGTCTTATATTGGGGGAACCAATCAGACTGGAAAA       c.360
 E  V  L  S  H  I  S  N  M  S  Y  I  G  G  T  N  Q  T  G  K         p.120

          .         .         .         .         .         .       g.24332
 GGATTAGAATACATAATGCAAAGCCACCTCACCAAGGCTGCTGGAAGCCGGGCCGGTGAC       c.420
 G  L  E  Y  I  M  Q  S  H  L  T  K  A  A  G  S  R  A  G  D         p.140

          .         .         .         .         .         .       g.24392
 GGAGTCCCTCAGGTTATCGTAGTGTTAACTGATGGACACTCGAAGGATGGCCTTGCTCTG       c.480
 G  V  P  Q  V  I  V  V  L  T  D  G  H  S  K  D  G  L  A  L         p.160

          .         .         .         .         .         .       g.24452
 CCCTCAGCGGAACTTAAGTCTGCTGATGTTAACGTGTTTGCAATTGGAGTTGAGGATGCA       c.540
 P  S  A  E  L  K  S  A  D  V  N  V  F  A  I  G  V  E  D  A         p.180

          .         .         .         .         .         .       g.24512
 GATGAAGGAGCGTTAAAAGAAATAGCAAGTGAACCGCTCAATATGCATATGTTCAACCTA       c.600
 D  E  G  A  L  K  E  I  A  S  E  P  L  N  M  H  M  F  N  L         p.200

          .         .         .         .         .         .       g.24572
 GAGAATTTTACCTCACTTCATGACATAGTAGGAAACTTAGTGTCCTGTGTGCATTCATCC       c.660
 E  N  F  T  S  L  H  D  I  V  G  N  L  V  S  C  V  H  S  S         p.220

          .         .         .         .          | 4         .    g.31034
 GTGAGTCCAGAAAGGGCTGGGGACACGGAAACCCTTAAAGACATCACAG | CACAAGACTCT    c.720
 V  S  P  E  R  A  G  D  T  E  T  L  K  D  I  T  A |   Q  D  S      p.240
                                                   ^  alternatively spliced exon

          .         .         .         .         .         .       g.31094
 GCTGACATTATTTTCCTTATTGATGGATCAAACAACACCGGAAGTGTCAATTTCGCAGTC       c.780
 A  D  I  I  F  L  I  D  G  S  N  N  T  G  S  V  N  F  A  V         p.260

          .         .         .         .         .         .       g.31154
 ATTCTCGACTTCCTTGTAAATCTCCTTGAGAAACTCCCAATTGGAACTCAGCAGATCCGA       c.840
 I  L  D  F  L  V  N  L  L  E  K  L  P  I  G  T  Q  Q  I  R         p.280

          .         .         .         .         .         .       g.31214
 GTGGGGGTGGTCCAGTTTAGCGATGAGCCCAGAACCATGTTCTCCTTGGACACCTACTCC       c.900
 V  G  V  V  Q  F  S  D  E  P  R  T  M  F  S  L  D  T  Y  S         p.300

          .         .         .         .         .         .       g.31274
 ACCAAGGCCCAGGTTCTGGGTGCAGTGAAAGCCCTCGGGTTTGCTGGTGGGGAGTTGGCC       c.960
 T  K  A  Q  V  L  G  A  V  K  A  L  G  F  A  G  G  E  L  A         p.320

          .         .         .         .         .         .       g.31334
 AATATCGGCCTCGCCCTTGATTTCGTGGTGGAGAACCACTTCACCCGGGCAGGGGGCAGC       c.1020
 N  I  G  L  A  L  D  F  V  V  E  N  H  F  T  R  A  G  G  S         p.340

          .         .         .         .         .         .       g.31394
 CGCGTGGAGGAAGGGGTTCCCCAGGTGCTGGTCCTCATAAGTGCCGGGCCTTCTAGTGAC       c.1080
 R  V  E  E  G  V  P  Q  V  L  V  L  I  S  A  G  P  S  S  D         p.360

          .         .         .         .         .         .       g.31454
 GAGATTCGCTACGGGGTGGTAGCACTGAAGCAGGCTAGCGTGTTCTCATTCGGCCTTGGA       c.1140
 E  I  R  Y  G  V  V  A  L  K  Q  A  S  V  F  S  F  G  L  G         p.380

          .         .         .         .         .         .       g.31514
 GCCCAGGCCGCCTCCAGGGCAGAGCTTCAGCACATAGCTACCGATGACAACTTGGTGTTT       c.1200
 A  Q  A  A  S  R  A  E  L  Q  H  I  A  T  D  D  N  L  V  F         p.400

          .         .         .         .         .         .       g.31574
 ACTGTCCCGGAATTCCGTAGCTTTGGGGACCTCCAGGAGAAATTACTGCCGTACATTGTT       c.1260
 T  V  P  E  F  R  S  F  G  D  L  Q  E  K  L  L  P  Y  I  V         p.420

          .         .         .         .         .   | 5      .    g.37716
 GGCGTGGCCCAAAGGCACATTGTCTTGAAACCGCCAACCATTGTCACACAAG | TCATTGAA    c.1320
 G  V  A  Q  R  H  I  V  L  K  P  P  T  I  V  T  Q  V |   I  E      p.440

          .         .         .         .         .         .       g.37776
 GTCAACAAGAGAGACATAGTCTTCCTGGTGGATGGCTCATCTGCACTGGGACTGGCCAAC       c.1380
 V  N  K  R  D  I  V  F  L  V  D  G  S  S  A  L  G  L  A  N         p.460

          .         .         .         .         .         .       g.37836
 TTCAATGCCATCCGAGACTTCATTGCTAAAGTCATCCAGAGGCTGGAAATCGGACAGGAT       c.1440
 F  N  A  I  R  D  F  I  A  K  V  I  Q  R  L  E  I  G  Q  D         p.480

          .         .         .         .         .         .       g.37896
 CTTATCCAGGTGGCAGTGGCCCAGTATGCAGACACTGTGAGGCCTGAATTTTATTTCAAT       c.1500
 L  I  Q  V  A  V  A  Q  Y  A  D  T  V  R  P  E  F  Y  F  N         p.500

          .         .         .         .         .         .       g.37956
 ACCCATCCAACAAAAAGGGAAGTCATAACCGCTGTGCGGAAAATGAAGCCCCTGGACGGC       c.1560
 T  H  P  T  K  R  E  V  I  T  A  V  R  K  M  K  P  L  D  G         p.520

          .         .         .         .         .         .       g.38016
 TCGGCCCTGTACACGGGCTCTGCTCTAGACTTTGTTCGTAACAACCTATTCACGAGTTCA       c.1620
 S  A  L  Y  T  G  S  A  L  D  F  V  R  N  N  L  F  T  S  S         p.540

          .         .         .         .         .         .       g.38076
 GCCGGCTACCGGGCTGCCGAGGGGATTCCTAAGCTTTTGGTGCTGATCACAGGTGGTAAG       c.1680
 A  G  Y  R  A  A  E  G  I  P  K  L  L  V  L  I  T  G  G  K         p.560

          .         .         .         .         .         .       g.38136
 TCCCTAGATGAAATCAGCCAGCCTGCCCAGGAGCTGAAGAGAAGCAGCATAATGGCCTTT       c.1740
 S  L  D  E  I  S  Q  P  A  Q  E  L  K  R  S  S  I  M  A  F         p.580

          .         .         .         .         .         .       g.38196
 GCCATTGGGAACAAGGGTGCCGATCAGGCTGAGCTGGAAGAGATCGCTTTCGACTCCTCC       c.1800
 A  I  G  N  K  G  A  D  Q  A  E  L  E  E  I  A  F  D  S  S         p.600

          .         .         .         .         .         .       g.38256
 CTGGTGTTCATCCCAGCTGAGTTCCGAGCCGCCCCATTGCAAGGCATGCTGCCTGGCTTG       c.1860
 L  V  F  I  P  A  E  F  R  A  A  P  L  Q  G  M  L  P  G  L         p.620

          .         .         .        | 6 .         .         .    g.39995
 CTGGCACCTCTCAGGACCCTCTCTGGAACCCCTGAAG | TTCACTCAAACAAAAGGGATATC    c.1920
 L  A  P  L  R  T  L  S  G  T  P  E  V |   H  S  N  K  R  D  I      p.640

          .         .         .         .         .         .       g.40055
 ATCTTTCTTTTGGATGGATCAGCCAACGTTGGAAAAACCAATTTCCCTTATGTGCGCGAC       c.1980
 I  F  L  L  D  G  S  A  N  V  G  K  T  N  F  P  Y  V  R  D         p.660

          .         .         .         .         .         .       g.40115
 TTTGTAATGAACCTAGTTAACAGCCTTGATATTGGAAATGACAATATTCGTGTTGGTTTA       c.2040
 F  V  M  N  L  V  N  S  L  D  I  G  N  D  N  I  R  V  G  L         p.680

          .         .         .         .         .         .       g.40175
 GTGCAATTTAGTGACACTCCTGTAACGGAGTTCTCTTTAAACACATACCAGACCAAGTCA       c.2100
 V  Q  F  S  D  T  P  V  T  E  F  S  L  N  T  Y  Q  T  K  S         p.700

          .         .         .         .         .         .       g.40235
 GATATCCTTGGTCATCTGAGGCAGCTGCAGCTCCAGGGAGGTTCGGGCCTGAACACAGGC       c.2160
 D  I  L  G  H  L  R  Q  L  Q  L  Q  G  G  S  G  L  N  T  G         p.720

          .         .         .         .         .         .       g.40295
 TCAGCCCTAAGCTATGTCTATGCCAACCACTTCACGGAAGCTGGCGGCAGCAGGATCCGT       c.2220
 S  A  L  S  Y  V  Y  A  N  H  F  T  E  A  G  G  S  R  I  R         p.740

          .         .         .         .         .         .       g.40355
 GAACACGTGCCGCAGCTCCTGCTTCTGCTCACAGCTGGGCAGTCTGAGGACTCCTATTTG       c.2280
 E  H  V  P  Q  L  L  L  L  L  T  A  G  Q  S  E  D  S  Y  L         p.760

          .         .         .         .         .         .       g.40415
 CAAGCTGCCAACGCCTTGACACGCGCGGGCATCCTGACTTTTTGTGTGGGAGCTAGCCAG       c.2340
 Q  A  A  N  A  L  T  R  A  G  I  L  T  F  C  V  G  A  S  Q         p.780

          .         .         .         .         .         .       g.40475
 GCGAATAAGGCAGAGCTTGAGCAGATTGCTTTTAACCCAAGCCTGGTGTATCTCATGGAT       c.2400
 A  N  K  A  E  L  E  Q  I  A  F  N  P  S  L  V  Y  L  M  D         p.800

          .         .         .         .         .         .       g.40535
 GATTTCAGCTCCCTGCCAGCTTTGCCTCAGCAGCTGATTCAGCCCCTAACCACATATGTT       c.2460
 D  F  S  S  L  P  A  L  P  Q  Q  L  I  Q  P  L  T  T  Y  V         p.820

          .         .         .        | 7 .         .         .    g.41886
 AGTGGAGGTGTGGAGGAAGTACCACTCGCTCAGCCAG | AGAGCAAGCGAGACATTCTGTTC    c.2520
 S  G  G  V  E  E  V  P  L  A  Q  P  E |   S  K  R  D  I  L  F      p.840

          .         .         .         .         .         .       g.41946
 CTCTTTGACGGCTCAGCCAATCTTGTGGGCCAGTTCCCTGTTGTCCGTGACTTTCTCTAC       c.2580
 L  F  D  G  S  A  N  L  V  G  Q  F  P  V  V  R  D  F  L  Y         p.860

          .         .         .         .         .         .       g.42006
 AAGATTATCGATGAGCTCAATGTGAAGCCAGAGGGGACCCGAATTGCGGTGGCTCAGTAC       c.2640
 K  I  I  D  E  L  N  V  K  P  E  G  T  R  I  A  V  A  Q  Y         p.880

          .         .         .         .         .         .       g.42066
 AGCGATGATGTCAAGGTGGAGTCCCGTTTTGATGAGCACCAGAGTAAGCCTGAGATCCTG       c.2700
 S  D  D  V  K  V  E  S  R  F  D  E  H  Q  S  K  P  E  I  L         p.900

          .         .         .         .         .         .       g.42126
 AATCTTGTGAAGAGAATGAAGATCAAGACGGGCAAAGCCCTCAACCTGGGCTACGCGCTG       c.2760
 N  L  V  K  R  M  K  I  K  T  G  K  A  L  N  L  G  Y  A  L         p.920

          .         .         .         .         .         .       g.42186
 GACTATGCACAGAGGTACATTTTTGTGAAGTCTGCTGGCAGCCGGATCGAGGATGGAGTG       c.2820
 D  Y  A  Q  R  Y  I  F  V  K  S  A  G  S  R  I  E  D  G  V         p.940

          .         .         .         .         .         .       g.42246
 CTTCAGTTCCTGGTGCTGCTGGTCGCAGGAAGGTCATCTGACCGTGTGGATGGGCCAGCA       c.2880
 L  Q  F  L  V  L  L  V  A  G  R  S  S  D  R  V  D  G  P  A         p.960

          .         .         .         .         .         .       g.42306
 AGTAACCTGAAGCAGAGTGGGGTTGTGCCTTTCATCTTCCAAGCCAAGAACGCAGACCCT       c.2940
 S  N  L  K  Q  S  G  V  V  P  F  I  F  Q  A  K  N  A  D  P         p.980

          .         .         .         .         .         .       g.42366
 GCTGAGTTAGAGCAGATCGTGCTGTCTCCAGCGTTTATCCTGGCTGCAGAGTCGCTTCCC       c.3000
 A  E  L  E  Q  I  V  L  S  P  A  F  I  L  A  A  E  S  L  P         p.1000

          .         .         .         .         .         .       g.42426
 AAGATTGGAGATCTTCATCCACAGATAGTGAATCTCTTAAAATCAGTGCACAACGGAGCA       c.3060
 K  I  G  D  L  H  P  Q  I  V  N  L  L  K  S  V  H  N  G  A         p.1020

          . | 8        .         .         .         .         .    g.44237
 CCAGCACCAG | TTTCAGGTGAAAAGGACGTGGTGTTTCTGCTTGATGGCTCTGAGGGCGTC    c.3120
 P  A  P  V |   S  G  E  K  D  V  V  F  L  L  D  G  S  E  G  V      p.1040

          .         .         .         .         .         .       g.44297
 AGGAGCGGCTTCCCTCTGTTGAAAGAGTTTGTCCAGAGAGTGGTGGAAAGCCTGGATGTG       c.3180
 R  S  G  F  P  L  L  K  E  F  V  Q  R  V  V  E  S  L  D  V         p.1060

          .         .         .         .         .         .       g.44357
 GGCCAGGACCGGGTCCGCGTGGCCGTGGTGCAGTACAGCGACCGGACCAGGCCCGAGTTC       c.3240
 G  Q  D  R  V  R  V  A  V  V  Q  Y  S  D  R  T  R  P  E  F         p.1080

          .         .         .         .         .         .       g.44417
 TACCTGAATTCATACATGAACAAGCAGGACGTCGTCAACGCTGTCCGCCAGCTGACCCTG       c.3300
 Y  L  N  S  Y  M  N  K  Q  D  V  V  N  A  V  R  Q  L  T  L         p.1100

          .         .         .         .         .         .       g.44477
 CTGGGAGGGCCGACCCCCAACACCGGGGCCGCCCTGGAGTTTGTCCTGAGGAACATCCTG       c.3360
 L  G  G  P  T  P  N  T  G  A  A  L  E  F  V  L  R  N  I  L         p.1120

          .         .         .         .         .         .       g.44537
 GTCAGCTCTGCGGGAAGCAGGATAACAGAAGGTGTGCCCCAGCTGCTGATCGTCCTCACG       c.3420
 V  S  S  A  G  S  R  I  T  E  G  V  P  Q  L  L  I  V  L  T         p.1140

          .         .         .         .         .         .       g.44597
 GCCGACAGGTCTGGGGATGATGTGCGGAACCCCTCCGTGGTCGTGAAGAGGGGTGGGGCT       c.3480
 A  D  R  S  G  D  D  V  R  N  P  S  V  V  V  K  R  G  G  A         p.1160

          .         .         .         .         .         .       g.44657
 GTGCCCATTGGCATTGGCATCGGGAACGCTGACATCACAGAGATGCAGACCATCTCCTTC       c.3540
 V  P  I  G  I  G  I  G  N  A  D  I  T  E  M  Q  T  I  S  F         p.1180

          .         .         .         .         .         .       g.44717
 ATCCCGGACTTTGCCGTGGCCATTCCCACCTTTCGCCAGCTGGGGACCGTCCAACAGGTC       c.3600
 I  P  D  F  A  V  A  I  P  T  F  R  Q  L  G  T  V  Q  Q  V         p.1200

          .         .         .         .         .         .       g.44777
 ATCTCTGAGAGGGTGACCCAGCTCACCCGCGAGGAGCTGAGCAGGCTGCAGCCGGTGTTG       c.3660
 I  S  E  R  V  T  Q  L  T  R  E  E  L  S  R  L  Q  P  V  L         p.1220

          .          | 9         .         .         .         .    g.46911
 CAGCCTCTACCGAGCCCAG | GTGTTGGTGGCAAGAGGGACGTGGTCTTTCTCATCGATGGG    c.3720
 Q  P  L  P  S  P  G |   V  G  G  K  R  D  V  V  F  L  I  D  G      p.1240

          .         .         .         .         .         .       g.46971
 TCCCAAAGTGCCGGGCCTGAGTTCCAGTACGTTCGCACCCTCATAGAGAGGCTGGTTGAC       c.3780
 S  Q  S  A  G  P  E  F  Q  Y  V  R  T  L  I  E  R  L  V  D         p.1260

          .         .         .         .         .         .       g.47031
 TACCTGGACGTGGGCTTTGACACCACCCGGGTGGCTGTCATCCAGTTCAGCGATGACCCC       c.3840
 Y  L  D  V  G  F  D  T  T  R  V  A  V  I  Q  F  S  D  D  P         p.1280

          .         .         .         .         .         .       g.47091
 AAGGTGGAGTTCCTGCTGAACGCCCATTCCAGCAAGGATGAAGTGCAGAACGCGGTGCAG       c.3900
 K  V  E  F  L  L  N  A  H  S  S  K  D  E  V  Q  N  A  V  Q         p.1300

          .         .         .         .         .         .       g.47151
 CGGCTGAGGCCCAAGGGAGGGCGGCAGATCAACGTGGGCAATGCCCTGGAGTACGTGTCC       c.3960
 R  L  R  P  K  G  G  R  Q  I  N  V  G  N  A  L  E  Y  V  S         p.1320

          .         .         .         .         .         .       g.47211
 AGGAACATCTTCAAGAGGCCCCTGGGGAGCCGCATTGAAGAGGGCGTCCCGCAGTTCCTG       c.4020
 R  N  I  F  K  R  P  L  G  S  R  I  E  E  G  V  P  Q  F  L         p.1340

          .         .         .         .         .         .       g.47271
 GTCCTCATCTCGTCTGGAAAGTCTGACGATGAGGTGGACGACCCGGCGGTGGAGCTCAAG       c.4080
 V  L  I  S  S  G  K  S  D  D  E  V  D  D  P  A  V  E  L  K         p.1360

          .         .         .         .         .         .       g.47331
 CAGTTTGGCGTGGCCCCTTTCACGATCGCCAGGAACGCAGACCAGGAGGAGCTGGTGAAG       c.4140
 Q  F  G  V  A  P  F  T  I  A  R  N  A  D  Q  E  E  L  V  K         p.1380

          .         .         .         .         .         .       g.47391
 ATCTCGCTGAGCCCCGAATATGTGTTCTCGGTGAGCACCTTCCGGGAGCTGCCCAGCCTG       c.4200
 I  S  L  S  P  E  Y  V  F  S  V  S  T  F  R  E  L  P  S  L         p.1400

          .         .         .         .         .         .       g.47451
 GAGCAGAAACTGCTGACGCCCATCACGACCCTGACCTCAGAGCAGATCCAGAAGCTCTTA       c.4260
 E  Q  K  L  L  T  P  I  T  T  L  T  S  E  Q  I  Q  K  L  L         p.1420

          .         .      | 10  .         .         .         .    g.50065
 GCCAGCACTCGCTATCCACCTCCAG | CAGTTGAGAGTGATGCTGCAGACATTGTCTTTCTG    c.4320
 A  S  T  R  Y  P  P  P  A |   V  E  S  D  A  A  D  I  V  F  L      p.1440

          .         .         .         .         .         .       g.50125
 ATCGACAGCTCTGAGGGAGTTAGGCCAGATGGCTTTGCACATATTCGAGATTTTGTTAGC       c.4380
 I  D  S  S  E  G  V  R  P  D  G  F  A  H  I  R  D  F  V  S         p.1460

          .         .         .         .         .         .       g.50185
 AGGATTGTTCGAAGACTCAACATCGGCCCCAGTAAAGTGAGAGTTGGGGTCGTGCAGTTC       c.4440
 R  I  V  R  R  L  N  I  G  P  S  K  V  R  V  G  V  V  Q  F         p.1480

          .         .         .         .         .         .       g.50245
 AGCAATGATGTCTTCCCAGAATTCTATCTGAAAACCTACAGATCCCAGGCCCCGGTGCTG       c.4500
 S  N  D  V  F  P  E  F  Y  L  K  T  Y  R  S  Q  A  P  V  L         p.1500

          .         .         .         .         .         .       g.50305
 GACGCCATACGGCGCCTGAGGCTCAGAGGGGGGTCCCCACTGAACACTGGCAAGGCTCTC       c.4560
 D  A  I  R  R  L  R  L  R  G  G  S  P  L  N  T  G  K  A  L         p.1520

          .         .         .         .         .         .       g.50365
 GAATTTGTGGCAAGAAACCTCTTTGTTAAGTCTGCGGGGAGTCGCATAGAAGACGGGGTG       c.4620
 E  F  V  A  R  N  L  F  V  K  S  A  G  S  R  I  E  D  G  V         p.1540

          .         .         .         .         .         .       g.50425
 CCCCAACACCTGGTCCTGGTCCTGGGTGGAAAATCCCAGGACGATGTGTCCAGGTTCGCC       c.4680
 P  Q  H  L  V  L  V  L  G  G  K  S  Q  D  D  V  S  R  F  A         p.1560

          .         .         .         .         .         .       g.50485
 CAGGTGATCCGTTCCTCGGGCATTGTGAGTTTAGGGGTAGGAGACCGGAACATCGACAGA       c.4740
 Q  V  I  R  S  S  G  I  V  S  L  G  V  G  D  R  N  I  D  R         p.1580

          .         .         .         .         .         .       g.50545
 ACAGAGCTGCAGACCATCACCAATGACCCCAGACTGGTCTTCACAGTGCGAGAGTTCAGA       c.4800
 T  E  L  Q  T  I  T  N  D  P  R  L  V  F  T  V  R  E  F  R         p.1600

          .         .         .         .         .         .       g.50605
 GAGCTTCCCAACATAGAAGAAAGAATCATGAACTCGTTTGGACCCTCCGCAGCCACTCCT       c.4860
 E  L  P  N  I  E  E  R  I  M  N  S  F  G  P  S  A  A  T  P         p.1620

          .         .         .         . | 11       .         .    g.51941
 GCACCTCCAGGGGTGGACACCCCTCCTCCTTCACGGCCAG | AGAAGAAGAAAGCAGACATT    c.4920
 A  P  P  G  V  D  T  P  P  P  S  R  P  E |   K  K  K  A  D  I      p.1640

          .         .         .         .         .         .       g.52001
 GTGTTCCTGTTGGATGGTTCCATCAACTTCAGGAGGGACAGTTTCCAGGAAGTGCTTCGT       c.4980
 V  F  L  L  D  G  S  I  N  F  R  R  D  S  F  Q  E  V  L  R         p.1660

          .         .         .         .         .         .       g.52061
 TTTGTGTCTGAAATAGTGGACACAGTTTATGAAGATGGCGACTCCATCCAAGTGGGGCTT       c.5040
 F  V  S  E  I  V  D  T  V  Y  E  D  G  D  S  I  Q  V  G  L         p.1680

          .         .         .         .         .         .       g.52121
 GTCCAGTACAACTCTGACCCCACTGACGAATTCTTCCTGAAGGACTTCTCTACCAAGAGG       c.5100
 V  Q  Y  N  S  D  P  T  D  E  F  F  L  K  D  F  S  T  K  R         p.1700

          .         .         .         .         .         .       g.52181
 CAGATTATTGACGCCATCAACAAAGTGGTCTACAAAGGGGGAAGACACGCCAACACTAAG       c.5160
 Q  I  I  D  A  I  N  K  V  V  Y  K  G  G  R  H  A  N  T  K         p.1720

          .         .         .         .         .         .       g.52241
 GTGGGCCTTGAGCACCTGCGGGTAAACCACTTTGTGCCTGAGGCAGGCAGCCGCCTGGAC       c.5220
 V  G  L  E  H  L  R  V  N  H  F  V  P  E  A  G  S  R  L  D         p.1740

          .         .         .         .         .         .       g.52301
 CAGCGGGTCCCTCAGATTGCCTTTGTGATCACGGGAGGAAAGTCGGTGGAAGATGCACAG       c.5280
 Q  R  V  P  Q  I  A  F  V  I  T  G  G  K  S  V  E  D  A  Q         p.1760

          .         .         .         .         .         .       g.52361
 GATGTGAGCCTGGCCCTCACCCAGAGGGGGGTCAAAGTGTTTGCTGTTGGAGTGAGGAAT       c.5340
 D  V  S  L  A  L  T  Q  R  G  V  K  V  F  A  V  G  V  R  N         p.1780

          .         .         .         .         .         .       g.52421
 ATCGACTCGGAGGAGGTTGGAAAGATAGCGTCCAACAGCGCCACAGCGTTCCGCGTGGGC       c.5400
 I  D  S  E  E  V  G  K  I  A  S  N  S  A  T  A  F  R  V  G         p.1800

          .         .         .         .         .         .       g.52481
 AACGTCCAGGAGCTGTCCGAACTGAGCGAGCAAGTTTTGGAAACTTTGCATGATGCGATG       c.5460
 N  V  Q  E  L  S  E  L  S  E  Q  V  L  E  T  L  H  D  A  M         p.1820

          .         .         .         . | 12       .         .    g.53192
 CATGAAACCCTTTGCCCTGGTGTAACTGATGCTGCCAAAG | CTTGTAATCTGGATGTGATT    c.5520
 H  E  T  L  C  P  G  V  T  D  A  A  K  A |   C  N  L  D  V  I      p.1840

          .         .         .         .         .         .       g.53252
 CTGGGGTTTGATGGTTCTAGAGACCAGAATGTTTTTGTGGCCCAGAAGGGCTTCGAGTCC       c.5580
 L  G  F  D  G  S  R  D  Q  N  V  F  V  A  Q  K  G  F  E  S         p.1860

          .         .         .         .         .         .       g.53312
 AAGGTGGACGCCATCTTGAACAGAATCAGCCAGATGCACAGGGTCAGCTGCAGCGGTGGC       c.5640
 K  V  D  A  I  L  N  R  I  S  Q  M  H  R  V  S  C  S  G  G         p.1880

          .         .         .         .         .         .       g.53372
 CGCTCGCCCACCGTGCGTGTGTCAGTGGTGGCCAACACGCCCTCGGGCCCGGTGGAGGCC       c.5700
 R  S  P  T  V  R  V  S  V  V  A  N  T  P  S  G  P  V  E  A         p.1900

          .         .         .         .         .         .       g.53432
 TTTGACTTTGACGAGTACCAGCCAGAGATGCTCGAGAAGTTCCGGAACATGCGCAGCCAG       c.5760
 F  D  F  D  E  Y  Q  P  E  M  L  E  K  F  R  N  M  R  S  Q         p.1920

          .         .         .         .         .         .       g.53492
 CACCCCTACGTCCTCACGGAGGACACCCTGAAGGTCTACCTGAACAAGTTCAGACAGTCC       c.5820
 H  P  Y  V  L  T  E  D  T  L  K  V  Y  L  N  K  F  R  Q  S         p.1940

          .         | 13         .         .         .         .    g.54821
 TCGCCGGACAGCGTGAAG | GTGGTCATTCATTTTACTGATGGAGCAGACGGAGATCTGGCT    c.5880
 S  P  D  S  V  K   | V  V  I  H  F  T  D  G  A  D  G  D  L  A      p.1960

          .         .         .        | 14.         .         .    g.55832
 GATTTACACAGAGCATCTGAGAACCTCCGCCAAGAAG | GAGTCCGTGCCTTGATCCTGGTG    c.5940
 D  L  H  R  A  S  E  N  L  R  Q  E  G |   V  R  A  L  I  L  V      p.1980

          .         .         .         .         .         .       g.55892
 GGCCTTGAACGAGTGGTCAACTTGGAGCGGCTAATGCATCTGGAGTTTGGGCGAGGGTTT       c.6000
 G  L  E  R  V  V  N  L  E  R  L  M  H  L  E  F  G  R  G  F         p.2000

          .         .         .         .         .         .       g.55952
 ATGTATGACAGGCCCCTGAGGCTTAACTTGCTGGACTTGGATTATGAACTAGCGGAGCAG       c.6060
 M  Y  D  R  P  L  R  L  N  L  L  D  L  D  Y  E  L  A  E  Q         p.2020

     | 15    .         .         .         .         .         .    g.57433
 CTT | GACAACATTGCCGAGAAAGCTTGCTGTGGGGTTCCCTGCAAGTGCTCTGGGCAGAGG    c.6120
 L   | D  N  I  A  E  K  A  C  C  G  V  P  C  K  C  S  G  Q  R      p.2040

          .         .         .       | 16 .         .         .    g.58057
 GGAGACCGCGGGCCCATCGGCAGCATCGGGCCAAAG | GGTATTCCTGGAGAAGACGGCTAC    c.6180
 G  D  R  G  P  I  G  S  I  G  P  K   | G  I  P  G  E  D  G  Y      p.2060

          .         .         . | 17       .         .         .    g.59078
 CGAGGCTATCCTGGTGATGAGGGTGGACCC | GGTGAGCGTGGTCCGCCTGGTGTGAACGGC    c.6240
 R  G  Y  P  G  D  E  G  G  P   | G  E  R  G  P  P  G  V  N  G      p.2080

          .         .         .         .   | 18     .         .    g.59837
 ACTCAAGGTTTCCAGGGCTGCCCGGGCCAGAGAGGAGTAAAG | GGCTCTCGGGGATTCCCA    c.6300
 T  Q  G  F  Q  G  C  P  G  Q  R  G  V  K   | G  S  R  G  F  P      p.2100

           | 19        .         .         .         .     | 20   . g.60125
 GGAGAGAAG | GGCGAAGTAGGAGAAATTGGACTGGATGGTCTGGATGGTGAAGAT | GGAGAC c.6360
 G  E  K   | G  E  V  G  E  I  G  L  D  G  L  D  G  E  D   | G  D   p.2120

          .         .         .         .         | 21         .    g.60636
 AAAGGATTGCCTGGTTCTTCTGGAGAGAAAGGGAATCCTGGAAGAAGG | GGTGATAAAGGA    c.6420
 K  G  L  P  G  S  S  G  E  K  G  N  P  G  R  R   | G  D  K  G      p.2140

          .         .         .         .         .  | 22      .    g.61334
 CCTCGAGGAGAGAAAGGAGAAAGAGGAGATGTTGGGATTCGAGGGGACCCG | GGTAACCCA    c.6480
 P  R  G  E  K  G  E  R  G  D  V  G  I  R  G  D  P   | G  N  P      p.2160

          .         .         .         .         .        | 23.    g.61819
 GGACAAGACAGCCAGGAGAGAGGACCCAAAGGAGAAACCGGTGACCTCGGCCCCATG | GGT    c.6540
 G  Q  D  S  Q  E  R  G  P  K  G  E  T  G  D  L  G  P  M   | G      p.2180

          .         .         .         .         .  | 24      .    g.64282
 GTCCCAGGGAGAGATGGAGTACCTGGAGGACCTGGAGAAACTGGGAAGAAT | GGTGGCTTT    c.6600
 V  P  G  R  D  G  V  P  G  G  P  G  E  T  G  K  N   | G  G  F      p.2200

          .         .        | 25.         .         .         .    g.65837
 GGCCGAAGGGGACCCCCCGGAGCTAAG | GGCAACAAGGGCGGTCCTGGCCAGCCGGGCTTT    c.6660
 G  R  R  G  P  P  G  A  K   | G  N  K  G  G  P  G  Q  P  G  F      p.2220

          .         .         . | 26       .         .         .    g.66653
 GAGGGAGAGCAGGGGACCAGAGGTGCACAG | GGCCCAGCTGGTCCTGCTGGTCCTCCAGGG    c.6720
 E  G  E  Q  G  T  R  G  A  Q   | G  P  A  G  P  A  G  P  P  G      p.2240

          .         .         .    | 27    .         .         .    g.68042
 CTGATAGGAGAACAAGGCATTTCTGGACCTCGG | GGAAGCGGAGGTGCCGCTGGTGCTCCT    c.6780
 L  I  G  E  Q  G  I  S  G  P  R   | G  S  G  G  A  A  G  A  P      p.2260

          .         .         .       | 28 .         .         .    g.69022
 GGAGAACGAGGCAGAACCGGTCCACTGGGAAGAAAG | GGTGAGCCCGGAGAGCCAGGACCA    c.6840
 G  E  R  G  R  T  G  P  L  G  R  K   | G  E  P  G  E  P  G  P      p.2280

          .         .         .          | 29        .         .    g.70565
 AAAGGAGGAATCGGGAACCGGGGCCCTCGTGGGGAGACG | GGAGATGACGGGAGAGACGGA    c.6900
 K  G  G  I  G  N  R  G  P  R  G  E  T   | G  D  D  G  R  D  G      p.2300

          .         .         . | 30       .         .         .    g.70853
 GTTGGCAGTGAAGGACGCAGAGGCAAAAAA | GGAGAAAGAGGATTCCCTGGATACCCAGGA    c.6960
 V  G  S  E  G  R  R  G  K  K   | G  E  R  G  F  P  G  Y  P  G      p.2320

        | 31 .         .         .         .         .         .    g.71392
 CCAAAG | GGTAACCCAGGTGAACCTGGGCTAAATGGAACAACAGGACCCAAAGGCATCAGA    c.7020
 P  K   | G  N  P  G  E  P  G  L  N  G  T  T  G  P  K  G  I  R      p.2340

           | 32        .         .         .         .         .    g.72693
 GGCCGAAGG | GGAAATTCGGGACCTCCAGGGATAGTTGGACAGAAGGGAGACCCTGGCTAC    c.7080
 G  R  R   | G  N  S  G  P  P  G  I  V  G  Q  K  G  D  P  G  Y      p.2360

          .   | 33     .         .         .      | 34  .         . g.74128
 CCAGGACCAGCT | GGTCCCAAGGGCAACAGGGGCGACTCCATCGAT | CAATGTGCCCTCATC c.7140
 P  G  P  A   | G  P  K  G  N  R  G  D  S  I  D   | Q  C  A  L  I   p.2380

          .         .   | 35     .     | 36   .         .         . g.74390
 CAAAGCATCAAAGATAAATGCC | CTTGCTGTTACG | GGCCCCTGGAGTGCCCCGTCTTCCCA c.7200
 Q  S  I  K  D  K  C  P |   C  C  Y  G |   P  L  E  C  P  V  F  P   p.2400

          .         .         .         .         .         .       g.74450
 ACAGAACTAGCCTTTGCTTTAGACACCTCTGAGGGAGTCAACCAAGACACTTTCGGCCGG       c.7260
 T  E  L  A  F  A  L  D  T  S  E  G  V  N  Q  D  T  F  G  R         p.2420

          .         .         .         .         .         .       g.74510
 ATGCGAGATGTGGTCTTGAGTATTGTGAATGACCTGACCATTGCTGAGAGCAACTGCCCA       c.7320
 M  R  D  V  V  L  S  I  V  N  D  L  T  I  A  E  S  N  C  P         p.2440

          .         .         .         .         .         .       g.74570
 CGGGGGGCCCGGGTGGCTGTGGTCACCTACAACAACGAGGTGACCACGGAGATCCGGTTT       c.7380
 R  G  A  R  V  A  V  V  T  Y  N  N  E  V  T  T  E  I  R  F         p.2460

          .         .         .         .         .         .       g.74630
 GCTGACTCCAAGAGGAAGTCGGTCCTCCTGGACAAGATTAAGAACCTTCAGGTGGCTCTG       c.7440
 A  D  S  K  R  K  S  V  L  L  D  K  I  K  N  L  Q  V  A  L         p.2480

          .         .         .         .         .         .       g.74690
 ACATCCAAACAGCAGAGTCTGGAGACTGCCATGTCGTTTGTGGCCAGGAACACATTTAAG       c.7500
 T  S  K  Q  Q  S  L  E  T  A  M  S  F  V  A  R  N  T  F  K         p.2500

          .         .         .         .         .         .       g.74750
 CGTGTGAGGAACGGATTCCTAATGAGGAAAGTGGCTGTTTTCTTCAGCAACACACCCACA       c.7560
 R  V  R  N  G  F  L  M  R  K  V  A  V  F  F  S  N  T  P  T         p.2520

          .         .         .         .         .         .       g.74810
 AGAGCATCCCCACAGCTCAGAGAGGCTGTGCTCAAGCTCTCAGATGCGGGGATCACCCCC       c.7620
 R  A  S  P  Q  L  R  E  A  V  L  K  L  S  D  A  G  I  T  P         p.2540

          .         .         .         .         | 37         .    g.77058
 TTGTTCCTTACAAGGCAGGAAGACCGGCAGCTCATCAACGCTTTGCAG | ATCAATAACACA    c.7680
 L  F  L  T  R  Q  E  D  R  Q  L  I  N  A  L  Q   | I  N  N  T      p.2560

          .         .         .         .         .         .       g.77118
 GCAGTGGGGCATGCGCTTGTCCTGCCTGCAGGGAGAGACCTCACAGACTTCCTGGAGAAT       c.7740
 A  V  G  H  A  L  V  L  P  A  G  R  D  L  T  D  F  L  E  N         p.2580

          .         .      | 38  .         .         .         .    g.78092
 GTCCTCACGTGTCATGTTTGCTTGG | ACATCTGCAACATCGACCCATCCTGTGGATTTGGC    c.7800
 V  L  T  C  H  V  C  L  D |   I  C  N  I  D  P  S  C  G  F  G      p.2600

          .         .         .         .         .         .       g.78152
 AGTTGGAGGCCTTCCTTCAGGGACAGGAGAGCGGCAGGGAGCGATGTGGACATCGACATG       c.7860
 S  W  R  P  S  F  R  D  R  R  A  A  G  S  D  V  D  I  D  M         p.2620

          .         .         .         .         .         .       g.78212
 GCTTTCATCTTAGACAGCGCTGAGACCACCACCCTGTTCCAGTTCAATGAGATGAAGAAG       c.7920
 A  F  I  L  D  S  A  E  T  T  T  L  F  Q  F  N  E  M  K  K         p.2640

          .         .         .         .         .         .       g.78272
 TACATAGCGTACCTGGTCAGACAACTGGACATGAGCCCAGATCCCAAGGCCTCCCAGCAC       c.7980
 Y  I  A  Y  L  V  R  Q  L  D  M  S  P  D  P  K  A  S  Q  H         p.2660

          .         .         .         .         .         .       g.78332
 TTCGCCAGAGTGGCAGTTGTGCAGCACGCGCCCTCTGAGTCCGTGGACAATGCCAGCATG       c.8040
 F  A  R  V  A  V  V  Q  H  A  P  S  E  S  V  D  N  A  S  M         p.2680

          .         .         .         .         .         .       g.78392
 CCACCTGTGAAGGTGGAATTCTCCCTGACTGACTATGGCTCCAAGGAGAAGCTGGTGGAC       c.8100
 P  P  V  K  V  E  F  S  L  T  D  Y  G  S  K  E  K  L  V  D         p.2700

          .         .         .         .         .         .       g.78452
 TTCCTCAGCAGGGGAATGACACAGTTGCAGGGAACCAGGGCCTTAGGCAGTGCCATTGAA       c.8160
 F  L  S  R  G  M  T  Q  L  Q  G  T  R  A  L  G  S  A  I  E         p.2720

          .         .         .         .         .         .       g.78512
 TACACCATAGAGAATGTCTTTGAAAGTGCCCCAAACCCACGGGACCTGAAAATTGTGGTC       c.8220
 Y  T  I  E  N  V  F  E  S  A  P  N  P  R  D  L  K  I  V  V         p.2740

          .         .         .         .         .         .       g.78572
 CTGATGCTGACGGGCGAGGTGCCGGAGCAGCAGCTGGAGGAGGCCCAGAGAGTCATCCTG       c.8280
 L  M  L  T  G  E  V  P  E  Q  Q  L  E  E  A  Q  R  V  I  L         p.2760

          .         .         .         .         .         .       g.78632
 CAGGCCAAATGCAAGGGCTACTTCTTCGTGGTCCTGGGCATTGGCAGGAAGGTGAACATC       c.8340
 Q  A  K  C  K  G  Y  F  F  V  V  L  G  I  G  R  K  V  N  I         p.2780

          .         .         .         .         .         .       g.78692
 AAGGAGGTATACACCTTCGCCAGTGAGCCAAACGACGTCTTCTTCAAATTAGTGGACAAG       c.8400
 K  E  V  Y  T  F  A  S  E  P  N  D  V  F  F  K  L  V  D  K         p.2800

          .         .         .         .         .         .       g.78752
 TCCACCGAGCTCAACGAGGAGCCTTTGATGCGCTTCGGGAGGCTGTTGCCATCCTTCGTC       c.8460
 S  T  E  L  N  E  E  P  L  M  R  F  G  R  L  L  P  S  F  V         p.2820

      | 39   .         .         .         .         .         .    g.80146
 AGCA | GTGAAAATGCTTTTTACTTGTCCCCAGATATCAGGAAACAGTGTGATTGGTTCCAA    c.8520
 S  S |   E  N  A  F  Y  L  S  P  D  I  R  K  Q  C  D  W  F  Q      p.2840

          .         .         .         .        | 40.         .    g.82688
 GGGGACCAACCCACAAAGAACCTTGTGAAGTTTGGTCACAAACAAGT | AAATGTTCCGAAT    c.8580
 G  D  Q  P  T  K  N  L  V  K  F  G  H  K  Q  V  |  N  V  P  N      p.2860

          .         .         .         .         .         .       g.82748
 AACGTTACTTCAAGTCCTACATCCAACCCAGTGACGACAACGAAGCCGGTGACTACGACG       c.8640
 N  V  T  S  S  P  T  S  N  P  V  T  T  T  K  P  V  T  T  T         p.2880

          .         .         .         .         .         .       g.82808
 AAGCCGGTGACCACCACAACAAAGCCTGTAACCACCACAACAAAGCCTGTGACTATTATA       c.8700
 K  P  V  T  T  T  T  K  P  V  T  T  T  T  K  P  V  T  I  I         p.2900

          .         .         .         .         .         .       g.82868
 AATCAGCCATCTGTGAAGCCAGCCGCTGCAAAGCCGGCCCCTGCGAAACCTGTGGCTGCC       c.8760
 N  Q  P  S  V  K  P  A  A  A  K  P  A  P  A  K  P  V  A  A         p.2920

          .         .         .         .         .         .       g.82928
 AAGCCTGTGGCCACAAAGATGGCCACTGTTAGACCCCCAGTGGCGGTGAAGCCAGCAACG       c.8820
 K  P  V  A  T  K  M  A  T  V  R  P  P  V  A  V  K  P  A  T         p.2940

          .         .         .         .         .         .       g.82988
 GCAGCGAAGCCTGTAGCAGCAAAGCCAGCAGCTGTAAGACCCCCCGCTGCTGCTGCTGCA       c.8880
 A  A  K  P  V  A  A  K  P  A  A  V  R  P  P  A  A  A  A  A         p.2960

          .         .         .         .         .         .       g.83048
 AAACCAGTGGCGACCAAGCCTGAGGTCCCTAGGCCACAGGCAGCCAAACCAGCTGCCACC       c.8940
 K  P  V  A  T  K  P  E  V  P  R  P  Q  A  A  K  P  A  A  T         p.2980

          .         .      | 41  .         .         .         .    g.84353
 AAGCCAGCCACCACTAAGCCCATGG | TTAAGATGTCCCGTGAAGTCCAGGTGTTTGAGATA    c.9000
 K  P  A  T  T  K  P  M  V |   K  M  S  R  E  V  Q  V  F  E  I      p.3000

          .         .         .         .         .         .       g.84413
 ACAGAGAACAGCGCCAAACTCCACTGGGAGAGGGCTGAGCCCCCCGGTCCTTATTTTTAT       c.9060
 T  E  N  S  A  K  L  H  W  E  R  A  E  P  P  G  P  Y  F  Y         p.3020

          .         .         .         .         .         .       g.84473
 GACCTCACCGTCACCTCAGCCCATGATCAGTCCCTGGTTCTGAAGCAGAACCTCACGGTC       c.9120
 D  L  T  V  T  S  A  H  D  Q  S  L  V  L  K  Q  N  L  T  V         p.3040

          .         .         .         .         .         .       g.84533
 ACGGACCGCGTCATTGGAGGCCTGCTCGCTGGGCAGACATACCATGTGGCTGTGGTCTGC       c.9180
 T  D  R  V  I  G  G  L  L  A  G  Q  T  Y  H  V  A  V  V  C         p.3060

          .         .         .         .          | 42        .    g.85670
 TACCTGAGGTCTCAGGTCAGAGCCACCTACCACGGAAGTTTCAGTACAA | AGAAATCTCAG    c.9240
 Y  L  R  S  Q  V  R  A  T  Y  H  G  S  F  S  T  K |   K  S  Q      p.3080

          .         .         .         .         .         .       g.85730
 CCCCCACCTCCACAGCCAGCAAGGTCAGCTTCTAGTTCAACCATCAATCTAATGGTGAGC       c.9300
 P  P  P  P  Q  P  A  R  S  A  S  S  S  T  I  N  L  M  V  S         p.3100

          .         .         | 43         .         .         .    g.93515
 ACAGAACCATTGGCTCTCACTGAAACAG | ATATATGCAAGTTGCCGAAAGACGAAGGAACT    c.9360
 T  E  P  L  A  L  T  E  T  D |   I  C  K  L  P  K  D  E  G  T      p.3120

          .         .         .         .         .         .       g.93575
 TGCAGGGATTTCATATTAAAATGGTACTATGATCCAAACACCAAAAGCTGTGCAAGATTC       c.9420
 C  R  D  F  I  L  K  W  Y  Y  D  P  N  T  K  S  C  A  R  F         p.3140

          .         .         .         .         .         .       g.93635
 TGGTATGGAGGTTGTGGTGGAAACGAAAACAAATTTGGATCACAGAAAGAATGTGAAAAG       c.9480
 W  Y  G  G  C  G  G  N  E  N  K  F  G  S  Q  K  E  C  E  K         p.3160

          .    | 44    .         .         .         .              g.94440
 GTTTGCGCTCCTG | TGCTCGCCAAACCCGGAGTCATCAGTGTGATGGGAACCTAA          c.9534
 V  C  A  P  V |   L  A  K  P  G  V  I  S  V  M  G  T  X            p.3177

          .         .         .         .         .         .       g.94500
 gcgtgggtggccaacatcatatacctcttgaagaagaaggagtcagccatcgccaacttg       c.*60

          .         .         .         .         .         .       g.94560
 tctctgtagaagctccgggtgtagattcccttgcactgtatcatttcatgctttgattta       c.*120

          .         .         .         .         .         .       g.94620
 cactcgaactcgggagggaacatcctgctgcatgacctatcagtatggtgctaatgtgtc       c.*180

          .         .         .         .         .         .       g.94680
 tgtggaccctcgctctctgtctccaggcagttctctcgaatactttgaatgttgtgtaac       c.*240

          .         .         .         .         .         .       g.94740
 agttagccactgctggtgtttatgtgaacattcctatcaatccaaattccctctggagtt       c.*300

          .         .         .         .         .         .       g.94800
 tcatgttatgcctgttgcaggcaaatgtaaagtctagaaaataatgcaaatgtcacggct       c.*360

          .         .         .         .         .         .       g.94860
 actctatatacttttgcttggttcattttttttcccttttagttaagcatgactttagat       c.*420

          .         .         .         .         .         .       g.94920
 gggaagcctgtgtatcgtggagaaacaagagaccaactttttcattccctgcccccaatt       c.*480

          .         .         .         .         .         .       g.94980
 tcccagactagatttcaagctaattttctttttctgaagcctctaacaaatgatctagtt       c.*540

          .         .         .         .         .         .       g.95040
 cagaaggaagcaaaatcccttaatctatgtgcaccgttgggaccaatgccttaattaaag       c.*600

          .         .         .         .         .         .       g.95100
 aatttaaaaaagttgtaatagagaatatttttggcattcctctaatgttgtgtgtttttt       c.*660

          .         .         .         .         .         .       g.95160
 ttttgtgtgtgctggagggaggggatttaattttaattttaaaatgtttaggaaatttat       c.*720

          .         .         .         .                           g.95202
 acaaagaaactttttaataaagtatattgaaagtttcctggg                         c.*762

 (downstream sequence)
Legend:
Nucleotide numbering (following the rules of the HGVS for a 'Coding DNA Reference Sequence') is indicated at the right of the sequence, counting the A of the ATG translation initiating Methionine as 1. Every 10th nucleotide is indicated by a "." above the sequence. The Collagen type VI alpha 3 protein sequence is shown below the coding DNA sequence, with numbering indicated at the right starting with 1 for the translation initiating Methionine. Every 10th amino acid is shown in bold. The position of introns is indicated by a vertical line, splitting the two exons. The start of the first exon (transcription initiation site) is indicated by a '\', the end of the last exon (poly-A addition site) by a '/'. The exon number is indicated above the first nucleotide(s) of the exon. To aid the description of frame shift mutations, all stop codons in the +1 frame are shown in bold while all stop codons in the +2 frame are underlined.

Powered by LOVD v.2.0 Build 25
©2004-2010 Leiden University Medical Center