( Elementary genetic overlappings
Preprint, Inst. Appl. Math., the Russian Academy of Science)

..
(N.N.Kozlov)

. ..

, 2004
, ( 04-01-00320, 02-07-90027), ( 37.011.11.0012), (-2003.2003.1)

, . , , . , . , , . . . .

Abstract

The set of elements produced by a genetic code is examined. The set is calculated for unusual ways of record of the genetic information named as overlapped genes, when the same site of DNA can code two protein sequences. The concept of elementary overlapping is entered which corresponds to overlapping for single amino acids. Five sets of elementary overlappings are constructed, each of which corresponds to one of five ways of the genetic overlappings which have been found out experimentally. The structure of sets of elementary overlappings and their properties is analyzed. The detailed and compressed representation of sets of elementary overlappings is given. The questions of application of sets are examined.

 

[1]. . - , 4- : A,T,C,G - , , . , , - G. , : AAA,AAT,AAC,...GGG, . 64 . , 61 20- - , TAA,TAG,TGA . , , , 6-. . (. . 1 [1]) 20- . ( ) . 1. - ( - ), , . 13 ( 1979.) , , , . 10 . [2].

, , , , , +1 -1 . . . , , , . , ( ). , - : TAA, TAG, TGA ( , ).

1 2 3 80

Met Met Met ... Arg

W1(80) Tyr His Asn Ser

TATG CATG AATG TCGN

 

 

1 2 3 80

 

Met Met Trp ... Arg

W2(80) Trp Cys Gly Gly

ATGG ATGY TGGN ZGGN

 

 

1 2 3 35

 

Met Met Trp Arg

W3(35) ATG ATG TGG ... AGX

GTA MTA YAC NTC

Met Ile His Leu

 

 

1 2 3 52

 

Met Trp Phe Arg

W4(52) ATG TGG TTT ... CGC

TAC ACC AAA GCG

His Pro Lys Ala

 

 

1 2 3 196

 

Met Met Met Arg

W5(196) ATG ATG ATG ... AGG

ACC ACA ACG CCT

Pro Thr Ala Ser

 

 

. 1. W1-W5.

4- .. .

 

- () , ( ). . , . 1976 - 174 [3]. 5386 . , , GSHV [4] 1300 , 1704 ( - 3311 ).

, . 2: +1 -1 . , , + - . , ( ) + - . 3: +1, 0 -1 - + . , 5. . 1 [1].

 

2. .

[1, 2] - .. - 5: W1-W5, 5- . .. - . : 1, 2 3 ( ) , .. : 3 4 2 . .1 ..: 4 .. W1-W5. .. . W1 .. Met - Tyr, , -1 . W2 .. Met Trp, TG, +1 . W1 W2 .. W3-W5 .

.. + , - . W3 .. ATG ( Met ) Met , ATG.

 

1

(..)

20- W1-W5

 

 

 

 

W1

W2

W3

W4

W5

å

1

Met

4

2

2

1

4

13

2

Trp

3

1

2

1

4

11

3

Phe

4

3

2

2

5

16

4

Tyr

3

3

1

2

8

17

5

His

4

3

2

2

8

19

6

Asn

3

3

2

2

8

18

7

Asp

2

3

1

2

8

16

8

Cys

3

2

2

2

8

17

9

Gln

4

4

2

1

8

19

10

Lys

3

4

2

2

8

19

11

Glu

2

4

1

2

8

17

12

Ile

4

4

2

3

8

21

13

Val

4

6

1

4

12

27

14

Pro

4

5

1

3

13

26

15

Thr

4

5

1

4

13

27

16

Ala

4

5

1

4

13

27

17

Gly

3

5

1

4

15

28

18

Ser

7

7

3

4

18

39

19

Leu

8

6

4

3

12

33

20

Arg

7

5

2

4

15

33

 

å

 

 

80

 

80

 

35

 

52

 

196

 

443

 

 

4

4

1.75

2.6

9.8

~22

 

2

1

1

1

4

11

 

8

7

4

4

18

39

 

, , -1 . + - : . W4 .. Met His(). . 3 : ,,GC. W5 .. Met Pro(CCA). +1 , 2 ,GC.

.. : W1 W2 80, W3 - 35, W4 - 52, W5 - 196. , . N:A,C,T,G; Y:T,C; X:A,G; Z:A,C; I:T,G. 1: Met, Trp, Phe, Arg. , W5 .. 5 - 196. : Gly, Ser, Arg. Gly .. 144.1 144.2 , : Gly Ser . Ser: .. Ser TCN, - AGY. .. : 156.1 156.2, 157.1 157.2, 164.1 164.2, Ser, Arg: CGN AGX. Ser Arg .. 188.1 188.2. 3 .. 5- ( ) .. , .., .

 

3. .

W1-W5 .. , . W1-W5 20, .. .. . 100 - - 1.

W1-W5. , W4, , , . , : Trp, Gln, Glu, .. 17 . : Pro, Gly, Leu - W1, W2, W5 : Phe, Lys W1, W2, Tyr, Ile, Ala, Arg W3, W5. : His, Asn, Asp, Cys, Val, Thr, Ser W5, Met W3.

.. . 160 283, - 80 283 .. .. 3,5 . . , W1 W2 ; . , .. 1 W1 .. 7 W2. , W3-W5 : .. - .., . W3-W5 .. , .. W1, W2. , 2 W1 W2? , ( W1-W5): .. .. , . , . . .. W1 [6]. .. W2 [7]: , 5- : MetMet, MenAsn, MetLys, MetIle, MetThr. .. W1-W5 [1]. , .., . , , .. , , .. . , .. .

1, .. , - , - ( W1-W5 ). , , , , .. 20- .. - 5- . , - 100 ; , [1]. W2, W3, W4, W1 , W5 - . 12: W2, 3 W4 8 W3. Ser (W2, W5), Leu (W1, W3). W4 6- : Arg, Ser, Gly, Ala, Thr, Val. 100 Ser (W5) 18. , , .. 20- . .. Ser , Phe Ile.

.. W1-W5 443. - 22, Trp, 11, - Ser, - 39. .. . 2 .., - .. () 400. , 443 .. .., . 16- 12, 14, 75, 113. 182 1-5, .. W1-W5. , 400 295 (113+182) - .., 105 . .. W1-W5 . 10 Trp, - 0 Ser, 5- (105/20). 15 .. W1-W5. Ser - 20 Ser .. 20- , 5 .

 

 

 

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

Met

1

3

1

 

2

24

2

2

1

 

 

 

3

 

5

5

5

 

5

 

 

Trp

2

2

 

 

 

3

 

 

 

3

 

 

 

2

45

5

5

1

5

2

 

Phe

3

 

 

12

 

 

3

 

 

5

39

45

2

2

 

 

 

5

1

12

5

Tyr

4

1

 

 

35

5

5

5

5

 

 

 

16

24

 

1

 

5

5

2

5

His

5

14

3

 

5

5

5

5

35

 

 

 

1

4

2

12

2

5

25

 

5

Asn

6

1

 

3

5

5

5

5

5

2

2

2

14

4

 

1

 

5

5

3

5

Asp

7

1

 

 

5

5

5

5

5

 

 

 

14

4

 

1

 

25

35

 

25

Cys

8

2

 

 

5

35

5

5

5

3

 

 

 

12

 

4

14

5

5

2

5

Gln

9

 

3

5

 

 

1

 

3

 

1

 

5

5

25

25

25

 

17

45

1

Lys

10

 

 

39

 

 

1

 

 

2

12

2

5

5

5

5

5

 

15

39

1

Glu

11

 

 

45

 

 

1

 

 

 

1

 

5

5

5

5

5

2

18

45

12

Ile

12

3

 

1

16

2

24

24

 

5

5

5

35

5

 

 

 

5

1

15

5

Val

13

 

1

1

14

4

4

4

12

5

5

5

5

5

5

35

5

25

17

15

25

Pro

14

5

45

 

 

1

 

 

 

15

5

5

 

5

17

25

25

39

25

15

19

Thr

15

5

5

 

2

12

2

2

4

15

5

5

 

35

15

5

5

45

45

15

19

Ala

16

5

5

 

 

1

 

 

24

15

5

5

 

5

15

5

35

29

29

15

75

Gly

17

 

2

5

5

5

5

15

5

 

 

1

5

15

39

45

45

17

45

5

25

Ser

18

5

5

2

5

15

5

35

5

17

25

28

2

17

15

45

19

45

5

65

19

Leu

19

 

1

12

1

 

3

 

1

45

39

45

25

25

25

25

25

5

65

17

35

Arg

20

 

 

5

5

5

5

15

5

2

2

12

5

15

29

29

75

15

29

35

35

 

. 2

443 .., W1-W5.


, 2- .. 4 , 4 ..: 65 - .. 1, 2, 3, 5 75 - .. 1, 2, 4, 5. , 82 .. ; .. 7 : 10 12(12 - .. W1, W2), 5 14, 19 15, 5 24, 18 25, 12 35, 13 45. , .. . W, .. . 27, .. 7 : 2 16 (1, 2, 4), 7 17 (1, 2,5), 1 18 (1, 3,5), 5 .. 19 (1, 4,5), 1 28 (2, 3, 5), 5 29 (2, 4, 5) 6 39 (3, 4, 5). . 2 , .. ( ). 25 .. 4-8 (Tyr-Cys) .. 13-20 (Val-Arg). 64 .. 4- 6- ; . .. (Tyr, His, Cys) 9- , .

 

4. .

 

1.     W1-W5 [1]. , 5- . , 16 , .


2.     [1] . , 100 W1-W5. , 12 (.. 28 ). , .


3.     W1-W5 [2]. , . W1-W5. , , [2].


4.     .. Leu, Ser, Arg [7]. . .. 12 (. . 1 [7]). .. , (. . 2 [7]).


.. .. .

 

 

 

 

1.     ... . . 2002. . 382. 5.

. 593-597.

2.     ... . . 2004.

T. 396. 6. . 740-745.

3.     Sanger F., Coulson A.R., Friedmann et al. // J. Mol. Biol. 1978. V. 125.

P. 225-246.

4.     Seeger C., Ganem D., Varmus H.E. // J.Virol. 1984. V. 51. P. 367-375.

5.     ... . . 1999. . 369. 4. . 553-556.

6.     ... . .

2000. . 373. 1. . 108-111.

7.     ... . . 1997.

. 355. 6. C. 830-833.

 

W1

 

 

1 Met Met Met Met

Met Tyr His Asn Asp

(4) TATG CATG AATG GATG

1 2 3 4

 

2 Trp Trp Trp

Trp Met Val Leu

(3) ATGG GTGG YTGG

5 6 7

 

3 Phe Phe Phe Phe

Phe Phe Ile Val Leu

(4) TTTY ATTY GTTY CTTY

8 9 10 11

 

4 Tyr Tyr Tyr

Tyr Ile Val Leu

(3) ATAY GTAY YTAY

12 13 14

 

5 His His His His

His Pro Thr Ala Ser

(4) CCAY ACAY GCAY TCAY

15 16 17 18

 

6 Asn Asn Asn

Asn Gln Lys Glu

(3) CAAY AAAY GAAY

19 20 21

 

7 Asp Asp

Asp Gly Arg

(2) GGAY ZGAY

22 23

 

8 Cys Cys Cys

Cys Met Val Leu

(3) ATGY GTGY YTGY

24 25 26

 

 

9 Gln Gln Gln Gln

Gln Pro Thr Ala Ser

(4) CCAX ACAX GCAX TCAX

27 28 29 30

 

 

10 Lys Lys Lys

Lys Gln Lys Glu

(3) CAAX AAAX GAAX

31 32 33

 

11 Glu Glu

Glu Gly Arg

(2) GGAX ZGAX

34 35

 

12 Ile Ile Ile Ile

Ile Tyr His Asn Asp

(4) TATM CATM AATM GATM

36 37 38 39

 

13 Val Val Val Val

Val Cys Gly Ser Arg

(4) TGTN GGTN AGTN CGTN

40 41 42 43

 

14 Pro Pro Pro Pro

Pro Pro Thr Ala Ser

(4) CCCN ACCN GCCN TCCN

44 45 46 47

 

15 Thr Thr Thr Thr

Thr Tyr His Asn Asp

(4) TACN CACN AACN GACN

48 49 50 51

 

16 Ala Ala Ala Ala

Ala Cys Gly Ser Arg

(4) TGCN GGCN AGCN CGCN

52 53 54 55

 

17 Gly Gly Gly

Gly Trp Gly Arg

(3) TGGN GGGN ZGGN

56 57 58

 

18 Ser Ser Ser Ser Ser Ser Ser

Ser Phe Gln Lys Glu Ile Val Leu

(7) TTCN CAGY AAGY GAGY ATCN GTCN CTCN

59 60 61 62 63 64 65

 

19 Leu Leu Leu Leu

Leu Phe Ile Val Pro

(8) TTTX ATTX GTTX CCTN

66 67 68 69

 

Leu Leu Leu Leu

Thr Ala Ser Leu

ACTN GCTN TCTN CTTX

70 71 72 73

 

 

20 Arg Arg Arg Arg Arg Arg Arg

Arg Gln Lys Glu Pro Thr Ala Ser

(7) CAGX AAGX GAGX CCGN ACGN GCGN TCGN

74 75 76 77 78 79 80

 

 

 

W2

 

1 Met Met

Met Trp Cys

(2) ATGG ATGY

1 2

 

2 Trp

Trp Gly

(1) TGGN

3

 

3 Phe Phe Phe

Phe Phe Ser Leu

(3) TTTY TTCN TTTX

4 5 6

 

4 Tyr Tyr Tyr

Tyr Met Ile Thr

(3) TATG TATM TACN

7 8 9

 

5 His His His

His Met Ile Thr

(3) CATG CATM CACN

10 11 12

 

6 Asn Asn Asn

Asn Met Ile Thr

(3) AATG AATM AACN

13 14 15

 

7 Asp Asp Asp

Asp Met Ile Thr

(3) GATG GATM GACN

16 17 18

 

8 Cys Cys

Cys Val Ala

(2) TGTN TGCN

19                        20

 

 

9 Gln Gln Gln Gln

Gln Asn Lys Ser Arg

(4) CAAY CAAX CAGY CAGX

21 22 23 24

10 Lys Lys Lys Lys

Lys Asn Lys Ser Arg

(4) AAAY AAAX AAGY AAGX

25 26 27 28

 

11 Glu Glu Glu Glu

Glu Asn Lys Ser Arg

(4) GAAY GAAX GAGY GAGX

29 30 31 32

 

12 Ile Ile Ile Ile

Ile Phe Tyr Ser Leu

(4) ATTY ATAY ATCN ATTX

33 34 35 36

 

13 Val Val Val Val Val Val

Val Trp Phe Tyr Cys Ser Leu

(6) GTGG GTTY GTAY GTGY GTCN GTTX

37 38 39 40 41 42

 

14 Pro Pro Pro Pro Pro

Pro His Gln Pro Leu Arg

(5) CCAY CCAX CCCN CCTN CCGN

43 44 45 46 47

 

15 Thr Thr Thr Thr Thr

Thr His Gln Pro Leu Arg

(5) ACAY ACAX ACCN ACTN ACGN

48 49 50 51 52

 

16 Ala Ala Ala Ala Ala

Ala His Gln Pro Leu Arg

(5) GCAY GCAX GCCN GCTN GCGN

53 54 55 56 57

 

17 Gly Gly Gly Gly Gly

Gly Asp Glu Val Ala Gly

(5) GGAY GGAX GGTN GGCN GGGN

58 59 60 61 62

 

18 Ser Ser Ser Ser

Ser His Gln Val Pro

(7) TCAY TCAX AGTN TCCN

63 64 65 66

 

Ser Ser Ser

Ala Leu Arg

AGCN TCTN TCGN

67 68 69

 

 

19 Leu Leu Leu Leu Leu Leu

Leu Trp Phe Tyr Cys Ser Leu

(6) YTGG CTTY YTAY YTGY CTCN CTTX

70 71 72 73 74 75

 

20 Arg Arg Arg Arg Arg

Arg Asp Glu Val Ala Gly

(5) ZGAY ZGAX CGTN CGCN ZGGN

76 77 78 79 80

 

 

 

 

W3

 

1 Met Met 2 Trp Trp

Met ATG ATG Trp TGG TGG

(2)         GTA MTA (2) YAC XAC

Met Ile His Gln

1 2 3 4

 

 

3 Phe Phe 4 Tyr

Phe TTY TTY Tyr TAY

(2)             YAA XAA (1) YAT

Asn Lys Tyr

5 6 7

 

 

 

5 His His 6 Asn Asn

His CAY CAY Asn AAY AAY

(2)             GGT YGT (2) YTT XTT

Trp Cys Phe Leu

8 9 10 11

 

 

7 Asp 8 Cys Cys

Asp GAY Cys TGY TGY

(1)             NCT (2) YAC XAC

Ser His Gln

12 13 14

 

 

9 Gln Gln 10 Lys Lys

Gln CAX CAX Lys AAX AAX

(2)             GGT YGT (2) YTT XTT

Trp Cys Phe Leu

15 16 17 18

 

 

11 Glu 12 Ile Ile

Glu GAX Ile ATM ATM

(1)             NCT (2) GTA MTA

Ser Met Ile

19 20 21

 

13 Val 14 Pro 15 Thr

Val GTN Pro CCN Thr ACN

(1)             NCA (1) NGG (1) NTG

Thr Gly Val

22 23 24

 

 

16 Ala 17 Gly

Ala GCN Gly GGN

(1)             NCG (1) NCC

Ala Pro

25 26

 

 

18 Ser Ser Ser

Ser TCN TCN AGY

(3) YAG XAG NTC

Asp Glu Leu

27 28 29

 

 

 

 

19 Leu Leu Leu Leu

Leu CTN CTN TTX TTX

(4) YGA XGA YAA XAA

Ser Arg Asn Lys

30 31 32 33

 

 

20 Arg Arg

Arg CGN AGX

(2)             NGC NTC

Arg Leu

34 35

 

 

 

 

 

W4

 

 

1 Met 2 Trp

Met ATG Trp TGG

(1) TAC (1) ACC

His Pro

1 2

 

3 Phe Phe 4 Tyr Tyr

Phe TTT TTC Tyr TAT TAC

(2)         AAA AAG (2) ATA ATG

Lys Glu Ile Val

3 4 5 6

 

5 His His 6 Asn Asn

His CAC CAT Asn AAT AAC

(2)                   GTG GTA (2) TTA TTG

Val Met Ile Val

7 8 9 10

 

7 Asp Asp 8 Cys Cys

Asp GAT GAC Cys TGT TGC

(2)                   CTA CTG (2) ACA ACG

Ile Val Thr Ala

11 12 13 14

 

9 Gln 10 Lys Lys

Gln CAX Lys AAA AAG

(1)                   GTY (2) TTT TTC

Leu Phe Leu

15 16 17

 

 

11 Glu Glu 12 Ile Ile Ile

Glu GAA GAG Ile ATA ATC ATT

(2)                   CTT CTC (3) TAT TAG TAA

Phe Leu Tyr Asp Asn

18 19 20 21 22

 

13 Val Val Val Val

Val GTT GTC GTA GTG

(4) CAA CAG CAT CAC

Asn Asp Tyr His

23 24 25 26

 

14 Pro Pro Pro

Pro CCC CCA CCI

(3) GGG GGT GGZ

Gly Trp Arg

27 28 29

 

15 Thr Thr Thr Thr

Thr ACT ACC ACA ACG

(4) TGA TGG TGT TGC

Ser Gly Cys Arg

30 31 32 33

 

16 Ala Ala Ala Ala

Ala GCT GCC GCA GCG

(4) CGA CGG CGT CGC

Ser Gly Cys Arg

34 35 36 37

 

17 Gly Gly Gly Gly

Gly GGT GGC GGA GGG

(4) CCA CCG CCT CCC

Thr Ala Ser Pro

38 39 40 41

 

18 Ser Ser Ser Ser

Ser TCI TCC AGT AGC

(4) AGZ AGG TCA TCG

Arg Gly Thr Ala

42 43 44 45

 

19 Leu Leu Leu

Leu CTT CTC YTG

(3) GAA GAG XAC

Lys Glu Gln

46 47 48

 

 

 

 

20 Arg Arg Arg Arg

Arg CGT CGC ZGA ZGG

(4) GCA GCG ICT ICC

Thr Ala Ser Pro

49 50 51 52

 

 

 

W5

 

 

1                        Met Met Met Met

Met ATG ATG ATG ATG

(4)                        ACC ACA ACG ACT

Pro Thr Ala Ser

1 2 3 4

 

 

2 Trp Trp Trp Trp

Trp TGG TGG TGG TGG

(4)                        CCC CCA CCG CCT

Pro Thr Ala Ser

5 6 7 8

 

 

3 Phe Phe Phe Phe Phe

Phe TTT TTT TTT TTC TTC

(5)                        AAC AAA AAG AGG AGZ

Gln Lys Glu Gly Arg

9 10 11 12 13

 

 

4 Tyr Tyr Tyr Tyr

Tyr TAT TAT TAT TAT

(8)                        TAT TAC TAA TAG

Tyr His Asn Asp

14 15 16 17

 

Tyr Tyr Tyr Tyr

TAC TAC TAC TAC

TGT TGG TGA TGC

Cys Gly Ser Arg

18 19 20 21

 

 

5 His His His His

His CAT CAT CAT CAT

(8)                        TAT TAC TAA TAG

Tyr His Asn Asp

22 23 24 25

 

 

His His His His

CAC CAC CAC CAC

TGT TGG TGA TGC

Cys Gly Ser Arg

26 27 28 29

 

6 Asn Asn Asn Asn

Asn AAT AAT AAT AAT

(8)                        TAT TAC TAA TAG

Tyr His Asn Asp

30 31 32 33

 

Asn Asn Asn Asn

AAC AAC AAC AAC

TGT TGG TGA TGC

Cys Gly Ser Arg

34 35 36 37

 

 

7 Asp Asp Asp Asp

Asp GAT GAT GAT GAT

(8)                        TAT TAC TAA TAG

Tyr His Asn Asp

38 39 40 41

 

Asp Asp Asp Asp

GAC GAC GAC GAC

TGT TGG TGA TGC

Cys Gly Ser Arg

42 43 44 45

 

 

8 Cys Cys Cys Cys

Cys TGT TGT TGT TGT

(8)                        CAT CAC CAG CAA

Tyr His Asp Asn

46 47 48 49

 

Cys Cys Cys Cys

TGC TGC TGC TGC

CGT CGG CGC CGA

Cys Gly Arg Ser

50 51 52 53

 

 

9 Gln Gln Gln Gln

Gln CAA CAA CAA CAA

(8) TTT TTA TTG TTC

Phe Ile Val Leu

54 55 56 57

 

Gln Gln Gln Gln

CAG CAG CAG CAG

TCC TCA TCG TCT

Pro Thr Ala Ser

58 59 60 61

 

10 Lys Lys Lys Lys

Lys AAA AAA AAA AAA

(8)                        TTT TTA TTG TTC

Phe Ile Val Leu

62 63 64 65

 

Lys Lys Lys Lys

AAG AAG AAG AAG

TCC TCA TCG TCT

Pro Thr Ala Ser

66 67 68 69

 

 

11 Glu Glu Glu Glu

Glu GAA GAA GAA GAA

(8)                        TTT TTA TTG TTC

Phe Ile Val Leu

70 71 72 73

 

Glu Glu Glu Glu

GAG GAG GAG GAG

TCC TCA TCG TCT

Pro Thr Ala Ser

74 75 76 77

 

 

12 Ile Ile Ile

Ile ATA ATA ATA

(8)                        ATA ATG ATY

Ile Val Leu

78 79 80

 

Ile Ile Ile

ATT ATT ATT

AAC AAA AAG

Gln Lys Glu

81 82 83

 

Ile Ile

ATC ATC

AGG AGZ

Gly Arg

84                        85

 

 

13 Val Val Val

Val GTT GTT GTT

(12)                   AAC AAA AAG

Gln Lys Glu

86 87 88

 

Val Val

GTC GTC

AGG AGZ

Gly Arg

89                        90

 

Val Val Val

GTA GTA GTG

ATA ATG ATY

Ile Val Leu

91 92 93

 

Val Val Val Val

GTG GTG GTG GTG

ACC ACA ACG ACT

Pro Thr Ala Ser

94 95 96 97

 

 

14 Pro Pro Pro

Pro CCT CCT CCT

(13)                   GAC GAA GAG

Gln Lys Glu

98 99 100

 

Pro Pro Pro

CCC CCC CCC

GGT GGG GGZ

Trp Gly Arg

101 102 103

 

Pro Pro Pro

CCA CCA CCA

GTA GTG GTY

Met Val Leu

104 105 106

 

Pro Pro Pro Pro

CCG CCG CCG CCG

GCC GCA GCG GCT

Pro Thr Ala Ser

107 108 109 110

 

 

 

15 Thr Thr Thr

Thr ACT ACT ACT

(13)    GAC GAA GAG

Gln Lys Glu

111 112 113

 

Thr Thr Thr

ACC ACC ACC

GGT GGG GGZ

Trp Gly Arg

114 115 116

 

Thr Thr Thr

ACA ACA ACA

GTA GTG GTY

Met Val Leu

117 118 119

 

Thr Thr Thr Thr

ACG ACG ACG ACG

GCC GCA GCG GCT

Pro Thr Ala Ser

120 121 122 123

 

 

16 Ala Ala Ala

Ala GCT GCT GCT

(13) GAC GAA GAG

Gln Lys Glu

124 125 126

 

Ala Ala Ala

GCC GCC GCC

GGT GGG GGZ

Trp Gly Arg

127 128 129

 

Ala Ala Ala

GCA GCA GCA

GTA GTG GTY

Met Val Leu

130 131 132

 

Ala Ala Ala Ala

GCG GCG GCG GCG

GCC GCA GCG GCT

Pro Thr Ala Ser

133 134 135 136

 

 

 

17 Gly Gly Gly Gly

Gly GGT GGT GGT GGT

(15) CAT CAC CAA CAG

Tyr His Asn Asp

137 138 139 140

 

Gly Gly Gly Gly Gly

GGC GGC GGC GGC GGG

CGT CGG CGC CGA CCT

Cys Gly Arg Ser Ser

141 142 143 144.1 144.2

Gly Gly Gly Gly

GGA GGA GGA GGA

CTT CTA CTG CTC

Phe Ile Val Leu

145 146 147 148

 

Gly Gly Gly

GGG GGG GGG

CCC CCA CCG

Pro Thr Ala

149 150 151

 

 

18 Ser Ser Ser Ser Ser

Ser TCT TCT TCT TCC TCC

(18) GAC GAA GAG GGT GGG

Gln Lys Glu Trp Gly

152 153 154 155 156.1

 

Ser Ser Ser Ser Ser

AGC TCC AGC TCA TCA

CGG GGZ CGC GTA GTG

Gly Arg Arg Met Val

156.2 157.1 157.2 158 159

 

Ser Ser Ser Ser Ser

TCA TCG TCG TCG TCG

GTY GCC GCA GCG GCT

Leu Pro Thr Ala Ser

160 161 162 163 164.1

Ser Ser Ser Ser Ser

AGC AGT AGT AGT AGT

CGA CAT CAC CAA CAG

Ser Tyr His Asn Asp

164.2 165 166 167 168

 

 

 

Ser

AGC

CGT

Cys

169

 

19 Leu Leu Leu

Leu CTT CTT CTT

(12) AAC AAA AAG

Gln Lys Glu

170 171 172

 

Leu Leu Leu Leu Leu

CTC CTC YTA YTA YTA

AGG AGZ ATA ATG ATY

Gly Arg Ile Val Leu

173 174 175 176 177

 

Leu Leu Leu Leu

YTG YTG YTG YTG

ACC ACA ACG ACT

Pro Thr Ala Ser

178 179 180 181

 

 

20 Arg Arg Arg Arg

Arg CGT CGT CGT CGT

(15) CAT CAC CAA CAG

Tyr His Asn Asp

182 183 184 185

 

Arg Arg Arg Arg Arg

CGC CGC CGC ZGG CGC

CGT CGG CGA CCT CGC

Cys Gly Ser Ser Arg

186 187 188.1 188.2 189

 

Arg Arg Arg Arg

ZGA ZGA ZGA ZGA

CTT CTA CTG CTC

Phe Ile Val Leu

190 191 192 193

 

Arg Arg Arg

ZGG ZGG ZGG

CCC CCA CCG

Pro Thr Ala

194 195 196