; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0006451 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0006451
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRNA-directed DNA polymerase
Genome locationchr03:23798901..23800777
RNA-Seq ExpressionPI0006451
SyntenyPI0006451
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.8e-4341.31Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        MK+RF+P +YEQ LY QYQ C+QG R+ A+Y EEFHRL  RT + E E + I+ FV GL+ +++E++  QP   LS AI+ A   E   E R K+  ++K
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPNSKEEVN
         PW+   +     G  N+K   ++S+   ++++     +  + ++   N YQRP  G CY+C Q  H SNQC Q K +A  +  +   + +     EE  
Subjt:  SPWDKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPNSKEEVN

Query:  ELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK
         +  +EG+ LSC++Q++L++PK E   QRHSLF+TRCTI GKVC VIIDS +SEN + K
Subjt:  ELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK

XP_022138327.1 uncharacterized protein LOC111009540 isoform X1 [Momordica charantia]7.7e-4244.18Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQG-KRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        M+ERFLP ++EQ+LYQ YQRC+QG K  A Y E FHRL A+T I E+E+Y+IARFVDGL+E+IQ+QMD QPI  L+ AI MA K E   ++++  + +++
Subjt:  MKERFLPADYEQILYQQYQRCKQG-KRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPT---TSYQQKGFENTKYGQSSSQPRSKEDQLPKASQC---SKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPN
        +PWDKP+   T+    G +  + G +S+      D   K+S       + +   N Y RP LG C++C Q  HLSN+C Q + +A V+ ++  E D    
Subjt:  SPWDKPT---TSYQQKGFENTKYGQSSSQPRSKEDQLPKASQC---SKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPN

Query:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKV
        ++++   + P+EG+ LSC++Q++ LTPK E  PQR+SLF+T  TINGK+
Subjt:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKV

XP_031741035.1 uncharacterized protein LOC116403692 [Cucumis sativus]1.1e-4544.61Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        +K RFLP +YEQ LY QYQ C+QG R  A+Y EEFHRL+ART ++E+E +Q+ARFV GL+ +I+E++  QP   LS AIS A   E     R KN ++++
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPW----------DKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDD
        S W          D+P+TS + KG E     Q  +  R KE     + Q         N+Y RP+LG C++C Q  HLS+ C Q K +A  E E  Q  +
Subjt:  SPW----------DKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDD

Query:  TAPNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK
         +  ++EE   +  ++GE++SC+IQ++L+TPK E + QRH LF+TRCTING+VC VIIDS +SEN + K
Subjt:  TAPNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK

XP_031743026.1 uncharacterized protein LOC116404533 [Cucumis sativus]2.3e-4644.98Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        +K RFLP +YEQ LY QYQ C+QG R  A+Y EEFHRL+ART ++E+E +Q+ARFV GL+ +I+E++  QP   LS AIS A   E     R KN ++++
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPW----------DKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDD
        S W          D+P+TS + KG E     Q  +  R KE     + Q         N Y RP+LG C++C Q  HLSN C Q K +A  E E  Q  +
Subjt:  SPW----------DKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDD

Query:  TAPNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK
         +  ++EE   +  ++GE++SC+IQ++L+TPK E + QRH LF+TRCTING+VC VIIDS +SEN + K
Subjt:  TAPNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK

XP_031745468.1 uncharacterized protein LOC116405837 [Cucumis sativus]8.7e-4644.81Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        +K RFLP +YEQ LY QYQ C+QG R  A+Y EEFHRL+ART ++E+E +Q+ARFV GL+ +I+E++  QP   LS AIS A   E     R KN ++++
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPW----------DKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVE-GEESQED
        S W          D+P+TS + KG E     Q  +  R KE     + Q         N Y RP+LG C++C Q  HLSN C Q K +A  E G ++ ED
Subjt:  SPW----------DKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVE-GEESQED

Query:  DTAPNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK
             ++EE   +  ++GE++SC+IQ++L+TPK E + QRH LF+TRCTING+VC VIID+ +SEN + K
Subjt:  DTAPNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK

TrEMBL top hitse value%identityAlignment
A0A5D3D655 Retrotrans_gag domain-containing protein5.0e-3940.38Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        MK+RFLP +YEQ LY QYQ C+QG R+ A+Y EEFHRL ART + ESE + IARF  GL+ +++E++  Q    LS  I  AY   ++     ++  ++K
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPTT--SYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEE----SQEDDTAPN
          W+  T+  +   +     K  +   +P  K+D   K      N++ + N Y RP  G CY+C Q  H S    Q K VA+V+ EE    S E+D    
Subjt:  SPWDKPTT--SYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEE----SQEDDTAPN

Query:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK
         +++   +  +EG+ LSC++Q++L+ PK E  PQ HSLF+TRCT+ GK+C VIIDS +SEN + K
Subjt:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK

A0A5D3DGR0 Reverse transcriptase8.8e-4441.31Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        MK+RF+P +YEQ LY QYQ C+QG R+ A+Y EEFHRL  RT + E E + I+ FV GL+ +++E++  QP   LS AI+ A   E   E R K+  ++K
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRR-AKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPNSKEEVN
         PW+   +     G  N+K   ++S+   ++++     +  + ++   N YQRP  G CY+C Q  H SNQC Q K +A  +  +   + +     EE  
Subjt:  SPWDKPTTSYQQKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPNSKEEVN

Query:  ELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK
         +  +EG+ LSC++Q++L++PK E   QRHSLF+TRCTI GKVC VIIDS +SEN + K
Subjt:  ELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFK

A0A6J1CAS9 uncharacterized protein LOC111009540 isoform X13.7e-4244.18Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQG-KRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        M+ERFLP ++EQ+LYQ YQRC+QG K  A Y E FHRL A+T I E+E+Y+IARFVDGL+E+IQ+QMD QPI  L+ AI MA K E   ++++  + +++
Subjt:  MKERFLPADYEQILYQQYQRCKQG-KRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPT---TSYQQKGFENTKYGQSSSQPRSKEDQLPKASQC---SKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPN
        +PWDKP+   T+    G +  + G +S+      D   K+S       + +   N Y RP LG C++C Q  HLSN+C Q + +A V+ ++  E D    
Subjt:  SPWDKPT---TSYQQKGFENTKYGQSSSQPRSKEDQLPKASQC---SKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPN

Query:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKV
        ++++   + P+EG+ LSC++Q++ LTPK E  PQR+SLF+T  TINGK+
Subjt:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKV

A0A6J1CCQ8 uncharacterized protein LOC111009540 isoform X24.8e-4244.35Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQG-KRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        M+ERFLP ++EQ+LYQ YQRC+QG K  A Y E FHRL A+T I E+E+Y+IARFVDGL+E+IQ+QMD QPI  L+ AI MA K E   ++++  + +++
Subjt:  MKERFLPADYEQILYQQYQRCKQG-KRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPT---TSYQQKGFENTKYGQSSSQPRSKEDQLPKASQC---SKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPN
        +PWDKP+   T+    G +  + G +S+      D   K+S       + +   N Y RP LG C++C Q  HLSN+C Q + +A V+ ++  E D    
Subjt:  SPWDKPT---TSYQQKGFENTKYGQSSSQPRSKEDQLPKASQC---SKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPN

Query:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGK
        ++++   + P+EG+ LSC++Q++ LTPK E  PQR+SLF+T  TINGK
Subjt:  SKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGK

A0A6P9EIQ8 uncharacterized protein LOC1089912426.3e-4240.66Show/hide
Query:  MKERFLPADYEQILYQQYQRCKQGKRRA-KYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK
        M+ RFLP DYEQ+LYQQYQ C+QG R   +Y EEF+RLN+R  ++E+E  Q+AR++ GL+  IQ+++    + TLS A+++A K E++  R         
Subjt:  MKERFLPADYEQILYQQYQRCKQGKRRA-KYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKK

Query:  SPWDKPTTSYQQKGFENTKYGQSSSQPRSKED-------QLPKASQCSKNQ--ELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDT
         P   P+ S   KG E          P S  D       Q PK +  +         N Y+RP  G C++CNQ  H S +C   + V  V+G+ES ++D 
Subjt:  SPWDKPTTSYQQKGFENTKYGQSSSQPRSKED-------QLPKASQCSKNQ--ELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDT

Query:  APNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFKARAST
           S+EE   +  +EG+ ++C+IQ++LLTPK E H QRH +F+TRCTIN KVC +IIDS + ENI+ +A  +T
Subjt:  APNSKEEVNELAPNEGEQLSCMIQQILLTPKTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFKARAST

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGAAAGATTTTTGCCGGCGGATTATGAACAAATATTGTACCAACAATATCAAAGATGCAAACAAGGGAAGCGGAGAGCTAAATATGCAGAAGAATTTCACCGACT
TAATGCAAGAACAAGAATCAACGAAAGTGAAAACTACCAAATTGCGAGATTTGTCGACGGCCTTAAAGAAGAAATCCAAGAACAAATGGACTTCCAACCAATCAGCACCC
TTTCAGCAGCAATCTCAATGGCTTACAAAGCCGAGATCAAGGCGGAAAGAAGACAAAAAAACAGCATTAGCAAGAAGAGCCCATGGGACAAACCGACTACTTCGTATCAA
CAAAAGGGATTCGAAAACACCAAATATGGTCAAAGCTCAAGTCAACCAAGAAGTAAAGAAGATCAACTCCCCAAAGCAAGTCAATGTTCAAAAAATCAAGAACTCGCGAT
CAATACTTACCAAAGACCGAATCTTGGATTTTGCTACCAATGCAACCAAAAGAGACACTTGTCTAACCAATGCTTACAAGGGAAGATGGTAGCATATGTTGAAGGGGAAG
AAAGCCAAGAAGACGATACGGCACCAAATTCCAAAGAAGAAGTCAATGAGTTAGCACCGAATGAGGGGGAGCAACTTTCTTGCATGATTCAACAAATTCTACTCACACCA
AAAACCGAGACTCACCCACAACGACATTCATTATTCCAAACACGTTGTACAATCAATGGCAAGGTTTGCTACGTCATAATTGATAGTCGGAATAGCGAAAACATTCTCTT
CAAAGCTCGTGCAAGCACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGAAAGATTTTTGCCGGCGGATTATGAACAAATATTGTACCAACAATATCAAAGATGCAAACAAGGGAAGCGGAGAGCTAAATATGCAGAAGAATTTCACCGACT
TAATGCAAGAACAAGAATCAACGAAAGTGAAAACTACCAAATTGCGAGATTTGTCGACGGCCTTAAAGAAGAAATCCAAGAACAAATGGACTTCCAACCAATCAGCACCC
TTTCAGCAGCAATCTCAATGGCTTACAAAGCCGAGATCAAGGCGGAAAGAAGACAAAAAAACAGCATTAGCAAGAAGAGCCCATGGGACAAACCGACTACTTCGTATCAA
CAAAAGGGATTCGAAAACACCAAATATGGTCAAAGCTCAAGTCAACCAAGAAGTAAAGAAGATCAACTCCCCAAAGCAAGTCAATGTTCAAAAAATCAAGAACTCGCGAT
CAATACTTACCAAAGACCGAATCTTGGATTTTGCTACCAATGCAACCAAAAGAGACACTTGTCTAACCAATGCTTACAAGGGAAGATGGTAGCATATGTTGAAGGGGAAG
AAAGCCAAGAAGACGATACGGCACCAAATTCCAAAGAAGAAGTCAATGAGTTAGCACCGAATGAGGGGGAGCAACTTTCTTGCATGATTCAACAAATTCTACTCACACCA
AAAACCGAGACTCACCCACAACGACATTCATTATTCCAAACACGTTGTACAATCAATGGCAAGGTTTGCTACGTCATAATTGATAGTCGGAATAGCGAAAACATTCTCTT
CAAAGCTCGTGCAAGCACTTGATCTCAAGCTAGACTCACATCCACATCCTTACAAAGTTATTTGGATCAAGAAAAGCGGCGAAGCACAGAAAGTTCAACTTGCACAATCC
CTCTCTCTATCGGTAATTTCTATAAAGATCAAATCATTTGTGATGTTCTTAATATGGATGTTTGTCGTATTTTGCTAGGATGTGGCAATACGACTTATAAGCAATACACC
GAGAAAGAGAAAACACTTATGAATTCACTTGGATGGGACAAAAAGTGAAGCTTCTACCTTCTATGAGCCCAACGGATAAAGTGAGTACCAAAACAAGAAAGAAATAAAGA
AACAACTTTTTTGCATCCAAGAAAGTGGAAGAATCATTGACAAGGAAGACAAGGAAGTGTGGGCCTTGATAGTAAAAGATCAAGTAACAACTCCGTTCTTCCAAGAAGAA
AATGAAGATATCAAAAAATTGCTAGAAGAATTTCACCAAGTGTTGGAAACTCCCACTAATTTGCCACCATTGAGAGATATCCAACACAACATTGATCTCATCCCGGGCTC
AGCGATTCCAAATTTACCTCACTACAGAATGAGTCCAAAAGAGTACGAGATTCTTCAAGAACAAGTAAGTGAACTTCTAGAGAAAGGACATGTTAGACCAAGCCTAAGTC
CTTGCACCGTGCCCGTTCTCCCAACTCCCAAGAAAGACGAATCGTGGAAGATGTGCGTGGATCAACAAAATTACAATCAAATACCGCTTTCCAATTCCAAGGATGTCCGA
CTTATTTGATCAACTTGGAGGAGCGAAAATTCTCCAAGATTGACTTAAAAAGTGGGTACCACCAAATAAGGATCCGATGGGGCGATGAATGGAAGACGGCATTTAAGACT
AATGAGGGTCTCATTGAGTGGTTAGTGATGCCTTTTGGACTTTCCAATGCGCCTAGTACATTCATGAGGTTAATGACCCAGGTTCTACACTCTATCTTAATAAATTTTTG
GTAGTGTATTTTGATAACATTTTAGTATATAGCACTTCATATGAAGAGCCTCTATCATACTTGCATAAACCTTTTAAAATGCTTCAAGAAAACCAACTTACTATTAATTT
TATTAAT
Protein sequenceShow/hide protein sequence
MKERFLPADYEQILYQQYQRCKQGKRRAKYAEEFHRLNARTRINESENYQIARFVDGLKEEIQEQMDFQPISTLSAAISMAYKAEIKAERRQKNSISKKSPWDKPTTSYQ
QKGFENTKYGQSSSQPRSKEDQLPKASQCSKNQELAINTYQRPNLGFCYQCNQKRHLSNQCLQGKMVAYVEGEESQEDDTAPNSKEEVNELAPNEGEQLSCMIQQILLTP
KTETHPQRHSLFQTRCTINGKVCYVIIDSRNSENILFKARAST