; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MELO3C034992 (gene) of Melon (DHL92) v4 genome

Gene IDMELO3C034992
OrganismCucumis melo DHL92 (Melon (DHL92) v4)
DescriptionCACTA en-spm transposon protein
Genome locationchr11:13432792..13434750
RNA-Seq ExpressionMELO3C034992
SyntenyMELO3C034992
Gene Ontology termsNA
InterPro domainsIPR004252 - Probable transposase, Ptta/En/Spm, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045433.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]3.0e-9771.43Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPHV+RFSQAIGVCVRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKY DPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT
        PPNAL                    EQSRT KAARQKQPYNHSSGS SFLQRQY+L ER+G+ VD V+LF+ETHVRAG F+SQA EDAHNQMLELQSQPT
Subjt:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT

Query:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVH--RHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKN
         EGSQPLSEDEICDQVLGRRPGYSKGLG  P  +P  RR  +    RHLVRSP K+RLNYKLNF KLWN LKYKIEITK+
Subjt:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVH--RHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKN

KAA0048108.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]6.3e-9574.32Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTF----------------------PRFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAP A+KPISPH +RFSQAIGVCVRK F                       R FVLDFN+QAMNRFVEHQM+TTFKEF ADC +HFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTF----------------------PRFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR
        PPNALEQSRT KAARQKQPYNHSSGS SFLQRQYELAERKG+LVD V+L ++THVRAGTF+SQA EDAHNQ+LELQ QPT EGSQPLS+DEICDQVLGR+
Subjt:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR

Query:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITK
        PGYSKGLG  P  +P  RR+    RHLV SPQK+RLNYKLNF KLW  LKYKIEITK
Subjt:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITK

KAA0066607.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]4.7e-9866.13Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPH           VRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT
        PPNAL                    EQSRT KAARQKQPYNHSSGS SFLQRQYELAERKG+ VD V+LF+ETHVRAGTF+SQA EDAHNQMLELQSQPT
Subjt:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT

Query:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKC
        LEGSQPLSE+EICDQVLGRR GYSKGLG  P  RP ER+VQA+ RHLVRSP K+RLNYKLNF KLWNELK    I  ++ R            FPDAG+C
Subjt:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKC

Query:  AGIVCVGKEAFPT
         GI  VG + FPT
Subjt:  AGIVCVGKEAFPT

TYK14211.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]7.0e-10270.65Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPH           VRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR
        PPNALEQSRT KAARQKQPYNHSSGS SFLQRQYELAERKG+ VD V+LF+ETHVRAGTF+SQA EDAHNQMLELQSQPTLEGSQPLSE+EICDQVLGRR
Subjt:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR

Query:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKCAGIVCVGKEAFPT
         GYSKGLG  P  RP ER+VQA+ RHLVRSP K+RLNYKLNF KLWNELK    I  ++ R            FPDAG+C GI  VG + FPT
Subjt:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKCAGIVCVGKEAFPT

TYK21553.1 CACTA en-spm transposon protein [Cucumis melo var. makuwa]6.3e-9562.39Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPH VRFSQAIGVCVRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT
        PPNAL                    EQSRT KAARQKQPYNHSSGS SFLQRQYELAER+G+ VD V+LF+ETHVRAGTF+SQA EDAHNQMLELQSQP 
Subjt:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT

Query:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSG-------KKRRERG
         EGSQPLSEDEICDQVLGRRPGYSKGLG  P  +P  RR  +         Q  +   +L  K   +E   +IE+   + +   S        KKRRER 
Subjt:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSG-------KKRRERG

Query:  FPDAGKCAGIVCVGKEAFPTRAMPTLS
        FPDAG+CAGI C+   +F T+AMPTL+
Subjt:  FPDAGKCAGIVCVGKEAFPTRAMPTLS

TrEMBL top hitse value%identityAlignment
A0A5A7TPH1 CACTA en-spm transposon protein1.5e-9771.43Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPHV+RFSQAIGVCVRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKY DPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT
        PPNAL                    EQSRT KAARQKQPYNHSSGS SFLQRQY+L ER+G+ VD V+LF+ETHVRAG F+SQA EDAHNQMLELQSQPT
Subjt:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT

Query:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVH--RHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKN
         EGSQPLSEDEICDQVLGRRPGYSKGLG  P  +P  RR  +    RHLVRSP K+RLNYKLNF KLWN LKYKIEITK+
Subjt:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVH--RHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKN

A0A5A7TYQ7 CACTA en-spm transposon protein3.1e-9574.32Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTF----------------------PRFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAP A+KPISPH +RFSQAIGVCVRK F                       R FVLDFN+QAMNRFVEHQM+TTFKEF ADC +HFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTF----------------------PRFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR
        PPNALEQSRT KAARQKQPYNHSSGS SFLQRQYELAERKG+LVD V+L ++THVRAGTF+SQA EDAHNQ+LELQ QPT EGSQPLS+DEICDQVLGR+
Subjt:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR

Query:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITK
        PGYSKGLG  P  +P  RR+    RHLV SPQK+RLNYKLNF KLW  LKYKIEITK
Subjt:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITK

A0A5A7VHM7 CACTA en-spm transposon protein2.3e-9866.13Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPH           VRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT
        PPNAL                    EQSRT KAARQKQPYNHSSGS SFLQRQYELAERKG+ VD V+LF+ETHVRAGTF+SQA EDAHNQMLELQSQPT
Subjt:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT

Query:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKC
        LEGSQPLSE+EICDQVLGRR GYSKGLG  P  RP ER+VQA+ RHLVRSP K+RLNYKLNF KLWNELK    I  ++ R            FPDAG+C
Subjt:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKC

Query:  AGIVCVGKEAFPT
         GI  VG + FPT
Subjt:  AGIVCVGKEAFPT

A0A5D3CQY3 CACTA en-spm transposon protein3.4e-10270.65Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPH           VRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR
        PPNALEQSRT KAARQKQPYNHSSGS SFLQRQYELAERKG+ VD V+LF+ETHVRAGTF+SQA EDAHNQMLELQSQPTLEGSQPLSE+EICDQVLGRR
Subjt:  PPNALEQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRR

Query:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKCAGIVCVGKEAFPT
         GYSKGLG  P  RP ER+VQA+ RHLVRSP K+RLNYKLNF KLWNELK    I  ++ R            FPDAG+C GI  VG + FPT
Subjt:  PGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKCAGIVCVGKEAFPT

A0A5D3DDA9 CACTA en-spm transposon protein3.1e-9562.39Show/hide
Query:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN
        MTIAPGAEKPISPH VRFSQAIGVCVRKTFP                      R FVLDFNDQAMNRFVEHQM+TTFKEF ADCHRHFKKYSDPEEARAN
Subjt:  MTIAPGAEKPISPHVVRFSQAIGVCVRKTFP----------------------RFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARAN

Query:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT
        PPNAL                    EQSRT KAARQKQPYNHSSGS SFLQRQYELAER+G+ VD V+LF+ETHVRAGTF+SQA EDAHNQMLELQSQP 
Subjt:  PPNAL--------------------EQSRTKKAARQKQPYNHSSGSMSFLQRQYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPT

Query:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSG-------KKRRERG
         EGSQPLSEDEICDQVLGRRPGYSKGLG  P  +P  RR  +         Q  +   +L  K   +E   +IE+   + +   S        KKRRER 
Subjt:  LEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNFKKLWNELKYKIEITKNSRRNETSG-------KKRRERG

Query:  FPDAGKCAGIVCVGKEAFPTRAMPTLS
        FPDAG+CAGI C+   +F T+AMPTL+
Subjt:  FPDAGKCAGIVCVGKEAFPTRAMPTLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATCGCCCCTGGAGCGGAGAAGCCTATTTCTCCACACGTCGTTCGCTTCAGTCAGGCGATAGGCGTGTGCGTGCGAAAGACATTTCCCCGATTCTTTGTGCTTGA
TTTCAATGATCAAGCAATGAACAGGTTTGTTGAGCATCAGATGGTCACGACCTTTAAAGAGTTCTGGGCCGACTGTCATAGACATTTCAAAAAGTACAGCGACCCGGAGG
AGGCTCGTGCCAACCCACCAAACGCATTGGAGCAATCACGGACGAAGAAGGCTGCTAGACAGAAACAGCCTTACAATCATAGTAGCGGGTCTATGTCGTTTCTACAACGA
CAGTATGAACTCGCTGAGAGAAAAGGGGAGCTGGTCGATCATGTGAAATTGTTCCAGGAAACACACGTTCGAGCTGGGACATTTATGTCGCAGGCCGTCGAGGATGCACA
TAATCAAATGCTGGAACTCCAATCCCAGCCTACCCTAGAGGGTAGTCAGCCACTCTCTGAGGATGAGATATGTGATCAGGTATTGGGTAGACGACCAGGCTACTCAAAAG
GCCTTGGTGGGGACCCAAGCCGAAGGCCCGGAGAACGGCGAGTGCAAGCAGTTCATCGACATCTTGTTCGTAGTCCACAGAAAAGGAGATTGAATTACAAGCTAAACTTC
AAGAAGCTTTGGAACGAATTGAAGTACAAGATAGAAATCACCAAGAATTCCCGACGTAACGAAACATCGGGAAAAAAACGTCGGGAAAGAGGATTTCCCGACGCTGGAAA
GTGCGCCGGCATAGTCTGCGTCGGGAAAGAGGCATTCCCGACGCGGGCTATGCCGACGCTTAGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCCTCCAGTTTGAGGACGATTTAGATAACATCGCGGGAGGGTCGTCATCTGTGGACGACAATACGGGTGAGCCTAAGTATTTTTTTCATTTTAACTTACAAATATT
TCATAGTCTTCTTCTCAACAAGCGACTCCGACTCCGACTCCTAGGAGACATGCGCATTGTCGTCTCTTAGAGTTAGAGCGCCACATTGCAATAAATGGACGCATTCCGAT
GACGATCGCCCCTGGAGCGGAGAAGCCTATTTCTCCACACGTCGTTCGCTTCAGTCAGGCGATAGGCGTGTGCGTGCGAAAGACATTTCCCCGATTCTTTGTGCTTGATT
TCAATGATCAAGCAATGAACAGGTTTGTTGAGCATCAGATGGTCACGACCTTTAAAGAGTTCTGGGCCGACTGTCATAGACATTTCAAAAAGTACAGCGACCCGGAGGAG
GCTCGTGCCAACCCACCAAACGCATTGGAGCAATCACGGACGAAGAAGGCTGCTAGACAGAAACAGCCTTACAATCATAGTAGCGGGTCTATGTCGTTTCTACAACGACA
GTATGAACTCGCTGAGAGAAAAGGGGAGCTGGTCGATCATGTGAAATTGTTCCAGGAAACACACGTTCGAGCTGGGACATTTATGTCGCAGGCCGTCGAGGATGCACATA
ATCAAATGCTGGAACTCCAATCCCAGCCTACCCTAGAGGGTAGTCAGCCACTCTCTGAGGATGAGATATGTGATCAGGTATTGGGTAGACGACCAGGCTACTCAAAAGGC
CTTGGTGGGGACCCAAGCCGAAGGCCCGGAGAACGGCGAGTGCAAGCAGTTCATCGACATCTTGTTCGTAGTCCACAGAAAAGGAGATTGAATTACAAGCTAAACTTCAA
GAAGCTTTGGAACGAATTGAAGTACAAGATAGAAATCACCAAGAATTCCCGACGTAACGAAACATCGGGAAAAAAACGTCGGGAAAGAGGATTTCCCGACGCTGGAAAGT
GCGCCGGCATAGTCTGCGTCGGGAAAGAGGCATTCCCGACGCGGGCTATGCCGACGCTTAGTTAG
Protein sequenceShow/hide protein sequence
MTIAPGAEKPISPHVVRFSQAIGVCVRKTFPRFFVLDFNDQAMNRFVEHQMVTTFKEFWADCHRHFKKYSDPEEARANPPNALEQSRTKKAARQKQPYNHSSGSMSFLQR
QYELAERKGELVDHVKLFQETHVRAGTFMSQAVEDAHNQMLELQSQPTLEGSQPLSEDEICDQVLGRRPGYSKGLGGDPSRRPGERRVQAVHRHLVRSPQKRRLNYKLNF
KKLWNELKYKIEITKNSRRNETSGKKRRERGFPDAGKCAGIVCVGKEAFPTRAMPTLS