; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G25470 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G25470
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposase
Genome locationChr3:22822333..22823854
RNA-Seq ExpressionCSPI03G25470
SyntenyCSPI03G25470
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0008234 - cysteine-type peptidase activity (molecular function)
InterPro domainsIPR025452 - Domain of unknown function DUF4218


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK18940.1 transposase [Cucumis melo var. makuwa]2.0e-12972.65Show/hide
Query:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------
        G T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                       
Subjt:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------

Query:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN
             SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRN
Subjt:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN

Query:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ
        R+RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQ
Subjt:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ

Query:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        KWIQDEHN+TF++WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

TYK22670.1 transposase [Cucumis melo var. makuwa]1.1e-12972.94Show/hide
Query:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------
        G T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                       
Subjt:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------

Query:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN
             SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRN
Subjt:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN

Query:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ
        R+RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQ
Subjt:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ

Query:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        KWIQDEHN+TFI+WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

TYK22869.1 hypothetical protein E5676_scaffold334G00040 [Cucumis melo var. makuwa]5.7e-14573.91Show/hide
Query:  AYLGHRKFLPQNHPFRHQRKSFNGQRELGSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSN
        AYLGH+KFLP NHPF  Q+KSFNGQRELGST LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN
Subjt:  AYLGHRKFLPQNHPFRHQRKSFNGQRELGSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSN

Query:  IKNFV----------------------------SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVRE
        + N V                            SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVRE
Subjt:  IKNFV----------------------------SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVRE

Query:  VKLCGLIYLRWMYPFERFMKDIKNVVRNRHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVL
        VKLCG IYLRWMYPFERFMK IKN VRNR+RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVL
Subjt:  VKLCGLIYLRWMYPFERFMKDIKNVVRNRHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVL

Query:  ENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        ENTVDVQPYIEKHLI LQQQHRSRSKNQKWIQDEHN+TFI+WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  ENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

TYK24859.1 transposase [Cucumis melo var. makuwa]1.4e-13280.66Show/hide
Query:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------
        G T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                       
Subjt:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------

Query:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN
             SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRN
Subjt:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN

Query:  RHRPDCCIAEG-GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLR
        R+RP+ CIAE  GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQKWIQDEHN+TF++WLREKV TEL TGDVE+SDNLR
Subjt:  RHRPDCCIAEG-GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLR

Query:  WIAHG
        WIAHG
Subjt:  WIAHG

XP_031745762.1 uncharacterized protein LOC116406207 [Cucumis sativus]1.0e-14163.96Show/hide
Query:  MAYLGHRKFLPQNHPFRHQRKSFNGQRELGS---------------------------------------------------------------------
        MAYLGHRKFLPQNHPFR ++KSFNGQRELGS                                                                     
Subjt:  MAYLGHRKFLPQNHPFRHQRKSFNGQRELGS---------------------------------------------------------------------

Query:  ------TFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-------------------
              T LDIPGKTKD LNARRDLADLKIRPELTPINED+ IFIPP CYTLTKKEKRFLLKTLSEMKVPRGYSSNI+N V                   
Subjt:  ------TFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-------------------

Query:  ---------SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKN
                 SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN
Subjt:  ---------SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKN

Query:  VVRNRHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSR
         VRNRH P+ CIAEG                                    GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSR
Subjt:  VVRNRHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSR

Query:  SKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        SKNQKWIQDEHNKTFIAWLREKVGTEL TGDVEISDNLRWIAHG
Subjt:  SKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

TrEMBL top hitse value%identityAlignment
A0A5D3BSE4 Transposase2.1e-12973.08Show/hide
Query:  TFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-------------------------
        T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                         
Subjt:  TFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-------------------------

Query:  ---SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRNRH
           SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRNR+
Subjt:  ---SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRNRH

Query:  RPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKW
        RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQKW
Subjt:  RPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKW

Query:  IQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        IQDEHN+TFI+WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  IQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

A0A5D3D5Z2 Transposase9.5e-13072.65Show/hide
Query:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------
        G T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                       
Subjt:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------

Query:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN
             SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRN
Subjt:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN

Query:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ
        R+RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQ
Subjt:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ

Query:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        KWIQDEHN+TF++WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

A0A5D3DH49 Transposase5.5e-13072.94Show/hide
Query:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------
        G T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                       
Subjt:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------

Query:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN
             SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRN
Subjt:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN

Query:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ
        R+RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQ
Subjt:  RHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQ

Query:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        KWIQDEHN+TFI+WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  KWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

A0A5D3DHZ4 ULP_PROTEASE domain-containing protein2.7e-14573.91Show/hide
Query:  AYLGHRKFLPQNHPFRHQRKSFNGQRELGSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSN
        AYLGH+KFLP NHPF  Q+KSFNGQRELGST LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN
Subjt:  AYLGHRKFLPQNHPFRHQRKSFNGQRELGSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSN

Query:  IKNFV----------------------------SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVRE
        + N V                            SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVRE
Subjt:  IKNFV----------------------------SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVRE

Query:  VKLCGLIYLRWMYPFERFMKDIKNVVRNRHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVL
        VKLCG IYLRWMYPFERFMK IKN VRNR+RP+ CIAEG                                    GRPLSSGVT+IPERELL+QAHRYVL
Subjt:  VKLCGLIYLRWMYPFERFMKDIKNVVRNRHRPDCCIAEG------------------------------------GRPLSSGVTNIPERELLHQAHRYVL

Query:  ENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG
        ENTVDVQPYIEKHLI LQQQHRSRSKNQKWIQDEHN+TFI+WLREKV TEL TGDVE+SDNLRWIAHG
Subjt:  ENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHG

A0A5D3DNH1 Transposase7.0e-13380.66Show/hide
Query:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------
        G T LDIPGKTKD LNARRDLADLKIRPELTPIN +KKIFIPP CYTLTKKEKRFLLK+LSEMKVPRGYSSN+ N V                       
Subjt:  GSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFV-----------------------

Query:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN
             SVLPKHVRYAIT LCLFFNSICNKVI+VTQVEKLQE+IVITLCLLEKYFPPSFFTIMV LTVHLVREVKLCG IYLRWMYPFERFMK IKN VRN
Subjt:  -----SVLPKHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRN

Query:  RHRPDCCIAEG-GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLR
        R+RP+ CIAE  GRPLSSGVT+IPERELL+QAHRYVLENTVDVQPYIEKHLI LQQQHRSRSKNQKWIQDEHN+TF++WLREKV TEL TGDVE+SDNLR
Subjt:  RHRPDCCIAEG-GRPLSSGVTNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLR

Query:  WIAHG
        WIAHG
Subjt:  WIAHG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATACCTTGGGCATCGAAAATTTCTACCACAGAATCATCCTTTTCGTCATCAAAGGAAATCGTTTAATGGTCAGCGAGAACTTGGAAGTACATTTCTTGATATTCC
AGGAAAAACTAAGGACAGTTTGAATGCTAGACGTGATTTAGCTGATTTAAAGATTCGACCTGAGCTTACTCCTATTAATGAGGATAAAAAGATATTCATTCCCCCTACTT
GTTATACTCTTACTAAGAAAGAAAAACGTTTTCTTTTAAAGACGTTATCAGAAATGAAAGTTCCTCGGGGTTACTCTTCCAATATTAAGAATTTTGTATCTGTGCTTCCA
AAACATGTTCGATATGCTATAACTCATTTGTGTCTTTTCTTCAATTCTATATGTAACAAAGTTATAAATGTTACACAAGTAGAGAAGTTGCAAGAAGAAATTGTTATTAC
ATTATGTTTACTAGAGAAATACTTCCCTCCTTCATTCTTCACAATAATGGTGCTTCTCACTGTACACCTTGTTAGAGAAGTAAAACTTTGTGGGCTCATTTATTTGCGAT
GGATGTATCCATTTGAAAGGTTCATGAAGGATATAAAAAATGTTGTGAGAAATCGACATCGTCCAGATTGTTGTATTGCTGAAGGTGGTAGACCATTGTCAAGTGGAGTT
ACTAACATACCTGAACGAGAGCTTTTACATCAAGCTCATCGATATGTTTTGGAGAATACTGTTGATGTGCAACCATATATAGAGAAACATTTGATCACATTGCAACAACA
ACACCGAAGTAGATCAAAAAACCAAAAATGGATTCAAGATGAACACAACAAAACCTTCATAGCTTGGTTACGAGAAAAGGTTGGAACGGAACTTGTAACAGGAGATGTTG
AAATTTCAGATAACTTGCGGTGGATTGCTCATGGCGCCCTCATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCATACCTTGGGCATCGAAAATTTCTACCACAGAATCATCCTTTTCGTCATCAAAGGAAATCGTTTAATGGTCAGCGAGAACTTGGAAGTACATTTCTTGATATTCC
AGGAAAAACTAAGGACAGTTTGAATGCTAGACGTGATTTAGCTGATTTAAAGATTCGACCTGAGCTTACTCCTATTAATGAGGATAAAAAGATATTCATTCCCCCTACTT
GTTATACTCTTACTAAGAAAGAAAAACGTTTTCTTTTAAAGACGTTATCAGAAATGAAAGTTCCTCGGGGTTACTCTTCCAATATTAAGAATTTTGTATCTGTGCTTCCA
AAACATGTTCGATATGCTATAACTCATTTGTGTCTTTTCTTCAATTCTATATGTAACAAAGTTATAAATGTTACACAAGTAGAGAAGTTGCAAGAAGAAATTGTTATTAC
ATTATGTTTACTAGAGAAATACTTCCCTCCTTCATTCTTCACAATAATGGTGCTTCTCACTGTACACCTTGTTAGAGAAGTAAAACTTTGTGGGCTCATTTATTTGCGAT
GGATGTATCCATTTGAAAGGTTCATGAAGGATATAAAAAATGTTGTGAGAAATCGACATCGTCCAGATTGTTGTATTGCTGAAGGTGGTAGACCATTGTCAAGTGGAGTT
ACTAACATACCTGAACGAGAGCTTTTACATCAAGCTCATCGATATGTTTTGGAGAATACTGTTGATGTGCAACCATATATAGAGAAACATTTGATCACATTGCAACAACA
ACACCGAAGTAGATCAAAAAACCAAAAATGGATTCAAGATGAACACAACAAAACCTTCATAGCTTGGTTACGAGAAAAGGTTGGAACGGAACTTGTAACAGGAGATGTTG
AAATTTCAGATAACTTGCGGTGGATTGCTCATGGCGCCCTCATCTAG
Protein sequenceShow/hide protein sequence
MAYLGHRKFLPQNHPFRHQRKSFNGQRELGSTFLDIPGKTKDSLNARRDLADLKIRPELTPINEDKKIFIPPTCYTLTKKEKRFLLKTLSEMKVPRGYSSNIKNFVSVLP
KHVRYAITHLCLFFNSICNKVINVTQVEKLQEEIVITLCLLEKYFPPSFFTIMVLLTVHLVREVKLCGLIYLRWMYPFERFMKDIKNVVRNRHRPDCCIAEGGRPLSSGV
TNIPERELLHQAHRYVLENTVDVQPYIEKHLITLQQQHRSRSKNQKWIQDEHNKTFIAWLREKVGTELVTGDVEISDNLRWIAHGALI