; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0008617 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0008617
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr04:30941222..30944133
RNA-Seq ExpressionPI0008617
SyntenyPI0008617
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032146.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.2e-7142.89Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPI+ELL RVDTLE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
        GELD++QK+++EMING+SEDFRVTLDVVRNEIADV+AR+ LTMRAMA+QAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ
                                                                                    Q VQ+L+SAYAAAERLFDL+ DSQ
Subjt:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ

Query:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG
        D  R  S SP R+ D   SSPKA GG        KP      +  +R N + P  RP  CFIC+GPHLARECPN+  F+AFQA+L+ D +DK  +  DE 
Subjt:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG

Query:  AQIEGDCPRTDIRQPNGLKMISALQLREG
          I+G   +T I     +K +S+LQ + G
Subjt:  AQIEGDCPRTDIRQPNGLKMISALQLREG

KAA0035958.1 uncharacterized protein E6C27_scaffold56G001620 [Cucumis melo var. makuwa]2.7e-7048.15Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS+   +GKA  DRLVE+EE MLYL+E  D++R+LESR++E+SEK DAIDAV GR++ LPI+ELL RVDTLE  T+    +NYERG SSSS A H+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
         ELD SQK+++EMING+SEDFR TLDVVRNEI DV+ R+ LTMRAMANQ P GGA+                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKA-RGGALPIKPGSPE--------RRQNHQKPR
                                VQ L+SAYAAAERLFDL  DSQD  R  S SP R+ +   SSPKA  G     K   P         RR N + P 
Subjt:  ----------------------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKA-RGGALPIKPGSPE--------RRQNHQKPR

Query:  GRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEG
         RP  CFIC+GPHLARECPNR  F+AFQA+L+ D +DK     DE   I+G
Subjt:  GRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEG

KAA0040892.1 uncharacterized protein E6C27_scaffold345G00520 [Cucumis melo var. makuwa]2.5e-6840.31Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPIKELL RVD LE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
         ELD++QK+++EMING+SEDF+VTLDVVRNEIADV+AR+ LTMRAMANQAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR
              Q VQ+L+SAYAAAERLFDL+ DSQD  R  S SP R+ D   SSPKA GG   P K   P         RR N + P  RP  CFIC+GPHLAR
Subjt:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR

Query:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG
        ECPN+  F+AFQA+L+ D +DK  +  DE   I+G   +T I     +K +S+LQ + G
Subjt:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG

TYK03044.1 uncharacterized protein E5676_scaffold46G001390 [Cucumis melo var. makuwa]4.3e-6840.31Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPIKELL RVD LE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
         ELD++QK+++EMING+SEDF+VTLDVVRNEIADV+AR+ LTMRAMANQAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR
              Q VQ+L+SAYAAAERLFDL+ DSQD  R  S SP R+ D   SSPKA GG   P K   P         RR N + P  RP  CFIC+GPHLAR
Subjt:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR

Query:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG
        ECPN+  F+AFQA+L+ D +DK  +  DE   I+G   +T I     +K +S+LQ + G
Subjt:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG

TYK14391.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]8.4e-7243.12Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPI+ELL RVDTLE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
        GELD++QK+++EMING+SEDFRVTLDVVRNEIADV+AR+ LTMRAMA+QAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ
                                                                                    Q VQ+L+SAYAAAERLFDL+ DSQ
Subjt:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ

Query:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG
        D  R  S SP R+ D   SSPKA GG        KP      +  RR+N + P  RP  CFIC+GPHLARECPN+  F+AFQA+L+ D +DK  +  DE 
Subjt:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG

Query:  AQIEGDCPRTDIRQPNGLKMISALQLREG
          I+G   +T I     +K +S+LQ + G
Subjt:  AQIEGDCPRTDIRQPNGLKMISALQLREG

TrEMBL top hitse value%identityAlignment
A0A5A7SQ54 Ty3-gypsy retrotransposon protein2.0e-7142.89Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPI+ELL RVDTLE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
        GELD++QK+++EMING+SEDFRVTLDVVRNEIADV+AR+ LTMRAMA+QAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ
                                                                                    Q VQ+L+SAYAAAERLFDL+ DSQ
Subjt:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ

Query:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG
        D  R  S SP R+ D   SSPKA GG        KP      +  +R N + P  RP  CFIC+GPHLARECPN+  F+AFQA+L+ D +DK  +  DE 
Subjt:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG

Query:  AQIEGDCPRTDIRQPNGLKMISALQLREG
          I+G   +T I     +K +S+LQ + G
Subjt:  AQIEGDCPRTDIRQPNGLKMISALQLREG

A0A5A7TBQ6 Retrotrans_gag domain-containing protein1.2e-6840.31Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPIKELL RVD LE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
         ELD++QK+++EMING+SEDF+VTLDVVRNEIADV+AR+ LTMRAMANQAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR
              Q VQ+L+SAYAAAERLFDL+ DSQD  R  S SP R+ D   SSPKA GG   P K   P         RR N + P  RP  CFIC+GPHLAR
Subjt:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR

Query:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG
        ECPN+  F+AFQA+L+ D +DK  +  DE   I+G   +T I     +K +S+LQ + G
Subjt:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG

A0A5D3BV48 Retrotrans_gag domain-containing protein2.1e-6840.31Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPIKELL RVD LE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
         ELD++QK+++EMING+SEDF+VTLDVVRNEIADV+AR+ LTMRAMANQAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR
              Q VQ+L+SAYAAAERLFDL+ DSQD  R  S SP R+ D   SSPKA GG   P K   P         RR N + P  RP  CFIC+GPHLAR
Subjt:  ------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGG-ALPIKPGSPE--------RRQNHQKPRGRPPGCFICKGPHLAR

Query:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG
        ECPN+  F+AFQA+L+ D +DK  +  DE   I+G   +T I     +K +S+LQ + G
Subjt:  ECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREG

A0A5D3CRH9 Ty3-gypsy retrotransposon protein4.1e-7243.12Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS++  +GKA  DRLVEIEE MLYL+E PD++R+LESR++E+SEKA+ IDAV GR++GLPI+ELL RVDTLE  T+    INYERG SSS  AAH+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
        GELD++QK+++EMING+SEDFRVTLDVVRNEIADV+AR+ LTMRAMA+QAP GGAI                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ
                                                                                    Q VQ+L+SAYAAAERLFDL+ DSQ
Subjt:  ----------------------------------------------------------------------------QGVQELSSAYAAAERLFDLSPDSQ

Query:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG
        D  R  S SP R+ D   SSPKA GG        KP      +  RR+N + P  RP  CFIC+GPHLARECPN+  F+AFQA+L+ D +DK  +  DE 
Subjt:  DGSRPPSPSPERDEDDCVSSPKARGG----ALPIKP-----GSPERRQNHQKPRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEG

Query:  AQIEGDCPRTDIRQPNGLKMISALQLREG
          I+G   +T I     +K +S+LQ + G
Subjt:  AQIEGDCPRTDIRQPNGLKMISALQLREG

A0A5D3E568 Uncharacterized protein1.3e-7048.15Show/hide
Query:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI
        MS+   +GKA  DRLVE+EE MLYL+E  D++R+LESR++E+SEK DAIDAV GR++ LPI+ELL RVDTLE  T+    +NYERG SSSS A H+EER+
Subjt:  MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTS---RINYERGSSSSSPAAHIEERI

Query:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------
         ELD SQK+++EMING+SEDFR TLDVVRNEI DV+ R+ LTMRAMANQ P GGA+                                            
Subjt:  GELDSSQKSIVEMINGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAI--------------------------------------------

Query:  ----------------------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKA-RGGALPIKPGSPE--------RRQNHQKPR
                                VQ L+SAYAAAERLFDL  DSQD  R  S SP R+ +   SSPKA  G     K   P         RR N + P 
Subjt:  ----------------------QGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKA-RGGALPIKPGSPE--------RRQNHQKPR

Query:  GRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEG
         RP  CFIC+GPHLARECPNR  F+AFQA+L+ D +DK     DE   I+G
Subjt:  GRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACCGCGAAACAGACGGGCAAGGCCCACGCAGACCGATTGGTAGAGATCGAAGAGCATATGCTCTACCTGTTGGAATTTCCAGACACCGTCCGCTTCTTGGAAAG
CCGAATTGAAGAAGTTTCCGAGAAAGCCGACGCTATTGATGCGGTGACCGGTCGCCTCGACGGTTTACCTATAAAGGAGTTGTTAACAAGGGTTGACACGTTAGAGGCAA
AGACCTCAAGAATAAACTACGAGCGTGGGAGCAGCTCGTCGAGCCCTGCTGCCCACATAGAGGAACGGATTGGCGAGCTAGATAGCTCCCAAAAGTCGATTGTCGAGATG
ATAAACGGCCTGTCAGAAGATTTTCGTGTCACTCTCGATGTCGTTAGGAATGAGATCGCAGACGTGAGCGCGAGGGTGGACCTCACGATGCGAGCCATGGCAAACCAAGC
TCCAACTGGAGGAGCTATACAAGGGGTCCAAGAACTCTCGTCGGCATATGCAGCAGCAGAACGACTGTTTGATCTGTCCCCTGACTCTCAAGATGGGAGTCGTCCTCCAA
GTCCCTCACCTGAAAGAGATGAGGATGATTGTGTAAGTTCTCCGAAGGCCAGAGGGGGAGCCCTCCCAATCAAACCCGGGAGTCCGGAGCGAAGACAGAACCACCAGAAG
CCTCGCGGTCGTCCCCCTGGCTGTTTCATATGTAAGGGGCCGCACCTGGCGAGAGAATGTCCGAATCGGGCCACCTTCTATGCCTTTCAGGCTGCCTTAGTCCCAGATCC
AGAAGACAAGAAGGGTCGGACAGGGGACGAAGGTGCCCAAATAGAAGGCGATTGCCCCCGGACAGACATCCGTCAGCCCAACGGACTAAAAATGATTTCCGCCCTACAGC
TGAGGGAGGGCCTTACTCGCGATGAACCGACCTTCATCACCCCTCCGCTTAAACCGCTGGGGATTCAGGAAAAAGTAACCTCGAAAGACGTTCAGCGTGCCGTAGAGAAA
TATAACGATAAGGGGCTTGATACTCACAATTCCACAAGGGAGTGGAAACGGATAGCAGACATCACTCGAGCTTATTCAGAGAAGACCTCCGAGCGGATGAAGAAGCGGGC
AGGGAAGAAACGTCGCCCCCTCGAGTTTCAGGCGAGAGACCGAGTCCTCATCAAAAGGCGATTAGGACAAGGCTGGTTTCGAGGACGTACGGACCAACACCTCGTTAGGG
AATACGAGGGCCCTGTTGAAGTCCTCAAGAAGGTGGGAAAAGCATCCTACCGAGTGGCGTTGCCCACGTGGATGAAAATGCATCCAATAATTCACGTGAGTAACTTAAAG
CCCTACCACCAAGACCCCGGTGACACCCGATGCAACGTTGTCAGCCGGCCAAACATCGGTCTGAACAAAAGGGAAGAAGACCGAGATGTTGAAGCGATCCTTGCCGACCG
GGGCGGTGTTTGCCTGTGGCCGCATGTTCGAACACCTTCACCCCACCTTGAGTTATGTTTTTCCTTTAGTTTCCATGTCACTTTGCTTTTACTTTCCGTGTCGCTTTATG
CTTTCATTTACTTTCCGTGTCGCATTGCTTTTACCTTCTATGTCGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTACCGCGAAACAGACGGGCAAGGCCCACGCAGACCGATTGGTAGAGATCGAAGAGCATATGCTCTACCTGTTGGAATTTCCAGACACCGTCCGCTTCTTGGAAAG
CCGAATTGAAGAAGTTTCCGAGAAAGCCGACGCTATTGATGCGGTGACCGGTCGCCTCGACGGTTTACCTATAAAGGAGTTGTTAACAAGGGTTGACACGTTAGAGGCAA
AGACCTCAAGAATAAACTACGAGCGTGGGAGCAGCTCGTCGAGCCCTGCTGCCCACATAGAGGAACGGATTGGCGAGCTAGATAGCTCCCAAAAGTCGATTGTCGAGATG
ATAAACGGCCTGTCAGAAGATTTTCGTGTCACTCTCGATGTCGTTAGGAATGAGATCGCAGACGTGAGCGCGAGGGTGGACCTCACGATGCGAGCCATGGCAAACCAAGC
TCCAACTGGAGGAGCTATACAAGGGGTCCAAGAACTCTCGTCGGCATATGCAGCAGCAGAACGACTGTTTGATCTGTCCCCTGACTCTCAAGATGGGAGTCGTCCTCCAA
GTCCCTCACCTGAAAGAGATGAGGATGATTGTGTAAGTTCTCCGAAGGCCAGAGGGGGAGCCCTCCCAATCAAACCCGGGAGTCCGGAGCGAAGACAGAACCACCAGAAG
CCTCGCGGTCGTCCCCCTGGCTGTTTCATATGTAAGGGGCCGCACCTGGCGAGAGAATGTCCGAATCGGGCCACCTTCTATGCCTTTCAGGCTGCCTTAGTCCCAGATCC
AGAAGACAAGAAGGGTCGGACAGGGGACGAAGGTGCCCAAATAGAAGGCGATTGCCCCCGGACAGACATCCGTCAGCCCAACGGACTAAAAATGATTTCCGCCCTACAGC
TGAGGGAGGGCCTTACTCGCGATGAACCGACCTTCATCACCCCTCCGCTTAAACCGCTGGGGATTCAGGAAAAAGTAACCTCGAAAGACGTTCAGCGTGCCGTAGAGAAA
TATAACGATAAGGGGCTTGATACTCACAATTCCACAAGGGAGTGGAAACGGATAGCAGACATCACTCGAGCTTATTCAGAGAAGACCTCCGAGCGGATGAAGAAGCGGGC
AGGGAAGAAACGTCGCCCCCTCGAGTTTCAGGCGAGAGACCGAGTCCTCATCAAAAGGCGATTAGGACAAGGCTGGTTTCGAGGACGTACGGACCAACACCTCGTTAGGG
AATACGAGGGCCCTGTTGAAGTCCTCAAGAAGGTGGGAAAAGCATCCTACCGAGTGGCGTTGCCCACGTGGATGAAAATGCATCCAATAATTCACGTGAGTAACTTAAAG
CCCTACCACCAAGACCCCGGTGACACCCGATGCAACGTTGTCAGCCGGCCAAACATCGGTCTGAACAAAAGGGAAGAAGACCGAGATGTTGAAGCGATCCTTGCCGACCG
GGGCGGTGTTTGCCTGTGGCCGCATGTTCGAACACCTTCACCCCACCTTGAGTTATGTTTTTCCTTTAGTTTCCATGTCACTTTGCTTTTACTTTCCGTGTCGCTTTATG
CTTTCATTTACTTTCCGTGTCGCATTGCTTTTACCTTCTATGTCGCTTAG
Protein sequenceShow/hide protein sequence
MSTAKQTGKAHADRLVEIEEHMLYLLEFPDTVRFLESRIEEVSEKADAIDAVTGRLDGLPIKELLTRVDTLEAKTSRINYERGSSSSSPAAHIEERIGELDSSQKSIVEM
INGLSEDFRVTLDVVRNEIADVSARVDLTMRAMANQAPTGGAIQGVQELSSAYAAAERLFDLSPDSQDGSRPPSPSPERDEDDCVSSPKARGGALPIKPGSPERRQNHQK
PRGRPPGCFICKGPHLARECPNRATFYAFQAALVPDPEDKKGRTGDEGAQIEGDCPRTDIRQPNGLKMISALQLREGLTRDEPTFITPPLKPLGIQEKVTSKDVQRAVEK
YNDKGLDTHNSTREWKRIADITRAYSEKTSERMKKRAGKKRRPLEFQARDRVLIKRRLGQGWFRGRTDQHLVREYEGPVEVLKKVGKASYRVALPTWMKMHPIIHVSNLK
PYHQDPGDTRCNVVSRPNIGLNKREEDRDVEAILADRGGVCLWPHVRTPSPHLELCFSFSFHVTLLLLSVSLYAFIYFPCRIAFTFYVA