; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0015052 (gene) of Chayote v1 genome

Gene IDSed0015052
OrganismSechium edule (Chayote v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationLG09:28512027..28513679
RNA-Seq ExpressionSed0015052
SyntenySed0015052
Gene Ontology termsGO:0005488 - binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA8516701.1 hypothetical protein F0562_016793 [Nyssa sinensis]2.4e-3641.16Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH G    H
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH

KAA8519786.1 hypothetical protein F0562_014124 [Nyssa sinensis]2.4e-3641.16Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH G    H
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH

KAA8521875.1 hypothetical protein F0562_012811 [Nyssa sinensis]2.4e-3641.16Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH G    H
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH

KAA8523789.1 hypothetical protein F0562_010212 [Nyssa sinensis]1.4e-3641.73Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHK
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH   GQSH+
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHK

KAA8524269.1 hypothetical protein F0562_010692 [Nyssa sinensis]2.4e-3641.16Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH G    H
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH

TrEMBL top hitse value%identityAlignment
A0A2N9G2I0 Uncharacterized protein3.1e-3740.88Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAAL--DAQNKKDDYATQSLAMTENFNR
        +KK GE+I SY+Q++KN   +L AV +LID E+L+   + GLP  Y  F ++IRTR++ +S  E+ ++L+ EE +L   + + KD  A    A     NR
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAAL--DAQNKKDDYATQSLAMTENFNR

Query:  GGNWRGS-------GRGRGRGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAK
          N + S        RGRGR N+ RGRG  +  +N  S    SS SN+   G +TF   S        P CQIC + GH ALDCYH  +FAYQG+ P AK
Subjt:  GGNWRGS-------GRGRGRGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAK

Query:  LAAMVTSSYRQPRFPAHTSNTQDSG-IWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQ
        LAAM           A TSN   +G  WL D+G   HL++ ++N N  +PY G+D V VGN Q   I++ G GQ
Subjt:  LAAMVTSSYRQPRFPAHTSNTQDSG-IWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQ

A0A2N9G2N5 Uncharacterized protein1.1e-3740.29Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAAL--DAQNKKDDYATQSLAMTENFNR
        +KK GE+I SY+Q++KN   +L AV +LID E+L+   + GLP  Y  F ++IRTR++ +S  E+ ++L+ +E +L   + + KD  A    A     NR
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAAL--DAQNKKDDYATQSLAMTENFNR

Query:  GGNWRGS-------GRGRGRGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAK
          N + S        RGRGR N+ RGRG  +  +N  S    SS SN+   G +TF   S        P CQIC + GH ALDCYH  +FAYQG+ P AK
Subjt:  GGNWRGS-------GRGRGRGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAK

Query:  LAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQ
        LAAM ++S          + TQ    WL D+G   HL++ +SN N  +PY G+D V VGN Q   I++ G GQ
Subjt:  LAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQ

A0A5J4ZT09 Flavin-containing monooxygenase1.2e-3641.16Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH G    H
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH

A0A5J5A0G3 Retrotran_gag_3 domain-containing protein6.9e-3741.73Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHK
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH   GQSH+
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHK

A0A5J5B049 Retrotran_gag_3 domain-containing protein1.2e-3641.16Show/hide
Query:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN
        + K  ++IDSYIQ+IK     LA+V VLI+ ED++IY +NGLP  YN FKTSIRT+S++I+L E++ MLK EE  +++ +K+++      A  +     N
Subjt:  MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDY-----ATQSLAMTEN

Query:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS
        F  NRG    N+ G GRGRGR +N  GR  S+ +  + + G S+      Q   SN   N S     PV   CQICN+ GH+ALDCYH  +F+YQGK PS
Subjt:  F--NRG---GNWRGSGRGRGRGNNGRGR-GSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPS

Query:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH
         +L AM +++Y       +T +      W  D+G   H++++L+N N    Y GDDN+T+ N Q   ISH G    H
Subjt:  AKLAAMVTSSYRQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSH

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-0925.35Show/hide
Query:  KSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDYATQSLAMTENFNRGGNW
        K  +TID Y+Q +   + +LA +   +D ++ V   +  LP  Y      I  +    +L E+H  L   E+ + A +        + A++       N 
Subjt:  KSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDYATQSLAMTENFNRGGNW

Query:  RGSGRGRGRGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNF--AYQGKQPSAKLAAMVTSSY
                  NNG     Y  ++N ++ S     S+T    +N   NQS    +P    CQIC  +GH+A  C  + +F  +   +QP +          
Subjt:  RGSGRGRGRGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNF--AYQGKQPSAKLAAMVTSSY

Query:  RQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHKQNSLLWSKQGWYLPNLY
         QPR      +   S  WL DSG   H++S+ +N +   PY+G D+V V +     ISH G      ++  L      Y+PN++
Subjt:  RQPRFPAHTSNTQDSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHKQNSLLWSKQGWYLPNLY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0825.46Show/hide
Query:  YHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDYATQSLAMTE---NFNRGGNWRGSGRGRGRGNNG
        + +LA +   +D ++ V   +  LP  Y      I  +    SL E+H  L   E+ L A N  +     +  +T    N NR  N RG  R     NN 
Subjt:  YHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDYATQSLAMTE---NFNRGGNWRGSGRGRGRGNNG

Query:  RGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAKLAAMVTSSYR--QPRFPAHTSNTQ
               + +++   SS S S N Q               +P    CQIC+ +GH+A  C  +  F     Q  +      TS +   QPR     ++  
Subjt:  RGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAKLAAMVTSSYR--QPRFPAHTSNTQ

Query:  DSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHKQNSLLWSKQGWYLPNLY
        ++  WL DSG   H++S+ +N +   PY+G D+V + +     I+H G       +  L   +  Y+PN++
Subjt:  DSGIWLYDSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHKQNSLLWSKQGWYLPNLY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAATCTGGAGAAACAATTGATTCCTACATTCAAAGAATTAAGAATTTGTATCATCGACTTGCTGCTGTTGAAGTTCTAATTGATCCGGAAGATTTGGTGATCTA
CACTGTTAATGGCCTTCCTTTGGCTTACAATGTCTTCAAGACCTCTATTCGCACTCGTTCACAGAGTATTTCATTATTTGAGTTGCATATTATGTTAAAATTTGAAGAAG
CTGCCCTTGATGCTCAAAACAAGAAAGATGATTATGCAACACAATCTTTGGCAATGACTGAAAATTTTAATCGAGGGGGTAACTGGCGCGGTTCTGGTCGAGGTCGTGGC
AGAGGTAATAATGGAAGAGGTCGAGGTTCATATTCGCAACAATCAAATTTCTCCTCTGGTTCTTCTTCTTCTTCTTCTTCGAACACACAAGGTTTTGGATCAAATACCTT
TGGGAATCAGTCAGATCTTGCAGACCGACCAGTCAATCCTCCATGTCAAATCTGTAACAGAGAAGGACATAATGCCTTAGATTGTTACCACATAGGGAATTTCGCATATC
AAGGAAAACAACCTTCAGCAAAGTTAGCAGCCATGGTAACATCTTCATACAGACAACCTCGTTTCCCTGCTCATACAAGTAATACTCAAGATTCTGGAATTTGGTTATAT
GACTCGGGCTGTAATATACATCTCTCTAGTGAACTAAGCAACTTCAACACAACCTCACCTTATTCCGGAGATGATAATGTAACAGTTGGCAACGACCAACCACCACAAAT
TTCCCATCAAGGCATTGGACAAAGCCACAAGCAAAACTCTCTATTATGGTCCAAGCAAGGATGGTATTTACCCAATCTATACTTCAAATTCTTTGCCTCCCTTCCCAAAA
GCTAA
mRNA sequenceShow/hide mRNA sequence
GCTACACCGGCTTCGTCATCTTCTACTTCTTTCTCAAATTCGTCCATTCATCTACTAACAAACATTTGCAATCTTGTATCTATGCGATTCGATTCGACCAAATTTGTATT
ATGGCGATTCCAAATCTCCCCACTGCTGAAAGCTCACAAGCTCTATGGCTATGTCGATGGAACCATATCTGCACCAAAAATCAGCGAAACATCTGAGACAATGACTACTG
AAGAAAGAACTGCGTACGACCAATGGTTCGAAAGAGAACAAGCATTTATGACACTGCTCAATGCAACATTATCTCCAACAGCGCTTTCCTTAGCGAGAGGGTGTGAAACA
TCTAAAGATCTATGGGAAACCCTAGAAAAACAATTTTCATCTTCTACCCATACCTAGATTGTTGGTCTCAAGACTGATCTTCAAGGTATAATGAAGAAATCTGGAGAAAC
AATTGATTCCTACATTCAAAGAATTAAGAATTTGTATCATCGACTTGCTGCTGTTGAAGTTCTAATTGATCCGGAAGATTTGGTGATCTACACTGTTAATGGCCTTCCTT
TGGCTTACAATGTCTTCAAGACCTCTATTCGCACTCGTTCACAGAGTATTTCATTATTTGAGTTGCATATTATGTTAAAATTTGAAGAAGCTGCCCTTGATGCTCAAAAC
AAGAAAGATGATTATGCAACACAATCTTTGGCAATGACTGAAAATTTTAATCGAGGGGGTAACTGGCGCGGTTCTGGTCGAGGTCGTGGCAGAGGTAATAATGGAAGAGG
TCGAGGTTCATATTCGCAACAATCAAATTTCTCCTCTGGTTCTTCTTCTTCTTCTTCTTCGAACACACAAGGTTTTGGATCAAATACCTTTGGGAATCAGTCAGATCTTG
CAGACCGACCAGTCAATCCTCCATGTCAAATCTGTAACAGAGAAGGACATAATGCCTTAGATTGTTACCACATAGGGAATTTCGCATATCAAGGAAAACAACCTTCAGCA
AAGTTAGCAGCCATGGTAACATCTTCATACAGACAACCTCGTTTCCCTGCTCATACAAGTAATACTCAAGATTCTGGAATTTGGTTATATGACTCGGGCTGTAATATACA
TCTCTCTAGTGAACTAAGCAACTTCAACACAACCTCACCTTATTCCGGAGATGATAATGTAACAGTTGGCAACGACCAACCACCACAAATTTCCCATCAAGGCATTGGAC
AAAGCCACAAGCAAAACTCTCTATTATGGTCCAAGCAAGGATGGTATTTACCCAATCTATACTTCAAATTCTTTGCCTCCCTTCCCAAAAGCTAATATTAGTATTAAAGA
TTCGTCCAAACTTTGGCATGATAGATTAGGACACCCAAATGATGTTGTTCTAAAGAATTTTCTTAGTACTTTTTCTTCTCAAGTTTCTATAGATACTGAGTTTTGTAATG
AATGTAAAAGTGGAAAAATGTGTAAGCAATTTTTTCCCATCTCTGTTTCTGTTTCTCACTCACCTTTAGAGTTGTTA
Protein sequenceShow/hide protein sequence
MKKSGETIDSYIQRIKNLYHRLAAVEVLIDPEDLVIYTVNGLPLAYNVFKTSIRTRSQSISLFELHIMLKFEEAALDAQNKKDDYATQSLAMTENFNRGGNWRGSGRGRG
RGNNGRGRGSYSQQSNFSSGSSSSSSSNTQGFGSNTFGNQSDLADRPVNPPCQICNREGHNALDCYHIGNFAYQGKQPSAKLAAMVTSSYRQPRFPAHTSNTQDSGIWLY
DSGCNIHLSSELSNFNTTSPYSGDDNVTVGNDQPPQISHQGIGQSHKQNSLLWSKQGWYLPNLYFKFFASLPKS