; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0007512 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0007512
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr07:2875047..2882230
RNA-Seq ExpressionPI0007512
SyntenyPI0007512
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]2.4e-23793.27Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        MAIAVASSLTFEGA C   YAFTSSWNRSSFDVCGRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV K  R TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSWDEFQYV DDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR VGDYS VDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK
        DYIEVVEADLA+IPVDWGVPDVSS+VHPVYFAKCL KVINMEY+R MKHPSNGVSILGCLRPAYADEESY+RRLFYFEESEGYNTEWKGLEGE SNLESK
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK

Query:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEV+
Subjt:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSH DGLRDTVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

XP_008453943.1 PREDICTED: uncharacterized protein At3g49140 isoform X1 [Cucumis melo]1.6e-23392.15Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        MA+AVASSLTFEGASC   YAFTS WNRSSFDVCGRNKK GSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV KRAR TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSWDE QYV DDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR +GDYSAVDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK
        DYIE VEADLA+IPVDWGVPDVSSLVHPVYFAKCLNKV+N+EY+R MKHPSNGV+ILG LRP YADEESYVRRLF FEESEGYNTEWKGLEGE SNLE K
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK

Query:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
        IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEV+
Subjt:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRS+ DGLRDTVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]6.7e-23292.87Show/hide
Query:  EGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVN
        +GA C   YAFTSSWNRSSFDVCGRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV K  R TELTAAEVARTAVEVN
Subjt:  EGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVH EPHEQVSWDEFQYV DDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR VGDYS VDSGYGDVAPFDYDYIEVVEADLA
Subjt:  SNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLA

Query:  DIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLY
        +IPVDWGVPDVSS+VHPVYFAKCL KVINMEY+R MKHPSNGVSILGCLRPAYADEESY+RRLFYFEESEGYNTEWKGLEGE SNLESKIDRSSQRSTLY
Subjt:  DIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLY

Query:  RLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRAT
        RLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEV+TFRFPFKIRAT
Subjt:  RLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        SE AAEKQIQQLLFPRSRRKKLRSH DGLRDTVSF
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]3.0e-23290.95Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        M IAVAS+LTFEGA C   YAFTSSWNRSSFDV GRNK+ GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSS+ TNKGYHPLEDLKV KRAR TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSW+EFQYVIDDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRRTVGDYSA DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWK-------GLEGE
        DYIEVVE DLADIPVDWG PD SSLVHPVYFAKCLNKVINMEY+RKM HPSNGVSILGCLRPAYADEESYVRRLF+FEESEGYNTEWK       GLEGE
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWK-------GLEGE

Query:  ASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
          +LESKIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQ AEPDILLHSTAEI+ERFSEKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
Subjt:  ASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV

Query:  CFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        CFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSH DGLRDTVSF
Subjt:  CFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]2.5e-23492.38Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        M IAVAS+LTFEGA C   YAFTSSWNRSSFDV GRNK+ GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSS+ TNKGYHPLEDLKV KRAR TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSW+EFQYVIDDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRRTVGDYSA DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK
        DYIEVVE DLADIPVDWG PD SSLVHPVYFAKCLNKVINMEY+RKM HPSNGVSILGCLRPAYADEESYVRRLF+FEESEGYNTEWKGLEGE  +LESK
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK

Query:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQ AEPDILLHSTAEI+ERFSEKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
Subjt:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSH DGLRDTVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein1.2e-23793.27Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        MAIAVASSLTFEGA C   YAFTSSWNRSSFDVCGRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV K  R TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSWDEFQYV DDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR VGDYS VDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK
        DYIEVVEADLA+IPVDWGVPDVSS+VHPVYFAKCL KVINMEY+R MKHPSNGVSILGCLRPAYADEESY+RRLFYFEESEGYNTEWKGLEGE SNLESK
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK

Query:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEV+
Subjt:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSH DGLRDTVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X17.7e-23492.15Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        MA+AVASSLTFEGASC   YAFTS WNRSSFDVCGRNKK GSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV KRAR TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSWDE QYV DDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR +GDYSAVDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK
        DYIE VEADLA+IPVDWGVPDVSSLVHPVYFAKCLNKV+N+EY+R MKHPSNGV+ILG LRP YADEESYVRRLF FEESEGYNTEWKGLEGE SNLE K
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK

Query:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
        IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEV+
Subjt:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRS+ DGLRDTVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

A0A1S3BYP0 uncharacterized protein At3g49140 isoform X25.7e-22992.18Show/hide
Query:  EGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVN
        EGASC   YAFTS WNRSSFDVCGRNKK GSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV KRAR TELTAAEVARTAVEVN
Subjt:  EGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVH EPHEQVSWDE QYV DDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR +GDYSAVDSGYGDVAPFDYDYIE VEADLA
Subjt:  SNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLA

Query:  DIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLY
        +IPVDWGVPDVSSLVHPVYFAKCLNKV+N+EY+R MKHPSNGV+ILG LRP YADEESYVRRLF FEESEGYNTEWKGLEGE SNLE KIDRSSQRSTLY
Subjt:  DIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLY

Query:  RLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRAT
        RLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEV+TFRFPFKIRAT
Subjt:  RLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        SEVAAEKQIQQLLFPRSRRKKLRS+ DGLRDTVSF
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

A0A5A7TTC0 Pentatricopeptide repeat (PPR) superfamily protein isoform 25.4e-21992.72Show/hide
Query:  GRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDE
        GRNKK GSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV KRAR TELTAAEVARTAVEVNSNALLLFPGTVH EPHEQVSWDE
Subjt:  GRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKC
         QYV +DYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR +GDYSAVDSGYGDVAPFDYDYIE VEADLA+IPVDWGVPDVSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQD
        LNKV+N+EY+R MKHPSNGV+ILGCLRP YADEESYVRRLF FEESEGYNTEWKGLEGE SNLE KIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQD
Subjt:  LNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST +ILERF+EKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHADGLRDTVSF
        S+ DGLRDTVSF
Subjt:  SHADGLRDTVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X11.1e-22488.57Show/hide
Query:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA
        MAIAVASSLTFEGA C   YAFTSSWNR S DV GRN   GSTE HWLSKGRDL  SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKV KRAR TELTA
Subjt:  MAIAVASSLTFEGASC---YAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVH EPHEQVSWDEFQYVIDDYGDLYFEIFD VNMLEDR AHNPVNALIGMDMQMYESRR VGDY+A DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDY

Query:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK
        DYIEVVE DL+DIPVDWGVPDVSSLVHPVYFAKCLNKVINMEY++KMKHPSNGVSILGCLRPA+ADEESY+RRLFYFE SEGY TEWKGL+GEA + ESK
Subjt:  DYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESK

Query:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ
         D+SSQRSTLYRLEI+RIELFSVYGVQ+E+SLQDFQ+AEPDIL+HSTAEI+E FSEKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+
Subjt:  IDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQ

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSH DG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDTVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491405.3e-4628.94Show/hide
Query:  RDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLK--VRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEI
        + L  ++    A+Y DS  D         YHP E+++  + +    + L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++
Subjt:  RDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLK--VRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEI

Query:  FDGVNMLED-REAHNPVNALIGMD-MQMYESRRTVG----DYSAVDSGYGDVAPFD-------YDYIEVVE------------------ADLADIPVDWG
         +  ++++     +N V  ++G D M+M +    +G    D+   D   GD    D        +++ ++E                  +D  +   DW 
Subjt:  FDGVNMLED-REAHNPVNALIGMD-MQMYESRRTVG----DYSAVDSGYGDVAPFD-------YDYIEVVE------------------ADLADIPVDWG

Query:  VPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTEWKGL------EGEASNLESKIDRSSQRS---
          +     HP++FAK + +V + +    M  PS G++I G L     ++ S + ++L     +   N + + L        +A   ES+ID S       
Subjt:  VPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTEWKGL------EGEASNLESKIDRSSQRS---

Query:  --TLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFP
            Y+LE++RI+L +  G Q+EV ++D + A+PD + H++AEI+ R  E G K   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++++ RF 
Subjt:  --TLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRR
        F  RATSE  AE QI++LLFP++ +
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-4728.94Show/hide
Query:  RDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLK--VRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEI
        + L  ++    A+Y DS  D         YHP E+++  + +    + L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++
Subjt:  RDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLK--VRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEI

Query:  FDGVNMLED-REAHNPVNALIGMD-MQMYESRRTVG----DYSAVDSGYGDVAPFD-------YDYIEVVE------------------ADLADIPVDWG
         +  ++++     +N V  ++G D M+M +    +G    D+   D   GD    D        +++ ++E                  +D  +   DW 
Subjt:  FDGVNMLED-REAHNPVNALIGMD-MQMYESRRTVG----DYSAVDSGYGDVAPFD-------YDYIEVVE------------------ADLADIPVDWG

Query:  VPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTEWKGL------EGEASNLESKIDRSSQRS---
          +     HP++FAK + +V + +    M  PS G++I G L     ++ S + ++L     +   N + + L        +A   ES+ID S       
Subjt:  VPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYV-RRLFYFEESEGYNTEWKGL------EGEASNLESKIDRSSQRS---

Query:  --TLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFP
            Y+LE++RI+L +  G Q+EV ++D + A+PD + H++AEI+ R  E G K   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++++ RF 
Subjt:  --TLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRR
        F  RATSE  AE QI++LLFP++ +
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-13455.41Show/hide
Query:  MAIAVASSLTFEGASC-------------YAFTSSWNRSSFDVCG-RNKKLGSTE---------FHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKG
        M IA ASS +   + C             Y  TS+     FD CG  N  + S+          FH  S G DL  +KVSVAADY DSVPDSS Y    G
Subjt:  MAIAVASSLTFEGASC-------------YAFTSSWNRSSFDVCG-RNKKLGSTE---------FHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKG

Query:  YHPLEDLKVRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRR
        YHPLEDLK  KR ++T+L+A+EVART VE NS+A+L+FPG +H EPH+  SW EF+YVIDDYGD++FEI D  N+LED  A NPV A  GMD+  YE+ R
Subjt:  YHPLEDLKVRKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRR

Query:  TVGDYSAVDSGYGDVAPFDYDYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEES
           +Y+  D G  D   FD  Y E+++++  DIP+DWG+PD S+ VHP+YFAK L+K I+M+Y+RKM +PSNGVSILGCLRPA+ DEESY+RRLF  E+ 
Subjt:  TVGDYSAVDSGYGDVAPFDYDYIEVVEADLADIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEES

Query:  EGYNTEWKGLEGEASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAI
        + Y+ E +G +   ++  S+ D +   S+LYRLEI+ IEL S+YG +S +SLQDFQDAEPDIL+HST+ I+ERF+ +GI  +IALKALCKK+GLH E+A 
Subjt:  EGYNTEWKGLEGEASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAI

Query:  LIGVDSLGMDVRVCFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRD
        LI VDSLGMDVRV  G +VQT RFPFK RAT+E+AAEK+I QLLFPRSRR+KL+ H + L+D
Subjt:  LIGVDSLGMDVRVCFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRD

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein3.7e-4729.21Show/hide
Query:  SKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKV---RKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++     K    + L+  E ART +EVN    L+  G +    HE + W +  YV D +G++
Subjt:  SKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKV---RKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDL

Query:  YFEI-----------------------FDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVE---------ADLADI
        YF++                       FD + M++D E  +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D  + 
Subjt:  YFEI-----------------------FDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVE---------ADLADI

Query:  PVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTEWKGLEGEASNLESKIDRS
          DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S +++             +E E     ++G+ GE  +    ++ S
Subjt:  PVDWGVPDVSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTEWKGLEGEASNLESKIDRS

Query:  SQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFR
              Y+LEI+RI+L +  G Q+EV ++D + A+PD++  ++  IL R  E G K   AL++LC +  G+  E+  LIG+DSLG D+R+C G +++T R
Subjt:  SQRSTLYRLEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFR

Query:  FPFKIRATSEVAAEKQIQQLLFPRSRRK
        F F IRATSE  AE Q+++LLF  +  K
Subjt:  FPFKIRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein4.1e-4629.05Show/hide
Query:  SKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKV---RKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEI----
        ++    A+Y  S  D         YHP ED++     K    + L+  E ART +EVN    L+  G +    HE + W +  YV D +G++YF++    
Subjt:  SKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKV---RKRARKTELTAAEVARTAVEVNSNALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEI----

Query:  -------------------FDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVE---------ADLADIPVDWGVPD
                           FD + M++D E  +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D  +   DW   +
Subjt:  -------------------FDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVE---------ADLADIPVDWGVPD

Query:  VSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLYR
             HP+YFA+ + +V + +    M  PS G++I G L P   ++ S +++             +E E     ++G+ GE  +    ++ S      Y+
Subjt:  VSSLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLF---------YFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLYR

Query:  LEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRAT
        LEI+RI+L +  G Q+EV ++D + A+PD++  ++  IL R  E G K   AL++LC +  G+  E+  LIG+DSLG D+R+C G +++T RF F IRAT
Subjt:  LEILRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFSEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRAT

Query:  SEVAAEKQIQQLLFPRSRRK
        SE  AE Q+++LLF  +  K
Subjt:  SEVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGCTGTAGCTTCTTCACTTACCTTTGAAGGGGCTTCTTGCTATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGACGTTTGTGGCAGAAATAAAAAATT
GGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTCGTCAAAAGTTTCAGTCGCTGCTGATTACCCAGATTCAGTTCCAGATTCTTCAAGTTATTTTA
CTAACAAAGGTTATCATCCCCTAGAAGATCTAAAAGTTCGCAAAAGAGCACGAAAGACTGAACTCACTGCTGCAGAAGTAGCAAGGACTGCTGTGGAGGTCAATAGCAAT
GCTCTGCTGTTATTTCCTGGAACTGTGCACAGGGAACCACACGAACAAGTATCGTGGGATGAGTTTCAATATGTTATTGACGATTATGGAGATTTGTATTTTGAAATTTT
TGATGGTGTGAACATGTTAGAAGATCGTGAAGCACACAACCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGACAGTTGGAGATTATAGTG
CGGTAGATAGTGGCTATGGTGATGTTGCTCCTTTTGATTATGATTATATTGAGGTAGTGGAGGCTGATTTAGCCGATATTCCAGTTGACTGGGGAGTTCCAGATGTTTCT
AGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGAGAGAAAGATGAAGCATCCTTCCAATGGGGTTTCCATATTGGGATGTCT
CAGACCTGCATATGCTGATGAAGAATCTTATGTAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGCGAAGCCTCAAACT
TGGAGTCCAAAATCGATAGAAGCAGTCAAAGATCCACTCTCTACAGGTTGGAGATACTGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAGTTAGTTTGCAA
GATTTTCAAGATGCTGAACCTGATATTCTTTTGCACTCTACTGCGGAAATTCTAGAGCGTTTTAGTGAGAAGGGTATTAAGTGCAATATTGCCCTTAAAGCTCTTTGCAA
AAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTCGATAGTCTTGGCATGGATGTGAGAGTATGTTTTGGGACAGAAGTACAGACTTTTCGATTTCCCTTTA
AAATCAGGGCAACATCAGAAGTTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATCTCGTCGTAAAAAATTACGAAGCCATGCGGATGGATTGAGAGATACT
GTCAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
AAAATTTTGTAGGTAATATATAAACAGAATTGAATATTGATAGACCCATTTGTTATTTTCTACATTTATTAAACTGAGATTTCTGAACACGTTGGGACTTGCGGCATCTT
TCCCTTTTAGCTGACGTTTGAATCTTCTTTATGGCCAAATCCTACTGCGACCTGCTCGTAGAAGCCGATACCCATCACACCAAAATTCGATTTTGATCTCTCTCTCTCTC
TCTCTCTCTCTCTCTCTCTCATGGCAATTGCTGTAGCTTCTTCACTTACCTTTGAAGGGGCTTCTTGCTATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGACGTT
TGTGGCAGAAATAAAAAATTGGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTCGTCAAAAGTTTCAGTCGCTGCTGATTACCCAGATTCAGTTCC
AGATTCTTCAAGTTATTTTACTAACAAAGGTTATCATCCCCTAGAAGATCTAAAAGTTCGCAAAAGAGCACGAAAGACTGAACTCACTGCTGCAGAAGTAGCAAGGACTG
CTGTGGAGGTCAATAGCAATGCTCTGCTGTTATTTCCTGGAACTGTGCACAGGGAACCACACGAACAAGTATCGTGGGATGAGTTTCAATATGTTATTGACGATTATGGA
GATTTGTATTTTGAAATTTTTGATGGTGTGAACATGTTAGAAGATCGTGAAGCACACAACCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAG
GACAGTTGGAGATTATAGTGCGGTAGATAGTGGCTATGGTGATGTTGCTCCTTTTGATTATGATTATATTGAGGTAGTGGAGGCTGATTTAGCCGATATTCCAGTTGACT
GGGGAGTTCCAGATGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGAGAGAAAGATGAAGCATCCTTCCAATGGG
GTTTCCATATTGGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATGTAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTT
AGAAGGCGAAGCCTCAAACTTGGAGTCCAAAATCGATAGAAGCAGTCAAAGATCCACTCTCTACAGGTTGGAGATACTGAGAATTGAGCTCTTCTCTGTGTATGGAGTTC
AGTCTGAAGTTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTTTGCACTCTACTGCGGAAATTCTAGAGCGTTTTAGTGAGAAGGGTATTAAGTGCAATATT
GCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTCGATAGTCTTGGCATGGATGTGAGAGTATGTTTTGGGACAGAAGTACA
GACTTTTCGATTTCCCTTTAAAATCAGGGCAACATCAGAAGTTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATCTCGTCGTAAAAAATTACGAAGCCATG
CGGATGGATTGAGAGATACTGTCAGTTTTTAGAACACCCTCTGCGTTATTTTGACATTTGGTGGTTCCTAGAAATAAATTTCACAGCTAAGGCGACTTCTTTTACTCAAA
TCCGTCATAGCATTAGTCTTAGTTTGAGAATTGAACAGTTCAAGTTGATAATTTGCCTCTGTTTTTATTGGATTTTGTCTGCTCTTATTTTTTCTTGGTATTACCCTCTT
GACTTTCATATCCGAATTCCCTTGGTGTCACTACAAGGTTTTTTTCCCTATGCCGATGCAAAAATGCGTCGATATATGTAACAAAACGTGTCGACAGAGACTATGCCGAC
ACATGCATGGGCAAACACCATCGGCAGAAGACAGTCAACATGCCTTCTGCGCATGCCATACATGCCAAAGTCTTCGGTTGTCGGAAGAGAAGGTCGGCACAACCCAAAGT
TCTTGTAGTGTGTTTTCCTTCACGGTTCCTCTTGGAGATTCATTCTCATTAATATTACTTGATTAGTTTAAGGCTGCTAAAAGTAGTAGGAAAAAAAAAAAAGAATAAAA
GCAAGCAGGGCCACATTAATCGATTTCTGAGATGAAAATGAAATGCTAGCGTGTGTGATTGACTATGAGGACACATCCTTTCGCTGGAT
Protein sequenceShow/hide protein sequence
MAIAVASSLTFEGASCYAFTSSWNRSSFDVCGRNKKLGSTEFHWLSKGRDLCSSKVSVAADYPDSVPDSSSYFTNKGYHPLEDLKVRKRARKTELTAAEVARTAVEVNSN
ALLLFPGTVHREPHEQVSWDEFQYVIDDYGDLYFEIFDGVNMLEDREAHNPVNALIGMDMQMYESRRTVGDYSAVDSGYGDVAPFDYDYIEVVEADLADIPVDWGVPDVS
SLVHPVYFAKCLNKVINMEYERKMKHPSNGVSILGCLRPAYADEESYVRRLFYFEESEGYNTEWKGLEGEASNLESKIDRSSQRSTLYRLEILRIELFSVYGVQSEVSLQ
DFQDAEPDILLHSTAEILERFSEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVQTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHADGLRDT
VSF