; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024980 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024980
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationtig00002486:5071926..5086043
RNA-Seq ExpressionSgr024980
SyntenySgr024980
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]6.4e-22389.81Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GA CS SYAFTSSWNRSSFDV GRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDEFQYV DDYGDLYFEIFDS+NMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDYDYIEVVE DL++
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET + ESK D SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEIMRIELFSVYGVQSE+SLQDFQDAEPDIL+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SE AAEKQIQQLLFPRSRRKKLRSHGDG RD+
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

XP_022137915.1 uncharacterized protein At3g49140 isoform X1 [Momordica charantia]3.5e-23795.37Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GACCSTSYAFTSSWNR S DVRGRNP+FGSTELHWLSKGRDL LSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD++NMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE LSFESKSD SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEIMRIELFSVYGVQ+EISLQDFQ+AEPDILVHSTAEIVE FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]6.4e-22389.81Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GA CS SYAFTSSWNRSSFDV GRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDEFQYV DDYGDLYFEIFDS+NMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDYDYIEVVE DL++
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET + ESK D SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEIMRIELFSVYGVQSE+SLQDFQDAEPDIL+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SE AAEKQIQQLLFPRSRRKKLRSHGDG RD+
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]1.0e-22890.66Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GACCSTSYAFTSSWNRSSFDVRGRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTN+GYHPLEDLKVCKRAR+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSW+EFQYVIDDYGDLYFEIFDS+NMLEDRGAHNPVNALIGMDMQMYESRR VGDY+A DSG GDVVPFDYDYIEVVETDL+D
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGETLSFESKSDGS
        IPVDWG PD  SSLVHPVYFAKCLNKVINMEYD+KM HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWK       GLEGETLS ESK D S
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGETLSFESKSDGS

Query:  SRRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRF
        S+RSTLYRLEIMRIELFSVYGVQSE+SLQDFQ AEPDIL+HSTAEI+ERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+TFRF
Subjt:  SRRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRF

Query:  PFKMRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        PFK+RATSEVAAEKQIQQLLFPRSRRKKLRSHGDG RD+
Subjt:  PFKMRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]8.3e-23192.13Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GACCSTSYAFTSSWNRSSFDVRGRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTN+GYHPLEDLKVCKRAR+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSW+EFQYVIDDYGDLYFEIFDS+NMLEDRGAHNPVNALIGMDMQMYESRR VGDY+A DSG GDVVPFDYDYIEVVETDL+D
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWG PD  SSLVHPVYFAKCLNKVINMEYD+KM HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWKGLEGETLS ESK D SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEIMRIELFSVYGVQSE+SLQDFQ AEPDIL+HSTAEI+ERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+TFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SEVAAEKQIQQLLFPRSRRKKLRSHGDG RD+
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein3.1e-22389.81Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GA CS SYAFTSSWNRSSFDV GRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDEFQYV DDYGDLYFEIFDS+NMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDYDYIEVVE DL++
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET + ESK D SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEIMRIELFSVYGVQSE+SLQDFQDAEPDIL+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SE AAEKQIQQLLFPRSRRKKLRSHGDG RD+
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

A0A1S3BY92 uncharacterized protein At3g49140 isoform X12.3e-21887.5Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GA CSTSYAFTS WNRSSFDV GRN  FGSTE HWLSKGRDLC SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDE QYV DDYGDLYFEIFDS+NMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDYDYIE VE DL++
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SSLVHPVYFAKCLNKV+N+EYD+ MKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGET + E K D SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEI+RIELFSVYGVQSE+SLQDFQDAEPDIL+HST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SEVAAEKQIQQLLFPRSRRKKLRS+GDG RD+
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

A0A1S3BYP0 uncharacterized protein At3g49140 isoform X22.3e-21887.5Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GA CSTSYAFTS WNRSSFDV GRN  FGSTE HWLSKGRDLC SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDE QYV DDYGDLYFEIFDS+NMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDYDYIE VE DL++
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SSLVHPVYFAKCLNKV+N+EYD+ MKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGET + E K D SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEI+RIELFSVYGVQSE+SLQDFQDAEPDIL+HST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SEVAAEKQIQQLLFPRSRRKKLRS+GDG RD+
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

A0A6J1C800 uncharacterized protein At3g49140 isoform X11.7e-23795.37Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GACCSTSYAFTSSWNR S DVRGRNP+FGSTELHWLSKGRDL LSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD++NMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
        IPVDWGVPDV SSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE LSFESKSD SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLEIMRIELFSVYGVQ+EISLQDFQ+AEPDILVHSTAEIVE FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
        SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

A0A6J1GXY6 uncharacterized protein At3g49140-like isoform X21.0e-21887.99Show/hide
Query:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS
        GACCSTS+AFTS W+RSSFDVRGRNPIFGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKV KRAR+TELTAAEVART VEVNS
Subjt:  GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNS

Query:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD
        NALLLFP TVHSEPHE+VSWDEFQYVIDDYGDLYFEIFDS NMLEDRGAHNPV ALIGMD+QMYES R VGDY A DS  GDV+PF +DYIE VETDL+D
Subjt:  NALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSD

Query:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
         PVDWGV DV SSLVHP+YFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEG+N EWK L GETL FESKSD SS+RSTLY
Subjt:  IPVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT
        RLE MRIELFSVYGVQSE+SLQDF+DAEPDIL+HSTAEIVERF EKGIRCNIALKALCKK+GLHV+DA LIGVDSLGMDVRVCFG EVRT+RFPFK+RAT
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRAT

Query:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSS
        SEVAAEKQIQQLLFPRSRRK+LRSHGDG  D++
Subjt:  SEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSS

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491401.3e-4528.5Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMN
        ++    A+Y DS  D         YHP E+++    +   D+ L+ AE  RT +EVN+   L+   ++    HE + W +  Y+ D  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMN

Query:  MLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWGVPDVS
        +++     +N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ ++E                  +D  +   DW   +  
Subjt:  MLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWGVPDVS

Query:  SSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDGSSRR-STL
         S  HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+    +S  D  +R     
Subjt:  SSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDGSSRR-STL

Query:  YRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMR
        Y+LE++RI+L +  G Q+E+ ++D + A+PD + H++AEI+ R  E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  R
Subjt:  YRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMR

Query:  ATSEVAAEKQIQQLLFPRSRR
        ATSE  AE QI++LLFP++ +
Subjt:  ATSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein9.4e-4728.5Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMN
        ++    A+Y DS  D         YHP E+++    +   D+ L+ AE  RT +EVN+   L+   ++    HE + W +  Y+ D  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMN

Query:  MLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWGVPDVS
        +++     +N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ ++E                  +D  +   DW   +  
Subjt:  MLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWGVPDVS

Query:  SSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDGSSRR-STL
         S  HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+    +S  D  +R     
Subjt:  SSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDGSSRR-STL

Query:  YRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMR
        Y+LE++RI+L +  G Q+E+ ++D + A+PD + H++AEI+ R  E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  R
Subjt:  YRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMR

Query:  ATSEVAAEKQIQQLLFPRSRR
        ATSE  AE QI++LLFP++ +
Subjt:  ATSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein4.7e-13960.24Show/hide
Query:  SWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNSNALLLFPATVHS
        S N S    R + P FGS   H  S G DL L+KVSVAADY DSVPDSS Y    GYHPLEDLK  KR ++T+L+A+EVARTTVE NS+A+L+FP  +H 
Subjt:  SWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNSNALLLFPATVHS

Query:  EPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDIPVDWGVPDVSS
        EPH+  SW EF+YVIDDYGD++FEI D  N+LED GA NPV A  GMD+  YE+ R   +YN  D GN D + FD  Y E+++++  DIP+DWG+PD S+
Subjt:  EPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDIPVDWGVPDVSS

Query:  SLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLYRLEIMRIELFSV
          VHP+YFAK L+K I+M+YD+KM +PSNGVSILGCLRPA+ DEESYIRRLF  E+ + Y+ E +G +    S  S+ D +   S+LYRLEI+ IEL S+
Subjt:  SLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLYRLEIMRIELFSV

Query:  YGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRATSEVAAEKQIQQL
        YG +S ISLQDFQDAEPDILVHST+ I+ERF+ +GI  +IALKALCKK+GLH E+A LI VDSLGMDVRV  G +V+T RFPFK RAT+E+AAEK+I QL
Subjt:  YGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRATSEVAAEKQIQQL

Query:  LFPRSRRKKLRSHGDGFRDS
        LFPRSRR+KL+ H +  +D+
Subjt:  LFPRSRRKKLRSHGDGFRDS

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4528.9Show/hide
Query:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++     K   D+ L+  E ART +EVN    L+    +    HE + W +  YV D +G++
Subjt:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDL

Query:  YFEI-----------------------FDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDI
        YF++                       FD+M M++D    +P     G++ ++ +    V D N  D   G+    D +++ V+E         +D  + 
Subjt:  YFEI-----------------------FDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDI

Query:  PVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDG
          DW   + +    HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  S     + 
Subjt:  PVDWGVPDVSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDG

Query:  SSRRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTF
        S      Y+LEI+RI+L +  G Q+E+ ++D + A+PD++  ++  I+ R  E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T 
Subjt:  SSRRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTF

Query:  RFPFKMRATSEVAAEKQIQQLLFPRSRRK
        RF F +RATSE  AE Q+++LLF  +  K
Subjt:  RFPFKMRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein2.0e-4428.74Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEI----
        ++    A+Y  S  D         YHP ED++     K   D+ L+  E ART +EVN    L+    +    HE + W +  YV D +G++YF++    
Subjt:  SKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTTVEVNSNALLLFPATVHSEPHEQVSWDEFQYVIDDYGDLYFEI----

Query:  -------------------FDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIPVDWGVPD
                           FD+M M++D    +P     G++ ++ +    V D N  D   G+    D +++ V+E         +D  +   DW   +
Subjt:  -------------------FDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIPVDWGVPD

Query:  VSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY
         +    HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  S     + S      Y
Subjt:  VSSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRA
        +LEI+RI+L +  G Q+E+ ++D + A+PD++  ++  I+ R  E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F +RA
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDILVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRA

Query:  TSEVAAEKQIQQLLFPRSRRK
        TSE  AE Q+++LLF  +  K
Subjt:  TSEVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AAGGGGCTTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGATGTTCGTGGCAGAAATCCAATATTTGGATCAACAGAATTACATTGGTTG
TCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTTGCTGCAGATTACCCAGATTCAGTTCCAGATTCATCAAGTTACTTGACTAACCAAGGTTATCATCCTCTTGA
AGATCTAAAAGTTTGCAAAAGAGCACGGGACACTGAACTCACTGCTGCAGAAGTAGCAAGGACAACTGTGGAGGTCAATAGCAATGCTTTGCTGTTATTTCCTGCAACTG
TGCACAGTGAACCACATGAACAAGTATCTTGGGATGAGTTTCAATATGTTATTGATGATTATGGAGATTTGTATTTTGAAATTTTTGACAGTATGAACATGTTGGAAGAT
CGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGATAGTTGGAGATTATAATGCGCCAGATAGTGGCAATGGTGATGT
TGTTCCTTTTGATTATGACTATATTGAGGTAGTGGAAACTGATTTGTCCGATATTCCAGTTGACTGGGGAGTTCCAGATGTTTCTTCTAGCTTGGTTCATCCTGTATATT
TTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGACAAAAAGATGAAGCATCCTTCAAATGGAGTTTCCATTTTGGGATGTCTCAGGCCTGCATATGCTGATGAA
GAATCTTATATAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGTGAAACCTTGAGCTTCGAGTCCAAAAGTGATGGAAG
CAGCCGAAGATCAACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAATTAGTTTGCAAGATTTTCAAGATGCTGAACCTG
ATATTCTTGTGCACTCTACTGCGGAAATTGTAGAGCGTTTTAGTGAGAAGGGTATTAGGTGCAATATTGCTCTTAAAGCTCTATGCAAAAAGAGGGGACTTCATGTTGAG
GACGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGACGTGAGGGTATGTTTTGGGACAGAAGTACGGACTTTTCGATTTCCCTTTAAAATGCGAGCAACGTCTGAAGT
TGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCTCGATCTCGTCGTAAGAAATTACGAAGCCATGGGGATGGATTTAGAGATAGTAGCCTAAGATTTAACTTGGGAG
TGATGGCAACAAGCAATGGAAATGGAAATGGACAACAAATTGGTGGTAGTGGAGCTAGGCAAGAGTCTGGTGAAATGGAGCTAGATGCTCAAACTTTGTGGGCAAGAAAT
TTTGACTTGGAGCAAATGATGGAAGCTAGAGTTCAATTTTGTAGAGGCCAGCTGAAATTGTGCAATTGCTAA
mRNA sequenceShow/hide mRNA sequence
AAGGGGCTTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGATGTTCGTGGCAGAAATCCAATATTTGGATCAACAGAATTACATTGGTTG
TCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTTGCTGCAGATTACCCAGATTCAGTTCCAGATTCATCAAGTTACTTGACTAACCAAGGTTATCATCCTCTTGA
AGATCTAAAAGTTTGCAAAAGAGCACGGGACACTGAACTCACTGCTGCAGAAGTAGCAAGGACAACTGTGGAGGTCAATAGCAATGCTTTGCTGTTATTTCCTGCAACTG
TGCACAGTGAACCACATGAACAAGTATCTTGGGATGAGTTTCAATATGTTATTGATGATTATGGAGATTTGTATTTTGAAATTTTTGACAGTATGAACATGTTGGAAGAT
CGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGATAGTTGGAGATTATAATGCGCCAGATAGTGGCAATGGTGATGT
TGTTCCTTTTGATTATGACTATATTGAGGTAGTGGAAACTGATTTGTCCGATATTCCAGTTGACTGGGGAGTTCCAGATGTTTCTTCTAGCTTGGTTCATCCTGTATATT
TTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGACAAAAAGATGAAGCATCCTTCAAATGGAGTTTCCATTTTGGGATGTCTCAGGCCTGCATATGCTGATGAA
GAATCTTATATAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGTGAAACCTTGAGCTTCGAGTCCAAAAGTGATGGAAG
CAGCCGAAGATCAACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAATTAGTTTGCAAGATTTTCAAGATGCTGAACCTG
ATATTCTTGTGCACTCTACTGCGGAAATTGTAGAGCGTTTTAGTGAGAAGGGTATTAGGTGCAATATTGCTCTTAAAGCTCTATGCAAAAAGAGGGGACTTCATGTTGAG
GACGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGACGTGAGGGTATGTTTTGGGACAGAAGTACGGACTTTTCGATTTCCCTTTAAAATGCGAGCAACGTCTGAAGT
TGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCTCGATCTCGTCGTAAGAAATTACGAAGCCATGGGGATGGATTTAGAGATAGTAGCCTAAGATTTAACTTGGGAG
TGATGGCAACAAGCAATGGAAATGGAAATGGACAACAAATTGGTGGTAGTGGAGCTAGGCAAGAGTCTGGTGAAATGGAGCTAGATGCTCAAACTTTGTGGGCAAGAAAT
TTTGACTTGGAGCAAATGATGGAAGCTAGAGTTCAATTTTGTAGAGGCCAGCTGAAATTGTGCAATTGCTAA
Protein sequenceShow/hide protein sequence
GACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTTVEVNSNALLLFPATV
HSEPHEQVSWDEFQYVIDDYGDLYFEIFDSMNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDIPVDWGVPDVSSSLVHPVYF
AKCLNKVINMEYDKKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDGSSRRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPD
ILVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKMRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSSLRFNLGV
MATSNGNGNGQQIGGSGARQESGEMELDAQTLWARNFDLEQMMEARVQFCRGQLKLCNC