; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G03680 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G03680
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationChr4:2238293..2245323
RNA-Seq ExpressionCSPI04G03680
SyntenyCSPI04G03680
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]1.0e-25699.78Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR VGDYSDVDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_008453943.1 PREDICTED: uncharacterized protein At3g49140 isoform X1 [Cucumis melo]2.0e-23993.05Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MA+AVASSLTFEGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR +GDYS VDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIE VEADLANIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE K
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]3.8e-25199.54Show/hide
Query:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
        +GAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
Subjt:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR VGDYSDVDSGYGDVAPFDYDYIEVVEADLA
Subjt:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLA

Query:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
        NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
Subjt:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]2.2e-23590.95Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        M IAVAS+LTFEGA CS SYAFTSSWNRSSFDV GRNK+FGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYV DDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRTVGDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGE
        DYIEVVE DLA+IPVDWG PD SS+VHPVYFAKCL KVINMEYDR M HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWK       GLEGE
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGE

Query:  TSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
        T +LESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQ AEPDILLHSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
Subjt:  TSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV

Query:  CVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        C GTEV+TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  CVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]1.8e-23792.38Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        M IAVAS+LTFEGA CS SYAFTSSWNRSSFDV GRNK+FGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYV DDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRTVGDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVE DLA+IPVDWG PD SS+VHPVYFAKCL KVINMEYDR M HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWKGLEGET +LESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQ AEPDILLHSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEV+
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein5.0e-25799.78Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR VGDYSDVDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X19.5e-24093.05Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MA+AVASSLTFEGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR +GDYS VDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIE VEADLANIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE K
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A1S3BYP0 uncharacterized protein At3g49140 isoform X29.2e-23593.1Show/hide
Query:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
        EGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVN
Subjt:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR +GDYS VDSGYGDVAPFDYDYIE VEADLA
Subjt:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLA

Query:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
        NIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE KIDRSSQRSTLY
Subjt:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        RLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVRTFRFPFKIRAT
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        SE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A5A7TTC0 Pentatricopeptide repeat (PPR) superfamily protein isoform 24.7e-22393.45Show/hide
Query:  GRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKC
         QYVT+DYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR +GDYS VDSGYGDVAPFDYDYIE VEADLANIPVDWGVPDVSS+VHPVYFAKC
Subjt:  FQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKC

Query:  LKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        L KV+N+EYDRNMKHPSNGV+ILGCLRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQD
Subjt:  LKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        S+GDGLRDTVSF
Subjt:  SHGDGLRDTVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X14.1e-22788.57Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MAIAVASSLTFEGA CS SYAFTSSWNR S DV GRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRR VGDY+  DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVE DL++IPVDWGVPDVSS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491403.1e-4629.05Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN
        ++    A+Y DS  D         YHP E+++    ++  ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+TD  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN

Query:  MLED-RRAHNPVNALIGMD-MQMYESRRTVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS
        +++     +N V  ++G D M+M +    +G    D+   D   GD    D        +++ ++E                  +D      DW   +  
Subjt:  MLED-RRAHNPVNALIGMD-MQMYESRRTVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS

Query:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+ S ++S  D  ++     Y
Subjt:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA
        +LE++RI+L +  G Q+EV ++D + A+PD + H++AEI+ R  E G K   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  RA
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA

Query:  TSEAAAEKQIQQLLFPRSRR
        TSE  AE QI++LLFP++ +
Subjt:  TSEAAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-4729.05Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN
        ++    A+Y DS  D         YHP E+++    ++  ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+TD  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN

Query:  MLED-RRAHNPVNALIGMD-MQMYESRRTVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS
        +++     +N V  ++G D M+M +    +G    D+   D   GD    D        +++ ++E                  +D      DW   +  
Subjt:  MLED-RRAHNPVNALIGMD-MQMYESRRTVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS

Query:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+ S ++S  D  ++     Y
Subjt:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA
        +LE++RI+L +  G Q+EV ++D + A+PD + H++AEI+ R  E G K   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  RA
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA

Query:  TSEAAAEKQIQQLLFPRSRR
        TSE  AE QI++LLFP++ +
Subjt:  TSEAAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-13656.06Show/hide
Query:  MAIAVASSLTFEGAPCSKSYA--FTSS--WNRSS------FDVCGRNK----------KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKG
        M IA ASS +   + C +SY   F+SS  + R+S      FD CG              F  + FH  S G DL L+KVSVAADY DSVPDSS Y    G
Subjt:  MAIAVASSLTFEGAPCSKSYA--FTSS--WNRSS------FDVCGRNK----------KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKG

Query:  YHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR
        YHPLEDLK  K V+ T+L+A+EVART VE NS+A+L+FPG +H EPH+  SW EF+YV DDYGD++FEI D  N+LED  A NPV A  GMD+  YE+ R
Subjt:  YHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR

Query:  TVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEES
           +Y+  D G  D   FD  Y E+++++  +IP+DWG+PD S+ VHP+YFAK L K I+M+YDR M +PSNGVSILGCLRPA+ DEESYIRRLF  E+ 
Subjt:  TVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEES

Query:  EGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAI
        + Y+ E +G +   ++  S+ D +   S+LYRLEI+ IEL S+YG +S +SLQDFQDAEPDIL+HST+ I+ERFN +GI  +IALKALCKK+GLH E+A 
Subjt:  EGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAI

Query:  LIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRD
        LI VDSLGMDVRV  G +V+T RFPFK RAT+E AAEK+I QLLFPRSRR+KL+ H + L+D
Subjt:  LIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRD

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein7.6e-4829.44Show/hide
Query:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++     K+  ++ L+  E ART +EVN    L+  G +    HE + W +  YVTD +G++
Subjt:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDL

Query:  YFEI-----------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANI
        YF++                       FD++ M++D    +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D    
Subjt:  YFEI-----------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANI

Query:  PVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRS
          DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  +    ++ S
Subjt:  PVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRS

Query:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFR
              Y+LEI+RI+L +  G Q+EV ++D + A+PD++  ++  IL R  E G K   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T R
Subjt:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFR

Query:  FPFKIRATSEAAAEKQIQQLLFPRSRRK
        F F IRATSE  AE Q+++LLF  +  K
Subjt:  FPFKIRATSEAAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein8.4e-4729.29Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEI----
        ++    A+Y  S  D         YHP ED++     K+  ++ L+  E ART +EVN    L+  G +    HE + W +  YVTD +G++YF++    
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEI----

Query:  -------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANIPVDWGVPD
                           FD++ M++D    +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D      DW   +
Subjt:  -------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANIPVDWGVPD

Query:  VSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYR
             HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  +    ++ S      Y+
Subjt:  VSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYR

Query:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        LEI+RI+L +  G Q+EV ++D + A+PD++  ++  IL R  E G K   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F IRAT
Subjt:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRK
        SE  AE Q+++LLF  +  K
Subjt:  SEAAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGCTGTAGCTTCTTCACTTACCTTTGAAGGGGCTCCTTGCTCGAAATCGTATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGACGTTTGTGGCAGAAA
TAAAAAATTTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTCGCTGCTGATTACCCAGATTCAGTTCCAGATTCTTCAA
GTTATTTAACTAACAAAGGTTATCATCCCCTTGAAGATCTAAAAGTTTGCAAAAGTGTACGGAATACAGAACTCACTGCTGCAGAAGTAGCAAGGACTGCCGTGGAGGTC
AATAGCAATGCTCTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGTCACTGACGATTATGGAGATTTGTATTT
TGAAATTTTTGATAGTGTGAACATGTTAGAAGATCGGCGAGCACACAACCCTGTGAATGCCTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGACAGTTGGAG
ATTATAGTGATGTAGATAGTGGCTATGGTGATGTTGCTCCTTTTGATTATGATTATATTGAGGTAGTGGAAGCTGATTTAGCCAATATTCCAGTTGACTGGGGAGTTCCT
GATGTTTCTAGCATGGTTCATCCTGTATATTTTGCTAAGTGCTTGAAAAAGGTTATCAATATGGAATATGACAGAAACATGAAGCATCCTTCCAATGGGGTTTCCATATT
GGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTCGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGCGAAA
CCTCAAACTTGGAGTCCAAAATTGATAGAAGCAGTCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAGTT
AGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTTTGCACTCTACTGCGGAAATTCTAGAGCGTTTTAATGAGAAGGGTATTAAGTGCAATATTGCCCTTAAAGC
TCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGATGTGAGAGTATGTGTTGGGACGGAAGTACGGACTTTTCGAT
TTCCCTTTAAAATCAGGGCAACGTCAGAAGCTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATCTCGTCGTAAAAAACTACGAAGCCATGGGGATGGATTG
AGAGATACTGTCAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
TCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATGGCAATTGCTGTAGCTTCTTCACTTACCTTTGAAGGGGCTCCTTGCTCGAAATCGTATGCATTC
ACAAGCAGTTGGAATAGATCTTCTTTTGACGTTTGTGGCAGAAATAAAAAATTTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGT
TTCAGTCGCTGCTGATTACCCAGATTCAGTTCCAGATTCTTCAAGTTATTTAACTAACAAAGGTTATCATCCCCTTGAAGATCTAAAAGTTTGCAAAAGTGTACGGAATA
CAGAACTCACTGCTGCAGAAGTAGCAAGGACTGCCGTGGAGGTCAATAGCAATGCTCTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGG
GATGAGTTTCAATATGTCACTGACGATTATGGAGATTTGTATTTTGAAATTTTTGATAGTGTGAACATGTTAGAAGATCGGCGAGCACACAACCCTGTGAATGCCTTGAT
TGGAATGGACATGCAAATGTATGAGAGTAGGAGGACAGTTGGAGATTATAGTGATGTAGATAGTGGCTATGGTGATGTTGCTCCTTTTGATTATGATTATATTGAGGTAG
TGGAAGCTGATTTAGCCAATATTCCAGTTGACTGGGGAGTTCCTGATGTTTCTAGCATGGTTCATCCTGTATATTTTGCTAAGTGCTTGAAAAAGGTTATCAATATGGAA
TATGACAGAAACATGAAGCATCCTTCCAATGGGGTTTCCATATTGGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTCGAAGA
AAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGCGAAACCTCAAACTTGGAGTCCAAAATTGATAGAAGCAGTCAAAGATCCACTCTCTACAGGTTGGAGATAA
TGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAGTTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTTTGCACTCTACTGCGGAAATTCTAGAG
CGTTTTAATGAGAAGGGTATTAAGTGCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTTGATAGTCTTGGCAT
GGATGTGAGAGTATGTGTTGGGACGGAAGTACGGACTTTTCGATTTCCCTTTAAAATCAGGGCAACGTCAGAAGCTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCC
CACGATCTCGTCGTAAAAAACTACGAAGCCATGGGGATGGATTGAGAGATACTGTCAGTTTTTAGAATGCCCTCTGCGTTATTTTGACATTTGGTGGTTCCTAGAAATAA
ATTTCAAAGCTAAGGCGACTTCTTTTACTTAAATCCGTCATAGCATTAGTTTTTGTTTGAGAATTGTACGGTTCAAGTTGATAACTTGCCTCGTTTTTTATTGGATTCCT
TTGTCTGCTCTTATTTTTTCTTGGTATTACCCTCTTGACTTTCATATCCGAATTCCCTTGGTGTCACTGCAAGTTTTTTTTTTCCAATGCTGACGCAAAAATGTGTCAAT
ATATGTAACAAAACATGTCGACAGAGACTATGCCGACGCATGCATGGGTAAACACCATCACCAGAAGACAGTCAGCATACCTTCCGTGCATGCCATACATGCCAAAGTCT
TCGGTTGTTGGGAGAGAAGGTCGGCACACTGAAATTCTTGCAGTGTGTTTTCCTCCACGGGCACCTCTTGTAGATTACTTGATTAGTAGACTGCTAAAAGTAGTTGAAAA
AAAGAAGAATAAAAGCAAGCAGGGTCACATAATCGATTTCCGGAGATGAAAATGAAATGTCACCGTGTGTGATTTTGACTATGAGGACACATCCTTTTGCTGGATTTGTA
AGACTAGATATTTGAGACAATTGATAGCATGTCGTAAAATTGAGTAGCACAACAATGCATAAAGAACACACTGGTATTGAGAAAATATGAAATTGCTCTATAAAAAACTG
TATTATCTCCTTGGAG
Protein sequenceShow/hide protein sequence
MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEV
NSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRTVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVP
DVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEV
SLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGL
RDTVSF