; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy4G003570 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy4G003570
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationGy14Chr4:2216523..2223180
RNA-Seq ExpressionCsGy4G003570
SyntenyCsGy4G003570
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]0.0100Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_008453943.1 PREDICTED: uncharacterized protein At3g49140 isoform X1 [Cucumis melo]2.54e-30593.27Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MA+AVASSLTFEGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRI+GDYS VDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIE VEADLANIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE K
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_008453945.1 PREDICTED: uncharacterized protein At3g49140 isoform X2 [Cucumis melo]4.89e-29993.33Show/hide
Query:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
        EGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVN
Subjt:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRI+GDYS VDSGYGDVAPFDYDYIE VEADLA
Subjt:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA

Query:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
        NIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE KIDRSSQRSTLY
Subjt:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        RLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVRTFRFPFKIRAT
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        SE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]0.099.77Show/hide
Query:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
        +GAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
Subjt:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA
Subjt:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA

Query:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
        NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
Subjt:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]4.69e-30192.15Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        M IAVAS+LTFEGA CS SYAFTSSWNRSSFDV GRNK+FGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYV DDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRR VGDYS  DSGYGDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVE DLA+IPVDWG PD SS+VHPVYFAKCL KVINMEYDR M HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWKGLEGET +LESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQ AEPDILLHSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEV+
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein0.0100Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X11.23e-30593.27Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MA+AVASSLTFEGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRI+GDYS VDSGYGDVAPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIE VEADLANIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE K
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
        IDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A1S3BYP0 uncharacterized protein At3g49140 isoform X22.37e-29993.33Show/hide
Query:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN
        EGA CS SYAFTS WNRSSFDVCGRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVN
Subjt:  EGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVN

Query:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA
        SNALLLFPGTVHSEPHEQVSWDE QYVTDDYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRI+GDYS VDSGYGDVAPFDYDYIE VEADLA
Subjt:  SNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLA

Query:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY
        NIPVDWGVPDVSS+VHPVYFAKCL KV+N+EYDRNMKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE KIDRSSQRSTLY
Subjt:  NIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        RLEI+RIELFSVYGVQSEVSLQDFQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVRTFRFPFKIRAT
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        SE AAEKQIQQLLFPRSRRKKLRS+GDGLRDTVSF
Subjt:  SEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

A0A5A7TTC0 Pentatricopeptide repeat (PPR) superfamily protein isoform 21.65e-28493.69Show/hide
Query:  GRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKKFGSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKC
         QYVT+DYGDLYFEIFDSVNMLEDR AHNPVNALIGMDMQMYESRRI+GDYS VDSGYGDVAPFDYDYIE VEADLANIPVDWGVPDVSS+VHPVYFAKC
Subjt:  FQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKC

Query:  LKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        L KV+N+EYDRNMKHPSNGV+ILGCLRP YADEESY+RRLF FEESEGYNTEWKGLEGETSNLE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQD
Subjt:  LKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST +ILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLG+DVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        S+GDGLRDTVSF
Subjt:  SHGDGLRDTVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X14.98e-28988.79Show/hide
Query:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA
        MAIAVASSLTFEGA CS SYAFTSSWNR S DV GRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTA
Subjt:  MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDY

Query:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK
        DYIEVVE DL++IPVDWGVPDVSS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK
Subjt:  DYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESK

Query:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVR
Subjt:  IDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVR

Query:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491401.1e-4629.05Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN
        ++    A+Y DS  D         YHP E+++    ++  ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+TD  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN

Query:  MLED-RRAHNPVNALIGMD-MQMYESRRIVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS
        +++     +N V  ++G D M+M +   ++G    D+   D   GD    D        +++ ++E                  +D      DW   +  
Subjt:  MLED-RRAHNPVNALIGMD-MQMYESRRIVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS

Query:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+ S ++S  D  ++     Y
Subjt:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA
        +LE++RI+L +  G Q+EV ++D + A+PD + H++AEI+ R  E G K   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  RA
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA

Query:  TSEAAAEKQIQQLLFPRSRR
        TSE  AE QI++LLFP++ +
Subjt:  TSEAAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein7.6e-4829.05Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN
        ++    A+Y DS  D         YHP E+++    ++  ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+TD  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC--KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVN

Query:  MLED-RRAHNPVNALIGMD-MQMYESRRIVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS
        +++     +N V  ++G D M+M +   ++G    D+   D   GD    D        +++ ++E                  +D      DW   +  
Subjt:  MLED-RRAHNPVNALIGMD-MQMYESRRIVG----DYSDVDSGYGDVAPFD-------YDYIEVVE------------------ADLANIPVDWGVPDVS

Query:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+ S ++S  D  ++     Y
Subjt:  SMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETSNLESKIDRSSQR-STLY

Query:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA
        +LE++RI+L +  G Q+EV ++D + A+PD + H++AEI+ R  E G K   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  RA
Subjt:  RLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA

Query:  TSEAAAEKQIQQLLFPRSRR
        TSE  AE QI++LLFP++ +
Subjt:  TSEAAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-13656.06Show/hide
Query:  MAIAVASSLTFEGAPCSKSYA--FTSS--WNRSS------FDVCGRNK----------KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKG
        M IA ASS +   + C +SY   F+SS  + R+S      FD CG              F  + FH  S G DL L+KVSVAADY DSVPDSS Y    G
Subjt:  MAIAVASSLTFEGAPCSKSYA--FTSS--WNRSS------FDVCGRNK----------KFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKG

Query:  YHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR
        YHPLEDLK  K V+ T+L+A+EVART VE NS+A+L+FPG +H EPH+  SW EF+YV DDYGD++FEI D  N+LED  A NPV A  GMD+  YE+ R
Subjt:  YHPLEDLKVCKSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRR

Query:  IVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEES
           +Y+  D G  D   FD  Y E+++++  +IP+DWG+PD S+ VHP+YFAK L K I+M+YDR M +PSNGVSILGCLRPA+ DEESYIRRLF  E+ 
Subjt:  IVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEES

Query:  EGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAI
        + Y+ E +G +   ++  S+ D +   S+LYRLEI+ IEL S+YG +S +SLQDFQDAEPDIL+HST+ I+ERFN +GI  +IALKALCKK+GLH E+A 
Subjt:  EGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAI

Query:  LIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRD
        LI VDSLGMDVRV  G +V+T RFPFK RAT+E AAEK+I QLLFPRSRR+KL+ H + L+D
Subjt:  LIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRD

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein9.9e-4829.44Show/hide
Query:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++     K+  ++ L+  E ART +EVN    L+  G +    HE + W +  YVTD +G++
Subjt:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDL

Query:  YFEI-----------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANI
        YF++                       FD++ M++D    +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D    
Subjt:  YFEI-----------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANI

Query:  PVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRS
          DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  +    ++ S
Subjt:  PVDWGVPDVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRS

Query:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFR
              Y+LEI+RI+L +  G Q+EV ++D + A+PD++  ++  IL R  E G K   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T R
Subjt:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFR

Query:  FPFKIRATSEAAAEKQIQQLLFPRSRRK
        F F IRATSE  AE Q+++LLF  +  K
Subjt:  FPFKIRATSEAAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein1.1e-4629.29Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEI----
        ++    A+Y  S  D         YHP ED++     K+  ++ L+  E ART +EVN    L+  G +    HE + W +  YVTD +G++YF++    
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KSVRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEI----

Query:  -------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANIPVDWGVPD
                           FD++ M++D    +P     G++ ++ +    V D +  D   G+    D +++ V+E         +D      DW   +
Subjt:  -------------------FDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVE---------ADLANIPVDWGVPD

Query:  VSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYR
             HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  +    ++ S      Y+
Subjt:  VSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYR

Query:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT
        LEI+RI+L +  G Q+EV ++D + A+PD++  ++  IL R  E G K   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F IRAT
Subjt:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRAT

Query:  SEAAAEKQIQQLLFPRSRRK
        SE  AE Q+++LLF  +  K
Subjt:  SEAAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGCTGTAGCTTCTTCACTTACCTTTGAAGGGGCTCCTTGCTCGAAATCGTACGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGACGTTTGTGGCAGAAA
TAAAAAATTTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTCGCTGCTGATTACCCAGATTCAGTTCCAGATTCTTCAA
GTTATTTAACTAACAAAGGTTATCATCCCCTTGAAGATCTAAAAGTTTGCAAAAGTGTACGGAATACAGAACTCACTGCTGCAGAAGTAGCAAGGACTGCCGTGGAGGTC
AATAGCAATGCTCTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGTCACTGACGATTATGGAGATTTGTATTT
TGAAATTTTTGATAGTGTGAACATGTTAGAAGATCGGCGAGCACACAACCCTGTGAATGCCTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGATAGTTGGAG
ATTATAGTGATGTAGATAGTGGCTATGGTGATGTTGCTCCTTTTGATTATGATTATATTGAGGTAGTGGAAGCTGATTTAGCCAATATTCCAGTTGACTGGGGAGTTCCT
GATGTTTCTAGCATGGTTCATCCTGTATATTTTGCTAAGTGCTTGAAAAAGGTTATCAATATGGAATATGACAGAAACATGAAGCATCCTTCCAATGGGGTTTCCATATT
GGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTCGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGCGAAA
CCTCAAACTTGGAGTCCAAAATTGATAGAAGCAGTCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTATGGAGTTCAGTCTGAAGTT
AGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTTTGCACTCTACTGCGGAAATTCTAGAGCGTTTTAATGAGAAGGGTATTAAGTGCAATATTGCCCTTAAAGC
TCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGATGTGAGAGTATGTGTTGGGACGGAAGTACGGACTTTTCGAT
TTCCCTTTAAAATCAGGGCAACGTCAGAAGCTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATCTCGTCGTAAAAAACTACGAAGCCATGGGGATGGATTG
AGAGATACTGTCAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATATATATAAACAGAATTGAATATTGTTGGACCAATTTGTTATTTTGTACATTTATTAAACTGAGATTTCTGAGCACGTTGGGACTTGCGGCATCTTTCCCTTTTAGCTG
ACGTTTGAATCGTCTTTATGGCCAAATTCTGCTGCGATCTACTCGTAGAAGCCGATACCCATCACACCCAAATTCGATTTTGATCTCTCTCTCTCTCTCTCTCTCTCTCT
CTCTCTCTCTCTCTCTCTCATGGCAATTGCTGTAGCTTCTTCACTTACCTTTGAAGGGGCTCCTTGCTCGAAATCGTACGCATTCACAAGCAGTTGGAATAGATCTTCTT
TTGACGTTTGTGGCAGAAATAAAAAATTTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTCGCTGCTGATTACCCAGAT
TCAGTTCCAGATTCTTCAAGTTATTTAACTAACAAAGGTTATCATCCCCTTGAAGATCTAAAAGTTTGCAAAAGTGTACGGAATACAGAACTCACTGCTGCAGAAGTAGC
AAGGACTGCCGTGGAGGTCAATAGCAATGCTCTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGTCACTGACG
ATTATGGAGATTTGTATTTTGAAATTTTTGATAGTGTGAACATGTTAGAAGATCGGCGAGCACACAACCCTGTGAATGCCTTGATTGGAATGGACATGCAAATGTATGAG
AGTAGGAGGATAGTTGGAGATTATAGTGATGTAGATAGTGGCTATGGTGATGTTGCTCCTTTTGATTATGATTATATTGAGGTAGTGGAAGCTGATTTAGCCAATATTCC
AGTTGACTGGGGAGTTCCTGATGTTTCTAGCATGGTTCATCCTGTATATTTTGCTAAGTGCTTGAAAAAGGTTATCAATATGGAATATGACAGAAACATGAAGCATCCTT
CCAATGGGGTTTCCATATTGGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTCGAAGAAAGTGAAGGCTACAACACAGAATGG
AAAGGTTTAGAAGGCGAAACCTCAAACTTGGAGTCCAAAATTGATAGAAGCAGTCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTA
TGGAGTTCAGTCTGAAGTTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTTTGCACTCTACTGCGGAAATTCTAGAGCGTTTTAATGAGAAGGGTATTAAGT
GCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGATGTGAGAGTATGTGTTGGGACG
GAAGTACGGACTTTTCGATTTCCCTTTAAAATCAGGGCAACGTCAGAAGCTGCAGCAGAGAAGCAGATTCAGCAACTCTTGTTCCCACGATCTCGTCGTAAAAAACTACG
AAGCCATGGGGATGGATTGAGAGATACTGTCAGTTTTTAGAACGCCCTCTGCGTTATTTTGACATTTGGTGGTTCCTAGAAATAAATTTCAAAGCTAAGGCGACTTCTTT
TACTTAAATCCGTCATAGCATTAGTTTTTGTTTGAGAATTGTACGGTTCAAGTTGATAATTTGTCTCGTTTTTTATTGGATTTCTTTGTCTGCTCTTATTTTTTC
Protein sequenceShow/hide protein sequence
MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDVCGRNKKFGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEV
NSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIGMDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVP
DVSSMVHPVYFAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEV
SLQDFQDAEPDILLHSTAEILERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSEAAAEKQIQQLLFPRSRRKKLRSHGDGL
RDTVSF