; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0779 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0779
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationMC05:6458203..6499763
RNA-Seq ExpressionMC05g0779
SyntenyMC05g0779
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]2.43e-28988.79Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MAIAVASSLTFEGA CS SYAFTSSWNR S DV GRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIEVVE DL++IPVDWGVPDVSS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

XP_008453943.1 PREDICTED: uncharacterized protein At3g49140 isoform X1 [Cucumis melo]8.64e-28386.32Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MA+AVASSLTFEGA CSTSYAFTS WNR S DV GRN  FGSTE HWLSKGRDL  SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYV DDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIE VE DL++IPVDWGVPDVSSLVHPVYFAKCLNKV+N+EYD+ MKHPSNGV+ILG LRP +ADEESY+RRLF FE SEGY TEWKGL+GE  + E K
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         D+SSQRSTLYRLEI+RIELFSVYGVQ+E+SLQDFQ+AEPDIL+HST +I+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRS+GDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

XP_022137915.1 uncharacterized protein At3g49140 isoform X1 [Momordica charantia]0.0100Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]1.79e-29589.4Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        M IAVAS+LTFEGACCSTSYAFTSSWNR S DVRGRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSS+LTN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+A DSG GDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWK-------GLDGE
        DYIEVVETDL+DIPVDWG PD SSLVHPVYFAKCLNKVINMEYD+KM HPSNGVSILGCLRPA+ADEESY+RRLF+FE SEGY TEWK       GL+GE
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWK-------GLDGE

Query:  ALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
         LS ESK D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ AEPDIL+HSTAEI+E FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
Subjt:  ALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV

Query:  CFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        CFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  CFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]2.49e-29890.81Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        M IAVAS+LTFEGACCSTSYAFTSSWNR S DVRGRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSS+LTN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+EFQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+A DSG GDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIEVVETDL+DIPVDWG PD SSLVHPVYFAKCLNKVINMEYD+KM HPSNGVSILGCLRPA+ADEESY+RRLF+FE SEGY TEWKGL+GE LS ESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ AEPDIL+HSTAEI+E FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein1.18e-28988.79Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MAIAVASSLTFEGA CS SYAFTSSWNR S DV GRN  FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIEVVE DL++IPVDWGVPDVSS+VHPVYFAKCL KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQDFQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X14.18e-28386.32Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MA+AVASSLTFEGA CSTSYAFTS WNR S DV GRN  FGSTE HWLSKGRDL  SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE QYV DDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIE VE DL++IPVDWGVPDVSSLVHPVYFAKCLNKV+N+EYD+ MKHPSNGV+ILG LRP +ADEESY+RRLF FE SEGY TEWKGL+GE  + E K
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         D+SSQRSTLYRLEI+RIELFSVYGVQ+E+SLQDFQ+AEPDIL+HST +I+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRS+GDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X10.0100Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

A0A6J1GXY6 uncharacterized protein At3g49140-like isoform X24.69e-28286.55Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MAIAVASSLTFEGACCSTS+AFTS W+R S DVRGRNP+FGSTE HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKV KRAR+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHE+VSWDEFQYVIDDYGDLYFEIFD+ NMLEDRGAHNPV ALIGMD+QMYESR  VGDY A DS  GDV+PF +
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIE VETDL+D PVDWGV DVSSLVHP+YFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEG+  EWK L+GE L FESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SD+SSQRSTLYRLE MRIELFSVYGVQ+E+SLQDF++AEPDIL+HSTAEIVE F EKGIRCNIALKALCKK+GLHV+DA LIGVDSLGMDVRVCFG EVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        T+RFPFK+RATSEVAAEKQIQQLLFPRSRRK+LRSHGDG  D+ SF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

A0A6J1IUC5 uncharacterized protein At3g49140-like isoform X24.69e-28286.77Show/hide
Query:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA
        MAIAVASSLTFEGACCSTS+AFTS W+R S DVRGRNP+FGS+E HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKV KRAR+TELTA
Subjt:  MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTA

Query:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY
        AEVARTAVEVNSNALLLFPGTVHSEPHE+VSWDEFQYVIDDYGDLYFEIFD+ NMLEDRGAHNPV ALIGMD+QMYESR  VGDY A DS  GDV+PF +
Subjt:  AEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDY

Query:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK
        DYIE VETDL+D PVDWGV DVSSLVHP+YFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SE + TEWK LDGE L FESK
Subjt:  DYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESK

Query:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SD+SSQRSTLYRLE MRIELFSVYGVQ+E+SLQDF++AEPDIL+HSTAEIVEHF EKGIRCNIALKALCKK+GLHV+DA LIGVDSLGMDVRVCFG EVR
Subjt:  SDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF
        T+RFPFK+RATSEVAAEKQIQQLLFPRSRRK+LRSHGDG  D+ SF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491401.9e-4828.24Show/hide
Query:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI
        + LR ++    A+Y DS  D         YHP E+++    +   D+ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++
Subjt:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI

Query:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWG
         ++ ++++     +N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ ++E                  +D  +   DW 
Subjt:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWG

Query:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ
          +     HP++FAK + +V + +    M  PS G++I G L     ++ S I++      S            +      K    E+    S+ +K+  
Subjt:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ

Query:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP
            Y+LE++RI+L +  G QTE+ ++D ++A+PD + H++AEI+    E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF 
Subjt:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRR
        F  RATSE  AE QI++LLFP++ +
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4928.24Show/hide
Query:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI
        + LR ++    A+Y DS  D         YHP E+++    +   D+ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++
Subjt:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI

Query:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWG
         ++ ++++     +N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ ++E                  +D  +   DW 
Subjt:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDIPVDWG

Query:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ
          +     HP++FAK + +V + +    M  PS G++I G L     ++ S I++      S            +      K    E+    S+ +K+  
Subjt:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ

Query:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP
            Y+LE++RI+L +  G QTE+ ++D ++A+PD + H++AEI+    E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF 
Subjt:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRR
        F  RATSE  AE QI++LLFP++ +
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-13956.68Show/hide
Query:  MAIAVASSLTFEGACCSTSYA--FTSS--WNRYS-----------------CDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQ
        M IA ASS +   + C  SY   F+SS  + R S                    R + P FGS   H  S G DL L+KVSVAADY DSVPDSS Y    
Subjt:  MAIAVASSLTFEGACCSTSYA--FTSS--WNRYS-----------------CDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQ

Query:  GYHPLEDLKVCKRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESR
        GYHPLEDLK  KR ++T+L+A+EVART VE NS+A+L+FPG +H EPH+  SW EF+YVIDDYGD++FEI D+ N+LED GA NPV A  GMD+  YE+ 
Subjt:  GYHPLEDLKVCKRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESR

Query:  RIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEG
        R   +YN  D GN D + FD  Y E+++++  DIP+DWG+PD S+ VHP+YFAK L+K I+M+YD+KM +PSNGVSILGCLRPAF DEESYIRRLF  E 
Subjt:  RIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDIPVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEG

Query:  SEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDA
         + Y+ E +G D    S  S+ D++   S+LYRLEI+ IEL S+YG ++ ISLQDFQ+AEPDILVHST+ I+E F+ +GI  +IALKALCKK+GLH E+A
Subjt:  SEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDA

Query:  ILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS
         LI VDSLGMDVRV  G +V+T RFPFK RAT+E+AAEK+I QLLFPRSRR+KL+ H +  +D+
Subjt:  ILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDS

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein9.0e-4929.98Show/hide
Query:  SKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL
        S G+ LR ++    A+Y  S  D         YHP ED++     K   D+ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++
Subjt:  SKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL

Query:  YFEIFDNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIP
        YF++ +N ++++      +N V  ++G D M+M +   +                   V D N  D   G+    D +++ V+E         +D  +  
Subjt:  YFEIFDNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIP

Query:  VDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------R
         DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++      S G T + K  +     FE   +  S+       R
Subjt:  VDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------R

Query:  STL--YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRF
        + +  Y+LEI+RI+L +  G QTE+ ++D ++A+PD++  ++  I+    E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF
Subjt:  STL--YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRF

Query:  PFKIRATSEVAAEKQIQQLLFPRSRRK
         F IRATSE  AE Q+++LLF  +  K
Subjt:  PFKIRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein9.9e-4829.86Show/hide
Query:  LRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIF
        LR ++    A+Y  S  D         YHP ED++     K   D+ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++YF++ 
Subjt:  LRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIF

Query:  DNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIPVDWGV
        +N ++++      +N V  ++G D M+M +   +                   V D N  D   G+    D +++ V+E         +D  +   DW  
Subjt:  DNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIPVDWGV

Query:  PDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------RSTL--
         +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++      S G T + K  +     FE   +  S+       R+ +  
Subjt:  PDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------RSTL--

Query:  YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR
        Y+LEI+RI+L +  G QTE+ ++D ++A+PD++  ++  I+    E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F IR
Subjt:  YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR

Query:  ATSEVAAEKQIQQLLFPRSRRK
        ATSE  AE Q+++LLF  +  K
Subjt:  ATSEVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGCTGTAGCTTCTTCACTTACCTTCGAAGGGGCCTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAATAGATATTCTTGTGATGTTCGTGGCAGAAA
CCCAATGTTTGGATCAACAGAATTACATTGGTTGTCTAAGGGACGTGACCTTCGCTTGTCAAAAGTTTCAGTTGCTGCAGATTATCCAGATTCAGTTCCAGATTCATCAA
GTTACTTGACTAACCAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGAGCACGGGACACTGAACTCACTGCAGCAGAAGTAGCAAGGACAGCTGTGGAGGTC
AATAGCAACGCTTTGCTATTATTTCCTGGAACTGTGCACAGTGAACCACACGAACAAGTATCTTGGGATGAGTTTCAATATGTTATCGACGATTATGGAGATTTGTATTT
TGAAATTTTTGACAATGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGATAGTTGGAG
ATTATAATGCGCCAGATAGTGGCAATGGTGATGTTGTTCCTTTTGATTATGACTATATTGAGGTAGTGGAAACTGATTTGTCTGATATTCCAGTTGACTGGGGAGTTCCA
GATGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATACGATAAAAAGATGAAGCATCCTTCAAATGGAGTTTCCATTTT
GGGATGTCTCAGACCTGCATTTGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGGAAGTGAAGGCTACACCACAGAATGGAAAGGTTTAGATGGTGAAG
CCTTGAGCTTCGAGTCCAAAAGTGATAAAAGCAGCCAAAGATCAACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCAGTGTATGGAGTTCAGACTGAAATC
AGTTTGCAAGATTTTCAAGAAGCTGAGCCTGATATTCTTGTGCACTCCACTGCGGAAATTGTAGAGCATTTTAGTGAGAAGGGTATTAGGTGTAATATTGCTCTTAAAGC
TCTTTGCAAAAAAAGAGGTCTTCATGTTGAGGACGCTATTTTGATAGGAGTCGATAGTCTTGGCATGGATGTGAGGGTATGTTTCGGGACAGAAGTACGGACTTTTCGAT
TTCCCTTCAAAATCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAAATTCAGCAACTCTTGTTCCCACGGTCTCGTCGTAAGAAATTACGAAGCCATGGAGATGGATTT
AGAGACAGTGTCAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
CTTCCACTTTCGTACTTCACAATTGCAATATCCTTTGTTCAATATTTTTGCACTCAATATTTCTTTTTCAGGTTTAATAAAAACAGAATCGACAGACCCATTTGTTATTT
TATACATTTATTAAACTGAGATTTCTGCACACGCTTTGACATCTTCCCTTTCCGCTGACATTCAAATCTCGATGGCCAATTCGTACTACAGGCTATACGTTGAGAACGAT
ACCTATCTCTGCCCGGTTCAACTTTGATCTCTCTCATGGCAATTGCTGTAGCTTCTTCACTTACCTTCGAAGGGGCCTGTTGCTCGACATCATATGCATTCACAAGCAGT
TGGAATAGATATTCTTGTGATGTTCGTGGCAGAAACCCAATGTTTGGATCAACAGAATTACATTGGTTGTCTAAGGGACGTGACCTTCGCTTGTCAAAAGTTTCAGTTGC
TGCAGATTATCCAGATTCAGTTCCAGATTCATCAAGTTACTTGACTAACCAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGAGCACGGGACACTGAACTCA
CTGCAGCAGAAGTAGCAAGGACAGCTGTGGAGGTCAATAGCAACGCTTTGCTATTATTTCCTGGAACTGTGCACAGTGAACCACACGAACAAGTATCTTGGGATGAGTTT
CAATATGTTATCGACGATTATGGAGATTTGTATTTTGAAATTTTTGACAATGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGA
CATGCAAATGTATGAGAGTAGGAGGATAGTTGGAGATTATAATGCGCCAGATAGTGGCAATGGTGATGTTGTTCCTTTTGATTATGACTATATTGAGGTAGTGGAAACTG
ATTTGTCTGATATTCCAGTTGACTGGGGAGTTCCAGATGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATACGATAAA
AAGATGAAGCATCCTTCAAATGGAGTTTCCATTTTGGGATGTCTCAGACCTGCATTTGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGGAAGTGAAGG
CTACACCACAGAATGGAAAGGTTTAGATGGTGAAGCCTTGAGCTTCGAGTCCAAAAGTGATAAAAGCAGCCAAAGATCAACTCTCTACAGGTTGGAGATAATGAGAATTG
AGCTCTTCTCAGTGTATGGAGTTCAGACTGAAATCAGTTTGCAAGATTTTCAAGAAGCTGAGCCTGATATTCTTGTGCACTCCACTGCGGAAATTGTAGAGCATTTTAGT
GAGAAGGGTATTAGGTGTAATATTGCTCTTAAAGCTCTTTGCAAAAAAAGAGGTCTTCATGTTGAGGACGCTATTTTGATAGGAGTCGATAGTCTTGGCATGGATGTGAG
GGTATGTTTCGGGACAGAAGTACGGACTTTTCGATTTCCCTTCAAAATCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAAATTCAGCAACTCTTGTTCCCACGGTCTC
GTCGTAAGAAATTACGAAGCCATGGAGATGGATTTAGAGACAGTGTCAGTTTTTAGAACACCCTGTGCATTATTGTGAGATTTGGAGGATCTTAGAAACAAATTTCAAAG
CTCAAGGTGACTTCTTCACTCAAATCTGTCACATTAGAGTCTTAGTTTCTGAATTGTACAGTTCAAGTTGTATAATGTATTCTCTGTTTATTACGGATTCTCTGTCTGCC
CTCAGTTGTATTTGGTACCCTCTTCACCTTCATATCTGAATTCCCTGGGGTGTCCTCCCCTCACCGGCTTTGAGAAATTTGTTGAGCTCTGTCATCAATCATCTTGTTCT
TCGAGATCTTTTATTGGCGCAAGCTTCTTTTCATTCAATAAAATTTTTATTATGCAATTAGTGAAATCCCGT
Protein sequenceShow/hide protein sequence
MAIAVASSLTFEGACCSTSYAFTSSWNRYSCDVRGRNPMFGSTELHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEV
NSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDIPVDWGVP
DVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEI
SLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGF
RDSVSF