; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS014392 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS014392
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationscaffold19:653113..676953
RNA-Seq ExpressionMS014392
SyntenyMS014392
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]4.6e-21188.35Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDYDYIEVVE DL++I VDWGVPDVSS+VHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        L KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        SHGDG RD+VSF
Subjt:  SHGDGFRDSVSF

XP_022137915.1 uncharacterized protein At3g49140 isoform X1 [Momordica charantia]1.4e-23699.27Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRNPMFGST+LHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDI VDWGVPDVSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        SHGDGFRDSVSF
Subjt:  SHGDGFRDSVSF

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]4.6e-21188.35Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDYDYIEVVE DL++I VDWGVPDVSS+VHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        L KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        SHGDG RD+VSF
Subjt:  SHGDGFRDSVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]1.0e-21388.78Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL LSKVSVAADYPDSVPDSSS+LTN+GYHPLEDLKVCKRAR+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSW+E
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+A DSG GDVVPFDYDYIEVVETDL+DI VDWG PD SSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWK-------GLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQ
        LNKVINMEYD+KM HPSNGVSILGCLRPA+ADEESY+RRLF+FE SEGY TEWK       GL+GE LS ESK D+SSQRSTLYRLEIMRIELFSVYGVQ
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWK-------GLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQ

Query:  TEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPR
        +E+SLQDFQ AEPDIL+HSTAEI+E FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPR
Subjt:  TEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPR

Query:  SRRKKLRSHGDGFRDSVSF
        SRRKKLRSHGDG RD+VSF
Subjt:  SRRKKLRSHGDGFRDSVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]8.2e-21690.29Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL LSKVSVAADYPDSVPDSSS+LTN+GYHPLEDLKVCKRAR+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSW+E
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYVIDDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRR VGDY+A DSG GDVVPFDYDYIEVVETDL+DI VDWG PD SSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        LNKVINMEYD+KM HPSNGVSILGCLRPA+ADEESY+RRLF+FE SEGY TEWKGL+GE LS ESK D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ AEPDIL+HSTAEI+E FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        SHGDG RD+VSF
Subjt:  SHGDGFRDSVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein2.2e-21188.35Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKVCK  R+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYV DDYGDLYFEIFD+VNMLEDR AHNPVNALIGMDMQMYESRRIVGDY+  DSG GDV PFDYDYIEVVE DL++I VDWGVPDVSS+VHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        L KVINMEYD+ MKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE  + ESK D+SSQRSTLYRLEIMRIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ+AEPDIL+HSTAEI+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        SHGDG RD+VSF
Subjt:  SHGDGFRDSVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X13.7e-20685.92Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL  SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
         QYV DDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDYDYIE VE DL++I VDWGVPDVSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        LNKV+N+EYD+ MKHPSNGV+ILG LRP +ADEESY+RRLF FE SEGY TEWKGL+GE  + E K D+SSQRSTLYRLEI+RIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ+AEPDIL+HST +I+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        S+GDG RD+VSF
Subjt:  SHGDGFRDSVSF

A0A5A7TTC0 Pentatricopeptide repeat (PPR) superfamily protein isoform 27.5e-20785.92Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL  SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
         QYV +DYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDYDYIE VE DL++I VDWGVPDVSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        LNKV+N+EYD+ MKHPSNGV+ILGCLRP +ADEESY+RRLF FE SEGY TEWKGL+GE  + E K D+SSQRSTLYRLEI+RIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ+AEPDIL+HST +I+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        S+GDG RD+VSF
Subjt:  SHGDGFRDSVSF

A0A5D3CYH3 Pentatricopeptide repeat (PPR) superfamily protein isoform 23.7e-20685.92Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRN  FGST+ HWLSKGRDL  SKVSVAADYPDSVPDSSSY TN+GYHPLEDLKVCKRAR+TELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
         QYV DDYGDLYFEIFD+VNMLEDRGAHNPVNALIGMDMQMYESRRI+GDY+A DSG GDV PFDYDYIE VE DL++I VDWGVPDVSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        LNKV+N+EYD+ MKHPSNGV+ILG LRP +ADEESY+RRLF FE SEGY TEWKGL+GE  + E K D+SSQRSTLYRLEI+RIELFSVYGVQ+E+SLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ+AEPDIL+HST +I+E F+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        S+GDG RD+VSF
Subjt:  SHGDGFRDSVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X16.9e-23799.27Show/hide
Query:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE
        GRNPMFGST+LHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVN NALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC
        FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDI VDWGVPDVSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKC

Query:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
        LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD
Subjt:  LNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQD

Query:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGFRDSVSF
        SHGDGFRDSVSF
Subjt:  SHGDGFRDSVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491402.8e-4928.47Show/hide
Query:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI
        + LR ++    A+Y DS  D         YHP E+++    +   D+ L+ AE  RT +EVN    L+  G++    HE + W +  Y+ D  G+LYF++
Subjt:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI

Query:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDILVDWG
         ++ ++++     +N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ ++E                  +D  + L DW 
Subjt:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDILVDWG

Query:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ
          +     HP++FAK + +V + +    M  PS G++I G L     ++ S I++      S            +      K    E+    S+ +K+  
Subjt:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ

Query:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP
            Y+LE++RI+L +  G QTE+ ++D ++A+PD + H++AEI+    E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF 
Subjt:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRR
        F  RATSE  AE QI++LLFP++ +
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-5028.47Show/hide
Query:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI
        + LR ++    A+Y DS  D         YHP E+++    +   D+ L+ AE  RT +EVN    L+  G++    HE + W +  Y+ D  G+LYF++
Subjt:  RDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC--KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI

Query:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDILVDWG
         ++ ++++     +N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ ++E                  +D  + L DW 
Subjt:  FDNVNMLED-RGAHNPVNALIGMD-MQMYESRRIVG----DYNAPDSGNGDVVPFD-------YDYIEVVE------------------TDLSDILVDWG

Query:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ
          +     HP++FAK + +V + +    M  PS G++I G L     ++ S I++      S            +      K    E+    S+ +K+  
Subjt:  VPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGS------------EGYTTEWKGLDGEALSFESKSDKSSQ

Query:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP
            Y+LE++RI+L +  G QTE+ ++D ++A+PD + H++AEI+    E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF 
Subjt:  RSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFP

Query:  FKIRATSEVAAEKQIQQLLFPRSRR
        F  RATSE  AE QI++LLFP++ +
Subjt:  FKIRATSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein2.5e-13860.59Show/hide
Query:  PMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQY
        P FGS   H  S G DL L+KVSVAADY DSVPDSS Y    GYHPLEDLK  KR ++T+L+A+EVART VE N +A+L+FPG +H EPH+  SW EF+Y
Subjt:  PMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQY

Query:  VIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKCLNK
        VIDDYGD++FEI D+ N+LED GA NPV A  GMD+  YE+ R   +YN  D GN D + FD  Y E+++++  DI +DWG+PD S+ VHP+YFAK L+K
Subjt:  VIDDYGDLYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKCLNK

Query:  VINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQE
         I+M+YD+KM +PSNGVSILGCLRPAF DEESYIRRLF  E  + Y+ E +G D    S  S+ D++   S+LYRLEI+ IEL S+YG ++ ISLQDFQ+
Subjt:  VINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQE

Query:  AEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHG
        AEPDILVHST+ I+E F+ +GI  +IALKALCKK+GLH E+A LI VDSLGMDVRV  G +V+T RFPFK RAT+E+AAEK+I QLLFPRSRR+KL+ H 
Subjt:  AEPDILVHSTAEIVEHFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHG

Query:  DGFRDS
        +  +D+
Subjt:  DGFRDS

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-4930.21Show/hide
Query:  SKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL
        S G+ LR ++    A+Y  S  D         YHP ED++     K   D+ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++
Subjt:  SKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL

Query:  YFEIFDNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIL
        YF++ +N ++++      +N V  ++G D M+M +   +                   V D N  D   G+    D +++ V+E         +D  + L
Subjt:  YFEIFDNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDIL

Query:  VDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------R
         DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++      S G T + K  +     FE   +  S+       R
Subjt:  VDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------R

Query:  STL--YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRF
        + +  Y+LEI+RI+L +  G QTE+ ++D ++A+PD++  ++  I+    E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF
Subjt:  STL--YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRF

Query:  PFKIRATSEVAAEKQIQQLLFPRSRRK
         F IRATSE  AE Q+++LLF  +  K
Subjt:  PFKIRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4830.09Show/hide
Query:  LRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIF
        LR ++    A+Y  S  D         YHP ED++     K   D+ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++YF++ 
Subjt:  LRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVC---KRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIF

Query:  DNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDILVDWGV
        +N ++++      +N V  ++G D M+M +   +                   V D N  D   G+    D +++ V+E         +D  + L DW  
Subjt:  DNVNMLED--RGAHNPVNALIGMD-MQMYESRRI-------------------VGDYNAPDSGNGDVVPFDYDYIEVVE---------TDLSDILVDWGV

Query:  PDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------RSTL--
         +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++      S G T + K  +     FE   +  S+       R+ +  
Subjt:  PDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGVSILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQ-------RSTL--

Query:  YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR
        Y+LEI+RI+L +  G QTE+ ++D ++A+PD++  ++  I+    E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F IR
Subjt:  YRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR

Query:  ATSEVAAEKQIQQLLFPRSRRK
        ATSE  AE Q+++LLF  +  K
Subjt:  ATSEVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGCAGAAACCCAATGTTTGGATCAACAAAATTACATTGGTTGTCTAAGGGACGTGACCTTCGCTTGTCAAAAGTTTCAGTTGCTGCAGATTATCCAGATTCAGTTCCAGA
TTCATCAAGTTACTTGACTAACCAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGAGCACGGGACACTGAACTCACTGCAGCAGAAGTAGCAAGGACAGCTG
TGGAGGTCAATGGCAACGCTTTGCTATTATTTCCTGGAACTGTGCACAGTGAACCACACGAACAAGTATCTTGGGATGAGTTTCAATATGTTATCGACGATTATGGAGAT
TTGTATTTTGAAATTTTTGACAATGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGAT
AGTTGGAGATTATAATGCGCCAGATAGTGGCAATGGTGATGTTGTTCCTTTTGATTATGACTATATTGAGGTAGTGGAAACTGATTTGTCTGATATTCTAGTTGACTGGG
GAGTTCCAGATGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATACGATAAAAAGATGAAGCATCCTTCAAATGGAGTT
TCCATTTTGGGATGTCTCAGACCTGCATTTGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGGAAGTGAAGGCTACACCACAGAATGGAAAGGTTTAGA
TGGTGAAGCCTTGAGCTTCGAGTCCAAAAGTGATAAAAGCAGCCAAAGATCAACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCAGTGTATGGAGTTCAGA
CTGAAATTAGTTTGCAAGATTTTCAAGAAGCTGAGCCTGATATTCTTGTGCACTCCACTGCGGAAATTGTAGAGCATTTTAGTGAGAAGGGTATTAGGTGTAATATTGCT
CTTAAAGCTCTTTGCAAAAAAAGGGGTCTTCATGTTGAGGACGCTATTTTGATAGGAGTCGATAGTCTTGGCATGGATGTGAGGGTATGTTTCGGGACAGAAGTACGGAC
TTTTCGATTTCCCTTCAAAATCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAAATTCAGCAACTCTTGTTCCCACGGTCTCGTCGTAAGAAATTACGAAGCCATGGAG
ATGGATTTAGAGACAGTGTCAGTTTT
mRNA sequenceShow/hide mRNA sequence
GGCAGAAACCCAATGTTTGGATCAACAAAATTACATTGGTTGTCTAAGGGACGTGACCTTCGCTTGTCAAAAGTTTCAGTTGCTGCAGATTATCCAGATTCAGTTCCAGA
TTCATCAAGTTACTTGACTAACCAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGAGCACGGGACACTGAACTCACTGCAGCAGAAGTAGCAAGGACAGCTG
TGGAGGTCAATGGCAACGCTTTGCTATTATTTCCTGGAACTGTGCACAGTGAACCACACGAACAAGTATCTTGGGATGAGTTTCAATATGTTATCGACGATTATGGAGAT
TTGTATTTTGAAATTTTTGACAATGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATTGGAATGGACATGCAAATGTATGAGAGTAGGAGGAT
AGTTGGAGATTATAATGCGCCAGATAGTGGCAATGGTGATGTTGTTCCTTTTGATTATGACTATATTGAGGTAGTGGAAACTGATTTGTCTGATATTCTAGTTGACTGGG
GAGTTCCAGATGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATACGATAAAAAGATGAAGCATCCTTCAAATGGAGTT
TCCATTTTGGGATGTCTCAGACCTGCATTTGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGGAAGTGAAGGCTACACCACAGAATGGAAAGGTTTAGA
TGGTGAAGCCTTGAGCTTCGAGTCCAAAAGTGATAAAAGCAGCCAAAGATCAACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCAGTGTATGGAGTTCAGA
CTGAAATTAGTTTGCAAGATTTTCAAGAAGCTGAGCCTGATATTCTTGTGCACTCCACTGCGGAAATTGTAGAGCATTTTAGTGAGAAGGGTATTAGGTGTAATATTGCT
CTTAAAGCTCTTTGCAAAAAAAGGGGTCTTCATGTTGAGGACGCTATTTTGATAGGAGTCGATAGTCTTGGCATGGATGTGAGGGTATGTTTCGGGACAGAAGTACGGAC
TTTTCGATTTCCCTTCAAAATCCGGGCAACATCTGAAGTTGCAGCAGAGAAGCAAATTCAGCAACTCTTGTTCCCACGGTCTCGTCGTAAGAAATTACGAAGCCATGGAG
ATGGATTTAGAGACAGTGTCAGTTTT
Protein sequenceShow/hide protein sequence
GRNPMFGSTKLHWLSKGRDLRLSKVSVAADYPDSVPDSSSYLTNQGYHPLEDLKVCKRARDTELTAAEVARTAVEVNGNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGD
LYFEIFDNVNMLEDRGAHNPVNALIGMDMQMYESRRIVGDYNAPDSGNGDVVPFDYDYIEVVETDLSDILVDWGVPDVSSLVHPVYFAKCLNKVINMEYDKKMKHPSNGV
SILGCLRPAFADEESYIRRLFYFEGSEGYTTEWKGLDGEALSFESKSDKSSQRSTLYRLEIMRIELFSVYGVQTEISLQDFQEAEPDILVHSTAEIVEHFSEKGIRCNIA
LKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGFRDSVSF