; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10001823 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10001823
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationChr11:721701..730855
RNA-Seq ExpressionHG10001823
SyntenyHG10001823
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044625.1 Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Cucumis melo var. makuwa]1.6e-21991.28Show/hide
Query:  TGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD
        +GRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD
Subjt:  TGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD

Query:  EFQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAK
        E QYV +DYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES R +GDYS  DSGYGDV PFDYDYIE VE+DLA+IPVDWGVP+VSSLVHPVYFAK
Subjt:  EFQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAK

Query:  CLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQ
        CLNKV+NVEYDR MKHPSNGV+ILGCLRP YADEESY+RRLF FEESEGYNTEWKGLEGET +LE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQ
Subjt:  CLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQ

Query:  DFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL
        DFQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL
Subjt:  DFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL

Query:  RSHGDGLRDTVSF
        RS+GDGLRDTVSF
Subjt:  RSHGDGLRDTVSF

XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]4.8e-22493.69Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKK GSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
        FQYV DDYGDLYFE+FDSVNMLEDR AHNPVNALIGMDMQMYES R VGDYS+ DSGYGDV PFDYDYIEVVE+DLA+IPVDWGVP+VSS+VHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        L KVIN+EYDR MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET +LESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST EI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        SHGDGLRDTVSF
Subjt:  SHGDGLRDTVSF

XP_031740660.1 uncharacterized protein At3g49140 isoform X1 [Cucumis sativus]4.8e-22493.69Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKK GSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
        FQYV DDYGDLYFE+FDSVNMLEDR AHNPVNALIGMDMQMYES R VGDYS+ DSGYGDV PFDYDYIEVVE+DLA+IPVDWGVP+VSS+VHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        L KVIN+EYDR MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET +LESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST EI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        SHGDGLRDTVSF
Subjt:  SHGDGLRDTVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]3.4e-22593.56Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNK+ GSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+E
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
        FQYVIDDYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES RTVGDYS ADSGYGDVVPFDYDYIEVVE+DLADIPVDWG P+ SSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQ
        LNKVIN+EYDR M HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWK       GLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQ
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQ

Query:  SEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPR
        SEVSLQDFQ AEPDILLHST EIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPR
Subjt:  SEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPR

Query:  SRRKKLRSHGDGLRDTVSF
        SRRKKLRSHGDGLRDTVSF
Subjt:  SRRKKLRSHGDGLRDTVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]2.7e-22795.15Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNK+ GSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSW+E
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
        FQYVIDDYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES RTVGDYS ADSGYGDVVPFDYDYIEVVE+DLADIPVDWG P+ SSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        LNKVIN+EYDR M HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQ AEPDILLHST EIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        SHGDGLRDTVSF
Subjt:  SHGDGLRDTVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein2.3e-22493.69Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKK GSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCK  RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
        FQYV DDYGDLYFE+FDSVNMLEDR AHNPVNALIGMDMQMYES R VGDYS+ DSGYGDV PFDYDYIEVVE+DLA+IPVDWGVP+VSS+VHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        L KVIN+EYDR MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET +LESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST EI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVRTFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        SHGDGLRDTVSF
Subjt:  SHGDGLRDTVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X15.1e-21991.5Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
         QYV DDYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES R +GDYS  DSGYGDV PFDYDYIE VE+DLA+IPVDWGVP+VSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        LNKV+NVEYDR MKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGET +LE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQD
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        S+GDGLRDTVSF
Subjt:  SHGDGLRDTVSF

A0A1S3BYP0 uncharacterized protein At3g49140 isoform X25.1e-21991.5Show/hide
Query:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
        GRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE
Subjt:  GRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDE

Query:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC
         QYV DDYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES R +GDYS  DSGYGDV PFDYDYIE VE+DLA+IPVDWGVP+VSSLVHPVYFAKC
Subjt:  FQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKC

Query:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD
        LNKV+NVEYDR MKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGET +LE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQD
Subjt:  LNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQD

Query:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
        FQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR
Subjt:  FQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLR

Query:  SHGDGLRDTVSF
        S+GDGLRDTVSF
Subjt:  SHGDGLRDTVSF

A0A5A7TTC0 Pentatricopeptide repeat (PPR) superfamily protein isoform 27.8e-22091.28Show/hide
Query:  TGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD
        +GRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD
Subjt:  TGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD

Query:  EFQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAK
        E QYV +DYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES R +GDYS  DSGYGDV PFDYDYIE VE+DLA+IPVDWGVP+VSSLVHPVYFAK
Subjt:  EFQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAK

Query:  CLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQ
        CLNKV+NVEYDR MKHPSNGV+ILGCLRP YADEESY+RRLF FEESEGYNTEWKGLEGET +LE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQ
Subjt:  CLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQ

Query:  DFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL
        DFQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL
Subjt:  DFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL

Query:  RSHGDGLRDTVSF
        RS+GDGLRDTVSF
Subjt:  RSHGDGLRDTVSF

A0A5D3CYH3 Pentatricopeptide repeat (PPR) superfamily protein isoform 23.9e-21991.28Show/hide
Query:  TGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD
        +GRNKK GSTEFHWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKVCKR RNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD
Subjt:  TGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWD

Query:  EFQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAK
        E QYV DDYGDLYFE+FDSVNMLEDRGAHNPVNALIGMDMQMYES R +GDYS  DSGYGDV PFDYDYIE VE+DLA+IPVDWGVP+VSSLVHPVYFAK
Subjt:  EFQYVIDDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAK

Query:  CLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQ
        CLNKV+NVEYDR MKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGET +LE KIDRSSQRSTLYRLEI+RIELFSVYGVQSEVSLQ
Subjt:  CLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQ

Query:  DFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL
        DFQDAEPDILLHST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL
Subjt:  DFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKL

Query:  RSHGDGLRDTVSF
        RS+GDGLRDTVSF
Subjt:  RSHGDGLRDTVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491402.2e-4629.93Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRN---TELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEVFDSV
        ++    A+Y DS  D         YHP E+++    P+N   + L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF+V +  
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRN---TELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEVFDSV

Query:  NMLED-RGAHNPVNALIGMD-MQMYESWRTVG----DYSEADSGYGDVVPFD-------YDYIEVVE------------------SDLADIPVDWGVPEV
        ++++     +N V  ++G D M+M +    +G    D+   D   GD    D        +++ ++E                  SD  +   DW   E 
Subjt:  NMLED-RGAHNPVNALIGMD-MQMYESWRTVG----DYSEADSGYGDVVPFD-------YDYIEVVE------------------SDLADIPVDWGVPEV

Query:  SSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL------EGETLSLESKIDRSSQRS-----TL
            HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L        +    ES+ID S           
Subjt:  SSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL------EGETLSLESKIDRSSQRS-----TL

Query:  YRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR
        Y+LE++RI+L +  G Q+EV ++D + A+PD + H++ EII R  E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  R
Subjt:  YRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR

Query:  ATSEVAAEKQIQQLLFPRSRR
        ATSE  AE QI++LLFP++ +
Subjt:  ATSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein1.6e-4729.93Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRN---TELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEVFDSV
        ++    A+Y DS  D         YHP E+++    P+N   + L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF+V +  
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRN---TELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEVFDSV

Query:  NMLED-RGAHNPVNALIGMD-MQMYESWRTVG----DYSEADSGYGDVVPFD-------YDYIEVVE------------------SDLADIPVDWGVPEV
        ++++     +N V  ++G D M+M +    +G    D+   D   GD    D        +++ ++E                  SD  +   DW   E 
Subjt:  NMLED-RGAHNPVNALIGMD-MQMYESWRTVG----DYSEADSGYGDVVPFD-------YDYIEVVE------------------SDLADIPVDWGVPEV

Query:  SSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL------EGETLSLESKIDRSSQRS-----TL
            HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L        +    ES+ID S           
Subjt:  SSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL------EGETLSLESKIDRSSQRS-----TL

Query:  YRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR
        Y+LE++RI+L +  G Q+EV ++D + A+PD + H++ EII R  E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  R
Subjt:  YRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIR

Query:  ATSEVAAEKQIQQLLFPRSRR
        ATSE  AE QI++LLFP++ +
Subjt:  ATSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-13660.8Show/hide
Query:  FHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGD
        FH  S G DL L+KVSVAADY DSVPDSS Y    GYHPLEDLK  KR + T+L+A+EVART VE NS+A+L+FPG +H EPH+  SW EF+YVIDDYGD
Subjt:  FHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGD

Query:  LYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKCLNKVINVEYD
        ++FE+ D  N+LED GA NPV A  GMD+  YE+ R   +Y+ +D G  D + FD  Y E+++S+  DIP+DWG+P+ S+ VHP+YFAK L+K I+++YD
Subjt:  LYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKCLNKVINVEYD

Query:  RMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILL
        R M +PSNGVSILGCLRPA+ DEESYIRRLF  E+ + Y+ E +G +    S  S+ D +   S+LYRLEI+ IEL S+YG +S +SLQDFQDAEPDIL+
Subjt:  RMMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILL

Query:  HSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRD
        HST+ IIERF+ +GI  +IALKALCKK+GLH E+A LI VDSLGMDVRV  G +V+T RFPFK RAT+E+AAEK+I QLLFPRSRR+KL+ H + L+D
Subjt:  HSTTEIIERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRD

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-4929.91Show/hide
Query:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++     K P ++ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++
Subjt:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL

Query:  YFEV-----------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVE---------SDLADI
        YF+V                       FD++ M++D    +P     G++ ++ +    V D ++ D   G+    D +++ V+E         SD  + 
Subjt:  YFEV-----------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVE---------SDLADI

Query:  PVDWGVPEVSSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSLESKIDRS
          DW   E     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  S    ++ S
Subjt:  PVDWGVPEVSSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSLESKIDRS

Query:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFR
              Y+LEI+RI+L +  G Q+EV ++D + A+PD++  ++  I+ R  E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T R
Subjt:  SQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFR

Query:  FPFKIRATSEVAAEKQIQQLLFPRSRRK
        F F IRATSE  AE Q+++LLF  +  K
Subjt:  FPFKIRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein2.4e-4829.76Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEV----
        ++    A+Y  S  D         YHP ED++     K P ++ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++YF+V    
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVC---KRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEV----

Query:  -------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVE---------SDLADIPVDWGVPE
                           FD++ M++D    +P     G++ ++ +    V D ++ D   G+    D +++ V+E         SD  +   DW   E
Subjt:  -------------------FDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVE---------SDLADIPVDWGVPE

Query:  VSSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYR
             HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  S    ++ S      Y+
Subjt:  VSSLVHPVYFAKCLNKVINVEYDRMMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYR

Query:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRAT
        LEI+RI+L +  G Q+EV ++D + A+PD++  ++  I+ R  E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F IRAT
Subjt:  LEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRAT

Query:  SEVAAEKQIQQLLFPRSRRK
        SE  AE Q+++LLF  +  K
Subjt:  SEVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGAGGTTAACTGGCAGAAATAAAAAAATTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTGGCTGCTGATTACCC
AGATTCAGTTCCAGATTCATCAAGTTATTTAACTAACAAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGACCACGAAATACTGAACTCACTGCTGCTGAAG
TAGCAAGGACGGCTGTGGAGGTCAATAGCAATGCTTTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGTTATT
GACGATTATGGAGATTTGTATTTTGAAGTTTTTGATAGTGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATCGGAATGGACATGCAAATGTA
TGAGAGTTGGAGGACAGTTGGAGATTATAGTGAGGCAGATAGTGGCTATGGTGATGTTGTTCCTTTTGATTATGATTATATTGAGGTAGTGGAATCTGATTTAGCTGATA
TTCCAGTTGACTGGGGAGTTCCAGAGGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATGTGGAATATGACAGAATGATGAAACAT
CCTTCAAATGGGGTTTCCATTTTGGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGA
ATGGAAAGGTTTAGAAGGTGAAACCTTGAGCTTGGAGTCCAAAATTGATAGAAGCAGCCAAAGATCTACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTG
TGTATGGAGTTCAGTCTGAAGTTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTCTGCACTCTACTACGGAAATTATAGAGCGTTTTAGTGAGAAGGGTATT
AGGTGCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGATGTGAGAGTATGCTTTGG
GACAGAAGTACGGACTTTTCGTTTTCCTTTTAAAATCCGGGCAACATCAGAAGTTGCAGCAGAGAAGCAGATCCAACAACTCTTGTTCCCACGATCTCGTCGTAAAAAAT
TACGAAGCCATGGGGATGGATTGAGAGATACTGTCAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGAGGTTAACTGGCAGAAATAAAAAAATTGGATCAACAGAATTTCATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTGGCTGCTGATTACCC
AGATTCAGTTCCAGATTCATCAAGTTATTTAACTAACAAAGGTTATCATCCTCTTGAAGATCTAAAAGTTTGCAAAAGACCACGAAATACTGAACTCACTGCTGCTGAAG
TAGCAAGGACGGCTGTGGAGGTCAATAGCAATGCTTTGCTGTTATTTCCTGGAACTGTGCACAGTGAACCACATGAACAAGTATCGTGGGATGAGTTTCAATATGTTATT
GACGATTATGGAGATTTGTATTTTGAAGTTTTTGATAGTGTGAACATGTTAGAAGATCGTGGAGCACACAATCCTGTGAATGCTTTGATCGGAATGGACATGCAAATGTA
TGAGAGTTGGAGGACAGTTGGAGATTATAGTGAGGCAGATAGTGGCTATGGTGATGTTGTTCCTTTTGATTATGATTATATTGAGGTAGTGGAATCTGATTTAGCTGATA
TTCCAGTTGACTGGGGAGTTCCAGAGGTTTCTAGCTTGGTTCATCCTGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATGTGGAATATGACAGAATGATGAAACAT
CCTTCAAATGGGGTTTCCATTTTGGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGA
ATGGAAAGGTTTAGAAGGTGAAACCTTGAGCTTGGAGTCCAAAATTGATAGAAGCAGCCAAAGATCTACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTG
TGTATGGAGTTCAGTCTGAAGTTAGTTTGCAAGATTTTCAAGATGCTGAACCTGATATTCTTCTGCACTCTACTACGGAAATTATAGAGCGTTTTAGTGAGAAGGGTATT
AGGTGCAATATTGCCCTTAAAGCTCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGATGCTATTTTGATCGGAGTTGATAGTCTTGGCATGGATGTGAGAGTATGCTTTGG
GACAGAAGTACGGACTTTTCGTTTTCCTTTTAAAATCCGGGCAACATCAGAAGTTGCAGCAGAGAAGCAGATCCAACAACTCTTGTTCCCACGATCTCGTCGTAAAAAAT
TACGAAGCCATGGGGATGGATTGAGAGATACTGTCAGTTTTTAG
Protein sequenceShow/hide protein sequence
MLRLTGRNKKIGSTEFHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKRPRNTELTAAEVARTAVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVI
DDYGDLYFEVFDSVNMLEDRGAHNPVNALIGMDMQMYESWRTVGDYSEADSGYGDVVPFDYDYIEVVESDLADIPVDWGVPEVSSLVHPVYFAKCLNKVINVEYDRMMKH
PSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTTEIIERFSEKGI
RCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDTVSF