; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007191 (gene) of Snake gourd v1 genome

Gene IDTan0007191
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationLG10:13612240..13628821
RNA-Seq ExpressionTan0007191
SyntenyTan0007191
Gene Ontology termsNA
InterPro domainsIPR037119 - Haem oxygenase HugZ-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152092.1 uncharacterized protein At3g49140 isoform X2 [Cucumis sativus]6.7e-23290.36Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MAIAVASS TFEGA CS SYAFTSSWNRSSFDV GRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKV K  RNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFD VNMLEDR A NPVNALIG+DMQMYESRRI+GDY+  DSG GD+ PFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIEV E DLA+IP DWGVPDVSS+VHPVYFAKCL KVINMEYDR MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET + ESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         DRSSQRSTLYRLEIMRIELFSVYGVQSE+SLQDFQDAEPD+L+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDGLRD VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

XP_022137915.1 uncharacterized protein At3g49140 isoform X1 [Momordica charantia]6.7e-24093.05Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MAIAVASS TFEGACCSTSYAFTSSWNR S DVRGRNP+FGSTELHWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKV KRAR+TELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD VNMLEDRGA NPVNALIG+DMQMYESRRI+GDYNA DSGNGD+VPFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIEV ETDL+DIP DWGVPDVSSLVHPVYFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE LSFESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SD+SSQRSTLYRLEIMRIELFSVYGVQ+EISLQDFQ+AEPD+LVHSTAEIVE FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

XP_022956049.1 uncharacterized protein At3g49140-like isoform X2 [Cucurbita moschata]5.9e-22888.34Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MAIAVASS TFEGACCSTS+AFTS W+RSSFDVRGRNPIFGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKV KRARNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHE+VSWDEFQYVIDDYGDLYFEIFD  NMLEDRGA NPV ALIG+D+QMYES R +GDY AADS  GD++PF +
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIE  ETDLAD P DWGV DVSSLVHP+YFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEG+N EWK L GETL FESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SDRSSQRSTLYRLE MRIELFSVYGVQSE+SLQDF+DAEPD+L+HSTAEIVERF EKGIRCNIALKALCKK+GLHV+DA LIGVDSLGMDVRVCFG EVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        T+RFPFK+RATSEVAAEKQIQQLLFPRSRRK+LRSHGDG+ D  SF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

XP_038898170.1 uncharacterized protein At3g49140 isoform X1 [Benincasa hispida]1.2e-23690.95Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        M IAVAS+ TFEGACCSTSYAFTSSWNRSSFDVRGRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKV KRARNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSW+EFQYVIDDYGDLYFEIFD VNMLEDRGA NPVNALIG+DMQMYESRR +GDY+AADSG GD+VPFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGE
        DYIEV ETDLADIP DWG PD SSLVHPVYFAKCLNKVINMEYDRKM HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWK       GLEGE
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWK-------GLEGE

Query:  TLSFESKSDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
        TLS ESK DRSSQRSTLYRLEIMRIELFSVYGVQSE+SLQDFQ AEPD+L+HSTAEI+ERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV
Subjt:  TLSFESKSDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRV

Query:  CFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        CFGTEV+TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRD VSF
Subjt:  CFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

XP_038898179.1 uncharacterized protein At3g49140 isoform X2 [Benincasa hispida]9.7e-23992.38Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        M IAVAS+ TFEGACCSTSYAFTSSWNRSSFDVRGRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSS+LTNKGYHPLEDLKV KRARNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSW+EFQYVIDDYGDLYFEIFD VNMLEDRGA NPVNALIG+DMQMYESRR +GDY+AADSG GD+VPFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIEV ETDLADIP DWG PD SSLVHPVYFAKCLNKVINMEYDRKM HPSNGVSILGCLRPAYADEESY+RRLF+FEESEGYNTEWKGLEGETLS ESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         DRSSQRSTLYRLEIMRIELFSVYGVQSE+SLQDFQ AEPD+L+HSTAEI+ERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEV+
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRD VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

TrEMBL top hitse value%identityAlignment
A0A0A0KW72 Uncharacterized protein3.3e-23290.36Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MAIAVASS TFEGA CS SYAFTSSWNRSSFDV GRN  FGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKV K  RNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSWDEFQYV DDYGDLYFEIFD VNMLEDR A NPVNALIG+DMQMYESRRI+GDY+  DSG GD+ PFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIEV E DLA+IP DWGVPDVSS+VHPVYFAKCL KVINMEYDR MKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGET + ESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         DRSSQRSTLYRLEIMRIELFSVYGVQSE+SLQDFQDAEPD+L+HSTAEI+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLGMDVRVC GTEVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        TFRFPFKIRATSE AAEKQIQQLLFPRSRRKKLRSHGDGLRD VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

A0A1S3BY92 uncharacterized protein At3g49140 isoform X11.9e-22788.12Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MA+AVASS TFEGA CSTSYAFTS WNRSSFDV GRN  FGSTE HWLSKGRDLC SKVSVAADYPDSVPDSSSY TNKGYHPLEDLKV KRARNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSWDE QYV DDYGDLYFEIFD VNMLEDRGA NPVNALIG+DMQMYESRRI+GDY+A DSG GD+ PFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIE  E DLA+IP DWGVPDVSSLVHPVYFAKCLNKV+N+EYDR MKHPSNGV+ILG LRP YADEESY+RRLF FEESEGYNTEWKGLEGET + E K
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
         DRSSQRSTLYRLEI+RIELFSVYGVQSE+SLQDFQDAEPD+L+HST +I+ERF+EKGI+CNIALKALCKKRGLHVEDAILIGVDSLG+DVRVCFGTEVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRS+GDGLRD VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

A0A6J1C800 uncharacterized protein At3g49140 isoform X13.3e-24093.05Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MAIAVASS TFEGACCSTSYAFTSSWNR S DVRGRNP+FGSTELHWLSKGRDL LSKVSVAADYPDSVPDSSSYLTN+GYHPLEDLKV KRAR+TELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFD VNMLEDRGA NPVNALIG+DMQMYESRRI+GDYNA DSGNGD+VPFDY
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIEV ETDL+DIP DWGVPDVSSLVHPVYFAKCLNKVINMEYD+KMKHPSNGVSILGCLRPA+ADEESYIRRLFYFE SEGY TEWKGL+GE LSFESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SD+SSQRSTLYRLEIMRIELFSVYGVQ+EISLQDFQ+AEPD+LVHSTAEIVE FSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDG RD+VSF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

A0A6J1GVQ2 uncharacterized protein At3g49140-like isoform X19.2e-22787.95Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVR--GRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTEL
        MAIAVASS TFEGACCSTS+AFTS W+RSSFDVR  GRNPIFGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKV KRARNTEL
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVR--GRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTEL

Query:  TAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPF
        TAAEVART VEVNSNALLLFPGTVHSEPHE+VSWDEFQYVIDDYGDLYFEIFD  NMLEDRGA NPV ALIG+D+QMYES R +GDY AADS  GD++PF
Subjt:  TAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPF

Query:  DYDYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFE
         +DYIE  ETDLAD P DWGV DVSSLVHP+YFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEG+N EWK L GETL FE
Subjt:  DYDYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFE

Query:  SKSDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTE
        SKSDRSSQRSTLYRLE MRIELFSVYGVQSE+SLQDF+DAEPD+L+HSTAEIVERF EKGIRCNIALKALCKK+GLHV+DA LIGVDSLGMDVRVCFG E
Subjt:  SKSDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTE

Query:  VRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        VRT+RFPFK+RATSEVAAEKQIQQLLFPRSRRK+LRSHGDG+ D  SF
Subjt:  VRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

A0A6J1GXY6 uncharacterized protein At3g49140-like isoform X22.9e-22888.34Show/hide
Query:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA
        MAIAVASS TFEGACCSTS+AFTS W+RSSFDVRGRNPIFGSTE HWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKV KRARNTELTA
Subjt:  MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTA

Query:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY
        AEVART VEVNSNALLLFPGTVHSEPHE+VSWDEFQYVIDDYGDLYFEIFD  NMLEDRGA NPV ALIG+D+QMYES R +GDY AADS  GD++PF +
Subjt:  AEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDY

Query:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK
        DYIE  ETDLAD P DWGV DVSSLVHP+YFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEG+N EWK L GETL FESK
Subjt:  DYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESK

Query:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR
        SDRSSQRSTLYRLE MRIELFSVYGVQSE+SLQDF+DAEPD+L+HSTAEIVERF EKGIRCNIALKALCKK+GLHV+DA LIGVDSLGMDVRVCFG EVR
Subjt:  SDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVR

Query:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF
        T+RFPFK+RATSEVAAEKQIQQLLFPRSRRK+LRSHGDG+ D  SF
Subjt:  TFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDAVSF

SwissProt top hitse value%identityAlignment
Q0WMN5 Uncharacterized protein At3g491402.0e-4528.1Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLK--VHKRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVN
        ++    A+Y DS  D         YHP E+++  + +   ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLK--VHKRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVN

Query:  MLED-RGAPNPVNALIGID-MQMYESRRIIG----DYNAADSGNGDIVPFD-------YDYIEVAE------------------TDLADIPFDWGVPDVS
        +++      N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ + E                  +D  +   DW   +  
Subjt:  MLED-RGAPNPVNALIGID-MQMYESRRIIG----DYNAADSGNGDIVPFD-------YDYIEVAE------------------TDLADIPFDWGVPDVS

Query:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDRSSQR-STLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+    +S  D  ++     Y
Subjt:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDRSSQR-STLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRA
        +LE++RI+L +  G Q+E+ ++D + A+PD + H++AEI+ R  E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  RA
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRA

Query:  TSEVAAEKQIQQLLFPRSRR
        TSE  AE QI++LLFP++ +
Subjt:  TSEVAAEKQIQQLLFPRSRR

Arabidopsis top hitse value%identityAlignment
AT3G49140.1 Pentatricopeptide repeat (PPR) superfamily protein1.4e-4628.1Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLK--VHKRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVN
        ++    A+Y DS  D         YHP E+++  + +   ++ L+ AE  RT +EVN+   L+  G++    HE + W +  Y+ D  G+LYF++ +  +
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLK--VHKRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVN

Query:  MLED-RGAPNPVNALIGID-MQMYESRRIIG----DYNAADSGNGDIVPFD-------YDYIEVAE------------------TDLADIPFDWGVPDVS
        +++      N V  ++G D M+M +   ++G    D+   D  +GD    D        +++ + E                  +D  +   DW   +  
Subjt:  MLED-RGAPNPVNALIGID-MQMYESRRIIG----DYNAADSGNGDIVPFD-------YDYIEVAE------------------TDLADIPFDWGVPDVS

Query:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDRSSQR-STLY
           HP++FAK + +V + +    M  PS G++I G L     ++ S I ++L     +   N + + L           G+    +S  D  ++     Y
Subjt:  SLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYI-RRLFYFEESEGYNTEWKGL----------EGETLSFESKSDRSSQR-STLY

Query:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRA
        +LE++RI+L +  G Q+E+ ++D + A+PD + H++AEI+ R  E G +   ALK+LC +   +  E+  LIG+DSLG D+R+C G ++ + RF F  RA
Subjt:  RLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRA

Query:  TSEVAAEKQIQQLLFPRSRR
        TSE  AE QI++LLFP++ +
Subjt:  TSEVAAEKQIQQLLFPRSRR

AT3G59300.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-14157.54Show/hide
Query:  MAIAVASSFTFEGACCSTSYA--FTS-------------------SWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNK
        M IA ASSF+   + C  SY   F+S                   S N S    R + P FGS   H  S G DL L+KVSVAADY DSVPDSS Y    
Subjt:  MAIAVASSFTFEGACCSTSYA--FTS-------------------SWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNK

Query:  GYHPLEDLKVHKRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESR
        GYHPLEDLK  KR + T+L+A+EVARTTVE NS+A+L+FPG +H EPH+  SW EF+YVIDDYGD++FEI D  N+LED GA NPV A  G+D+  YE+ 
Subjt:  GYHPLEDLKVHKRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESR

Query:  RIIGDYNAADSGNGDIVPFDYDYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEE
        R   +YN +D GN D + FD  Y E+ +++  DIP DWG+PD S+ VHP+YFAK L+K I+M+YDRKM +PSNGVSILGCLRPA+ DEESYIRRLF  E+
Subjt:  RIIGDYNAADSGNGDIVPFDYDYIEVAETDLADIPFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEE

Query:  SEGYNTEWKGLEGETLSFESKSDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDA
         + Y+ E +G +    S  S+ D +   S+LYRLEI+ IEL S+YG +S ISLQDFQDAEPD+LVHST+ I+ERF+ +GI  +IALKALCKK+GLH E+A
Subjt:  SEGYNTEWKGLEGETLSFESKSDRSSQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDA

Query:  ILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDA
         LI VDSLGMDVRV  G +V+T RFPFK RAT+E+AAEK+I QLLFPRSRR+KL+ H + L+DA
Subjt:  ILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGLRDA

AT5G24060.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-4729.21Show/hide
Query:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVH---KRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL
        S G+ L  ++    A+Y  S  D         YHP ED++ +   K   ++ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++
Subjt:  SKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVH---KRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDL

Query:  YFEI-----------------------FDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDYDYIEVAE---------TDLADI
        YF++                       FD + M++D    +P     GI+ ++ +    + D N  D   G+    D +++ V E         +D  + 
Subjt:  YFEI-----------------------FDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDYDYIEVAE---------TDLADI

Query:  PFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDRS
          DW   +     HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  S     + S
Subjt:  PFDWGVPDVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDRS

Query:  SQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFR
              Y+LEI+RI+L +  G Q+E+ ++D + A+PDV+  ++  I+ R  E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T R
Subjt:  SQRSTLYRLEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFR

Query:  FPFKIRATSEVAAEKQIQQLLFPRSRRK
        F F IRATSE  AE Q+++LLF  +  K
Subjt:  FPFKIRATSEVAAEKQIQQLLFPRSRRK

AT5G24060.2 Pentatricopeptide repeat (PPR) superfamily protein1.9e-4629.05Show/hide
Query:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVH---KRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI----
        ++    A+Y  S  D         YHP ED++ +   K   ++ L+  E ART +EVN    L+  G +    HE + W +  YV D +G++YF++    
Subjt:  SKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVH---KRARNTELTAAEVARTTVEVNSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEI----

Query:  -------------------FDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDYDYIEVAE---------TDLADIPFDWGVPD
                           FD + M++D    +P     GI+ ++ +    + D N  D   G+    D +++ V E         +D  +   DW   +
Subjt:  -------------------FDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDYDYIEVAE---------TDLADIPFDWGVPD

Query:  VSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDRSSQRSTLYR
             HP+YFA+ + +V + +    M  PS G++I G L P   ++ S I++             +E E     ++G+ GE  S     + S      Y+
Subjt:  VSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLF---------YFEESEGYNTEWKGLEGETLSFESKSDRSSQRSTLYR

Query:  LEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRAT
        LEI+RI+L +  G Q+E+ ++D + A+PDV+  ++  I+ R  E G +   AL++LC +  G+  E+  LIG+DSLG D+R+C G ++ T RF F IRAT
Subjt:  LEIMRIELFSVYGVQSEISLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALC-KKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRAT

Query:  SEVAAEKQIQQLLFPRSRRK
        SE  AE Q+++LLF  +  K
Subjt:  SEVAAEKQIQQLLFPRSRRK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAATTGCTGTAGCTTCTTCATTTACCTTCGAAGGGGCTTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGACGTTCGTGGCAGAAA
TCCAATATTTGGATCAACAGAATTACATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTTGCTGCTGATTACCCAGATTCAGTTCCAGATTCATCAA
GTTATTTGACTAACAAAGGTTATCATCCTCTTGAAGATCTAAAAGTTCACAAAAGAGCACGGAACACTGAACTCACTGCTGCAGAAGTTGCAAGGACAACTGTGGAGGTC
AATAGCAACGCTTTGCTATTATTTCCTGGAACTGTGCACAGTGAACCACACGAACAAGTATCGTGGGATGAGTTTCAATATGTTATTGACGATTATGGAGATTTGTATTT
TGAAATTTTTGACTGTGTGAACATGTTAGAAGATCGTGGAGCACCCAATCCTGTGAATGCTTTGATTGGAATAGACATGCAAATGTATGAGAGTAGGAGGATAATTGGAG
ATTATAATGCGGCAGATAGTGGCAATGGTGATATTGTTCCTTTTGATTATGACTATATTGAGGTAGCGGAAACCGATTTGGCTGATATCCCATTTGACTGGGGAGTTCCA
GATGTTTCTAGCTTGGTTCATCCCGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGACAGAAAGATGAAGCATCCTTCAAATGGAGTTTCCATTTT
GGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGCGAAA
CCTTGAGCTTCGAGTCCAAAAGTGATAGAAGCAGCCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTACGGAGTTCAGTCTGAAATT
AGTTTGCAAGATTTTCAAGATGCTGAACCTGATGTTCTTGTGCACTCTACTGCGGAAATTGTAGAGCGTTTTAGTGAGAAGGGTATTAGGTGCAATATTGCCCTTAAAGC
TCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGACGCTATTCTGATTGGGGTCGATAGTCTTGGCATGGATGTGAGGGTATGTTTCGGAACGGAAGTACGGACTTTTCGAT
TTCCCTTTAAAATCCGGGCAACGTCTGAAGTTGCAGCAGAGAAGCAGATTCAACAACTCTTGTTCCCACGATCTCGTCGTAAAAAATTACGAAGCCATGGGGATGGATTG
AGAGATGCTGTGAGTTTTTAG
mRNA sequenceShow/hide mRNA sequence
GGCATGAAACCATGAACTTGTGTACTATTTTGGAAATAGAAACATGTGGTGTCTGTCTTATGCCTTTTCACTTTTGCATTAAAATTTTTTGCACTTGCCATTATTTTATA
GGTATATAAAAACAGAATTGATGGACCAATTTGTTATTTTCTACATTTATTAAACTGAGATTTCTGACTCTAAAAAAAAAAAAACCCAGAGATTTCTGTACACGTTTGGA
GATCTTCCCTTCTCGCTGAAGTTCGAATTTCGATGGCCAATTCGTACTTCGAACTATACGTAGAGACCGATACCCATCTCAGCTTGGTTCGACTTTGATATCTTTCTCTA
ATGGCAATTGCTGTAGCTTCTTCATTTACCTTCGAAGGGGCTTGTTGCTCGACATCATATGCATTCACAAGCAGTTGGAATAGATCTTCTTTTGACGTTCGTGGCAGAAA
TCCAATATTTGGATCAACAGAATTACATTGGTTGTCTAAGGGACGTGACCTTTGCTTGTCAAAAGTTTCAGTTGCTGCTGATTACCCAGATTCAGTTCCAGATTCATCAA
GTTATTTGACTAACAAAGGTTATCATCCTCTTGAAGATCTAAAAGTTCACAAAAGAGCACGGAACACTGAACTCACTGCTGCAGAAGTTGCAAGGACAACTGTGGAGGTC
AATAGCAACGCTTTGCTATTATTTCCTGGAACTGTGCACAGTGAACCACACGAACAAGTATCGTGGGATGAGTTTCAATATGTTATTGACGATTATGGAGATTTGTATTT
TGAAATTTTTGACTGTGTGAACATGTTAGAAGATCGTGGAGCACCCAATCCTGTGAATGCTTTGATTGGAATAGACATGCAAATGTATGAGAGTAGGAGGATAATTGGAG
ATTATAATGCGGCAGATAGTGGCAATGGTGATATTGTTCCTTTTGATTATGACTATATTGAGGTAGCGGAAACCGATTTGGCTGATATCCCATTTGACTGGGGAGTTCCA
GATGTTTCTAGCTTGGTTCATCCCGTATATTTTGCCAAGTGCTTGAATAAGGTTATCAATATGGAATATGACAGAAAGATGAAGCATCCTTCAAATGGAGTTTCCATTTT
GGGATGTCTCAGACCTGCATATGCTGATGAAGAATCTTATATAAGAAGATTATTTTACTTTGAAGAAAGTGAAGGCTACAACACAGAATGGAAAGGTTTAGAAGGCGAAA
CCTTGAGCTTCGAGTCCAAAAGTGATAGAAGCAGCCAAAGATCCACTCTCTACAGGTTGGAGATAATGAGAATTGAGCTCTTCTCTGTGTACGGAGTTCAGTCTGAAATT
AGTTTGCAAGATTTTCAAGATGCTGAACCTGATGTTCTTGTGCACTCTACTGCGGAAATTGTAGAGCGTTTTAGTGAGAAGGGTATTAGGTGCAATATTGCCCTTAAAGC
TCTTTGCAAAAAGAGGGGTCTTCATGTTGAGGACGCTATTCTGATTGGGGTCGATAGTCTTGGCATGGATGTGAGGGTATGTTTCGGAACGGAAGTACGGACTTTTCGAT
TTCCCTTTAAAATCCGGGCAACGTCTGAAGTTGCAGCAGAGAAGCAGATTCAACAACTCTTGTTCCCACGATCTCGTCGTAAAAAATTACGAAGCCATGGGGATGGATTG
AGAGATGCTGTGAGTTTTTAGAATACCCTGTGCATTATTTTAAGATTTGGAGGATCTTGGAAATAAATTTCAAAGCTTTAAGGGGACTTCTTTACTCAAATCTGTCATAA
TTGGTCTTAGTTTCAGAATTGTACAGTTCCAAGTTGATAATTTATCCTCTGTTTTTTTAATGGATTCCCTTGTTCGCCC
Protein sequenceShow/hide protein sequence
MAIAVASSFTFEGACCSTSYAFTSSWNRSSFDVRGRNPIFGSTELHWLSKGRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVHKRARNTELTAAEVARTTVEV
NSNALLLFPGTVHSEPHEQVSWDEFQYVIDDYGDLYFEIFDCVNMLEDRGAPNPVNALIGIDMQMYESRRIIGDYNAADSGNGDIVPFDYDYIEVAETDLADIPFDWGVP
DVSSLVHPVYFAKCLNKVINMEYDRKMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGLEGETLSFESKSDRSSQRSTLYRLEIMRIELFSVYGVQSEI
SLQDFQDAEPDVLVHSTAEIVERFSEKGIRCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCFGTEVRTFRFPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSHGDGL
RDAVSF