; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040248 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040248
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
Genome locationchr13:3246864..3249327
RNA-Seq ExpressionLag0040248
SyntenyLag0040248
Gene Ontology termsGO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR008978 - HSP20-like chaperone
IPR025723 - Anion-transporting ATPase-like domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR040612 - ArsA, HSP20-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446550.1 PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo]1.1e-21887.97Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPISIRTRT PC  R + ++ASK+T DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS+LQLER +GFSGIM QRDQK KYDIVIYDG+CTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSK RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGS L GRTS TDIWE LE +LEKGSSAF+EP +FSC++VMDPTSPASVQSALRYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQI GA AF SSH +AE+ A LKE FSPLSLAF+PQFSIGS VDWNTVL +ASSKGPRDL  SSKS TSSL+ PVKFDPGNKSVTLLMPGF KS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_022956773.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata]1.7e-21988.86Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRTRTAPCRRRS+AIEASKE  DVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLER LG SGIM Q DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSKARLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM IS PGS LSGRTS TD W+ALER+LEKGSSA +EP RFSCF+VMDPTSPASV+SA RYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQISGAFA ISS LDAES ARLKE+FSPLSL FMPQ S+GSPVDWNTVL +ASSKGPR+L  SSKSH+S+L SPVKF+PGNKSVTLLMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EI+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_022979170.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima]3.2e-22189.53Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRTRTAPCRRRS+AIEASKE  DVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        +SPVECSHNLSAVRLETTQMLLEPLKRLKQADS LNMTQG LEGVVGEELGILPGMDSIFSVLQLER LGFSGIM Q DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAM+IS PGS LSGRTS TD W+ALE +LEKGSSA +EP RFSCF+VMDPTSPASV+SALRYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQISGAFA ISS LDAES ARLKE+F PL LAFMPQ S+GSPVDWNTVL +ASSKGPR+L  SSKSH+S+LLSPVKFDPGNKSVTLLMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EI+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_023529730.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita pepo subsp. pepo]5.9e-22088.64Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRTRTAPCRRRS+AIEASKE  DVSSQN  RMLTFLGKGGSGKTTSAVF A+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLER LG SGIM Q DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSKARLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM+IS PGS LSGRTS TD W+ALE +LEKGSSA +EP RFSCF+VMDPTSPASV+SA RYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQISGAFA ISS LDAES ARLKE+FSPL LAFMPQ S+GSPVDWNTVL +ASSKGPR+L  SSKSH+++L SPVKF+PGNKSVTLLMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EI+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_038891424.1 uncharacterized protein At1g26090, chloroplastic [Benincasa hispida]1.3e-22290.42Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSL FSASFFGNPIPISIRTRTAPCR R +A++ASKE  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG+LEGVVGEELG+LPG DSIFS+LQLER LGFSGIM QRDQK KYD+VIYDGICTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGS LS RTS TDIWEALE +LEKGSSAF+EP +FSCF+VMDPTSPASVQSALRYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFS-SKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAG QISGA AFISSHL AES A LKE FSPLSLAFMPQFS GS VDWNTVL +ASSKGPRDL S SKS TSSLLSPVKFDPGNKSVTLLMPGF KS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFS-SKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAK TDR LVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

TrEMBL top hitse value%identityAlignment
A0A1S3BET7 uncharacterized protein At1g26090, chloroplastic5.4e-21987.97Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPISIRTRT PC  R + ++ASK+T DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS+LQLER +GFSGIM QRDQK KYDIVIYDG+CTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSK RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGS L GRTS TDIWE LE +LEKGSSAF+EP +FSC++VMDPTSPASVQSALRYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQI GA AF SSH +AE+ A LKE FSPLSLAF+PQFSIGS VDWNTVL +ASSKGPRDL  SSKS TSSL+ PVKFDPGNKSVTLLMPGF KS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

A0A5A7STS2 ArsA_ATPase domain-containing protein2.1e-19989Show/hide
Query:  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV
        DVSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVI NQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV
Subjt:  DVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGV

Query:  LEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSI
        LEGVVGEEL +LPGMDSIFS+LQLER +GFSGIM QRDQK KYDIVIYDG+CTEET+RMIGATSK RLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSI
Subjt:  LEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSI

Query:  SRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMP
        SRPGS L GRTS TDIWE LE +LEKGSSAF+EP +FSC++VMDPTSPASVQSALRYWGCTIQAGAQI GA AF SSH +AE+ A LKE FSPLSLAF+P
Subjt:  SRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMP

Query:  QFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKSEIKLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFT
        QFSIGS VDWNTVL +ASSKGPRDL  SSKS TSSL+ PVKFDPGNKSVTLLMPGF KSEIKLYQ R GGSELLVEAGDQRRVISLPKEIQGKVGGAKF 
Subjt:  QFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKSEIKLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFT

Query:  DRSLVITMR
        DRSLVITMR
Subjt:  DRSLVITMR

A0A6J1D944 uncharacterized protein At1g26090, chloroplastic9.9e-21385.21Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPC----RRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLD
        MASSLL+S SFFGNPIPIS+  RT       RRR+L +++SKE  D   Q PTR+LTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVIHNQDPT EYLLD
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPC----RRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLD

Query:  CKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEE
        CKIGNSPVEC HNLSAVRLETTQMLLEPLK+L+QADSRLNMTQGVLEGVVGEELG+LPGMDS+FSVL LE+ LGFS  M QRD+K  YDIVIYDGI TEE
Subjt:  CKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEE

Query:  TMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSAL
        T+R++GA SKARLYLKY+RS AEKTDLGRLATPSILRLVDEAM ISRPGS LSGRTS TDIWEALER+LE+GSSAFSEPS+F CF+VMDPTSPASVQSAL
Subjt:  TMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSAL

Query:  RYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPG
        RYWGCTIQAGAQISGAFAFISSHLDAES++RLKE+FSPLSLAFMP+FSIGSPVDWNTVLH+ASSKGPRDL  SSKSH SSLLSPVKFDPGN+SVTL MPG
Subjt:  RYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPG

Query:  FEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        FEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  FEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

A0A6J1GY43 uncharacterized protein At1g26090, chloroplastic8.4e-22088.86Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRTRTAPCRRRS+AIEASKE  DVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLER LG SGIM Q DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSKARLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM IS PGS LSGRTS TD W+ALER+LEKGSSA +EP RFSCF+VMDPTSPASV+SA RYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQISGAFA ISS LDAES ARLKE+FSPLSL FMPQ S+GSPVDWNTVL +ASSKGPR+L  SSKSH+S+L SPVKF+PGNKSVTLLMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EI+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

A0A6J1ISG8 uncharacterized protein At1g26090, chloroplastic1.5e-22189.53Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRTRTAPCRRRS+AIEASKE  DVSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVIHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIG

Query:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM
        +SPVECSHNLSAVRLETTQMLLEPLKRLKQADS LNMTQG LEGVVGEELGILPGMDSIFSVLQLER LGFSGIM Q DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG
        IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAM+IS PGS LSGRTS TD W+ALE +LEKGSSA +EP RFSCF+VMDPTSPASV+SALRYWG
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS
        CTIQAGAQISGAFA ISS LDAES ARLKE+F PL LAFMPQ S+GSPVDWNTVL +ASSKGPR+L  SSKSH+S+LLSPVKFDPGNKSVTLLMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLF-SSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKS

Query:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EI+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

SwissProt top hitse value%identityAlignment
O66908 Putative arsenical pump-driving ATPase 11.2e-0525Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDC---------KIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQG
        R++ F GKGG GKTT  + AA  + LS L   +++ + DP    L D          K    P++ + NL    ++  + +      + +    L  T G
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDC---------KIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQG

Query:  VLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVD
        + E ++ +EL ILPGM+ I S+L + +            ++  +D++I D   T E++R +   +  + Y+K  +    +  + ++A P++ R+ D
Subjt:  VLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVD

Q46366 Putative arsenical pump-driving ATPase2.2e-1523.11Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKT+ +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L +    +++  +R+ M QGV  GV+ +E
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEE

Query:  LGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLS
        + ILPGM+ +FS+L+++R    +G+         YD ++ D   T ET+R++         +K ++++  K  +  L+ P + ++ D+      P   + 
Subjt:  LGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLS

Query:  GRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVM--DPTSPASVQSALRY---WGCTIQ-------AGAQISGAFAFISSHLDAESIARLKEDFSPLSL
           S   +++ LE I E      ++  + +  LVM  +  S      AL Y   +G  +          AQ +  +      +  + +  ++E FSPL +
Subjt:  GRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVM--DPTSPASVQSALRY---WGCTIQ-------AGAQISGAFAFISSHLDAESIARLKEDFSPLSL

Query:  AFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSKSHTSSLLS--PVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVG
          +  +      D   V   +      D++     +  +    P+KF        + +     + + +  +  G EL V+ G+QR++I+LP  + G + G
Subjt:  AFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSKSHTSSLLS--PVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVG

Query:  GAKFTDRSLVI
         A F D+ L I
Subjt:  GAKFTDRSLVI

Q46465 Putative arsenical pump-driving ATPase2.2e-1522.97Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKT+ +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L E    +++  +R+ M QGV  GV+ +E
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEE

Query:  LGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLS
        + ILPGM+ +FS+L+++R    +G+         YD ++ D   T ET+R++              S+ +    G  A  ++ + +     +S+P S++S
Subjt:  LGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLS

Query:  GRTS----TTDIWEALERI---LEKGSSAFSEPSRFSCFLVM--DPTSPASVQSALRY---WGCTIQ-------AGAQISGAFAFISSHLDAESIARLKE
         + +      D  E+++++   LE      ++  + +  LVM  +  S      AL Y   +G  +          AQ +  +      +  + +  ++E
Subjt:  GRTS----TTDIWEALERI---LEKGSSAFSEPSRFSCFLVM--DPTSPASVQSALRY---WGCTIQ-------AGAQISGAFAFISSHLDAESIARLKE

Query:  DFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSKSHTSSLLS--PVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKE
         FSPL +  +  +      D   V   +      D++     +  +    P+KF        + +     + + +  +  G EL V+ G+QR++I+LP  
Subjt:  DFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSKSHTSSLLS--PVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKE

Query:  IQG-KVGGAKFTDRSLVI
        + G + G A F D+ L I
Subjt:  IQG-KVGGAKFTDRSLVI

Q55794 Putative arsenical pump-driving ATPase7.3e-1122.87Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEE
        R++   GKGG GKT+ A       A  G +T ++  +   +     D ++G+ P     NL    L+    L      +K+  +++   +G L+GV  EE
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEE

Query:  LGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQ-L
        L ILPGMD IF +++++R             +  YD++I D   T   +R++        Y++  R       +     P +  L       S P  + +
Subjt:  LGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQ-L

Query:  SGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWGCTI-QAGAQISGAFAFISSHLDAESIARLK-----------EDFSPLS
               +  EALE++L       ++ ++ S  LV +P      +S   +   ++      +  A   +   +D     R K           ++F PL 
Subjt:  SGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWGCTI-QAGAQISGAFAFISSHLDAESIARLK-----------EDFSPLS

Query:  LAFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSK-SHTSSLLSPVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVG
        +   P FS    +     L        +D   S+  +  + ++ V+    + S+ L +PG  K +I+L   + G EL V  G+ RR + LP+ +      
Subjt:  LAFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSK-SHTSSLLSPVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVG

Query:  GAKFTDRSLVI
        GAK  D  L I
Subjt:  GAKFTDRSLVI

Q6DYE4 Uncharacterized protein At1g26090, chloroplastic1.0e-14260.39Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRT-APCRRRSLAIEASKETADV------SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY
        + +S L  +S   N +PI +RT T +  R+R  A  A+  + DV      SSQ  T+ +TFLGKGGSGKTT+AVFAAQH+AL+GL TCLVIHNQDP+ E+
Subjt:  MASSLLFSASFFGNPIPISIRTRT-APCRRRSLAIEASKETADV------SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY

Query:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVK-YDIVIYDGI
        LL  KIG SP   + NLS +RLETT+MLLEPLK+LKQAD+RLNMTQGVLEGVVGEELG+LPGMDSIFS+L+LERL+GF    T+++ K K +D++IYDGI
Subjt:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVK-YDIVIYDGI

Query:  CTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASV
         TEET+RMIG +SK RLY KYLRS+AEKTDLGRL +PSI+R VDE+M+I+   S   G TS   +W+ LER LE G+SA+ +P RF  FLVMDP +P SV
Subjt:  CTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASV

Query:  QSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFS-SKSHTSSLLSPVKFDPGNKSVTL
        ++ALRYWGCT+QAG+ +SGAFA  SSHL ++     K DF PL  A        + +DW+ +L + ++   R+L S + SH +SL   V FD   K VTL
Subjt:  QSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFS-SKSHTSSLLSPVKFDPGNKSVTL

Query:  LMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
         MPGFEKSEIKLYQYRGGSELL+EAGDQRRVI LP +IQGKVGGAKF DRSL++TMR
Subjt:  LMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

Arabidopsis top hitse value%identityAlignment
AT1G26090.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein7.4e-14460.39Show/hide
Query:  MASSLLFSASFFGNPIPISIRTRT-APCRRRSLAIEASKETADV------SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY
        + +S L  +S   N +PI +RT T +  R+R  A  A+  + DV      SSQ  T+ +TFLGKGGSGKTT+AVFAAQH+AL+GL TCLVIHNQDP+ E+
Subjt:  MASSLLFSASFFGNPIPISIRTRT-APCRRRSLAIEASKETADV------SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEY

Query:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVK-YDIVIYDGI
        LL  KIG SP   + NLS +RLETT+MLLEPLK+LKQAD+RLNMTQGVLEGVVGEELG+LPGMDSIFS+L+LERL+GF    T+++ K K +D++IYDGI
Subjt:  LLDCKIGNSPVECSHNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVK-YDIVIYDGI

Query:  CTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASV
         TEET+RMIG +SK RLY KYLRS+AEKTDLGRL +PSI+R VDE+M+I+   S   G TS   +W+ LER LE G+SA+ +P RF  FLVMDP +P SV
Subjt:  CTEETMRMIGATSKARLYLKYLRSIAEKTDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASV

Query:  QSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFS-SKSHTSSLLSPVKFDPGNKSVTL
        ++ALRYWGCT+QAG+ +SGAFA  SSHL ++     K DF PL  A        + +DW+ +L + ++   R+L S + SH +SL   V FD   K VTL
Subjt:  QSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKEDFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFS-SKSHTSSLLSPVKFDPGNKSVTL

Query:  LMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
         MPGFEKSEIKLYQYRGGSELL+EAGDQRRVI LP +IQGKVGGAKF DRSL++TMR
Subjt:  LMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTTCTCTGCTCTTCTCCGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAAGAACAGCTCCATGTAGGAGAAGATCTCTGGCCATTGAAGCTTC
AAAAGAGACTGCGGACGTTTCTTCCCAGAACCCAACCAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCCGCGGTATTCGCCGCTCAGCACTTTG
CACTATCTGGACTTCGCACATGTCTGGTGATACATAATCAAGATCCTACTCCTGAGTATCTTCTGGATTGTAAAATAGGAAATTCTCCTGTTGAATGCAGTCACAACCTC
TCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAACGGCTGAAGCAAGCTGATTCTCGTCTTAACATGACACAAGGAGTTCTTGAAGGGGTTGTTGG
TGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGGCTTCTTGGGTTCTCAGGGATTATGACCCAAAGAGACCAAAAGGTTAAAT
ATGACATAGTAATATATGACGGTATCTGCACGGAAGAAACAATGAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAGGAGCATTGCTGAAAAA
ACTGATCTTGGGAGGTTGGCCACTCCTTCAATTCTGAGGCTTGTTGATGAAGCCATGAGTATAAGCAGGCCAGGCTCCCAACTCAGTGGTAGAACCAGTACAACTGATAT
TTGGGAAGCGCTGGAACGCATTTTAGAGAAAGGGTCTTCTGCATTTTCAGAGCCAAGTAGATTTAGCTGCTTTTTAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGT
CTGCATTACGGTACTGGGGTTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTTCATTTCTTCACATTTGGATGCAGAATCCATTGCTAGATTGAAGGAG
GATTTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAATTGGTTCCCCTGTAGATTGGAACACAGTTCTTCATAATGCATCAAGTAAAGGCCCGAGGGACCTTTT
TTCGTCAAAAAGCCACACCAGCAGTCTGCTATCACCTGTAAAATTTGATCCTGGAAACAAATCAGTTACACTTCTCATGCCAGGATTCGAGAAGTCAGAAATCAAGCTTT
ACCAGTATAGGGGAGGATCTGAGCTATTGGTGGAAGCTGGGGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCTAAGTTCACGGAC
AGAAGCCTTGTGATCACAATGCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTTCTCTGCTCTTCTCCGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAAGAACAGCTCCATGTAGGAGAAGATCTCTGGCCATTGAAGCTTC
AAAAGAGACTGCGGACGTTTCTTCCCAGAACCCAACCAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCCGCGGTATTCGCCGCTCAGCACTTTG
CACTATCTGGACTTCGCACATGTCTGGTGATACATAATCAAGATCCTACTCCTGAGTATCTTCTGGATTGTAAAATAGGAAATTCTCCTGTTGAATGCAGTCACAACCTC
TCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAACGGCTGAAGCAAGCTGATTCTCGTCTTAACATGACACAAGGAGTTCTTGAAGGGGTTGTTGG
TGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGGCTTCTTGGGTTCTCAGGGATTATGACCCAAAGAGACCAAAAGGTTAAAT
ATGACATAGTAATATATGACGGTATCTGCACGGAAGAAACAATGAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAGGAGCATTGCTGAAAAA
ACTGATCTTGGGAGGTTGGCCACTCCTTCAATTCTGAGGCTTGTTGATGAAGCCATGAGTATAAGCAGGCCAGGCTCCCAACTCAGTGGTAGAACCAGTACAACTGATAT
TTGGGAAGCGCTGGAACGCATTTTAGAGAAAGGGTCTTCTGCATTTTCAGAGCCAAGTAGATTTAGCTGCTTTTTAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGT
CTGCATTACGGTACTGGGGTTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTTCATTTCTTCACATTTGGATGCAGAATCCATTGCTAGATTGAAGGAG
GATTTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAATTGGTTCCCCTGTAGATTGGAACACAGTTCTTCATAATGCATCAAGTAAAGGCCCGAGGGACCTTTT
TTCGTCAAAAAGCCACACCAGCAGTCTGCTATCACCTGTAAAATTTGATCCTGGAAACAAATCAGTTACACTTCTCATGCCAGGATTCGAGAAGTCAGAAATCAAGCTTT
ACCAGTATAGGGGAGGATCTGAGCTATTGGTGGAAGCTGGGGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCTAAGTTCACGGAC
AGAAGCCTTGTGATCACAATGCGTTGA
Protein sequenceShow/hide protein sequence
MASSLLFSASFFGNPIPISIRTRTAPCRRRSLAIEASKETADVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGLRTCLVIHNQDPTPEYLLDCKIGNSPVECSHNL
SAVRLETTQMLLEPLKRLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERLLGFSGIMTQRDQKVKYDIVIYDGICTEETMRMIGATSKARLYLKYLRSIAEK
TDLGRLATPSILRLVDEAMSISRPGSQLSGRTSTTDIWEALERILEKGSSAFSEPSRFSCFLVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESIARLKE
DFSPLSLAFMPQFSIGSPVDWNTVLHNASSKGPRDLFSSKSHTSSLLSPVKFDPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTD
RSLVITMR