; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016523 (gene) of Snake gourd v1 genome

Gene IDTan0016523
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
Genome locationLG04:7809196..7811565
RNA-Seq ExpressionTan0016523
SyntenyTan0016523
Gene Ontology termsNA
InterPro domainsIPR008978 - HSP20-like chaperone
IPR025723 - Anion-transporting ATPase-like domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR040612 - ArsA, HSP20-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446550.1 PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo]2.7e-21285.27Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPISIRT T PC  + + ++ASK+  +VSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSG RTCL I NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        NSPVEC HNLSAVRLETTQMLLEPLK LKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS+LQLERF+GFSGIM QRDQK KYDIVIYDG+CTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSK RLYLKYL+SIAEKTDLGRLATPSILR VDEAMSIS PGS L  RTSTDIWE LE +LEK SSAF+E  KFSC+IVMDPTSPASVQSALRYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQI GA A  SS  +AE+ A LKE FSPLSLAF+PQFS GS VDWNTVLRDASSKGPRDLLS+SK+ TSSL+ PVKF+PGNKSVTLLMPGF KSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_022956773.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata]1.0e-21987.95Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        NSPVEC  NLSAVRLETTQMLLEPLK LKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQ DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSKARLYLKYL+SIAEKTDLGRLATPSI+R VDEAM IS PGS LS RTSTD W+ALERMLEK SSA +E  +FSCFIVMDPTSPASV+SA RYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISS LDAES ARLKENFSPLSL FMPQ S GSPVDWNTVL DASSKGPR+LLS+SK+H+S+L SPVKFNPGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        I+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_022979170.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima]1.6e-22088.39Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        +SPVEC HNLSAVRLETTQMLLEPLK LKQADS LNMTQG LEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQ DQKAKYDIV+YDGICTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSKARLYLKYL+SIAEKTDLGRLATPSILR VDEAM+IS PGS LS RTSTD W+ALE MLEK SSA +E  +FSCFIVMDPTSPASV+SALRYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISS LDAES ARLKENF PL LAFMPQ S GSPVDWNTVL DASSKGPR+LLS+SK+H+S+LLSPVKF+PGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        I+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_023529730.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita pepo subsp. pepo]1.6e-22087.95Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGSGKTTSAVF A+HFALSG RTCL IHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        NSPVEC HNLSAVRLETTQMLLEPLK LKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQ DQKAKYDIV+YDGICTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSKARLYLKYL+SIAEKTDLGRLATPSI+R VDEAM+IS PGS LS RTSTD W+ALE MLEK SSA +E  +FSCFIVMDPTSPASV+SA RYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISS LDAES ARLKENFSPL LAFMPQ S GSPVDWNTVL DASSKGPR+LLS+SK+H+++L SPVKFNPGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        I+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

XP_038891424.1 uncharacterized protein At1g26090, chloroplastic [Benincasa hispida]1.3e-21988.84Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSL FSASFFGNPIPISIRT TAPCR + +A++ASKEIT+VSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSG RTCL IHNQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        NSPVEC  NLSAVRLETTQMLLEPLK LKQADSRLNMTQG+LEGVVGEELG+LPG DSIFS+LQLERFLGFSGIM QRDQK KYD+VIYDGICTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSKARLYLKYL+SIAEKTDLGRLATPSILR VDEAMSIS PGS LS RTSTDIWEALE +LEK SSAF+E  KFSCFIVMDPTSPASVQSALRYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAG QISGA A ISS L AES A LKE FSPLSLAFMPQFS+GS VDWNTVLRDASSKGPRDLLS SK+ TSSLLSPVKF+PGNKSVTLLMPGF KSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAK TDR LVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

TrEMBL top hitse value%identityAlignment
A0A1S3BET7 uncharacterized protein At1g26090, chloroplastic1.3e-21285.27Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFS SFFGNPIPISIRT T PC  + + ++ASK+  +VSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSG RTCL I NQDPTPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        NSPVEC HNLSAVRLETTQMLLEPLK LKQADSRLNMTQGVLEGVVGEEL +LPGMDSIFS+LQLERF+GFSGIM QRDQK KYDIVIYDG+CTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSK RLYLKYL+SIAEKTDLGRLATPSILR VDEAMSIS PGS L  RTSTDIWE LE +LEK SSAF+E  KFSC+IVMDPTSPASVQSALRYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQI GA A  SS  +AE+ A LKE FSPLSLAF+PQFS GS VDWNTVLRDASSKGPRDLLS+SK+ TSSL+ PVKF+PGNKSVTLLMPGF KSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

A0A5A7STS2 ArsA_ATPase domain-containing protein7.1e-19586.76Show/hide
Query:  NVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGV
        +VSSQNPTR+LTFLGKGGSGKTTSAVFAAQHFALSG RTCL I NQDPTPEYLLDCKIGNSPVEC HNLSAVRLETTQMLLEPLK LKQADSRLNMTQGV
Subjt:  NVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGV

Query:  LEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSI
        LEGVVGEEL +LPGMDSIFS+LQLERF+GFSGIM QRDQK KYDIVIYDG+CTEET+RMIGATSK RLYLKYL+SIAEKTDLGRLATPSILR VDEAMSI
Subjt:  LEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSI

Query:  SGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ
        S PGS L  RTSTDIWE LE +LEK SSAF+E  KFSC+IVMDPTSPASVQSALRYWGCTIQAGAQI GA A  SS  +AE+ A LKE FSPLSLAF+PQ
Subjt:  SGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ

Query:  FSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFTD
        FS GS VDWNTVLRDASSKGPRDLLS+SK+ TSSL+ PVKF+PGNKSVTLLMPGF KSEIKLYQ R GGSELLVEAGDQRRVISLPKEIQGKVGGAKF D
Subjt:  FSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFTD

Query:  RSLVITMR
        RSLVITMR
Subjt:  RSLVITMR

A0A6J1D944 uncharacterized protein At1g26090, chloroplastic3.5e-21084.96Show/hide
Query:  MASSLLFSASFFGNPIPIS--IRTTTAPC--RRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLD
        MASSLL+S SFFGNPIPIS  IRT  A    RR++L V++SKEI +   Q PTR+LTFLGKGGSGKT+SAVFAAQHFAL+G RTCL IHNQDPT EYLLD
Subjt:  MASSLLFSASFFGNPIPIS--IRTTTAPC--RRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEE
        CKIGNSPVECGHNLSAVRLETTQMLLEPLK L+QADSRLNMTQGVLEGVVGEELG+LPGMDS+FSVL LE+FLGFS  MAQRD+KA YDIVIYDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEE

Query:  TVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALR
        T+R++GA SKARLYLKY++S AEKTDLGRLATPSILR VDEAM IS PGS LS RTSTDIWEALERMLE+ SSAFSE SKF CFIVMDPTSPASVQSALR
Subjt:  TVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGF
        YWGCTIQAGAQISGAFA ISS LDAES++RLKENFSPLSLAFMP+FS GSPVDWNTVL DASSKGPRDLLS+SK+H SSLLSPVKF+PGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGF

Query:  EKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        EKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  EKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

A0A6J1GY43 uncharacterized protein At1g26090, chloroplastic4.9e-22087.95Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        NSPVEC  NLSAVRLETTQMLLEPLK LKQADSRLNMTQG LEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQ DQK KYDIV+YDGICTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSKARLYLKYL+SIAEKTDLGRLATPSI+R VDEAM IS PGS LS RTSTD W+ALERMLEK SSA +E  +FSCFIVMDPTSPASV+SA RYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISS LDAES ARLKENFSPLSL FMPQ S GSPVDWNTVL DASSKGPR+LLS+SK+H+S+L SPVKFNPGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        I+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

A0A6J1ISG8 uncharacterized protein At1g26090, chloroplastic7.6e-22188.39Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG
        MASSLLFSASFFG+PIPISIRT TAPCRR+S+A+EASKE+T+VSSQN  RMLTFLGKGGSGKTTSAVFAA+HFALSG RTCL IHNQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIG

Query:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM
        +SPVEC HNLSAVRLETTQMLLEPLK LKQADS LNMTQG LEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQ DQKAKYDIV+YDGICTEET+RM
Subjt:  NSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRM

Query:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC
        IGATSKARLYLKYL+SIAEKTDLGRLATPSILR VDEAM+IS PGS LS RTSTD W+ALE MLEK SSA +E  +FSCFIVMDPTSPASV+SALRYWGC
Subjt:  IGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGC

Query:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISS LDAES ARLKENF PL LAFMPQ S GSPVDWNTVL DASSKGPR+LLS+SK+H+S+LLSPVKF+PGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSE

Query:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        I+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITMR
Subjt:  IKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

SwissProt top hitse value%identityAlignment
O50593 Arsenical pump-driving ATPase7.8e-0526.06Show/hide
Query:  QNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNS--PVECGHNLSAVRLETTQ-------MLLEPLKWLKQADSRLN
        QN    L F GKGG GKT+ +   A H A  G R  L   +       + D  IGN+  PV     LSA+ ++  +        +++P+K L   D  +N
Subjt:  QNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNS--PVECGHNLSAVRLETTQ-------MLLEPLKWLKQADSRLN

Query:  MTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMI
             L G    E+                 F  F+G++       ++D +I+D   T  T+R++
Subjt:  MTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMI

Q46366 Putative arsenical pump-driving ATPase1.1e-1422.41Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKT+ +   A   +  G RT +   +   +     + ++G  P +   NL A+ +     L +    +++  +R+ M QGV  GV+ +E
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEE

Query:  LGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLS
        + ILPGM+ +FS+L+++R+               YD ++ D   T ET+R++              S+ +    G  A  ++ +++     +S P S +S
Subjt:  LGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLS

Query:  SRTS-----TDIWEALERM---LEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ
         + +      D  E+++++   LE      ++  K +  +VM+     S++  +R        G ++      ++  LDA+  +   E +  +   ++ +
Subjt:  SRTS-----TDIWEALERM---LEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ

Query:  FSSG-SPVDWNTV-LRDASSKGPRDLL--------STSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG
           G SP+    + + D    G + L          T  +       P+KF        + +     + + +  +  G EL V+ G+QR++I+LP  + G
Subjt:  FSSG-SPVDWNTV-LRDASSKGPRDLL--------STSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG

Query:  -KVGGAKFTDRSLVI
         + G A F D+ L I
Subjt:  -KVGGAKFTDRSLVI

Q46465 Putative arsenical pump-driving ATPase4.9e-1522.65Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKT+ +   A   +  G RT +   +   +     + ++G  P +   NL A+ +     L E    +++  +R+ M QGV  GV+ +E
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEE

Query:  LGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLS
        + ILPGM+ +FS+L+++R+               YD ++ D   T ET+R++              S+ +    G  A  ++ +++     +S P S +S
Subjt:  LGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLS

Query:  SRTS-----TDIWEALERM---LEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ
         + +      D  E+++++   LE      ++  K +  +VM+     S++  +R        G ++      ++  LDA+  +   E +  +   ++ +
Subjt:  SRTS-----TDIWEALERM---LEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQ

Query:  FSSG-SPVDWNTV-LRDASSKGPRDLL--------STSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG
           G SP+    + + D    G + L          T  +       P+KF        + +     + + +  +  G EL V+ G+QR++I+LP  + G
Subjt:  FSSG-SPVDWNTV-LRDASSKGPRDLL--------STSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG

Query:  -KVGGAKFTDRSLVI
         + G A F D+ L I
Subjt:  -KVGGAKFTDRSLVI

Q55794 Putative arsenical pump-driving ATPase1.5e-1122.25Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEE
        R++   GKGG GKT+ A       A  G +T +   +   +     D ++G+ P     NL    L+    L      +K+  +++   +G L+GV  EE
Subjt:  RMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEE

Query:  LGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLS
        L ILPGMD IF +++++R             +A YD++I D   T   +R++        Y++      +   +        LR + E +     G  L 
Subjt:  LGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLS

Query:  SRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTI-QAGAQISGAFASISSQLDAESIARLK-----------ENFSPLSLA
         +   D        +E      ++ ++ S  +V +P      +S   +   ++      +  A   +   +D     R K           +NF PL + 
Subjt:  SRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTI-QAGAQISGAFASISSQLDAESIARLK-----------ENFSPLSLA

Query:  FMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGA
          P FS    +     L        +D   +   +  + ++ V+ +  + S+ L +PG  K +I+L   + G EL V  G+ RR + LP+ +      GA
Subjt:  FMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGA

Query:  KFTDRSLVI
        K  D  L I
Subjt:  KFTDRSLVI

Q6DYE4 Uncharacterized protein At1g26090, chloroplastic6.3e-14057.89Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKS----LAVEASKEITNV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEY
        + +S L  +S   N +PI +RT T    RK     +A  +S+++ +    SSQ  T+ +TFLGKGGSGKTT+AVFAAQH+AL+G  TCL IHNQDP+ E+
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKS----LAVEASKEITNV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEY

Query:  LLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAK-YDIVIYDGI
        LL  KIG SP     NLS +RLETT+MLLEPLK LKQAD+RLNMTQGVLEGVVGEELG+LPGMDSIFS+L+LER +GF     +++ K K +D++IYDGI
Subjt:  LLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAK-YDIVIYDGI

Query:  CTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQ
         TEET+RMIG +SK RLY KYL+S+AEKTDLGRL +PSI+RFVDE+M+I+   SP    TS  +W+ LER LE  +SA+ +  +F  F+VMDP +P SV+
Subjt:  CTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQ

Query:  SALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLL
        +ALRYWGCT+QAG+ +SGAFA  SS L ++     K +F PL  A      + + +DW+ +L D ++   R+LLS + +H +SL   V F+   K VTL 
Subjt:  SALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLL

Query:  MPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        MPGFEKSEIKLYQYRGGSELL+EAGDQRRVI LP +IQGKVGGAKF DRSL++TMR
Subjt:  MPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR

Arabidopsis top hitse value%identityAlignment
AT1G26090.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.5e-14157.89Show/hide
Query:  MASSLLFSASFFGNPIPISIRTTTAPCRRKS----LAVEASKEITNV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEY
        + +S L  +S   N +PI +RT T    RK     +A  +S+++ +    SSQ  T+ +TFLGKGGSGKTT+AVFAAQH+AL+G  TCL IHNQDP+ E+
Subjt:  MASSLLFSASFFGNPIPISIRTTTAPCRRKS----LAVEASKEITNV---SSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEY

Query:  LLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAK-YDIVIYDGI
        LL  KIG SP     NLS +RLETT+MLLEPLK LKQAD+RLNMTQGVLEGVVGEELG+LPGMDSIFS+L+LER +GF     +++ K K +D++IYDGI
Subjt:  LLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAK-YDIVIYDGI

Query:  CTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQ
         TEET+RMIG +SK RLY KYL+S+AEKTDLGRL +PSI+RFVDE+M+I+   SP    TS  +W+ LER LE  +SA+ +  +F  F+VMDP +P SV+
Subjt:  CTEETVRMIGATSKARLYLKYLKSIAEKTDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQ

Query:  SALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLL
        +ALRYWGCT+QAG+ +SGAFA  SS L ++     K +F PL  A      + + +DW+ +L D ++   R+LLS + +H +SL   V F+   K VTL 
Subjt:  SALRYWGCTIQAGAQISGAFASISSQLDAESIARLKENFSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLL

Query:  MPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR
        MPGFEKSEIKLYQYRGGSELL+EAGDQRRVI LP +IQGKVGGAKF DRSL++TMR
Subjt:  MPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTDRSLVITMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAACAACAGCTCCGTGTAGGAGAAAATCTCTGGCCGTTGAAGCTTC
AAAAGAGATTACGAACGTTTCTTCTCAGAACCCAACAAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCGGCGGTATTCGCCGCTCAGCACTTTG
CATTGTCTGGATTTCGCACATGCCTGGCGATACATAATCAAGATCCTACTCCTGAGTACCTTCTGGATTGTAAAATTGGGAATTCTCCTGTTGAATGCGGTCACAACCTC
TCGGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAATGGCTAAAGCAAGCTGATTCTCGTCTTAATATGACACAAGGAGTTCTTGAAGGGGTCGTTGG
AGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGATTTCTTGGGTTCTCAGGGATTATGGCCCAAAGAGACCAAAAAGCTAAAT
ATGACATAGTAATATATGACGGTATCTGCACTGAAGAAACAGTAAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAAGAGCATTGCTGAAAAA
ACTGATCTTGGGAGGTTGGCTACTCCTTCAATTTTGAGGTTTGTTGATGAAGCCATGAGTATAAGCGGGCCAGGCTCCCCTCTCAGTAGTAGAACCAGCACTGATATATG
GGAGGCACTTGAACGCATGTTAGAGAAAGCATCTTCCGCATTTTCAGAGCAAAGTAAATTTAGCTGCTTCATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAATCTG
CATTACGGTACTGGGGCTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCACAATTGGATGCAGAATCCATTGCTAGATTGAAGGAAAAT
TTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAAGTGGTTCCCCTGTAGATTGGAACACAGTTCTTCGCGATGCATCAAGTAAAGGCCCGAGGGACCTTCTTTC
TACGTCAAAAAACCACACCAGCAGTCTGCTATCACCCGTAAAATTCAATCCTGGAAACAAATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCTGAAATCAAGCTTT
ACCAGTATAGGGGAGGGTCTGAGCTATTGGTGGAAGCTGGTGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCCAAGTTCACGGAT
AGAAGTCTTGTGATCACAATGCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGAACCCAATTCCCATTTCAATACGAACAACAACAGCTCCGTGTAGGAGAAAATCTCTGGCCGTTGAAGCTTC
AAAAGAGATTACGAACGTTTCTTCTCAGAACCCAACAAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCGGGAAAGACCACTTCGGCGGTATTCGCCGCTCAGCACTTTG
CATTGTCTGGATTTCGCACATGCCTGGCGATACATAATCAAGATCCTACTCCTGAGTACCTTCTGGATTGTAAAATTGGGAATTCTCCTGTTGAATGCGGTCACAACCTC
TCGGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAATGGCTAAAGCAAGCTGATTCTCGTCTTAATATGACACAAGGAGTTCTTGAAGGGGTCGTTGG
AGAAGAGCTTGGGATACTTCCAGGAATGGATTCTATCTTTTCGGTACTTCAACTTGAGAGATTTCTTGGGTTCTCAGGGATTATGGCCCAAAGAGACCAAAAAGCTAAAT
ATGACATAGTAATATATGACGGTATCTGCACTGAAGAAACAGTAAGGATGATTGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAAGAGCATTGCTGAAAAA
ACTGATCTTGGGAGGTTGGCTACTCCTTCAATTTTGAGGTTTGTTGATGAAGCCATGAGTATAAGCGGGCCAGGCTCCCCTCTCAGTAGTAGAACCAGCACTGATATATG
GGAGGCACTTGAACGCATGTTAGAGAAAGCATCTTCCGCATTTTCAGAGCAAAGTAAATTTAGCTGCTTCATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAATCTG
CATTACGGTACTGGGGCTGTACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCACAATTGGATGCAGAATCCATTGCTAGATTGAAGGAAAAT
TTTTCACCCTTATCTTTGGCCTTTATGCCACAGTTCTCAAGTGGTTCCCCTGTAGATTGGAACACAGTTCTTCGCGATGCATCAAGTAAAGGCCCGAGGGACCTTCTTTC
TACGTCAAAAAACCACACCAGCAGTCTGCTATCACCCGTAAAATTCAATCCTGGAAACAAATCGGTTACACTTCTCATGCCAGGCTTCGAGAAGTCTGAAATCAAGCTTT
ACCAGTATAGGGGAGGGTCTGAGCTATTGGTGGAAGCTGGTGATCAGAGGCGTGTAATTTCTTTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCCAAGTTCACGGAT
AGAAGTCTTGTGATCACAATGCGTTGA
Protein sequenceShow/hide protein sequence
MASSLLFSASFFGNPIPISIRTTTAPCRRKSLAVEASKEITNVSSQNPTRMLTFLGKGGSGKTTSAVFAAQHFALSGFRTCLAIHNQDPTPEYLLDCKIGNSPVECGHNL
SAVRLETTQMLLEPLKWLKQADSRLNMTQGVLEGVVGEELGILPGMDSIFSVLQLERFLGFSGIMAQRDQKAKYDIVIYDGICTEETVRMIGATSKARLYLKYLKSIAEK
TDLGRLATPSILRFVDEAMSISGPGSPLSSRTSTDIWEALERMLEKASSAFSEQSKFSCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFASISSQLDAESIARLKEN
FSPLSLAFMPQFSSGSPVDWNTVLRDASSKGPRDLLSTSKNHTSSLLSPVKFNPGNKSVTLLMPGFEKSEIKLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFTD
RSLVITMR