; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G016110 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G016110
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
Genome locationCmo_Chr04:8223189..8226008
RNA-Seq ExpressionCmoCh04G016110
SyntenyCmoCh04G016110
Gene Ontology termsNA
InterPro domainsIPR008978 - HSP20-like chaperone
IPR025723 - Anion-transporting ATPase-like domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR040612 - ArsA, HSP20-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601324.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.2e-21798.99Show/hide
Query:  MLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEEL
        MLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEEL
Subjt:  MLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEEL

Query:  GILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSG
        GILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAM ISSPGSHLSG
Subjt:  GILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSG

Query:  RTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDW
        RTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSL FMPQISVGSPVDW
Subjt:  RTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDW

Query:  NTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        NTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITM
Subjt:  NTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

KAG7032107.1 putative protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-23395.31Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFSASFFGHPIPISIRTRT PCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAM ISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLM-PGFEKS
        TIQAGAQISGAFASISSGLDAESAARLKENFSPLSL FMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKS +     GF + 
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLM-PGFEKS

Query:  EIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
              YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITM
Subjt:  EIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

XP_022956773.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata]2.4e-248100Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE

Query:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
Subjt:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

XP_022979170.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima]2.9e-23896.42Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        +SPVECS NLSAVRLETTQMLLEPLKRLKQADS LNMTQGTLEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQTDQK KYDIVVYDGICTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSKARLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM ISSPGSHLSGRTSTDTWQALE MLEKGSSAIAEPRRFSCFIVMDPTSPASVKSA RYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISSGLDAESAARLKENF PL L FMPQISVGSPVDWNTVL DASSKGPRNLLSSSKSHSSNL SPVKF+PGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE

Query:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
Subjt:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

XP_023529730.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita pepo subsp. pepo]5.2e-24397.99Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVF ARHFALSGLRTCLVIHNQDATPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQK KYDIVVYDGICTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAM ISSPGSHLSGRTSTDTWQALE MLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISSGLDAESAARLKENFSPL L FMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHS+NLPSPVKFNPGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE

Query:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKF DRSLVITM
Subjt:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

TrEMBL top hitse value%identityAlignment
A0A1S3BET7 uncharacterized protein At1g26090, chloroplastic1.2e-20883.67Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFS SFFG+PIPISIRTRT PC  R + ++ASK+  DVSSQN  R+LTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVI NQD TPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        NSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG LEG+VGEEL +LPGMDSIFS+LQLERF+G SGIM Q DQK KYDIV+YDG+CTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSK RLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM IS PGSHL GRTSTD W+ LE +LEKGSSA AEPR+FSC+IVMDPTSPASV+SA RYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQI GA A  SS  +AE++A LKE FSPLSL F+PQ S+GS VDWNTVL DASSKGPR+LLSSSKS +S+L  PVKF+PGNKSVTLLMPGF KSE
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE

Query:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        I+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
Subjt:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

A0A5A7STS2 ArsA_ATPase domain-containing protein3.2e-19084.77Show/hide
Query:  DVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGT
        DVSSQN  R+LTFLGKGGSGKTTSAVFAA+HFALSGLRTCLVI NQD TPEYLLDCKIGNSPVECS NLSAVRLETTQMLLEPLKRLKQADSRLNMTQG 
Subjt:  DVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGT

Query:  LEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKI
        LEG+VGEEL +LPGMDSIFS+LQLERF+G SGIM Q DQK KYDIV+YDG+CTEETIRMIGATSK RLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM I
Subjt:  LEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKI

Query:  SSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQ
        S PGSHL GRTSTD W+ LE +LEKGSSA AEPR+FSC+IVMDPTSPASV+SA RYWGCTIQAGAQI GA A  SS  +AE++A LKE FSPLSL F+PQ
Subjt:  SSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQ

Query:  ISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFMD
         S+GS VDWNTVL DASSKGPR+LLSSSKS +S+L  PVKF+PGNKSVTLLMPGF KSEI+LYQ R GGSELLVEAGDQRRVISLPKEIQGKVGGAKFMD
Subjt:  ISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYR-GGSELLVEAGDQRRVISLPKEIQGKVGGAKFMD

Query:  RSLVITM
        RSLVITM
Subjt:  RSLVITM

A0A6J1D944 uncharacterized protein At1g26090, chloroplastic5.3e-20180.71Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPC----RRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPC----RRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLD

Query:  CKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+L+QADSRLNMTQG LEG+VGEELG+LPGMDS+FSVL LE+FLG S  MAQ D+K  YDIV+YDGI TEE
Subjt:  CKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEE

Query:  TIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSI+RLVDEAM IS PGSHLSGRTSTD W+ALERMLE+GSSA +EP +F CFIVMDPTSPASV+SA R
Subjt:  TIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASR

Query:  YWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGF
        YWGCTIQAGAQISGAFA ISS LDAES +RLKENFSPLSL FMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+L SPVKF+PGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGF

Query:  EKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        EKSEI+LYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
Subjt:  EKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

A0A6J1GY43 uncharacterized protein At1g26090, chloroplastic1.2e-248100Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE

Query:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
Subjt:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

A0A6J1ISG8 uncharacterized protein At1g26090, chloroplastic1.4e-23896.42Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
        MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIG

Query:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM
        +SPVECS NLSAVRLETTQMLLEPLKRLKQADS LNMTQGTLEG+VGEELGILPGMDSIFSVLQLERFLG SGIMAQTDQK KYDIVVYDGICTEETIRM
Subjt:  NSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRM

Query:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC
        IGATSKARLYLKYLRSIAEKTDLGRLATPSI+RLVDEAM ISSPGSHLSGRTSTDTWQALE MLEKGSSAIAEPRRFSCFIVMDPTSPASVKSA RYWGC
Subjt:  IGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGC

Query:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE
        TIQAGAQISGAFASISSGLDAESAARLKENF PL L FMPQISVGSPVDWNTVL DASSKGPRNLLSSSKSHSSNL SPVKF+PGNKSVTLLMPGFEKSE
Subjt:  TIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSE

Query:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
Subjt:  IRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

SwissProt top hitse value%identityAlignment
O50593 Arsenical pump-driving ATPase1.2e-0526.67Show/hide
Query:  QNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNS--PVECSRNLSAVRLETTQ-------MLLEPLKRLKQADSRLN
        QN P  L F GKGG GKT+ +   A H A  G R  LV  +  +    + D  IGN+  PV     LSA+ ++  +        +++P+K L   D  +N
Subjt:  QNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNS--PVECSRNLSAVRLETTQ-------MLLEPLKRLKQADSRLN

Query:  MTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMI
             L G    E+                 F   +G++       ++D +++D   T  TIR++
Subjt:  MTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMI

Q46366 Putative arsenical pump-driving ATPase5.5e-1422.67Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEE
        R+LTF GKGG GKT+ +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L +    +++  +R+ M QG + G++ +E
Subjt:  RMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEE

Query:  LGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLS
        + ILPGM+ +FS+L+++R+   +G+         YD +V D   T ET+R++              S+ +    G  A  ++ + +     +S P S +S
Subjt:  LGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLS

Query:  GRTS-----TDTWQALERM---LEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQ
         + +      D  ++++++   LE     + +  + +  +VM+     S+K   R        G ++      ++  LDA+  +   E +  +   ++ +
Subjt:  GRTS-----TDTWQALERM---LEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQ

Query:  ISVG-SPVDWNTV-LLDASSKGPRNL-LSSSKSHSSNLPS-------PVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQG
        I  G SP+    + + D    G ++L + +   +    PS       P+KF        + +     + + +  +  G EL V+ G+QR++I+LP  + G
Subjt:  ISVG-SPVDWNTV-LLDASSKGPRNL-LSSSKSHSSNLPS-------PVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQG

Query:  -KVGGAKFMDRSLVITMPL
         + G A F D+ L I   L
Subjt:  -KVGGAKFMDRSLVITMPL

Q46465 Putative arsenical pump-driving ATPase3.2e-1422.91Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEE
        R+LTF GKGG GKT+ +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L E    +++  +R+ M QG + G++ +E
Subjt:  RMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEE

Query:  LGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLS
        + ILPGM+ +FS+L+++R+   +G+         YD +V D   T ET+R++              S+ +    G  A  ++ + +     +S P S +S
Subjt:  LGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLS

Query:  GRTS-----TDTWQALERM---LEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQ
         + +      D  ++++++   LE     + +  + +  +VM+     S+K   R        G ++      ++  LDA+  +   E +  +   ++ +
Subjt:  GRTS-----TDTWQALERM---LEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQ

Query:  ISVG-SPVDWNTV-LLDASSKGPRNL-LSSSKSHSSNLPS-------PVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQG
        I  G SP+    + + D    G ++L + +   +    PS       P+KF        + +     + + +  +  G EL V+ G+QR++I+LP  + G
Subjt:  ISVG-SPVDWNTV-LLDASSKGPRNL-LSSSKSHSSNLPS-------PVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQG

Query:  -KVGGAKFMDRSLVITMPL
         + G A F D+ L I   L
Subjt:  -KVGGAKFMDRSLVITMPL

Q55794 Putative arsenical pump-driving ATPase2.7e-0821.27Show/hide
Query:  RMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEE
        R++   GKGG GKT+ A       A  G +T ++  +   +     D ++G+ P     NL    L+    L      +K+  +++   +G L+G+  EE
Subjt:  RMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEE

Query:  LGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLS
        L ILPGMD IF +++++R             +  YD+++ D   T   +R++        Y++      +   +        +R + E +     G  L 
Subjt:  LGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLS

Query:  GRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTI-QAGAQISGAFASISSGLDAESAARLK-----------ENFSPLSLT
         +   D        +E     + +  + S  +V +P      +S   +   ++      +  A   +   +D     R K           +NF PL + 
Subjt:  GRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTI-QAGAQISGAFASISSGLDAESAARLK-----------ENFSPLSLT

Query:  FMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGA
          P  S    +     L        ++   S   +  N  + V+ +  + S+ L +PG  K +I+L   + G EL V  G+ RR + LP+ +      GA
Subjt:  FMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQG-KVGGA

Query:  KFMDRSLVI
        K  D  L I
Subjt:  KFMDRSLVI

Q6DYE4 Uncharacterized protein At1g26090, chloroplastic6.7e-13757.58Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRS----MAIEASKEVTDV---SSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEY
        + +S L  +S   + +PI +RT T    R+     +A  +S++V D    SSQ   + +TFLGKGGSGKTT+AVFAA+H+AL+GL TCLVIHNQD + E+
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRS----MAIEASKEVTDV---SSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEY

Query:  LLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPK-YDIVVYDGI
        LL  KIG SP   + NLS +RLETT+MLLEPLK+LKQAD+RLNMTQG LEG+VGEELG+LPGMDSIFS+L+LER +G      + + K K +D+++YDGI
Subjt:  LLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPK-YDIVVYDGI

Query:  CTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVK
         TEET+RMIG +SK RLY KYLRS+AEKTDLGRL +PSIMR VDE+M I+S  S   G TS   W  LER LE G+SA  +P RF  F+VMDP +P SVK
Subjt:  CTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVK

Query:  SASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLL
        +A RYWGCT+QAG+ +SGAFA  SS L ++     K +F PL           + +DW+ +LLD ++   R LLS + SH ++L   V F+   K VTL 
Subjt:  SASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLL

Query:  MPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        MPGFEKSEI+LYQYRGGSELL+EAGDQRRVI LP +IQGKVGGAKF+DRSL++TM
Subjt:  MPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM

Arabidopsis top hitse value%identityAlignment
AT1G26090.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein4.8e-13857.58Show/hide
Query:  MASSLLFSASFFGHPIPISIRTRTAPCRRRS----MAIEASKEVTDV---SSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEY
        + +S L  +S   + +PI +RT T    R+     +A  +S++V D    SSQ   + +TFLGKGGSGKTT+AVFAA+H+AL+GL TCLVIHNQD + E+
Subjt:  MASSLLFSASFFGHPIPISIRTRTAPCRRRS----MAIEASKEVTDV---SSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEY

Query:  LLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPK-YDIVVYDGI
        LL  KIG SP   + NLS +RLETT+MLLEPLK+LKQAD+RLNMTQG LEG+VGEELG+LPGMDSIFS+L+LER +G      + + K K +D+++YDGI
Subjt:  LLDCKIGNSPVECSRNLSAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPK-YDIVVYDGI

Query:  CTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVK
         TEET+RMIG +SK RLY KYLRS+AEKTDLGRL +PSIMR VDE+M I+S  S   G TS   W  LER LE G+SA  +P RF  F+VMDP +P SVK
Subjt:  CTEETIRMIGATSKARLYLKYLRSIAEKTDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVK

Query:  SASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLL
        +A RYWGCT+QAG+ +SGAFA  SS L ++     K +F PL           + +DW+ +LLD ++   R LLS + SH ++L   V F+   K VTL 
Subjt:  SASRYWGCTIQAGAQISGAFASISSGLDAESAARLKENFSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLL

Query:  MPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM
        MPGFEKSEI+LYQYRGGSELL+EAGDQRRVI LP +IQGKVGGAKF+DRSL++TM
Subjt:  MPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGCACCCAATTCCCATCTCAATACGAACAAGAACAGCTCCATGTAGAAGAAGATCTATGGCCATTGAGGCTTC
AAAAGAGGTTACGGACGTTTCTTCTCAAAACCAACCCAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCAGGAAAGACTACTTCCGCAGTATTCGCCGCTCGGCACTTTG
CATTGTCTGGACTGCGCACATGCCTGGTGATACATAATCAAGACGCTACTCCTGAGTATCTTCTTGATTGCAAAATTGGAAATTCTCCCGTTGAATGCAGTCGCAACCTT
TCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTTAAACGGCTAAAGCAAGCTGATTCCCGTCTTAATATGACACAAGGAACTCTTGAAGGGATTGTTGG
AGAAGAGCTTGGGATACTCCCAGGAATGGATTCTATCTTTTCAGTACTTCAGCTTGAGAGATTTCTTGGGTTATCAGGTATTATGGCCCAAACAGACCAAAAACCTAAAT
ATGACATAGTAGTATATGACGGTATCTGCACCGAAGAAACGATAAGGATGATCGGAGCAACCAGTAAAGCAAGGTTGTACTTAAAATATCTGAGGAGTATTGCTGAAAAA
ACTGATCTTGGGAGGTTGGCTACTCCTTCAATTATGAGACTTGTTGATGAAGCCATGAAAATAAGCAGCCCAGGCTCCCATCTCAGTGGTAGAACCAGTACCGATACATG
GCAGGCACTGGAACGCATGTTAGAGAAAGGATCTTCTGCAATTGCAGAGCCAAGAAGATTTAGCTGCTTCATAGTGATGGACCCAACTAGTCCTGCCTCTGTGAAGTCTG
CATCACGGTACTGGGGTTGCACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCAGGCTTGGATGCAGAATCAGCTGCTAGATTGAAGGAGAAT
TTTTCACCCTTGTCTTTGACCTTTATGCCACAGATCTCAGTTGGTTCCCCAGTAGACTGGAACACAGTTCTTCTTGATGCATCAAGTAAAGGCCCGAGGAACCTTCTTTC
TTCGTCCAAAAGCCACTCCAGCAATCTGCCATCACCTGTAAAATTCAATCCTGGAAACAAATCGGTTACACTTCTCATGCCAGGCTTTGAGAAGTCAGAAATCAGGCTTT
ACCAGTATAGGGGAGGATCTGAGCTATTGGTGGAAGCTGGGGATCAGAGGCGTGTAATTTCTCTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCCAAGTTCATGGAC
AGAAGTCTTGTGATCACGATGCCGTTAAGTTACAGGTATGATATGATTGCTGAAATTAAATTAGGATGA
mRNA sequenceShow/hide mRNA sequence
CGATAGCGGCAGAGCGGGAAGTGCCTAAATGGCGTTGCCAGAAGAACAGCGTTAGCTACTTCACATGCTTATGTTTGGCTATGCCTATATAAACGCAGCCGGCGACTCCA
CCCACCTACGCGAACATTGTTGCAGGTTCCGATATCCATGGCGTCGTCTCTGCTCTTCTCTGCTTCTTTCTTCGGGCACCCAATTCCCATCTCAATACGAACAAGAACAG
CTCCATGTAGAAGAAGATCTATGGCCATTGAGGCTTCAAAAGAGGTTACGGACGTTTCTTCTCAAAACCAACCCAGGATGCTCACTTTTCTTGGCAAAGGCGGCTCAGGA
AAGACTACTTCCGCAGTATTCGCCGCTCGGCACTTTGCATTGTCTGGACTGCGCACATGCCTGGTGATACATAATCAAGACGCTACTCCTGAGTATCTTCTTGATTGCAA
AATTGGAAATTCTCCCGTTGAATGCAGTCGCAACCTTTCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTTAAACGGCTAAAGCAAGCTGATTCCCGTC
TTAATATGACACAAGGAACTCTTGAAGGGATTGTTGGAGAAGAGCTTGGGATACTCCCAGGAATGGATTCTATCTTTTCAGTACTTCAGCTTGAGAGATTTCTTGGGTTA
TCAGGTATTATGGCCCAAACAGACCAAAAACCTAAATATGACATAGTAGTATATGACGGTATCTGCACCGAAGAAACGATAAGGATGATCGGAGCAACCAGTAAAGCAAG
GTTGTACTTAAAATATCTGAGGAGTATTGCTGAAAAAACTGATCTTGGGAGGTTGGCTACTCCTTCAATTATGAGACTTGTTGATGAAGCCATGAAAATAAGCAGCCCAG
GCTCCCATCTCAGTGGTAGAACCAGTACCGATACATGGCAGGCACTGGAACGCATGTTAGAGAAAGGATCTTCTGCAATTGCAGAGCCAAGAAGATTTAGCTGCTTCATA
GTGATGGACCCAACTAGTCCTGCCTCTGTGAAGTCTGCATCACGGTACTGGGGTTGCACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTCCATTTCTTCAGG
CTTGGATGCAGAATCAGCTGCTAGATTGAAGGAGAATTTTTCACCCTTGTCTTTGACCTTTATGCCACAGATCTCAGTTGGTTCCCCAGTAGACTGGAACACAGTTCTTC
TTGATGCATCAAGTAAAGGCCCGAGGAACCTTCTTTCTTCGTCCAAAAGCCACTCCAGCAATCTGCCATCACCTGTAAAATTCAATCCTGGAAACAAATCGGTTACACTT
CTCATGCCAGGCTTTGAGAAGTCAGAAATCAGGCTTTACCAGTATAGGGGAGGATCTGAGCTATTGGTGGAAGCTGGGGATCAGAGGCGTGTAATTTCTCTGCCTAAAGA
AATTCAAGGGAAGGTGGGTGGTGCCAAGTTCATGGACAGAAGTCTTGTGATCACGATGCCGTTAAGTTACAGGTATGATATGATTGCTGAAATTAAATTAGGATGA
Protein sequenceShow/hide protein sequence
MASSLLFSASFFGHPIPISIRTRTAPCRRRSMAIEASKEVTDVSSQNQPRMLTFLGKGGSGKTTSAVFAARHFALSGLRTCLVIHNQDATPEYLLDCKIGNSPVECSRNL
SAVRLETTQMLLEPLKRLKQADSRLNMTQGTLEGIVGEELGILPGMDSIFSVLQLERFLGLSGIMAQTDQKPKYDIVVYDGICTEETIRMIGATSKARLYLKYLRSIAEK
TDLGRLATPSIMRLVDEAMKISSPGSHLSGRTSTDTWQALERMLEKGSSAIAEPRRFSCFIVMDPTSPASVKSASRYWGCTIQAGAQISGAFASISSGLDAESAARLKEN
FSPLSLTFMPQISVGSPVDWNTVLLDASSKGPRNLLSSSKSHSSNLPSPVKFNPGNKSVTLLMPGFEKSEIRLYQYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMD
RSLVITMPLSYRYDMIAEIKLG