; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1556 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1556
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionP-loop containing nucleoside triphosphate hydrolases superfamily protein
Genome locationMC09:21284640..21290846
RNA-Seq ExpressionMC09g1556
SyntenyMC09g1556
Gene Ontology termsNA
InterPro domainsIPR008978 - HSP20-like chaperone
IPR025723 - Anion-transporting ATPase-like domain
IPR027417 - P-loop containing nucleoside triphosphate hydrolase
IPR040612 - ArsA, HSP20-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446550.1 PREDICTED: uncharacterized protein At1g26090, chloroplastic [Cucumis melo]1.02e-25480.39Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFGNPIPIS+  RT   ++    R + +Q+SK+ MD   Q PTRLLTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVI NQDPT EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC HNLSAVRLETTQMLLEPLK+L+QADSRLNMTQGVLEGVVGEEL VLPGMDS+FS+L LE+F+GFS  M QRD+K  YDIVIYDG+ TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SK RLYLKY+RS AEKTDLGRLATPSILRLVDEAM ISRPGSHL GRTSTDIWE LE +LE+GSSAF+EP KF C+IVMDPTSPASVQSALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAGAQI GA AF SSH +AE+ + LKE FSPLSLAF+P+FSIGS VDWNTVL DASSKGPRDLLSSSKS  SSL+ PVKFDPGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
         KSEIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_022150659.1 uncharacterized protein At1g26090, chloroplastic [Momordica charantia]0.098.46Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
        MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI

Query:  GNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
        GNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
Subjt:  GNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR

Query:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
        IMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
Subjt:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKS
        CTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKS

Query:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_022956773.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita moschata]6.84e-25379.52Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+L+QADSRLNMTQG LEG+VGEELG+LPGMDS+FSVL LE+FLG S  MAQ D+K  YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSI+RLVDEAM IS PGSHLSGRTSTD W+ALERMLE+GSSA +EP +F CFIVMDPTSPASV+SA R
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAGAQISGAFA ISS LDAES +RLKENFSPLSL FMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+L SPVKF+PGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_022979170.1 uncharacterized protein At1g26090, chloroplastic [Cucurbita maxima]8.76e-25680.39Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIG+SPVEC HNLSAVRLETTQMLLEPLK+L+QADS LNMTQG LEGVVGEELG+LPGMDS+FSVL LE+FLGFS  MAQ D+KA YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSILRLVDEAM IS PGSHLSGRTSTD W+ALE MLE+GSSA +EP +F CFIVMDPTSPASV+SALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAGAQISGAFA ISS LDAES +RLKENF PL LAFMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+LLSPVKFDPGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

XP_038891424.1 uncharacterized protein At1g26090, chloroplastic [Benincasa hispida]3.51e-25481.26Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSL +S SFFGNPIPIS+  RT       R R + +Q+SKEI D   Q PTR+LTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVIHNQDPT EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+L+QADSRLNMTQG+LEGVVGEELGVLPG DS+FS+L LE+FLGFS  M QRD+K  YD+VIYDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSILRLVDEAM ISRPGSHLS RTSTDIWEALE +LE+GSSAF+EP KF CFIVMDPTSPASVQSALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAG QISGA AFISSHL AES + LKE FSPLSLAFMP+FS GS VDWNTVL DASSKGPRDLLS SKS  SSLLSPVKFDPGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
         KSEIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAK  DR LVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

TrEMBL top hitse value%identityAlignment
A0A1S3BET7 uncharacterized protein At1g26090, chloroplastic4.93e-25580.39Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFGNPIPIS+  RT   ++    R + +Q+SK+ MD   Q PTRLLTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVI NQDPT EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC HNLSAVRLETTQMLLEPLK+L+QADSRLNMTQGVLEGVVGEEL VLPGMDS+FS+L LE+F+GFS  M QRD+K  YDIVIYDG+ TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SK RLYLKY+RS AEKTDLGRLATPSILRLVDEAM ISRPGSHL GRTSTDIWE LE +LE+GSSAF+EP KF C+IVMDPTSPASVQSALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAGAQI GA AF SSH +AE+ + LKE FSPLSLAF+P+FSIGS VDWNTVL DASSKGPRDLLSSSKS  SSL+ PVKFDPGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
         KSEIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

A0A5A7STS2 ArsA_ATPase domain-containing protein1.43e-24283.33Show/hide
Query:  EIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGV
        ++  Q PTRLLTFLGKGGSGKT+SAVFAAQHFAL+GLRTCLVI NQDPT EYLLDCKIGNSPVEC HNLSAVRLETTQMLLEPLK+L+QADSRLNMTQGV
Subjt:  EIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGV

Query:  LEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGI
        LEGVVGEEL VLPGMDS+FS+L LE+F+GFS  M QRD+K  YDIVIYDG+ TEETIR++GA SK RLYLKY+RS AEKTDLGRLATPSILRLVDEAM I
Subjt:  LEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGI

Query:  SRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPK
        SRPGSHL GRTSTDIWE LE +LE+GSSAF+EP KF C+IVMDPTSPASVQSALRYWGCTIQAGAQI GA AF SSH +AE+ + LKE FSPLSLAF+P+
Subjt:  SRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPK

Query:  FSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVG
        FSIGS VDWNTVL DASSKGPRDLLSSSKS  SSL+ PVKFDPGN+SVTL MPGF KSEIKLYQARS      GGSELLVEAGDQRRVISLPKEIQGKVG
Subjt:  FSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVG

Query:  GAKFMDRSLVITMR
        GAKFMDRSLVITMR
Subjt:  GAKFMDRSLVITMR

A0A6J1D944 uncharacterized protein At1g26090, chloroplastic0.098.46Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
        MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKI

Query:  GNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
        GNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR
Subjt:  GNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIR

Query:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
        IMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG
Subjt:  IMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWG

Query:  CTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKS
        CTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKS
Subjt:  CTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKS

Query:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EIKLYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

A0A6J1GY43 uncharacterized protein At1g26090, chloroplastic3.31e-25379.52Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIGNSPVEC  NLSAVRLETTQMLLEPLK+L+QADSRLNMTQG LEG+VGEELG+LPGMDS+FSVL LE+FLG S  MAQ D+K  YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSI+RLVDEAM IS PGSHLSGRTSTD W+ALERMLE+GSSA +EP +F CFIVMDPTSPASV+SA R
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAGAQISGAFA ISS LDAES +RLKENFSPLSL FMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+L SPVKF+PGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

A0A6J1ISG8 uncharacterized protein At1g26090, chloroplastic4.24e-25680.39Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD
        MASSLL+S SFFG+PIPIS+  RT       RRR++ +++SKE+ D   Q   R+LTFLGKGGSGKT+SAVFAA+HFAL+GLRTCLVIHNQD T EYLLD
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMD---QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLD

Query:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE
        CKIG+SPVEC HNLSAVRLETTQMLLEPLK+L+QADS LNMTQG LEGVVGEELG+LPGMDS+FSVL LE+FLGFS  MAQ D+KA YDIV+YDGI TEE
Subjt:  CKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEE

Query:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR
        TIR++GA SKARLYLKY+RS AEKTDLGRLATPSILRLVDEAM IS PGSHLSGRTSTD W+ALE MLE+GSSA +EP +F CFIVMDPTSPASV+SALR
Subjt:  TIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALR

Query:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF
        YWGCTIQAGAQISGAFA ISS LDAES +RLKENF PL LAFMP+ S+GSPVDWNTVL DASSKGPR+LLSSSKSH S+LLSPVKFDPGN+SVTL MPGF
Subjt:  YWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGF

Query:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        EKSEI+LYQ       YRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
Subjt:  EKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

SwissProt top hitse value%identityAlignment
O52027 Putative arsenical pump-driving ATPase1.1e-0628.99Show/hide
Query:  SKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGH-NLSAVRLETTQMLLE----PLKQLRQADSR
        ++E++  + TR L F GKGG GK++ A   A   A AG  T +V  +       + +  +G+ P   G  NL A R++  + L E     L  +R+    
Subjt:  SKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGH-NLSAVRLETTQMLLE----PLKQLRQADSR

Query:  LNMTQGVLEGVVG--EELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIM
         + TQ  +E  V   EE    P  + + +   LEKF+ + E       +  YDIV++D   T  T+R++
Subjt:  LNMTQGVLEGVVG--EELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIM

Q46366 Putative arsenical pump-driving ATPase2.1e-1322.7Show/hide
Query:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKTS +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L +    +++  +R+ M QGV  GV+ +E
Subjt:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEE

Query:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLS
        + +LPGM+ +FS+L ++++               YD ++ D   T ET+R++              S  +    G  A  ++ + +     +S+P S +S
Subjt:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLS

Query:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPK
         + +      D  E+++++   LE      ++  K    +VM+     S++  +R        G ++      ++  LDA+  S   E +  +   ++ +
Subjt:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPK

Query:  FSIG-SPVDWNTV-LHDASSKGPRDLL--------SSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFL-LYRGGSELLVEAGDQRRVI
           G SP+    + ++D    G + L          +  S +     P+KF        +      + ++KL  A    + ++  G EL V+ G+QR++I
Subjt:  FSIG-SPVDWNTV-LHDASSKGPRDLL--------SSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFL-LYRGGSELLVEAGDQRRVI

Query:  SLPKEIQG-KVGGAKFMDRSLVI
        +LP  + G + G A F D+ L I
Subjt:  SLPKEIQG-KVGGAKFMDRSLVI

Q46465 Putative arsenical pump-driving ATPase5.5e-1422.9Show/hide
Query:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEE
        R+LTF GKGG GKTS +   A   +  G RT ++  +   +     + ++G  P +   NL A+ +     L E    +++  +R+ M QGV  GV+ +E
Subjt:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEE

Query:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLS
        + +LPGM+ +FS+L ++++               YD ++ D   T ET+R++              S  +    G  A  ++ + +     +S+P S +S
Subjt:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLS

Query:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPK
         + +      D  E+++++   LE      ++  K    +VM+     S++  +R        G ++      ++  LDA+  S   E +  +   ++ +
Subjt:  GRTS-----TDIWEALERM---LERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPK

Query:  FSIG-SPVDWNTV-LHDASSKGPRDLL--------SSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFL------LYRGGSELLVEAGD
           G SP+    + ++D    G + L          +  S +     P+KF               K +I   Q +  F       ++  G EL V+ G+
Subjt:  FSIG-SPVDWNTV-LHDASSKGPRDLL--------SSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFL------LYRGGSELLVEAGD

Query:  QRRVISLPKEIQG-KVGGAKFMDRSLVI
        QR++I+LP  + G + G A F D+ L I
Subjt:  QRRVISLPKEIQG-KVGGAKFMDRSLVI

Q55794 Putative arsenical pump-driving ATPase1.1e-1121.39Show/hide
Query:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEE
        R++   GKGG GKTS A       A  G +T ++  +   +     D ++G+ P     NL    L+    L      +++  +++   +G L+GV  EE
Subjt:  RLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEE

Query:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLS
        L +LPGMD +F ++           M +   +A+YD++I D   T   +R++        Y++      +   +     P +  L     G S P   + 
Subjt:  LGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLS

Query:  GRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTI-QAGAQISGAFAFISSHLDAESVSRLK-----------ENFSPLSLA
             + +E +E +        ++ ++    +V +P      +S   +   ++      +  A   +   +D     R K           +NF PL + 
Subjt:  GRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTI-QAGAQISGAFAFISSHLDAESVSRLK-----------ENFSPLSLA

Query:  FMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQ
          P FS    +     L        +D   S   +  + ++ V+    + S+ L++PG  K +I+L +          G EL V  G+ RR + LP+ + 
Subjt:  FMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQ

Query:  G-KVGGAKFMDRSLVI
             GAK  D  L I
Subjt:  G-KVGGAKFMDRSLVI

Q6DYE4 Uncharacterized protein At1g26090, chloroplastic1.5e-14157.63Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS
        + +S L  +S   N +PI   +RT   + +R+RRA  +   SS+++ D      QK T+ +TFLGKGGSGKT++AVFAAQH+ALAGL TCLVIHNQDP++
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS

Query:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD
        E+LL  KIG SP     NLS +RLETT+MLLEPLKQL+QAD+RLNMTQGVLEGVVGEELGVLPGMDS+FS+L LE+ +GF     +++ K   +D++IYD
Subjt:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD

Query:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS
        GISTEET+R++G +SK RLY KY+RS AEKTDLGRL +PSI+R VDE+M I+   S   G TS  +W+ LER LE G+SA+ +P +F  F+VMDP +P S
Subjt:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS

Query:  VQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT
        V++ALRYWGCT+QAG+ +SGAFA  SSHL ++     K +F PL  A        + +DW+ +L D ++   R+LLS + SH +SL   V FD   + VT
Subjt:  VQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT

Query:  LFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        LFMPGFEKSEIKLYQ       YRGGSELL+EAGDQRRVI LP +IQGKVGGAKF+DRSL++TMR
Subjt:  LFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR

Arabidopsis top hitse value%identityAlignment
AT1G26090.1 P-loop containing nucleoside triphosphate hydrolases superfamily protein1.1e-14257.63Show/hide
Query:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS
        + +S L  +S   N +PI   +RT   + +R+RRA  +   SS+++ D      QK T+ +TFLGKGGSGKT++AVFAAQH+ALAGL TCLVIHNQDP++
Subjt:  MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRA--LPVQSSKEIMD------QKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTS

Query:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD
        E+LL  KIG SP     NLS +RLETT+MLLEPLKQL+QAD+RLNMTQGVLEGVVGEELGVLPGMDS+FS+L LE+ +GF     +++ K   +D++IYD
Subjt:  EYLLDCKIGNSPVECGHNLSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKAN-YDIVIYD

Query:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS
        GISTEET+R++G +SK RLY KY+RS AEKTDLGRL +PSI+R VDE+M I+   S   G TS  +W+ LER LE G+SA+ +P +F  F+VMDP +P S
Subjt:  GISTEETIRIMGAASKARLYLKYMRSAAEKTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPAS

Query:  VQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT
        V++ALRYWGCT+QAG+ +SGAFA  SSHL ++     K +F PL  A        + +DW+ +L D ++   R+LLS + SH +SL   V FD   + VT
Subjt:  VQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKENFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVT

Query:  LFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR
        LFMPGFEKSEIKLYQ       YRGGSELL+EAGDQRRVI LP +IQGKVGGAKF+DRSL++TMR
Subjt:  LFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGKVGGAKFMDRSLVITMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGTCTCTGCTATATTCCACTTCTTTCTTTGGAAACCCAATTCCCATTTCAATGCCAATTCGAACCGGAAGAGCAGCATCTACTCGCAGGAGAAGAGCTCTGCC
AGTCCAGTCTTCCAAAGAGATTATGGACCAGAAACCAACCAGGCTGCTCACTTTTCTTGGCAAAGGCGGCTCGGGGAAGACCTCTTCAGCGGTATTCGCCGCTCAGCACT
TTGCATTGGCTGGACTGCGGACGTGTCTGGTGATACATAATCAAGACCCTACGTCTGAGTATCTTCTGGATTGTAAAATTGGGAATTCTCCCGTCGAATGCGGTCACAAC
CTCTCAGCTGTTAGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAACAGCTAAGGCAAGCAGATTCTCGTCTTAACATGACACAAGGAGTTCTTGAAGGGGTGGT
TGGAGAAGAGCTTGGAGTACTTCCAGGAATGGATTCTGTCTTCTCGGTACTTCTACTTGAGAAATTTCTTGGGTTCTCAGAGAATATGGCCCAAAGAGACCGAAAAGCTA
ACTATGACATAGTAATATATGACGGTATCAGCACCGAGGAAACAATAAGGATCATGGGAGCGGCCAGTAAAGCGAGGTTGTACCTAAAATATATGAGGAGCGCTGCTGAA
AAAACCGATCTTGGGAGATTGGCTACTCCTTCAATTTTGAGGCTTGTTGATGAAGCCATGGGTATAAGCAGGCCAGGCTCCCATCTCAGTGGTAGAACCAGTACAGATAT
ATGGGAGGCACTGGAACGCATGTTAGAGAGAGGGTCTTCTGCATTTTCAGAGCCAAGTAAATTTGGCTGCTTTATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGT
CTGCATTACGGTACTGGGGTTGCACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTTCATTTCTTCACACCTGGATGCAGAATCCGTTTCTAGGTTGAAGGAG
AACTTTTCCCCTTTATCTTTGGCCTTTATGCCAAAGTTCTCAATTGGTTCCCCTGTAGATTGGAACACAGTTCTGCATGATGCATCAAGTAAAGGCCCGAGAGACCTTCT
GTCTTCGTCAAAAAGCCATATCAGCAGTCTGCTATCACCTGTAAAATTCGATCCTGGAAACAGATCGGTTACACTTTTCATGCCAGGCTTCGAGAAGTCAGAAATCAAGC
TTTACCAGGCACGTTCATTCTTTCTATTATATAGGGGAGGGTCTGAGCTGTTGGTAGAAGCTGGAGATCAGAGGCGTGTCATTTCTCTGCCTAAAGAAATTCAAGGGAAG
GTGGGTGGTGCCAAGTTCATGGACAGAAGTCTTGTGATCACAATGCGTTGA
mRNA sequenceShow/hide mRNA sequence
AGGTAGCCGAGCCGGAGCGGGAAGTGCGGAAATGGAGGTGATATAAAGCGAAATTCCTCCACCTGTTCCAGTCCAGTTACCTATCAGGACATTGCTCTATGGCTTCGTCT
CTGCTATATTCCACTTCTTTCTTTGGAAACCCAATTCCCATTTCAATGCCAATTCGAACCGGAAGAGCAGCATCTACTCGCAGGAGAAGAGCTCTGCCAGTCCAGTCTTC
CAAAGAGATTATGGACCAGAAACCAACCAGGCTGCTCACTTTTCTTGGCAAAGGCGGCTCGGGGAAGACCTCTTCAGCGGTATTCGCCGCTCAGCACTTTGCATTGGCTG
GACTGCGGACGTGTCTGGTGATACATAATCAAGACCCTACGTCTGAGTATCTTCTGGATTGTAAAATTGGGAATTCTCCCGTCGAATGCGGTCACAACCTCTCAGCTGTT
AGGTTGGAAACCACTCAAATGCTTCTTGAACCTCTCAAACAGCTAAGGCAAGCAGATTCTCGTCTTAACATGACACAAGGAGTTCTTGAAGGGGTGGTTGGAGAAGAGCT
TGGAGTACTTCCAGGAATGGATTCTGTCTTCTCGGTACTTCTACTTGAGAAATTTCTTGGGTTCTCAGAGAATATGGCCCAAAGAGACCGAAAAGCTAACTATGACATAG
TAATATATGACGGTATCAGCACCGAGGAAACAATAAGGATCATGGGAGCGGCCAGTAAAGCGAGGTTGTACCTAAAATATATGAGGAGCGCTGCTGAAAAAACCGATCTT
GGGAGATTGGCTACTCCTTCAATTTTGAGGCTTGTTGATGAAGCCATGGGTATAAGCAGGCCAGGCTCCCATCTCAGTGGTAGAACCAGTACAGATATATGGGAGGCACT
GGAACGCATGTTAGAGAGAGGGTCTTCTGCATTTTCAGAGCCAAGTAAATTTGGCTGCTTTATAGTGATGGATCCAACTAGTCCTGCCTCTGTTCAGTCTGCATTACGGT
ACTGGGGTTGCACTATTCAAGCTGGTGCACAAATTTCTGGTGCATTTGCTTTCATTTCTTCACACCTGGATGCAGAATCCGTTTCTAGGTTGAAGGAGAACTTTTCCCCT
TTATCTTTGGCCTTTATGCCAAAGTTCTCAATTGGTTCCCCTGTAGATTGGAACACAGTTCTGCATGATGCATCAAGTAAAGGCCCGAGAGACCTTCTGTCTTCGTCAAA
AAGCCATATCAGCAGTCTGCTATCACCTGTAAAATTCGATCCTGGAAACAGATCGGTTACACTTTTCATGCCAGGCTTCGAGAAGTCAGAAATCAAGCTTTACCAGGCAC
GTTCATTCTTTCTATTATATAGGGGAGGGTCTGAGCTGTTGGTAGAAGCTGGAGATCAGAGGCGTGTCATTTCTCTGCCTAAAGAAATTCAAGGGAAGGTGGGTGGTGCC
AAGTTCATGGACAGAAGTCTTGTGATCACAATGCGTTGACGTAATTAATCACAAAAATCCATTTGGCTCCCATTTCAGATCTTGAGCATTTCTCTTCAACATTCTGACAC
CATCAATGGTGATGGAAAAAATTCCTGGCTAGGGGGCTCAACTAGGGCGGAGAATTTGCAATTTGACCATGGTTGGTCTGAAATTAAGGATAAAGAGAGAACTGTATATT
TCCAATTCTATATTTCTGATTCATTTCTATTTGACTGTCTATTCTTGAATTGAATTCAGTATTTTCTTGCTTTTTAATTCCATACTGTACAGCGCACAGAAGCTACCAGA
CTGAGTACATAATCTCTCGCATCTTTCCAGTAGATTGGTAAATATACAATGTCCAATGCGAATGATCTTCTGTAGTAGCATTGAGTTACATGTAGTATATGATTGCTGAA
ATTAAATTGGGATAATAATGAACTTGAAGAAAAAAATTCAGGCTTCTTGCTTATTGTCGTCTCCTATCTGTGTTGTTTGAACTCGGCTCCTCAATTCTTCAAAAGCCTTC
ATGCTCTCCGCATCGAAGCCTCCTGCAAACTGGAAACATATAGAATCCAATGTCAGCTTTGCAGGTTTCGTTTATCAGTTAGAACTTGATTGGAAACAAGATTATTCAAC
TTTGAAATGCAGTTGCTCGTCTGATGATGGCTTGTAACTATTGTTTTCTTTTTGAAGAATGAAGAGAGGCAATATGTACCTGATGAACACGGAAAGCAAACCGTACGCCT
TGTAAGACCAAAAACTCTCTGTTCGGCTCCTTCTCCAGTTCGTTCATTGCATCAAGTTCTGATGCTACACGCTTCATGTATTTCCTTGCTAATTGTACAGAAGACAGCTT
AATCTGCACTTGTTAAAGTATGATTACATTACACATTTTGAGACATTTATGGCCGCATATGAACATACTCCATATTTGCAAGTTCCAGTACTTTTGCTAACAAATGCTAA
CTATGCTCAAAATAAAAATTTCTTGATAACGAAAAGATTGAACTTTTCAAGCTATACCTTTCCAACAACTCCTGTGTCTGACAACCAATCAACTGGAATTCCAAACTCTC
GATAACGCGAGATAGCCATGTCTCTTGTGCGTAGGAGTGCATAGACACTCTGCTCAACCCTGTCAAAAGACGAGAACAAAAACAATGGTTGGAGATGACAACATAATTAG
AGACGGGGATATGTTCTCAATATTTGAAAATGGCAGCACACAGTAATGCATATGCATTAGTTCGAAAGCAGTGACACAGTAAATTTTTTGACAAATTTTTTGACAGTTGA
TCATATCAAAATTTACAAGAATCATGAGATGGTGATTGAACTACTTAACATGCCAAAGATGAGTATTTCCAAAATGGTTGACGTTCCATCACAAATTCTCTTCCAGGAAC
CGCGTCGTTGTAAGGGAATACTTACTTTTCAAGCAAGGAGTACATTTTCTTTAAAGCTGCTTCACATTGGAGTTTAGGGTCATCAACAAACGTGGTGATCCGCTTCTCCA
ACTTCATTAGGTCTTGATATTCAAAAGACGCCTCTCTCAATGCATCTGCTTTACCTTCTGGCCAATCAAAGTGCTTGAGCACTGCCCTTTCATCAACCTTAATCAAAGGC
AGAAAAGAAAGAAACTAGTAAACATAAAGGTCTAAGGAACTGAGTTCAGACTTGAGGACTAGCTGGATTACAGCAATGCACAACAAAAACCACTTCTTTCAATTTATTTT
CAACCTATTCAGAATGATAGCCATTAGCCAGTCCTTTTCTTCTGCATCAAAATTCAATGCATATTGGAGTTCAGTAAATAAAATGACCGTCTTACCAAGAATGATAGCTC
TTCATCTAACCAATTAACGAAGGCCACAACATCCTCGATATTAGAGAATGTAGCTGCTCGTACTTCTGCCGCTAATGACATGACAAAATCACCTTGAGTTTCAACATCAG
CTTTTACCTGGATTTAGTCAACAAAGTTACCAACTGGCAAATCACTATTAATGACAAAGATGAGAAGATAATGCAAAGGCAAAAGCAAAAGGATGATCTACTAACCGCTA
AGAGGAATGATGATCTATTCTCAATCTCCCCAATCATGTTACTTCTGGCATCAGATACATTAGATGATGTAGAAGAAAGTAAAGGAGTATCCTTCTTTGCTTCTCGTTTC
ATCAATGTCTGATAGAATTCAACTAACTCAGGCGCTCTGTGAACCTTATCACCACCTACACCCTTAGACAAGCTTCCTGGAGGAGGAGGCGGTCGAGGTGGTCCACCAGC
TGGAGGCAGTGGTGGGGCACCAGGAGGTGGAGGTGGTAGAGGTGGAGCAGCTGGTACTCCACCCTGAGGATTGGGATTTGTACTTACAGAAGCACCTGCAGATGGTTTTG
GTGGTGGCTTAGGTGTCCGTGGAGATCGCTTCTCAATCTCTGCTAGCTTCATCCTACTGATAGCTGGAGACTCTGTTGTCTTATTTTCACCAGATGGATCAGCAGAATCA
CTAGATACAACTGGTTTCTCCTTTATTAGAGAAAGTTTTGGCGGCAAAACTGCAGGTTTCTCTCTCTCGGTCTTACCTTTAAATCCAGAGTTCAAATTTGAATTTGAAAT
ATTGCCAAACCTTTCTGCTCTTGCCTGATCGGCCCTTTCCTTAATTTGCTTCTCCCTTGCTAATGCCAACTTATGTCGGTCTTTGTATGCTGGATATTTCTCATCTAACA
CTCCATCAACAGATTTAGACATCAGTTGGAATGATGTTGCCACTGAATTCAAGGAGTCACTAGAAGGTGTTTGTGTTCTGATATTAGGGAGATTCGGAGTGCCCGGAGAG
TCGGGAGTTTCCTGTTCCATCGTACCAAAGGTGGTGATTGCGACACTATCACTAGCATTTCTGAGCATCAACATTTCTAATGGACCCCTTTGCTTCTGACTCATGCTCAT
CCTGCTTGGAGAACCCCCAGAAAAGGATCTGGCTGGTGATGAAACAACACTAGAATCATCTTTGCTTTTACCACCCCATTTCTTCAACTTCTGGAGCAAGCTGGGTTTCT
TACTGAGACTACTATATCTACTAAAGGAACTATCTATTGAAGCATTGTCAAAATCCTCACTTCCAGGAGAAGATGGTTGGGAGAAGTTGCTTTCAAGATCTGTGTCCCCT
TGTCCACGTTCTGATCCAGCATACTCCAACATGAGCTGCTTAGCCTTCTCCTGAGATTTTGGGCTTAAATTCTTGTTGAGGTCACGAGCTGATACTTTTCCAGTAGGAGC
CTGGTAATTGCGGAGTTCATACCTTAAGCATGCATTGACCCATCGAAGGTACACTAATTCTTCAACTTCACTGAACCTGTTCATCTGAAGTCCTTCAACTTGCTTCATTA
AGTCCTCATTTGCATGCCTTAAATTGT
Protein sequenceShow/hide protein sequence
MASSLLYSTSFFGNPIPISMPIRTGRAASTRRRRALPVQSSKEIMDQKPTRLLTFLGKGGSGKTSSAVFAAQHFALAGLRTCLVIHNQDPTSEYLLDCKIGNSPVECGHN
LSAVRLETTQMLLEPLKQLRQADSRLNMTQGVLEGVVGEELGVLPGMDSVFSVLLLEKFLGFSENMAQRDRKANYDIVIYDGISTEETIRIMGAASKARLYLKYMRSAAE
KTDLGRLATPSILRLVDEAMGISRPGSHLSGRTSTDIWEALERMLERGSSAFSEPSKFGCFIVMDPTSPASVQSALRYWGCTIQAGAQISGAFAFISSHLDAESVSRLKE
NFSPLSLAFMPKFSIGSPVDWNTVLHDASSKGPRDLLSSSKSHISSLLSPVKFDPGNRSVTLFMPGFEKSEIKLYQARSFFLLYRGGSELLVEAGDQRRVISLPKEIQGK
VGGAKFMDRSLVITMR